est sequencing oligo: Topics by Science.gov

Sample records for est sequencing oligo

LSGermOPA, a custom OPA of 384 EST-derived SNPs for high-throughput lettuce (Lactuca sativa L.) germplasm fingerprinting

USDA-ARS?s Scientific Manuscript database

We assessed the genetic diversity and population structure among 148 cultivated lettuce (Lactuca sativa L.) accessions using the high-throughput GoldenGate assay and 384 EST (Expressed Sequence Tag)-derived SNP (single nucleotide polymorphism) markers. A custom OPA (Oligo Pool All), LSGermOPA was fo...
Curation of microarray oligonucleotides and corresponding ESTs/cDNAs used for gene expression analysis in zebra finches.

PubMed

Lovell, Peter V; Huizinga, Nicole A; Getachew, Abel; Mees, Brianna; Friedrich, Samantha R; Wirthlin, Morgan; Mello, Claudio V

2018-05-18

Zebra finches are a major model organism for investigating mechanisms of vocal learning, a trait that enables spoken language in humans. The development of cDNA collections with expressed sequence tags (ESTs) and microarrays has allowed for extensive molecular characterizations of circuitry underlying vocal learning and production. However, poor database curation can lead to errors in transcriptome and bioinformatics analyses, limiting the impact of these resources. Here we used genomic alignments and synteny analysis for orthology verification to curate and reannotate ~ 35% of the oligonucleotides and corresponding ESTs/cDNAs that make-up Agilent microarrays for gene expression analysis in finches. We found that: (1) 5475 out of 43,084 oligos (a) failed to align to the zebra finch genome, (b) aligned to multiple loci, or (c) aligned to Chr_un only, and thus need to be flagged until a better genome assembly is available, or (d) reflect cloning artifacts; (2) Out of 9635 valid oligos examined further, 3120 were incorrectly named, including 1533 with no known orthologs; and (3) 2635 oligos required name update. The resulting curated dataset provides a reference for correcting gene identification errors in previous finch microarrays studies, and avoiding such errors in future studies.
Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

PubMed Central

2013-01-01

Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially yielding different numbers of haplotype-based quality SNPs. Sweet orange (SO) had the most (213,830) ESTs, generating 11,182 quality SNPs in 3,327 out of 4,228 usable contigs. Summed from all the individually mining results, a total of 25,417 quality SNPs were discovered – 15,010 (59.1%) were transitions (AG and CT), 9,114 (35.9%) were transversions (AC, GT, CG, and AT), and 1,293 (5.0%) were insertion/deletions (indels). A vast majority of SNP-containing contigs consisted of only 2 haplotypes, as expected, but the percentages of 2 haplotype contigs varied widely in these citrus cultivars. BLAST of the 25,417 25-mer SNP oligos to the Clementine reference genome scaffolds revealed 2,947 SNPs had “no hits found”, 19,943 had 1 unique hit / alignment, 1,571 had one hit and 2+ alignments per hit, and 956 had 2+ hits and 1+ alignment per hit. Of the total 24,293 scaffold hits, 23,955 (98.6%) were on the main scaffolds 1 to 9, and only 338 were on 87 minor scaffolds. Most alignments had 100% (25/25) or 96% (24/25) nucleotide identities, accounting for 93% of all the alignments. Considering almost all the nucleotide discrepancies in the 24/25 alignments were at the SNP sites, it served well as in silico validation of these SNPs, in addition to and consistent with the rate (81%) validated by sequencing and SNaPshot assay. Conclusions High-quality EST-SNPs from different citrus genotypes were detected, and compared to estimate the heterozygosity of each genome. All the SNP oligo sequences were aligned with the Clementine citrus genome to determine their distribution and uniqueness and for in silico validation, in addition to SNaPshot and sequencing validation of selected SNPs. PMID:24175923
A functional genomics tool for the Pacific bluefin tuna: Development of a 44K oligonucleotide microarray from whole-genome sequencing data for global transcriptome analysis.

PubMed

Yasuike, Motoshige; Fujiwara, Atushi; Nakamura, Yoji; Iwasaki, Yuki; Nishiki, Issei; Sugaya, Takuma; Shimizu, Akio; Sano, Motohiko; Kobayashi, Takanori; Ototake, Mitsuru

2016-02-01

Bluefin tunas are one of the most important fishery resources worldwide. Because of high market values, bluefin tuna farming has been rapidly growing during recent years. At present, the most common form of the tuna farming is based on the stocking of wild-caught fish. Therefore, concerns have been raised about the negative impact of the tuna farming on wild stocks. Recently, the Pacific bluefin tuna (PBT), Thunnus orientalis, has succeeded in completing the reproduction cycle under aquaculture conditions, but production bottlenecks remain to be solved because of very little biological information on bluefin tunas. Functional genomics approaches promise to rapidly increase our knowledge on biological processes in the bluefin tuna. Here, we describe the development of the first 44K PBT oligonucleotide microarray (oligo-array), based on whole-genome shotgun (WGS) sequencing and large-scale expressed sequence tags (ESTs) data. In addition, we also introduce an initial 44K PBT oligo-array experiment using in vitro grown peripheral blood leukocytes (PBLs) stimulated with immunostimulants such as lipopolysaccharide (LPS: a cell wall component of Gram-negative bacteria) or polyinosinic:polycytidylic acid (poly I:C: a synthetic mimic of viral infection). This pilot 44K PBT oligo-array analysis successfully addressed distinct immune processes between LPS- and poly I:C- stimulated PBLs. Thus, we expect that this oligo-array will provide an excellent opportunity to analyze global gene expression profiles for a better understanding of diseases and stress, as well as for reproduction, development and influence of nutrition on tuna aquaculture production. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Scar-less multi-part DNA assembly design automation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hillson, Nathan J.

The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less
The Est3 protein associates with yeast telomerase through an OB-fold domain

PubMed Central

Lee, Jaesung S.; Mandell, Edward K.; Tucey, Timothy M.; Morris, Danna K.; Victoria, Lundblad

2009-01-01

The Est3 protein is a small regulatory subunit of yeast telomerase which is dispensable for enzyme catalysis but essential for telomere replication in vivo. Using structure prediction combined with in vivo characterization, we show here that Est3 consists of a predicted OB (oligo-saccharide/oligo-nucleotide binding) fold. Mutagenesis of predicted surface residues was used to generate a functional map of one surface of Est3, which identified a site that mediates association with the telomerase complex. Surprisingly, the predicted OB-fold of Est3 is structurally similar to the OB-fold of the mammalian TPP1 protein, despite the fact that Est3 and TPP1, as components of telomerase and a telomere capping complex, respectively, perform functionally distinct tasks at chromosome ends. The analysis performed on Est3 may be instructive in generating comparable missense mutations on the surface of the OB-fold domain of TPP1. PMID:19172754
Oligo/Polynucleotide-Based Gene Modification: Strategies and Therapeutic Potential

PubMed Central

Sargent, R. Geoffrey; Kim, Soya

2011-01-01

Oligonucleotide- and polynucleotide-based gene modification strategies were developed as an alternative to transgene-based and classical gene targeting-based gene therapy approaches for treatment of genetic disorders. Unlike the transgene-based strategies, oligo/polynucleotide gene targeting approaches maintain gene integrity and the relationship between the protein coding and gene-specific regulatory sequences. Oligo/polynucleotide-based gene modification also has several advantages over classical vector-based homologous recombination approaches. These include essentially complete homology to the target sequence and the potential to rapidly engineer patient-specific oligo/polynucleotide gene modification reagents. Several oligo/polynucleotide-based approaches have been shown to successfully mediate sequence-specific modification of genomic DNA in mammalian cells. The strategies involve the use of polynucleotide small DNA fragments, triplex-forming oligonucleotides, and single-stranded oligodeoxynucleotides to mediate homologous exchange. The primary focus of this review will be on the mechanistic aspects of the small fragment homologous replacement, triplex-forming oligonucleotide-mediated, and single-stranded oligodeoxynucleotide-mediated gene modification strategies as it relates to their therapeutic potential. PMID:21417933
MELOGEN: an EST database for melon functional genomics

PubMed Central

Gonzalez-Ibeas, Daniel; Blanca, José; Roig, Cristina; González-To, Mireia; Picó, Belén; Truniger, Verónica; Gómez, Pedro; Deleu, Wim; Caño-Delgado, Ana; Arús, Pere; Nuez, Fernando; Garcia-Mas, Jordi; Puigdomènech, Pere; Aranda, Miguel A

2007-01-01

Background Melon (Cucumis melo L.) is one of the most important fleshy fruits for fresh consumption. Despite this, few genomic resources exist for this species. To facilitate the discovery of genes involved in essential traits, such as fruit development, fruit maturation and disease resistance, and to speed up the process of breeding new and better adapted melon varieties, we have produced a large collection of expressed sequence tags (ESTs) from eight normalized cDNA libraries from different tissues in different physiological conditions. Results We determined over 30,000 ESTs that were clustered into 16,637 non-redundant sequences or unigenes, comprising 6,023 tentative consensus sequences (contigs) and 10,614 unclustered sequences (singletons). Many potential molecular markers were identified in the melon dataset: 1,052 potential simple sequence repeats (SSRs) and 356 single nucleotide polymorphisms (SNPs) were found. Sixty-nine percent of the melon unigenes showed a significant similarity with proteins in databases. Functional classification of the unigenes was carried out following the Gene Ontology scheme. In total, 9,402 unigenes were mapped to one or more ontology. Remarkably, the distributions of melon and Arabidopsis unigenes followed similar tendencies, suggesting that the melon dataset is representative of the whole melon transcriptome. Bioinformatic analyses primarily focused on potential precursors of melon micro RNAs (miRNAs) in the melon dataset, but many other genes potentially controlling disease resistance and fruit quality traits were also identified. Patterns of transcript accumulation were characterised by Real-Time-qPCR for 20 of these genes. Conclusion The collection of ESTs characterised here represents a substantial increase on the genetic information available for melon. A database (MELOGEN) which contains all EST sequences, contig images and several tools for analysis and data mining has been created. This set of sequences constitutes also the basis for an oligo-based microarray for melon that is being used in experiments to further analyse the melon transcriptome. PMID:17767721
Construction of an Ostrea edulis database from genomic and expressed sequence tags (ESTs) obtained from Bonamia ostreae infected haemocytes: Development of an immune-enriched oligo-microarray.

PubMed

Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino

2016-12-01

The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in response to B. ostreae through massively sequencing and has aided to improve our knowledge of the immune mechanisms of flat oyster. The validated oligo-microarray and the establishment of a reference transcriptome will be useful for large-scale gene expression studies in this species. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Selection of optimal oligonucleotide probes for microarrays usingmultiple criteria, global alignment and parameter estimation.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Xingyuan; He, Zhili; Zhou, Jizhong

2005-10-30

The oligonucleotide specificity for microarray hybridizationcan be predicted by its sequence identity to non-targets, continuousstretch to non-targets, and/or binding free energy to non-targets. Mostcurrently available programs only use one or two of these criteria, whichmay choose 'false' specific oligonucleotides or miss 'true' optimalprobes in a considerable proportion. We have developed a software tool,called CommOligo using new algorithms and all three criteria forselection of optimal oligonucleotide probes. A series of filters,including sequence identity, free energy, continuous stretch, GC content,self-annealing, distance to the 3'-untranslated region (3'-UTR) andmelting temperature (Tm), are used to check each possibleoligonucleotide. A sequence identity is calculated based onmore » gapped globalalignments. A traversal algorithm is used to generate alignments for freeenergy calculation. The optimal Tm interval is determined based on probecandidates that have passed all other filters. Final probes are pickedusing a combination of user-configurable piece-wise linear functions andan iterative process. The thresholds for identity, stretch and freeenergy filters are automatically determined from experimental data by anaccessory software tool, CommOligo_PE (CommOligo Parameter Estimator).The program was used to design probes for both whole-genome and highlyhomologous sequence data. CommOligo and CommOligo_PE are freely availableto academic users upon request.« less
Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray

PubMed Central

2010-01-01

Background Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Results Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties. Conclusion All results suggest that our high-density flax oligo-microarray platform can be used as a very sensitive tool for analyzing gene expression in a large variety of tissues as well as in different cultivars. Moreover, this highly reliable platform can also be used for the quantification of mRNA transcriptional profiling in different flax tissues. PMID:20964859
Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray.

PubMed

Fenart, Stéphane; Ndong, Yves-Placide Assoumou; Duarte, Jorge; Rivière, Nathalie; Wilmer, Jeroen; van Wuytswinkel, Olivier; Lucau, Anca; Cariou, Emmanuelle; Neutelings, Godfrey; Gutierrez, Laurent; Chabbert, Brigitte; Guillot, Xavier; Tavernier, Reynald; Hawkins, Simon; Thomasset, Brigitte

2010-10-21

Flax (Linum usitatissimum L.) has been cultivated for around 9,000 years and is therefore one of the oldest cultivated species. Today, flax is still grown for its oil (oil-flax or linseed cultivars) and its cellulose-rich fibres (fibre-flax cultivars) used for high-value linen garments and composite materials. Despite the wide industrial use of flax-derived products, and our actual understanding of the regulation of both wood fibre production and oil biosynthesis more information must be acquired in both domains. Recent advances in genomics are now providing opportunities to improve our fundamental knowledge of these complex processes. In this paper we report the development and validation of a high-density oligo microarray platform dedicated to gene expression analyses in flax. Nine different RNA samples obtained from flax inner- and outer-stems, seeds, leaves and roots were used to generate a collection of 1,066,481 ESTs by massive parallel pyrosequencing. Sequences were assembled into 59,626 unigenes and 48,021 sequences were selected for oligo design and high-density microarray (Nimblegen 385K) fabrication with eight, non-overlapping 25-mers oligos per unigene. 18 independent experiments were used to evaluate the hybridization quality, precision, specificity and accuracy and all results confirmed the high technical quality of our microarray platform. Cross-validation of microarray data was carried out using quantitative qRT-PCR. Nine target genes were selected on the basis of microarray results and reflected the whole range of fold change (both up-regulated and down-regulated genes in different samples). A statistically significant positive correlation was obtained comparing expression levels for each target gene across all biological replicates both in qRT-PCR and microarray results. Further experiments illustrated the capacity of our arrays to detect differential gene expression in a variety of flax tissues as well as between two contrasted flax varieties. All results suggest that our high-density flax oligo-microarray platform can be used as a very sensitive tool for analyzing gene expression in a large variety of tissues as well as in different cultivars. Moreover, this highly reliable platform can also be used for the quantification of mRNA transcriptional profiling in different flax tissues.
Designing oligo libraries taking alternative splicing into account

NASA Astrophysics Data System (ADS)

Shoshan, Avi; Grebinskiy, Vladimir; Magen, Avner; Scolnicov, Ariel; Fink, Eyal; Lehavi, David; Wasserman, Alon

2001-06-01

We have designed sequences for DNA microarrays and oligo libraries, taking alternative splicing into account. Alternative splicing is a common phenomenon, occurring in more than 25% of the human genes. In many cases, different splice variants have different functions, are expressed in different tissues or may indicate different stages of disease. When designing sequences for DNA microarrays or oligo libraries, it is very important to take into account the sequence information of all the mRNA transcripts. Therefore, when a gene has more than one transcript (as a result of alternative splicing, alternative promoter sites or alternative poly-adenylation sites), it is very important to take all of them into account in the design. We have used the LEADS transcriptome prediction system to cluster and assemble the human sequences in GenBank and design optimal oligonucleotides for all the human genes with a known mRNA sequence based on the LEADS predictions.
j5 v2.8.4

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hillson, Nathan

j5 automates and optimizes the design of the molecular biological process of cloning/constructing DNA. j5 enables users to benefit from (combinatorial) multi-part scar-less SLIC, Gibson, CPEC, Golden Gate assembly, or variants thereof, for which automation software does not currently exist, without the intense labor currently associated with the process. j5 inputs a list of the DNA sequences to be assembled, along with a Genbank, FASTA, jbei-seq, or SBOL v1.1 format sequence file for each DNA source. Given the list of DNA sequences to be assembled, j5 first determines the cost-minimizing assembly strategy for each part (direct synthesis, PCR/SOE, or oligo-embedding),more » designs DNA oligos with Primer3, adds flanking homology sequences (SLIC, Gibson, and CPEC; optimized with Primer3 for CPEC) or optimized overhang sequences (Golden Gate) to the oligos and direct synthesis pieces, and utilizes BLAST to check against oligo mis-priming and assembly piece incompatibility events. After identifying DNA oligos that are already contained within a local collection for reuse, the program estimates the total cost of direct synthesis and new oligos to be ordered. In the instance that j5 identifies putative assembly piece incompatibilities (multiple pieces with high flanking sequence homology), the program suggests hierarchical subassemblies where possible. The program outputs a comma-separated value (CSV) file, viewable via Excel or other spreadsheet software, that contains assembly design information (such as the PCR/SOE reactions to perform, their anticipated sizes and sequences, etc.) as well as a properly annotated genbank file containing the sequence resulting from the assembly, and appends the local oligo library with the oligos to be ordered j5 condenses multiple independent assembly projects into 96-well format for high-throughput liquid-handling robotics platforms, and generates configuration files for the PR-PR biology-friendly robot programming language. j5 thus provides a new way to design DNA assembly procedures much more productively and efficiently, not only in terms of time, but also in terms of cost. To a large extent, however, j5 does not allow people to do something that could not be done before by hand given enough time and effort. An exception to this is that, since the very act of using j5 to design the DNA assembly process standardizes the experimental details and workflow, j5 enables a single person to concurrently perform the independent DNA construction tasks of an entire group of researchers. Currently, this is not readily possible, since separate researchers employ disparate design strategies and workflows, and furthermore, their designs and workflows are very infrequently fully captured in an electronic format which is conducive to automation.« less
Presence of DNA methyltransferase activity and CpC methylation in Drosophila melanogaster.

PubMed

Panikar, Chitra S; Rajpathak, Shriram N; Abhyankar, Varada; Deshmukh, Saniya; Deobagkar, Deepti D

2015-12-01

Drosophila melanogaster lacks DNMT1/DNMT3 based methylation machinery. Despite recent reports confirming the presence of low DNA methylation in Drosophila; little is known about the methyltransferase. Therefore, in this study, we have aimed to investigate the possible functioning of DNA methyltransferase in Drosophila. The 14 K oligo microarray slide was incubated with native cell extract from adult Drosophila to check the presence of the methyltransferase activity. After incubation under appropriate conditions, the methylated oligo sequences were identified by the binding of anti 5-methylcytosine monoclonal antibody. The antibody bound to the methylated oligos was detected using Cy3 labeled secondary antibody. Methylation sensitive restriction enzyme mediated PCR was used to assess the methylation at a few selected loci identified on the array. It could be seen that a few of the total oligos got methylated under the assay conditions. Analysis of methylated oligo sequences provides evidence for the presence of de novo methyltransferase activity and allows identification of its sequence specificity in adult Drosophila. With the help of methylation sensitive enzymes we could detect presence of CpC methylation in the selected genomic regions. This study reports presence of an active DNA methyltransferase in adult Drosophila, which exhibits sequence specificity confirmed by presence of asymmetric methylation at corresponding sites in the genomic DNA. It also provides an innovative approach to investigate methylation specificity of a native methyltransferase.
RoboOligo: software for mass spectrometry data to support manual and de novo sequencing of post-transcriptionally modified ribonucleic acids

PubMed Central

Sample, Paul J.; Gaston, Kirk W.; Alfonzo, Juan D.; Limbach, Patrick A.

2015-01-01

Ribosomal ribonucleic acid (RNA), transfer RNA and other biological or synthetic RNA polymers can contain nucleotides that have been modified by the addition of chemical groups. Traditional Sanger sequencing methods cannot establish the chemical nature and sequence of these modified-nucleotide containing oligomers. Mass spectrometry (MS) has become the conventional approach for determining the nucleotide composition, modification status and sequence of modified RNAs. Modified RNAs are analyzed by MS using collision-induced dissociation tandem mass spectrometry (CID MS/MS), which produces a complex dataset of oligomeric fragments that must be interpreted to identify and place modified nucleosides within the RNA sequence. Here we report the development of RoboOligo, an interactive software program for the robust analysis of data generated by CID MS/MS of RNA oligomers. There are three main functions of RoboOligo: (i) automated de novo sequencing via the local search paradigm. (ii) Manual sequencing with real-time spectrum labeling and cumulative intensity scoring. (iii) A hybrid approach, coined ‘variable sequencing’, which combines the user intuition of manual sequencing with the high-throughput sampling of automated de novo sequencing. PMID:25820423
Superimposed Code Theorectic Analysis of DNA Codes and DNA Computing

DTIC Science & Technology

2010-03-01

because only certain collections (partitioned by font type) of sequences are allowed to be in each position (e.g., Arial = position 0, Comic ...rigidity of short oligos and the shape of the polar charge. Oligo movement was modeled by a Brownian motion 3 dimensional random walk. The one...temperature, kB is Boltz he viscosity of the medium. The random walk motion is modeled by assuming the oligo is on a three dimensional lattice and may
Terminator oligo blocking efficiently eliminates rRNA from Drosophila small RNA sequencing libraries.

PubMed

Wickersheim, Michelle L; Blumenstiel, Justin P

2013-11-01

A large number of methods are available to deplete ribosomal RNA reads from high-throughput RNA sequencing experiments. Such methods are critical for sequencing Drosophila small RNAs between 20 and 30 nucleotides because size selection is not typically sufficient to exclude the highly abundant class of 30 nucleotide 2S rRNA. Here we demonstrate that pre-annealing terminator oligos complimentary to Drosophila 2S rRNA prior to 5' adapter ligation and reverse transcription efficiently depletes 2S rRNA sequences from the sequencing reaction in a simple and inexpensive way. This depletion is highly specific and is achieved with minimal perturbation of miRNA and piRNA profiles.
Opposite consequences of two transcription pauses caused by an intrinsic terminator oligo(U): antitermination versus termination by bacteriophage T7 RNA polymerase.

PubMed

Lee, Sooncheol; Kang, Changwon

2011-05-06

The RNA oligo(U) sequence, along with an immediately preceding RNA hairpin structure, is an essential cis-acting element for bacterial class I intrinsic termination. This sequence not only causes a pause in transcription during the beginning of the termination process but also facilitates transcript release at the end of the process. In this study, the oligo(U) sequence of the bacteriophage T7 intrinsic terminator Tφ, rather than the hairpin structure, induced pauses of phage T7 RNA polymerase not only at the termination site, triggering a termination process, but also 3 bp upstream, exerting an antitermination effect. The upstream pause presumably allowed RNA to form a thermodynamically more stable secondary structure rather than a terminator hairpin and to persist because the 5'-half of the terminator hairpin-forming sequence could be sequestered by a farther upstream sequence via sequence-specific hybridization, prohibiting formation of the terminator hairpin and termination. The putative antiterminator RNA structure lacked several base pairs essential for termination when probed using RNases A, T1, and V1. When the antiterminator was destabilized by incorporation of IMP into nascent RNA at G residue positions, antitermination was abolished. Furthermore, antitermination strength increased with more stable antiterminator secondary structures and longer pauses. Thus, the oligo(U)-mediated pause prior to the termination site can exert a cis-acting antitermination activity on intrinsic terminator Tφ, and the termination efficiency depends primarily on the termination-interfering pause that precedes the termination-facilitating pause at the termination site.
An ovary transcriptome for all maturational stages of the striped bass (Morone saxatilis), a highly advanced perciform fish.

PubMed

Reading, Benjamin J; Chapman, Robert W; Schaff, Jennifer E; Scholl, Elizabeth H; Opperman, Charles H; Sullivan, Craig V

2012-02-21

The striped bass and its relatives (genus Morone) are important fisheries and aquaculture species native to estuaries and rivers of the Atlantic coast and Gulf of Mexico in North America. To open avenues of gene expression research on reproduction and breeding of striped bass, we generated a collection of expressed sequence tags (ESTs) from a complementary DNA (cDNA) library representative of their ovarian transcriptome. Sequences of a total of 230,151 ESTs (51,259,448 bp) were acquired by Roche 454 pyrosequencing of cDNA pooled from ovarian tissues obtained at all stages of oocyte growth, at ovulation (eggs), and during preovulatory atresia. Quality filtering of ESTs allowed assembly of 11,208 high-quality contigs ≥ 100 bp, including 2,984 contigs 500 bp or longer (average length 895 bp). Blastx comparisons revealed 5,482 gene orthologues (E-value < 10-3), of which 4,120 (36.7% of total contigs) were annotated with Gene Ontology terms (E-value < 10-6). There were 5,726 remaining unknown unique sequences (51.1% of total contigs). All of the high-quality EST sequences are available in the National Center for Biotechnology Information (NCBI) Short Read Archive (GenBank: SRX007394). Informative contigs were considered to be abundant if they were assembled from groups of ESTs comprising ≥ 0.15% of the total short read sequences (≥ 345 reads/contig). Approximately 52.5% of these abundant contigs were predicted to have predominant ovary expression through digital differential display in silico comparisons to zebrafish (Danio rerio) UniGene orthologues. Over 1,300 Gene Ontology terms from Biological Process classes of Reproduction, Reproductive process, and Developmental process were assigned to this collection of annotated contigs. This first large reference sequence database available for the ecologically and economically important temperate basses (genus Morone) provides a foundation for gene expression studies in these species. The predicted predominance of ovary gene expression and assignment of directly relevant Gene Ontology classes suggests a powerful utility of this dataset for analysis of ovarian gene expression related to fundamental questions of oogenesis. Additionally, a high definition Agilent 60-mer oligo ovary 'UniClone' microarray with 8 × 15,000 probe format has been designed based on this striped bass transcriptome (eArray Group: Striper Group, Design ID: 029004).

MS/MS Digital Readout: Analysis of Binary Information Encoded in the Monomer Sequences of Poly(triazole amide)s.

PubMed

Amalian, Jean-Arthur; Trinh, Thanh Tam; Lutz, Jean-François; Charles, Laurence

2016-04-05

Tandem mass spectrometry was evaluated as a reliable sequencing methodology to read codes encrypted in monodisperse sequence-coded oligo(triazole amide)s. The studied oligomers were composed of monomers containing a triazole ring, a short ethylene oxide segment, and an amide group as well as a short alkyl chain (propyl or isobutyl) which defined the 0/1 molecular binary code. Using electrospray ionization, oligo(triazole amide)s were best ionized as protonated molecules and were observed to adopt a single charge state, suggesting that adducted protons were located on every other monomer unit. Upon collisional activation, cleavages of the amide bond and of one ether bond were observed to proceed in each monomer, yielding two sets of complementary product ions. Distribution of protons over the precursor structure was found to remain unchanged upon activation, allowing charge state to be anticipated for product ions in the four series and hence facilitating their assignment for a straightforward characterization of any encoded oligo(triazole amide)s.
Parallel beta/alpha-barrels of alpha-amylase, cyclodextrin glycosyltransferase and oligo-1,6-glucosidase versus the barrel of beta-amylase: evolutionary distance is a reflection of unrelated sequences.

PubMed

Janecek, S

1994-10-17

The structures of functionally related beta/alpha-barrel starch hydrolases, alpha-amylase, beta-amylase, cyclodextrin glycosyltransferase and oligo-1,6-glucosidase, are discussed, their mutual sequence similarities being emphasized. Since these enzymes (except for beta-amylase) along with the predicted set of more than ten beta/alpha-barrels from the alpha-amylase enzyme superfamily fulfil the criteria characteristic of the products of divergent evolution, their unrooted distance tree is presented.
Triple helix purification and sequencing

DOEpatents

Wang, Renfeng; Smith, Lloyd M.; Tong, Xinchun E.

1995-01-01

Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis.
Triple helix purification and sequencing

DOEpatents

Wang, R.; Smith, L.M.; Tong, X.E.

1995-03-28

Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis. 4 figures.
Capillary electrophoretic separation-based approach to determine the labeling kinetics of oligodeoxynucleotides

PubMed Central

Kanavarioti, Anastassia; Greenman, Kevin L.; Hamalainen, Mark; Jain, Aakriti; Johns, Adam M.; Melville, Chris R.; Kemmish, Kent; Andregg, William

2014-01-01

With the recent advances in electron microscopy (EM), computation, and nanofabrication, the original idea of reading DNA sequence directly from an image can now be tested. One approach is to develop heavy atom labels that can provide the contrast required for EM imaging. While evaluating tentative labels for the respective nucleobases in synthetic oligodeoxynucleotides (oligos), we developed a streamlined capillary electrophoresis (CE) protocol to assess the label stability, reactivity, and selectivity. We report our protocol using osmium tetroxide 2,2′-bipyridine (Osbipy) as a thymidine (T) specific label. The observed rates show that the labeling process is kinetically independent of both the oligo length, and the base composition. The conditions, i.e. temperature, optimal Osbipy concentration, and molar ratio of reagents, to promote 100% conversion of the starting oligo to labeled product were established. Hence the optimized conditions developed with the oligos could be leveraged to allow osmylation of effectively all Ts in single-stranded (ss) DNA, while achieving minimal mislabeling. In addition, the approach and methods employed here may be adapted to the evaluation of other prospective contrasting agents/labels to facilitate next-generation DNA sequencing by EM. PMID:23147698
'Cold shock' increases the frequency of homology directed repair gene editing in induced pluripotent stem cells.

PubMed

Guo, Q; Mintier, G; Ma-Edmonds, M; Storton, D; Wang, X; Xiao, X; Kienzle, B; Zhao, D; Feder, John N

2018-02-01

Using CRISPR/Cas9 delivered as a RNA modality in conjunction with a lipid specifically formulated for large RNA molecules, we demonstrate that homology directed repair (HDR) rates between 20-40% can be achieved in induced pluripotent stem cells (iPSC). Furthermore, low HDR rates (between 1-20%) can be enhanced two- to ten-fold in both iPSCs and HEK293 cells by 'cold shocking' cells at 32 °C for 24-48 hours following transfection. This method can also increases the proportion of loci that have undergone complete sequence conversion across the donor sequence, or 'perfect HDR', as opposed to partial sequence conversion where nucleotides more distal to the CRISPR cut site are less efficiently incorporated ('partial HDR'). We demonstrate that the structure of the single-stranded DNA oligo donor can influence the fidelity of HDR, with oligos symmetric with respect to the CRISPR cleavage site and complementary to the target strand being more efficient at directing 'perfect HDR' compared to asymmetric non-target strand complementary oligos. Our protocol represents an efficient method for making CRISPR-mediated, specific DNA sequence changes within the genome that will facilitate the rapid generation of genetic models of human disease in iPSCs as well as other genome engineered cell lines.
Signal sequence and keyword trap in silico for selection of full-length human cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries.

PubMed

Otsuki, Tetsuji; Ota, Toshio; Nishikawa, Tetsuo; Hayashi, Koji; Suzuki, Yutaka; Yamamoto, Jun-ichi; Wakamatsu, Ai; Kimura, Kouichi; Sakamoto, Katsuhiko; Hatano, Naoto; Kawai, Yuri; Ishii, Shizuko; Saito, Kaoru; Kojima, Shin-ichi; Sugiyama, Tomoyasu; Ono, Tetsuyoshi; Okano, Kazunori; Yoshikawa, Yoko; Aotsuka, Satoshi; Sasaki, Naokazu; Hattori, Atsushi; Okumura, Koji; Nagai, Keiichi; Sugano, Sumio; Isogai, Takao

2005-01-01

We have developed an in silico method of selection of human full-length cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries. Fullness rates were increased to about 80% by combination of the oligo-capping method and ATGpr, software for prediction of translation start point and the coding potential. Then, using 5'-end single-pass sequences, cDNAs having the signal sequence were selected by PSORT ('signal sequence trap'). We also applied 'secretion or membrane protein-related keyword trap' based on the result of BLAST search against the SWISS-PROT database for the cDNAs which could not be selected by PSORT. Using the above procedures, 789 cDNAs were primarily selected and subjected to full-length sequencing, and 334 of these cDNAs were finally selected as novel. Most of the cDNAs (295 cDNAs: 88.3%) were predicted to encode secretion or membrane proteins. In particular, 165(80.5%) of the 205 cDNAs selected by PSORT were predicted to have signal sequences, while 70 (54.2%) of the 129 cDNAs selected by 'keyword trap' preserved the secretion or membrane protein-related keywords. Many important cDNAs were obtained, including transporters, receptors, and ligands, involved in significant cellular functions. Thus, an efficient method of selecting secretion or membrane protein-encoding cDNAs was developed by combining the above four procedures.
Generation and analysis of ESTs from strawberry (Fragaria xananassa) fruits and evaluation of their utility in genetic and molecular studies

PubMed Central

2010-01-01

Background Cultivated strawberry is a hybrid octoploid species (Fragaria xananassa Duchesne ex. Rozier) whose fruit is highly appreciated due to its organoleptic properties and health benefits. Despite recent studies on the control of its growth and ripening processes, information about the role played by different hormones on these processes remains elusive. Further advancement of this knowledge is hampered by the limited sequence information on genes from this species, despite the abundant information available on genes from the wild diploid relative Fragaria vesca. However, the diploid species, or one ancestor, only partially contributes to the genome of the cultivated octoploid. We have produced a collection of expressed sequence tags (ESTs) from different cDNA libraries prepared from different fruit parts and developmental stages. The collection has been analysed and the sequence information used to explore the involvement of different hormones in fruit developmental processes, and for the comparison of transcripts in the receptacle of ripe fruits of diploid and octoploid species. The study is particularly important since the commercial fruit is indeed an enlarged flower receptacle with the true fruits, the achenes, on the surface and connected through a network of vascular vessels to the central pith. Results We have sequenced over 4,500 ESTs from Fragaria xananassa, thus doubling the number of ESTs available in the GenBank of this species. We then assembled this information together with that available from F. xananassa resulting a total of 7,096 unigenes. The identification of SSRs and SNPs in many of the ESTs allowed their conversion into functional molecular markers. The availability of libraries prepared from green growing fruits has allowed the cloning of cDNAs encoding for genes of auxin, ethylene and brassinosteroid signalling processes, followed by expression studies in selected fruit parts and developmental stages. In addition, the sequence information generated in the project, jointly with previous information on sequences from both F. xananassa and F. vesca, has allowed designing an oligo-based microarray that has been used to compare the transcriptome of the ripe receptacle of the diploid and octoploid species. Comparison of the transcriptomes, grouping the genes by biological processes, points to differences being quantitative rather than qualitative. Conclusions The present study generates essential knowledge and molecular tools that will be useful in improving investigations at the molecular level in cultivated strawberry (F. xananassa). This knowledge is likely to provide useful resources in the ongoing breeding programs. The sequence information has already allowed the development of molecular markers that have been applied to germplasm characterization and could be eventually used in QTL analysis. Massive transcription analysis can be of utility to target specific genes to be further studied, by their involvement in the different plant developmental processes. PMID:20849591
sigReannot: an oligo-set re-annotation pipeline based on similarities with the Ensembl transcripts and Unigene clusters.

PubMed

Casel, Pierrot; Moreews, François; Lagarrigue, Sandrine; Klopp, Christophe

2009-07-16

Microarray is a powerful technology enabling to monitor tens of thousands of genes in a single experiment. Most microarrays are now using oligo-sets. The design of the oligo-nucleotides is time consuming and error prone. Genome wide microarray oligo-sets are designed using as large a set of transcripts as possible in order to monitor as many genes as possible. Depending on the genome sequencing state and on the assembly state the knowledge of the existing transcripts can be very different. This knowledge evolves with the different genome builds and gene builds. Once the design is done the microarrays are often used for several years. The biologists working in EADGENE expressed the need of up-to-dated annotation files for the oligo-sets they share including information about the orthologous genes of model species, the Gene Ontology, the corresponding pathways and the chromosomal location. The results of SigReannot on a chicken micro-array used in the EADGENE project compared to the initial annotations show that 23% of the oligo-nucleotide gene annotations were not confirmed, 2% were modified and 1% were added. The interest of this up-to-date annotation procedure is demonstrated through the analysis of real data previously published. SigReannot uses the oligo-nucleotide design procedure criteria to validate the probe-gene link and the Ensembl transcripts as reference for annotation. It therefore produces a high quality annotation based on reference gene sets.
HTP-OligoDesigner: An Online Primer Design Tool for High-Throughput Gene Cloning and Site-Directed Mutagenesis.

PubMed

Camilo, Cesar M; Lima, Gustavo M A; Maluf, Fernando V; Guido, Rafael V C; Polikarpov, Igor

2016-01-01

Following burgeoning genomic and transcriptomic sequencing data, biochemical and molecular biology groups worldwide are implementing high-throughput cloning and mutagenesis facilities in order to obtain a large number of soluble proteins for structural and functional characterization. Since manual primer design can be a time-consuming and error-generating step, particularly when working with hundreds of targets, the automation of primer design process becomes highly desirable. HTP-OligoDesigner was created to provide the scientific community with a simple and intuitive online primer design tool for both laboratory-scale and high-throughput projects of sequence-independent gene cloning and site-directed mutagenesis and a Tm calculator for quick queries.
MerMade: An Oligodeoxyribonucleotide Synthesizer for High Throughput Oligonucleotide Production in Dual 96-Well Plates

PubMed Central

Rayner, Simon; Brignac, Stafford; Bumeister, Ron; Belosludtsev, Yuri; Ward, Travis; Grant, O’dell; O’Brien, Kevin; Evans, Glen A.; Garner, Harold R.

1998-01-01

We have designed and constructed a machine that synthesizes two standard 96-well plates of oligonucleotides in a single run using standard phosphoramidite chemistry. The machine is capable of making a combination of standard, degenerate, or modified oligos in a single plate. The run time is typically 17 hr for two plates of 20-mers and a reaction scale of 40 nm. The reaction vessel is a standard polypropylene 96-well plate with a hole drilled in the bottom of each well. The two plates are placed in separate vacuum chucks and mounted on an xy table. Each well in turn is positioned under the appropriate reagent injection line and the reagent is injected by switching a dedicated valve. All aspects of machine operation are controlled by a Macintosh computer, which also guides the user through the startup and shutdown procedures, provides a continuous update on the status of the run, and facilitates a number of service procedures that need to be carried out periodically. Over 25,000 oligos have been synthesized for use in dye terminator sequencing reactions, polymerase chain reactions (PCRs), hybridization, and RT–PCR. Oligos up to 100 bases in length have been made with a coupling efficiency in excess of 99%. These machines, working in conjunction with our oligo prediction code are particularly well suited to application in automated high throughput genomic sequencing. PMID:9685322
MerMade: an oligodeoxyribonucleotide synthesizer for high throughput oligonucleotide production in dual 96-well plates.

PubMed

Rayner, S; Brignac, S; Bumeister, R; Belosludtsev, Y; Ward, T; Grant, O; O'Brien, K; Evans, G A; Garner, H R

1998-07-01

We have designed and constructed a machine that synthesizes two standard 96-well plates of oligonucleotides in a single run using standard phosphoramidite chemistry. The machine is capable of making a combination of standard, degenerate, or modified oligos in a single plate. The run time is typically 17 hr for two plates of 20-mers and a reaction scale of 40 nM. The reaction vessel is a standard polypropylene 96-well plate with a hole drilled in the bottom of each well. The two plates are placed in separate vacuum chucks and mounted on an xy table. Each well in turn is positioned under the appropriate reagent injection line and the reagent is injected by switching a dedicated valve. All aspects of machine operation are controlled by a Macintosh computer, which also guides the user through the startup and shutdown procedures, provides a continuous update on the status of the run, and facilitates a number of service procedures that need to be carried out periodically. Over 25,000 oligos have been synthesized for use in dye terminator sequencing reactions, polymerase chain reactions (PCRs), hybridization, and RT-PCR. Oligos up to 100 bases in length have been made with a coupling efficiency in excess of 99%. These machines, working in conjunction with our oligo prediction code are particularly well suited to application in automated high throughput genomic sequencing.
Oligo-DNA Custom Macroarray for Monitoring Major Pathogenic and Non-Pathogenic Fungi and Bacteria in the Phyllosphere of Apple Trees

PubMed Central

He, Ying-Hong; Isono, Sayaka; Shibuya, Makoto; Tsuji, Masaharu; Adkar Purushothama, Charith-Raj; Tanaka, Kazuaki; Sano, Teruo

2012-01-01

Background To monitor the richness in microbial inhabitants in the phyllosphere of apple trees cultivated under various cultural and environmental conditions, we developed an oligo-DNA macroarray for major pathogenic and non-pathogenic fungi and bacteria inhabiting the phyllosphere of apple trees. Methods and Findings First, we isolated culturable fungi and bacteria from apple orchards by an agar-plate culture method, and detected 32 fungal and 34 bacterial species. Alternaria, Aureobasidium, Cladosporium, Rhodotorula, Cystofilobasidium, and Epicoccum genera were predominant among the fungi, and Bacillus, Pseudomonas, Sphingomonas, Methylobacterium, and Pantoea genera were predominant among the bacteria. Based on the data, we selected 29 major non-pathogenic and 12 phytopathogenic fungi and bacteria as the targets of macroarray. Forty-one species-specific 40-base pair long oligo-DNA sequences were selected from the nucleotide sequences of rDNA-internal transcribed spacer region for fungi and 16S rDNA for bacteria. The oligo-DNAs were fixed on nylon membrane and hybridized with digoxigenin-labeled cRNA probes prepared for each species. All arrays except those for Alternaria, Bacillus, and their related species, were specifically hybridized. The array was sensitive enough to detect 103 CFU for Aureobasidium pullulans and Bacillus cereus. Nucleotide sequencing of 100 each of independent fungal rDNA-ITS and bacterial 16S-rDNA sequences from apple tree was in agreement with the macroarray data obtained using the same sample. Finally, we analyzed the richness in the microbial inhabitants in the samples collected from apple trees in four orchards. Major apple pathogens that cause scab, Alternaria blotch, and Marssonina blotch were detected along with several non-phytopathogenic fungal and bacterial inhabitants. Conclusions The macroarray technique presented here is a strong tool to monitor the major microbial species and the community structures in the phyllosphere of apple trees and identify key species antagonistic, supportive or co-operative to specific pathogens in the orchard managed under different environmental conditions. PMID:22479577
Oligo-DNA custom macroarray for monitoring major pathogenic and non-pathogenic fungi and bacteria in the phyllosphere of apple trees.

PubMed

He, Ying-Hong; Isono, Sayaka; Shibuya, Makoto; Tsuji, Masaharu; Adkar Purushothama, Charith-Raj; Tanaka, Kazuaki; Sano, Teruo

2012-01-01

To monitor the richness in microbial inhabitants in the phyllosphere of apple trees cultivated under various cultural and environmental conditions, we developed an oligo-DNA macroarray for major pathogenic and non-pathogenic fungi and bacteria inhabiting the phyllosphere of apple trees. First, we isolated culturable fungi and bacteria from apple orchards by an agar-plate culture method, and detected 32 fungal and 34 bacterial species. Alternaria, Aureobasidium, Cladosporium, Rhodotorula, Cystofilobasidium, and Epicoccum genera were predominant among the fungi, and Bacillus, Pseudomonas, Sphingomonas, Methylobacterium, and Pantoea genera were predominant among the bacteria. Based on the data, we selected 29 major non-pathogenic and 12 phytopathogenic fungi and bacteria as the targets of macroarray. Forty-one species-specific 40-base pair long oligo-DNA sequences were selected from the nucleotide sequences of rDNA-internal transcribed spacer region for fungi and 16S rDNA for bacteria. The oligo-DNAs were fixed on nylon membrane and hybridized with digoxigenin-labeled cRNA probes prepared for each species. All arrays except those for Alternaria, Bacillus, and their related species, were specifically hybridized. The array was sensitive enough to detect 10(3) CFU for Aureobasidium pullulans and Bacillus cereus. Nucleotide sequencing of 100 each of independent fungal rDNA-ITS and bacterial 16S-rDNA sequences from apple tree was in agreement with the macroarray data obtained using the same sample. Finally, we analyzed the richness in the microbial inhabitants in the samples collected from apple trees in four orchards. Major apple pathogens that cause scab, Alternaria blotch, and Marssonina blotch were detected along with several non-phytopathogenic fungal and bacterial inhabitants. The macroarray technique presented here is a strong tool to monitor the major microbial species and the community structures in the phyllosphere of apple trees and identify key species antagonistic, supportive or co-operative to specific pathogens in the orchard managed under different environmental conditions.
Transcriptome sequencing of newly molted adult female cattle ticks, Rhipicephalus microplus: Raw Illumina reads.

USDA-ARS?s Scientific Manuscript database

Illumina paired end oligo-dT sequencing technology was used to sequence the transcriptome from newly molted adult females from the cattle tick, Rhipicephalus microplus. These samples include newly molted unfed whole adult females, newly molted whole adult females feeding for 2 hours on a bovine host...
Oligonucleotide recombination enabled site-specific mutagenesis in bacteria

USDA-ARS?s Scientific Manuscript database

Recombineering refers to a strategy for engineering DNA sequences using a specialized mode of homologous recombination. This technology can be used for rapidly constructing precise changes in bacterial genome sequences in vivo. Oligo recombination is one type of recombineering that uses ssDNA olig...
Effect of Backbone Design on Hybridization Thermodynamics of Oligo-nucleic Acids: A Coarse-Grained Molecular Dynamics Simulation Study

NASA Astrophysics Data System (ADS)

Ghobadi, Ahmadreza F.; Jayaraman, Arthi

DNA hybridization is the basis of various bio-nano technologies, such as DNA origami and assembly of DNA-functionalized nanoparticles. A hybridized double stranded (ds) DNA is formed when complementary nucleobases on hybridizing strands exhibit specific and directional hydrogen bonds through canonical Watson-Crick base-pairing interactions. In recent years, the need for cheaper alternatives and significant synthetic advances have driven design of DNA mimics with new backbone chemistries. However, a fundamental understanding of how these backbone modifications in the oligo-nucleic acids impact the hybridization and melting behavior of the duplex is still lacking. In this talk, we present our recent findings on impact of varying backbone chemistry on hybridization of oligo-nucleic acid duplexes. We use coarse-grained molecular dynamics simulations to isolate the effect of strand flexibility, electrostatic interactions and nucleobase spacing on the melting curves for duplexes with various strand sequences and concentrations. Since conjugation of oligo-nucleic acids with polymers serve as building blocks for thermo-responsive polymer networks and gels, we also present the effect of such conjugation on hybridization thermodynamics and polymer conformation.
The Limits of Template-Directed Synthesis with Nucleoside-5'-Phosphoro(2-Methyl) Imidazolides

NASA Technical Reports Server (NTRS)

Hill, Aubrey R., Jr.; Orgel, Leslie E.; Wu, Taifeng

1993-01-01

In earlier work we have shown that C-rich templates containing isolated A, T or G residues and short oligo(G) sequences can be copied effectively using nucleoside-5'-phosphoro(2-methyl)imidazolides as substrates. We now show that isolated A or T residues within an oligo(G) sequence are a complete block to copying and that an isolated C residue is copied inefficiently. Replication is possible only if there are two complementary oligonucleotides each of which acts as a template to facilitate the synthesis of the other. We emphasize the severity of the problems that need to be overcome to make possible non-enzymatic replication in homogeneous aqueous solution. We conclude that an efficient catalyst was involved in the origin of polynucleotide replication.
Merlin: Computer-Aided Oligonucleotide Design for Large Scale Genome Engineering with MAGE.

PubMed

Quintin, Michael; Ma, Natalie J; Ahmed, Samir; Bhatia, Swapnil; Lewis, Aaron; Isaacs, Farren J; Densmore, Douglas

2016-06-17

Genome engineering technologies now enable precise manipulation of organism genotype, but can be limited in scalability by their design requirements. Here we describe Merlin ( http://merlincad.org ), an open-source web-based tool to assist biologists in designing experiments using multiplex automated genome engineering (MAGE). Merlin provides methods to generate pools of single-stranded DNA oligonucleotides (oligos) for MAGE experiments by performing free energy calculation and BLAST scoring on a sliding window spanning the targeted site. These oligos are designed not only to improve recombination efficiency, but also to minimize off-target interactions. The application further assists experiment planning by reporting predicted allelic replacement rates after multiple MAGE cycles, and enables rapid result validation by generating primer sequences for multiplexed allele-specific colony PCR. Here we describe the Merlin oligo and primer design procedures and validate their functionality compared to OptMAGE by eliminating seven AvrII restriction sites from the Escherichia coli genome.
Oligo Design: a computer program for development of probes for oligonucleotide microarrays.

PubMed

Herold, Keith E; Rasooly, Avraham

2003-12-01

Oligonucleotide microarrays have demonstrated potential for the analysis of gene expression, genotyping, and mutational analysis. Our work focuses primarily on the detection and identification of bacteria based on known short sequences of DNA. Oligo Design, the software described here, automates several design aspects that enable the improved selection of oligonucleotides for use with microarrays for these applications. Two major features of the program are: (i) a tiling algorithm for the design of short overlapping temperature-matched oligonucleotides of variable length, which are useful for the analysis of single nucleotide polymorphisms and (ii) a set of tools for the analysis of multiple alignments of gene families and related short DNA sequences, which allow for the identification of conserved DNA sequences for PCR primer selection and variable DNA sequences for the selection of unique probes for identification. Note that the program does not address the full genome perspective but, instead, is focused on the genetic analysis of short segments of DNA. The program is Internet-enabled and includes a built-in browser and the automated ability to download sequences from GenBank by specifying the GI number. The program also includes several utilities, including audio recital of a DNA sequence (useful for verifying sequences against a written document), a random sequence generator that provides insight into the relationship between melting temperature and GC content, and a PCR calculator.

BioPartsDB: a synthetic biology workflow web-application for education and research.

PubMed

Stracquadanio, Giovanni; Yang, Kun; Boeke, Jef D; Bader, Joel S

2016-11-15

Synthetic biology has become a widely used technology, and expanding applications in research, education and industry require progress tracking for team-based DNA synthesis projects. Although some vendors are beginning to supply multi-kilobase sequence-verified constructs, synthesis workflows starting with short oligos remain important for cost savings and pedagogical benefit. We developed BioPartsDB as an open source, extendable workflow management system for synthetic biology projects with entry points for oligos and larger DNA constructs and ending with sequence-verified clones. BioPartsDB is released under the MIT license and available for download at https://github.com/baderzone/biopartsdb Additional documentation and video tutorials are available at https://github.com/baderzone/biopartsdb/wiki An Amazon Web Services image is available from the AWS Market Place (ami-a01d07c8). joel.bader@jhu.edu. © The Author 2016. Published by Oxford University Press.
Stroma Based Prognosticators Incorporating Differences between African and European Americans

DTIC Science & Technology

2017-10-01

amenable to bisulfite sequencing of more than a few genes. Exploiting the recent three-fold reduction in the cost of sequencing per read , we developed oligo...cards. The ability of the HiSeq 4000 to obtain about three times as many reads as the HiSeq2500, at the same price, means we can stay on track, though...capture, and sequencing (Table 2). We obtain tens of millions of mapped deduplicated reads per sample, while using only 5% of a sequencing lane per sample
Oligo-Miocene reservoir sequence characterization and structuring in the Sisseb El Alem-Kalaa Kebira regions (Northeastern Tunisia)

NASA Astrophysics Data System (ADS)

Houatmia, Faten; Khomsi, Sami; Bédir, Mourad

2015-11-01

The Sisseb El Alem-Enfidha basin is located in the northeastern Tunisia, It is borded by Nadhour - Saouaf syncline to the north, Kairouan plain to the south, the Mediterranean Sea to the east and Tunisian Atlassic "dorsale" to the west. Oligocene and Miocene deltaic deposits present the main potential deep aquifers in this basin with high porosity (25%-30%). The interpretation of twenty seismic reflection profiles, calibrated by wire line logging data of twelve oil wells, hydraulic wells and geologic field sections highlighted the impact of tectonics on the structuring geometry of Oligo-Miocene sandstones reservoirs and their distribution in raised structures and subsurface depressions. Miocene seismostratigraphy analysis from Ain Ghrab Formation (Langhian) to the Segui Formation (Quaternary) showed five third-order seismic sequence deposits and nine extended lenticular sandy bodies reservoirs limited by toplap and downlap surfaces unconformities, Oligocene deposits presented also five third- order seismic sequences with five extended lenticular sandy bodies reservoirs. The Depth and the thickness maps of these sequence reservoir packages exhibited the structuring of this basin in sub-basins characterized by important lateral and vertical geometric and thichness variations. Petroleum wells wire line logging correlation with clay volume calculation showed an heterogeneous multilayer reservoirs of Oligocene and Miocene formed by the arrangement of fourteen sandstone bodies being able to be good reservoirs, separated by impermeable clay packages and affected by faults. Reservoirs levels correspond mainly to the lower system tract (LST) of sequences. Intensive fracturing by deep seated faults bounding the different sub-basins play a great role for water surface recharge and inter-layer circulations between affected reservoirs. The total pore volume of the Oligo-Miocene reservoir sandy bodies in the study area, is estimated to about 4 × 1012 m3 and equivalent to 4 × 109 m3 of deep water reserves.
Biomarker Discovery and Mechanistic Studies of Prostate Cancer using Targeted Proteomic Approaches

DTIC Science & Technology

2012-07-01

basigin in Drosophila ) tightly regulates cytoskeleton rearrangement in Drosophila melanogaster [23]. Based on the present results and the existing...from OligoEngine according to the manufac- turer’s instruction. Plasmids were amplified in DH5a cell and confirmed by sequencing . Subconfluent cell...electrophoresis and the results are shown in Figure 1 (Panel C). The RT-PCR products were cloned and subjected to DNA sequenc - ing. The sequencing
XET Activity is Found Near Sites of Growth and Cell Elongation in Bryophytes and Some Green Algae: New Insights into the Evolution of Primary Cell Wall Elongation

PubMed Central

Van Sandt, Vicky S. T.; Stieperaere, Herman; Guisez, Yves; Verbelen, Jean-Pierre; Vissenberg, Kris

2007-01-01

Background and Aims In angiosperms xyloglucan endotransglucosylase (XET)/hydrolase (XTH) is involved in reorganization of the cell wall during growth and development. The location of oligo-xyloglucan transglucosylation activity and the presence of XTH expressed sequence tags (ESTs) in the earliest diverging extant plants, i.e. in bryophytes and algae, down to the Phaeophyta was examined. The results provide information on the presence of an XET growth mechanism in bryophytes and algae and contribute to the understanding of the evolution of cell wall elongation in general. Methods Representatives of the different plant lineages were pressed onto an XET test paper and assayed. XET or XET-related activity was visualized as the incorporation of fluorescent signal. The Physcomitrella genome database was screened for the presence of XTHs. In addition, using the 3′ RACE technique searches were made for the presence of possible XTH ESTs in the Charophyta. Key Results XET activity was found in the three major divisions of bryophytes at sites corresponding to growing regions. In the Physcomitrella genome two putative XTH-encoding cDNA sequences were identified that contain all domains crucial for XET activity. Furthermore, XET activity was located at the sites of growth in Chara (Charophyta) and Ulva (Chlorophyta) and a putative XTH ancestral enzyme in Chara was identified. No XET activity was identified in the Rhodophyta or Phaeophyta. Conclusions XET activity was shown to be present in all major groups of green plants. These data suggest that an XET-related growth mechanism originated before the evolutionary divergence of the Chlorobionta and open new insights in the evolution of the mechanisms of primary cell wall expansion. PMID:17098750
Publications - GMC 352 | Alaska Division of Geological & Geophysical

Science.gov Websites

, Alaska as based from core samples from the following wells: North Cook Inlet Unit A-02; Middle Ground , Chemostratigraphy of Oligo-Miocene sequences in Cook Inlet, Alaska as based from core samples from the following
Mechanism of transcription termination by RNA polymerase III utilizes a nontemplate-strand sequence-specific signal element

PubMed Central

Arimbasseri, Aneeshkumar G.; Maraia, Richard J.

2015-01-01

SUMMARY Understanding the mechanism of transcription termination by a eukaryotic RNA polymerase (RNAP) has been limited by lack of a characterizable intermediate that reflects transition from an elongation complex to a true termination event. While other multisubunit RNAPs require multipartite cis-signals and/or ancillary factors to mediate pausing and release of the nascent transcript from the clutches of these enzymes, RNAP III does so with precision and efficiency on a simple oligo(dT) tract, independent of other cis-elements or trans-factors. We report a RNAP III pre-termination complex that reveals termination mechanisms controlled by sequence-specific elements in the non-template strand. Furthermore, the TFIIF-like, RNAP III subunit, C37 is required for this function of the non-template strand signal. The results reveal the RNAP III terminator as an information-rich control element. While the template strand promotes destabilization via a weak oligo(rU:dA) hybrid, the non-template strand provides distinct sequence-specific destabilizing information through interactions with the C37 subunit. PMID:25959395
Telomerase Responsive Delivery of Doxorubicin from Mesoporous Silica Nanoparticles in Multiple Malignancies: Therapeutic Efficacies against Experimental Aggressive Murine Lymphoma.

PubMed

Srivastava, Prateek; Hira, Sumit Kumar; Sharma, Amod; Kashif, Mohammad; Srivastava, Prashant; Srivastava, Divesh N Narayan; Singh, Ram Adhar; Manna, Partha Pratim

2018-05-25

Mammalian telomerase maintain the length and integrity of telomeres by adding the telomeric repeats to chromosome end. This work describes the telomerase responsive delivery of doxorubicin against telomerase positive human and murine cancer cells. Wrapping of doxorubicin loaded mesoporous silica nanoparticles with specific oligonucleotide sequence, containing telomeric repeat complementary sequence and a telomerase substrate primer sequence resulted slow and sustained release of doxorubicin, contiguous to the tumor cells. The DNA wrapped nano probe significantly inhibit the proliferation and enhanced the cytotoxicity in telomerase positive human and mouse tumor cells, and its function is impeded following exposure to specific telomerase inhibitor, AZT. Entrapping of doxorubicin by telomerase specific oligo, manifests enhanced apoptosis and significantly higher uptake of the drug in the tumor cells. Treatment of telomerase positive Dalton's lymphoma bearing mice with a novel and newly designed oligo wrapped nano probe, specific for mouse telomerase, significantly enhanced the survival and improved the histopathological parameters. In addition, the treatment also induced significant reduction in the number of tumor foci and restored the normal architecture of the vascularised organs, besides preventing metastasis.
The Oligo-Miocene of Eil (NE Somalia): a prograding coral- Lepidocyclina system

NASA Astrophysics Data System (ADS)

Bosellini, A.; Russo, A.; Arush, M. A.; Cabdulqadir, M. M.

The Oligo-Miocene succession of Eil is the product of a depositional regression and constitutes a 120-150 m thick depositional sequence that prograded seaward for at least 20-25 km. Its time-transgressive stratigraphy is documented physically by well exposed tangential clinoforms (previously considered as evidence of a tectonic coastal flexure) and biostratigraphically by the occurrence of calcareous nannoplankton, planktonic and benthonic foraminifera, and a rich coral fauna. The upper boundary of the sequence is indicated by a reefal toplap, which constitutes the flat surface of the Nogal Plateau. Age (Chattian to Burdigalian) and toplap relationships of the sequence indicate clearly that progradation took place after the Late Oligocene flooding which followed the strong fall of sea-level during the Chattian. Because of the horizontal geometry of the entire sedimentary system, it has been possible to make a clear environmental reconstruction and a facies model with original water depths. A worldwide Tertiary facies—the Lepidocyclina beds— was confined to the front of the reef, at depths ranging from 35-40 to 120-130 m.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Smith, O.P.

Potato leafroll virus (PLRV) was aphid-transmitted from potato (Solanum tuberosum cultivar Russett Burbank) to ground cherry (Physalis floridana), where it was maintained by serial aphid transmission. Serological and plant differential tests indicated that the isolate was not contaminated with beet western yellows virus. Purified PLRV RNA was poly(A)-tailed in vitro and used as a template for reverse transcriptase, primed with oligo(dT). Alkaline gel electrophoresis of /sup 32/P-labeled first-strand complementary DNA (cDNA) indicated a major size range of 0.1 to 3.5 kilobases (kb). A small percentage of transcripts corresponded to full length PLRV RNA. Following RNase H and DNA polymerase I-mediatedmore » second strand synthesis, double-stranded cDNA was cloned into the Pst I site of the plasmid pUC9 using oligo (dC)-oligo(dG) tailing methodology. Escherichia coli JM109 transformants were screened with first-strand /sup 32/P-cDNA in colony hybridization experiments to confirm that recombinants contained PLRV-specific sequences.« less
Transcriptome analysis of salinity stress responses in common wheat using a 22k oligo-DNA microarray.

PubMed

Kawaura, Kanako; Mochida, Keiichi; Yamazaki, Yukiko; Ogihara, Yasunari

2006-04-01

In this study, we constructed a 22k wheat oligo-DNA microarray. A total of 148,676 expressed sequence tags of common wheat were collected from the database of the Wheat Genomics Consortium of Japan. These were grouped into 34,064 contigs, which were then used to design an oligonucleotide DNA microarray. Following a multistep selection of the sense strand, 21,939 60-mer oligo-DNA probes were selected for attachment on the microarray slide. This 22k oligo-DNA microarray was used to examine the transcriptional response of wheat to salt stress. More than 95% of the probes gave reproducible hybridization signals when targeted with RNAs extracted from salt-treated wheat shoots and roots. With the microarray, we identified 1,811 genes whose expressions changed more than 2-fold in response to salt. These included genes known to mediate response to salt, as well as unknown genes, and they were classified into 12 major groups by hierarchical clustering. These gene expression patterns were also confirmed by real-time reverse transcription-PCR. Many of the genes with unknown function were clustered together with genes known to be involved in response to salt stress. Thus, analysis of gene expression patterns combined with gene ontology should help identify the function of the unknown genes. Also, functional analysis of these wheat genes should provide new insight into the response to salt stress. Finally, these results indicate that the 22k oligo-DNA microarray is a reliable method for monitoring global gene expression patterns in wheat.
Oligo kernels for datamining on biological sequences: a case study on prokaryotic translation initiation sites

PubMed Central

Meinicke, Peter; Tech, Maike; Morgenstern, Burkhard; Merkl, Rainer

2004-01-01

Background Kernel-based learning algorithms are among the most advanced machine learning methods and have been successfully applied to a variety of sequence classification tasks within the field of bioinformatics. Conventional kernels utilized so far do not provide an easy interpretation of the learnt representations in terms of positional and compositional variability of the underlying biological signals. Results We propose a kernel-based approach to datamining on biological sequences. With our method it is possible to model and analyze positional variability of oligomers of any length in a natural way. On one hand this is achieved by mapping the sequences to an intuitive but high-dimensional feature space, well-suited for interpretation of the learnt models. On the other hand, by means of the kernel trick we can provide a general learning algorithm for that high-dimensional representation because all required statistics can be computed without performing an explicit feature space mapping of the sequences. By introducing a kernel parameter that controls the degree of position-dependency, our feature space representation can be tailored to the characteristics of the biological problem at hand. A regularized learning scheme enables application even to biological problems for which only small sets of example sequences are available. Our approach includes a visualization method for transparent representation of characteristic sequence features. Thereby importance of features can be measured in terms of discriminative strength with respect to classification of the underlying sequences. To demonstrate and validate our concept on a biochemically well-defined case, we analyze E. coli translation initiation sites in order to show that we can find biologically relevant signals. For that case, our results clearly show that the Shine-Dalgarno sequence is the most important signal upstream a start codon. The variability in position and composition we found for that signal is in accordance with previous biological knowledge. We also find evidence for signals downstream of the start codon, previously introduced as transcriptional enhancers. These signals are mainly characterized by occurrences of adenine in a region of about 4 nucleotides next to the start codon. Conclusions We showed that the oligo kernel can provide a valuable tool for the analysis of relevant signals in biological sequences. In the case of translation initiation sites we could clearly deduce the most discriminative motifs and their positional variation from example sequences. Attractive features of our approach are its flexibility with respect to oligomer length and position conservation. By means of these two parameters oligo kernels can easily be adapted to different biological problems. PMID:15511290
Oligopeptides and copeptides of homochiral sequence, via beta-sheets, from mixtures of racemic alpha-amino acids, in a one-pot reaction in water; relevance to biochirogenesis.

PubMed

Illos, Roni A; Bisogno, Fabricio R; Clodic, Gilles; Bolbach, Gerard; Weissbuch, Isabelle; Lahav, Meir

2008-07-09

As part of our studies on the biochirogenesis of peptides of homochiral sequence during early evolution, the formation of oligopeptides composed of 14-24 residues of the same handedness in the polymerization of dl-leucine (Leu), dl-phenylalanine (Phe), and dl-valine (Val) in aqueous solutions, by activation with N, N'-carbonyldiimidazole and then initiation with a primary amine, in a one-pot reaction, was demonstrated by MALDI-TOF MS using deuterium enantio-labeled alpha-amino acids. The formation of long isotactic peptides is rationalized by the following steps occurring in tandem: (i) creation of a library of short diasteroisomeric oligopeptides containing isotactic peptides in excess in comparison to a binomial kinetics, as a result of an asymmetric induction exerted by the N-terminal residue of a given handedness; (ii) precipitation of the less soluble racemic isotactic penta- and hexapeptides in the form of beta-sheets that are delineated by homochiral rims; (iii) regio-enantiospecific chain elongation occurring heterogeneously at the beta-sheets/solution interface. Polymerization of l-Leu with l-isoleucine (Ile) or l-Phe with l- (1) N-Me-histidine yielded mixtures of copeptides containing both residues. In contrast, in the polymerization of the corresponding mixtures of l- + d-alpha-amino acids, the long oligopeptides were composed mainly from oligo- l-Leu and oligo- d-Ile in the first system and oligo- d-Phe in the second. Furthermore, in the polymerization of mixtures of hydrophobic racemic alpha-amino acids dl-Leu, dl-Val, and dl-Phe and with added racemic dl-alanine and dl-tyrosine, copeptides of homochiral sequences are most dominantly represented. Possible routes for a spontaneous "mirror-symmetry breaking" process of the racemic mixtures of homochiral peptides are presented.
Enhanced splicing correction effect by an oligo-aspartic acid-PNA conjugate and cationic carrier complexes.

PubMed

Bae, Yun Mi; Kim, Myung Hee; Yu, Gwang Sig; Um, Bong Ho; Park, Hee Kyung; Lee, Hyun-il; Lee, Kang Taek; Suh, Yung Doug; Choi, Joon Sig

2014-02-10

Peptide nucleic acids (PNAs) are synthetic structural analogues of DNA and RNA. They recognize specific cellular nucleic acid sequences and form stable complexes with complementary DNA or RNA. Here, we designed an oligo-aspartic acid-PNA conjugate and showed its enhanced delivery into cells with high gene correction efficiency using conventional cationic carriers, such as polyethylenimine (PEI) and Lipofectamine 2000. The negatively charged oligo-aspartic acid-PNA (Asp(n)-PNA) formed complexes with PEI and Lipofectamine, and the resulting Asp(n)-PNA/PEI and Asp(n)-PNA/Lipofectamine complexes were introduced into cells. We observed significantly enhanced cellular uptake of Asp(n)-PNA by cationic carriers and detected an active splicing correction effect even at nanomolar concentrations. We found that the splicing correction efficiency of the complex depended on the kind of the cationic carriers and on the number of repeating aspartic acid units. By enhancing the cellular uptake efficiency of PNAs, these results may provide a novel platform technology of PNAs as bioactive substances for their biological and therapeutic applications. Copyright © 2013 Elsevier B.V. All rights reserved.
The 'PUCE CAFE' Project: the First 15K Coffee Microarray, a New Tool for Discovering Candidate Genes correlated to Agronomic and Quality Traits

PubMed Central

2011-01-01

Background Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. Results The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. Conclusion We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research. PMID:21208403
The 'PUCE CAFE' Project: the first 15K coffee microarray, a new tool for discovering candidate genes correlated to agronomic and quality traits.

PubMed

Privat, Isabelle; Bardil, Amélie; Gomez, Aureliano Bombarely; Severac, Dany; Dantec, Christelle; Fuentes, Ivanna; Mueller, Lukas; Joët, Thierry; Pot, David; Foucrier, Séverine; Dussert, Stéphane; Leroy, Thierry; Journot, Laurent; de Kochko, Alexandre; Campa, Claudine; Combes, Marie-Christine; Lashermes, Philippe; Bertrand, Benoit

2011-01-05

Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research.
Mango: multiple alignment with N gapped oligos.

PubMed

Zhang, Zefeng; Lin, Hao; Li, Ming

2008-06-01

Multiple sequence alignment is a classical and challenging task. The problem is NP-hard. The full dynamic programming takes too much time. The progressive alignment heuristics adopted by most state-of-the-art works suffer from the "once a gap, always a gap" phenomenon. Is there a radically new way to do multiple sequence alignment? In this paper, we introduce a novel and orthogonal multiple sequence alignment method, using both multiple optimized spaced seeds and new algorithms to handle these seeds efficiently. Our new algorithm processes information of all sequences as a whole and tries to build the alignment vertically, avoiding problems caused by the popular progressive approaches. Because the optimized spaced seeds have proved significantly more sensitive than the consecutive k-mers, the new approach promises to be more accurate and reliable. To validate our new approach, we have implemented MANGO: Multiple Alignment with N Gapped Oligos. Experiments were carried out on large 16S RNA benchmarks, showing that MANGO compares favorably, in both accuracy and speed, against state-of-the-art multiple sequence alignment methods, including ClustalW 1.83, MUSCLE 3.6, MAFFT 5.861, ProbConsRNA 1.11, Dialign 2.2.1, DIALIGN-T 0.2.1, T-Coffee 4.85, POA 2.0, and Kalign 2.0. We have further demonstrated the scalability of MANGO on very large datasets of repeat elements. MANGO can be downloaded at http://www.bioinfo.org.cn/mango/ and is free for academic usage.
Utilization of paramagnetic microparticles for automated isolation of free circulating mRNA as a new tool in prostate cancer diagnostics.

PubMed

Fojtu, Michaela; Gumulec, Jaromir; Balvan, Jan; Raudenska, Martina; Sztalmachova, Marketa; Polanska, Hana; Smerkova, Kristyna; Adam, Vojtech; Kizek, Rene; Masarik, Michal

2014-02-01

Determination of serum mRNA gained a lot of attention in recent years, particularly from the perspective of disease markers. Streptavidin-modified paramagnetic particles (SMPs) seem an interesting technique, mainly due to possible automated isolation and high efficiency. The aim of this study was to optimize serum isolation protocol to reduce the consumption of chemicals and sample volume. The following factors were optimized: amounts of (i) paramagnetic particles, (ii) oligo(dT)20 probe, (iii) serum, and (iv) the binding sequence (SMPs, oligo(dT)20 , serum vs. oligo(dT)20 , serum and SMPs). RNA content was measured, and the expression of metallothionein-2A as possible prostate cancer marker was analyzed to demonstrate measurable RNA content with ability for RT-PCR detection. Isolation is possible on serum volume range (10-200 μL) without altering of efficiency or purity. Amount of SMPs can be reduced up to 5 μL, with optimal results within 10-30 μL SMPs. Volume of oligo(dT)20 does not affect efficiency, when used within 0.1-0.4 μL. This optimized protocol was also modified to fit needs of automated one-step single-tube analysis with identical efficiency compared to conventional setup. One-step analysis protocol is considered a promising simplification, making RNA isolation suitable for automatable process. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

2013-06-25

A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
New molecular markers and cytogenetic probes enable chromosome identification of wheat-Thinopyrum intermedium introgression lines for improving protein and gluten contents.

PubMed

Li, Guangrong; Wang, Hongjin; Lang, Tao; Li, Jianbo; La, Shixiao; Yang, Ennian; Yang, Zujun

2016-10-01

New molecular markers were developed for targeting Thinopyrum intermedium 1St#2 chromosome, and novel FISH probe representing the terminal repeats was produced for identification of Thinopyrum chromosomes. Thinopyrum intermedium has been used as a valuable resource for improving the disease resistance and yield potential of wheat. A wheat-Th. intermedium ssp. trichophorum chromosome 1St#2 substitution and translocation has displayed superior grain protein and wet gluten content. With the aim to develop a number of chromosome 1St#2 specific molecular and cytogenetic markers, a high throughput, low-cost specific-locus amplified fragment sequencing (SLAF-seq) technology was used to compare the sequences between a wheat-Thinopyrum 1St#2 (1D) substitution and the related species Pseudoroegneria spicata (St genome, 2n = 14). A total of 5142 polymorphic fragments were analyzed and 359 different SLAF markers for 1St#2 were predicted. Thirty-seven specific molecular markers were validated by PCR from 50 randomly selected SLAFs. Meanwhile, the distribution of transposable elements (TEs) at the family level between wheat and St genomes was compared using the SLAFs. A new oligo-nucleotide probe named Oligo-pSt122 from high SLAF reads was produced for fluorescence in situ hybridization (FISH), and was observed to hybridize to the terminal region of 1St#L and also onto the terminal heterochromatic region of Th. intermedium genomes. The genome-wide markers and repetitive based probe Oligo-pSt122 will be valuable for identifying Thinopyrum chromosome segments in wheat backgrounds.

Optimized knock-in of point mutations in zebrafish using CRISPR/Cas9.

PubMed

Prykhozhij, Sergey V; Fuller, Charlotte; Steele, Shelby L; Veinotte, Chansey J; Razaghi, Babak; Robitaille, Johane M; McMaster, Christopher R; Shlien, Adam; Malkin, David; Berman, Jason N

2018-06-14

We have optimized point mutation knock-ins into zebrafish genomic sites using clustered regularly interspaced palindromic repeats (CRISPR)/Cas9 reagents and single-stranded oligodeoxynucleotides. The efficiency of knock-ins was assessed by a novel application of allele-specific polymerase chain reaction and confirmed by high-throughput sequencing. Anti-sense asymmetric oligo design was found to be the most successful optimization strategy. However, cut site proximity to the mutation and phosphorothioate oligo modifications also greatly improved knock-in efficiency. A previously unrecognized risk of off-target trans knock-ins was identified that we obviated through the development of a workflow for correct knock-in detection. Together these strategies greatly facilitate the study of human genetic diseases in zebrafish, with additional applicability to enhance CRISPR-based approaches in other animal model systems.
A systematic comparison of error correction enzymes by next-generation sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lubock, Nathan B.; Zhang, Di; Sidore, Angus M.

Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared sixmore » different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.« less
A systematic comparison of error correction enzymes by next-generation sequencing

DOE PAGES

Lubock, Nathan B.; Zhang, Di; Sidore, Angus M.; ...

2017-08-01

Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared sixmore » different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.« less
In Ovo and dietary administration of oligosaccharides extracted from palm kernel cake influence general health of pre- and neonatal broiler chicks.

PubMed

Faseleh Jahromi, Mohammad; Shokryazdan, Parisa; Idrus, Zulkifli; Ebrahimi, Rohollah; Liang, Juan Boo

2017-01-01

Palm kernel cake (PKC) is the main byproduct from the palm oil industry in several tropical countries that contains considerable amounts of oligosaccharide. We earlier demonstrated beneficial prebiotic effects of oligosaccharides extract of PKC (OligoPKC) in starter and finisher broiler birds. This study was envisaged to elucidate the effects of in ovo and/or oral administration of the OligoPKC on prenatal and post-hatched broiler chicks. A total of 140 broiler (Cobb500) eggs were randomly divided into two groups (n = 70 each), and on day 12 of incubation, eggs in one group received in ovo injection of 0.1 mL (containing 20 mg) of OligoPKC, while those in the other group received 0.1 mL of saline (placebo) solution. Of these in ovo placebo or OligoPKC injected eggs, after hatching, six chicks from each group were sampled for day-one analysis, while 48 chicks from each group were randomly allocated to two dietary regimes involving either no feeding or feeding of OligoPKC through basal diet for a 14 days experiment forming the experimental groups as: (i) saline-injected (Control, C), (ii) OligoPKC-injected (PREBovo), (iii) saline-injected, but fed 1% OligoPKC (PREBd), and (iv) OligoPKC-injected and also 1% OligoPKC (PREBovo+d). In ovo injection of prebiotic OligoPKC had no effect on body weight and serum immunoglobulins concentrations of day old chicks, except for IgG, which was increased significantly (P<0.05). Body weight and feed conversion ratio of 14 days old chicks were neither affected by in ovo injection nor feeding of OligoPKC. However, populations of cecal total bacteria and major beneficial bacteria of the chicks were markedly enhanced by feeding of OligoPKC (PREBd and PREBovo+d > C and PREBovo), but lesser influenced by in ovo OligoPKC injection. Irrespective of its prior in ovo exposure, chicks fed OligoPKC diets had lower population of pathogenic bacteria. Overall serum immunoglobulin status of birds was improved by feeding of OligoPKC but in ovo OligoPKC injection had minor effect on that. In most cases, in ovo OligoPKC injection and feeding of OligoPKC reduced the expression of nutrient transporters in the intestine and improved antioxidant capacity of liver and serum. It is concluded that in ovo injection of OligoPKC increased IgG production and antioxidant capacity in serum and liver of prenatal chicks and had limited carrying-over effects on the post-hatched chicks comparing to the supplementary feeding of OligoPKC.
Oligo-dT anchored cDNA-SCoT: a novel differential display method for analyzing differential gene expression in response to several stress treatments in mango (Mangifera indica L.).

PubMed

Luo, Cong; He, Xin-Hua; Hu, Ying; Yu, Hai-xia; Ou, Shi-Jin; Fang, Zhong-Bin

2014-09-15

Differential display is a powerful technique for analyzing differences in gene expression. Oligo-dT cDNAstart codon targeted marker (cDNA-SCoT) technique is a novel, simple, cheap, rapid, and efficient method for differential gene expression research. In the present study, the oligo-dT anchored cDNA-SCoT technique was exploited to identify differentially expressed genes during several stress treatments in mango. A total of 37 primers combined with oligo-dT anchor primers 3side amplified approximately 150 fragments of 150 bp to 1500 bp in length. Up to 100 fragments were differentially expressed among the stress treatments and control samples, among which 92 were obtained and sequenced. Out of the 92 transcript derived fragments (TDFs), 70% were highly homologous to known genes, and 30% encoded unclassified proteins with unknown functions. The expression pattern of nine genes with known functions involved in several abiotic stresses in other species was confirmed by quantitative reverse transcription polymerase chain reaction (qRT-PCR) under cold (4 °C), salinity (NaCl), polyethylene glycol (PEG, MW 6000), and heavy metal treatments in leaves and stems at different time points (0, 24, 48, and 72 h). The expression patterns of the genes (TDF4, TDF7, TDF23, TDF45, TDF49, TDF50, TDF57, TDF91 and TDF92) that had direct or indirect relationships with cold, salinity, drought and heavy metal stress response were analyzed through qRT-PCR. The possible roles of these genes are discussed. This study suggests that the oligo-dT anchored cDNA-SCoT differential display method is a useful tool to serve as an initial step for characterizing transcriptional changes induced by abiotic stresses and provide gene information for further study and application in genetic improvement and breeding in mango. Copyright © 2014 Elsevier B.V. All rights reserved.
Spectroscopic studies of the interaction of aspirin and its important metabolite, salicylate ion, with DNA, A·T and G·C rich sequences

NASA Astrophysics Data System (ADS)

Bathaie, S. Z.; Nikfarjam, L.; Rahmanpour, R.; Moosavi-Movahedi, A. A.

2010-12-01

Among different biological effects of acetylsalicylic acid (ASA), its anticancer property is controversial. Since ASA hydrolyzes rapidly to salicylic acid (SA), especially in the blood, interaction of both ASA and SA (as the small molecules) with ctDNA, oligo(dA·dT) 15 and oligo(dG·dC) 15, as a possible mechanism of their action, is investigated here. The results show that the rate of ASA hydrolysis in the absence and presence of ctDNA is similar. The spectrophotometric results indicate that both ASA and SA cooperatively bind to ctDNA. The binding constants ( K) are (1.7 ± 0.7) × 10 3 M -1 and (6.7 ± 0.2) × 10 3 M -1 for ASA and SA, respectively. Both ligands quench the fluorescence emission of ethidium bromide (Et)-ctDNA complex. The Scatchard plots indicate the non-displacement based quenching (non-intercalative binding). The circular dichroism (CD) spectra of ASA- or SA-ctDsNA complexes show the minor distortion of ctDNA structure, with no characteristic peaks for intercalation of ligands. Tm of ctDNA is decreased up to 3 °C upon ASA binding. The CD results also indicate more distortions on oligo(dG·dC) 15 structure due to the binding of both ASA and SA in comparison with oligo(dA·dT) 15. All data indicate the more affinity for SA binding with DNA minor groove in comparison with ASA which has more hydrophobic character.
Multi-targeted priming for genome-wide gene expression assays.

PubMed

Adomas, Aleksandra B; Lopez-Giraldez, Francesc; Clark, Travis A; Wang, Zheng; Townsend, Jeffrey P

2010-08-17

Complementary approaches to assaying global gene expression are needed to assess gene expression in regions that are poorly assayed by current methodologies. A key component of nearly all gene expression assays is the reverse transcription of transcribed sequences that has traditionally been performed by priming the poly-A tails on many of the transcribed genes in eukaryotes with oligo-dT, or by priming RNA indiscriminately with random hexamers. We designed an algorithm to find common sequence motifs that were present within most protein-coding genes of Saccharomyces cerevisiae and of Neurospora crassa, but that were not present within their ribosomal RNA or transfer RNA genes. We then experimentally tested whether degenerately priming these motifs with multi-targeted primers improved the accuracy and completeness of transcriptomic assays. We discovered two multi-targeted primers that would prime a preponderance of genes in the genomes of Saccharomyces cerevisiae and Neurospora crassa while avoiding priming ribosomal RNA or transfer RNA. Examining the response of Saccharomyces cerevisiae to nitrogen deficiency and profiling Neurospora crassa early sexual development, we demonstrated that using multi-targeted primers in reverse transcription led to superior performance of microarray profiling and next-generation RNA tag sequencing. Priming with multi-targeted primers in addition to oligo-dT resulted in higher sensitivity, a larger number of well-measured genes and greater power to detect differences in gene expression. Our results provide the most complete and detailed expression profiles of the yeast nitrogen starvation response and N. crassa early sexual development to date. Furthermore, our multi-targeting priming methodology for genome-wide gene expression assays provides selective targeting of multiple sequences and counter-selection against undesirable sequences, facilitating a more complete and precise assay of the transcribed sequences within the genome.
A dinucleotide motif in oligonucleotides shows potent immunomodulatory activity and overrides species-specific recognition observed with CpG motif.

PubMed

Kandimalla, Ekambar R; Bhagat, Lakshmi; Zhu, Fu-Gang; Yu, Dong; Cong, Yan-Ping; Wang, Daqing; Tang, Jimmy X; Tang, Jin-Yan; Knetter, Cathrine F; Lien, Egil; Agrawal, Sudhir

2003-11-25

Bacterial and synthetic DNAs containing CpG dinucleotides in specific sequence contexts activate the vertebrate immune system through Toll-like receptor 9 (TLR9). In the present study, we used a synthetic nucleoside with a bicyclic heterobase [1-(2'-deoxy-beta-d-ribofuranosyl)-2-oxo-7-deaza-8-methyl-purine; R] to replace the C in CpG, resulting in an RpG dinucleotide. The RpG dinucleotide was incorporated in mouse- and human-specific motifs in oligodeoxynucleotides (oligos) and 3'-3-linked oligos, referred to as immunomers. Oligos containing the RpG motif induced cytokine secretion in mouse spleen-cell cultures. Immunomers containing RpG dinucleotides showed activity in transfected-HEK293 cells stably expressing mouse TLR9, suggesting direct involvement of TLR9 in the recognition of RpG motif. In J774 macrophages, RpG motifs activated NF-kappa B and mitogen-activated protein kinase pathways. Immunomers containing the RpG dinucleotide induced high levels of IL-12 and IFN-gamma, but lower IL-6 in time- and concentration-dependent fashion in mouse spleen-cell cultures costimulated with IL-2. Importantly, immunomers containing GTRGTT and GARGTT motifs were recognized to a similar extent by both mouse and human immune systems. Additionally, both mouse- and human-specific RpG immunomers potently stimulated proliferation of peripheral blood mononuclear cells obtained from diverse vertebrate species, including monkey, pig, horse, sheep, goat, rat, and chicken. An immunomer containing GTRGTT motif prevented conalbumin-induced and ragweed allergen-induced allergic inflammation in mice. We show that a synthetic bicyclic nucleotide is recognized in the C position of a CpG dinucleotide by immune cells from diverse vertebrate species without bias for flanking sequences, suggesting a divergent nucleotide motif recognition pattern of TLR9.
An in vitro study of alginate oligomer therapies on oral biofilms.

PubMed

Roberts, J L; Khan, S; Emanuel, C; Powell, L C; Pritchard, M F; Onsøyen, E; Myrvold, R; Thomas, D W; Hill, K E

2013-10-01

The in vitro effect of a novel, oligosaccharide nanomedicine OligoG against oral pathogen-related biofilms, both alone and in the presence of the conventional anti-bacterial agent triclosan, was evaluated. The effect of OligoG±triclosan was assessed against established Streptococcus mutans and Porphyromonas gingivalis biofilms by bacterial counts and image analysis using LIVE/DEAD(®) staining and atomic force microscopy (AFM). The effect of triclosan and OligoG surface pre-treatments on bacterial attachment to titanium and polymethylmethacrylate was also studied. OligoG potentiated the antimicrobial effect of triclosan, particularly when used in combination at 0.3% against S. mutans grown in artificial saliva. OligoG was less effective against established P. gingivalis biofilms. However, attachment of P. gingivalis, to titanium in particular, was significantly reduced after surface pre-treatment with OligoG and triclosan at 0.01% when compared to controls. Light microscopy and AFM showed that OligoG was biocidal to P. gingivalis, but not S. mutans. OligoG and triclosan when used in combination produced an enhanced antimicrobial effect against two important oral pathogens and reduced bacterial attachment to dental materials such as titanium, even at reduced triclosan concentrations. Whilst the use of triclosan against oral bacteria has been widely documented, its synergistic use with OligoG described here, has not previously been reported. The use of lower concentrations of triclosan, if used in combination therapy with OligoG, could have environmental benefits. The potentiation of antimicrobial agents by naturally occurring oligomers such as OligoG may represent a novel, safe adjunct to conventional oral hygiene and periodontal therapy. The ability of OligoG to inhibit the growth and impair bacterial adherence highlights its potential in the management of peri-implantitis. Copyright © 2013 Elsevier Ltd. All rights reserved.
The Trypanosoma cruzi Satellite DNA OligoC-TesT and Trypanosoma cruzi Kinetoplast DNA OligoC-TesT for Diagnosis of Chagas Disease: A Multi-cohort Comparative Evaluation Study

PubMed Central

De Winne, Koen; Büscher, Philippe; Luquetti, Alejandro O.; Tavares, Suelene B. N.; Oliveira, Rodrigo A.; Solari, Aldo; Zulantay, Ines; Apt, Werner; Diosque, Patricio; Monje Rumi, Mercedes; Gironès, Nuria; Fresno, Manuel; Lopez-Velez, Rogelio; Perez-Molina, José A.; Monge-Maillo, Begoña; Garcia, Lineth; Deborggraeve, Stijn

2014-01-01

Background The Trypanosoma cruzi satellite DNA (satDNA) OligoC-TesT is a standardised PCR format for diagnosis of Chagas disease. The sensitivity of the test is lower for discrete typing unit (DTU) TcI than for TcII-VI and the test has not been evaluated in chronic Chagas disease patients. Methodology/Principal Findings We developed a new prototype of the OligoC-TesT based on kinetoplast DNA (kDNA) detection. We evaluated the satDNA and kDNA OligoC-TesTs in a multi-cohort study with 187 chronic Chagas patients and 88 healthy endemic controls recruited in Argentina, Chile and Spain and 26 diseased non-endemic controls from D.R. Congo and Sudan. All specimens were tested in duplicate. The overall specificity in the controls was 99.1% (95% CI 95.2%–99.8%) for the satDNA OligoC-TesT and 97.4% (95% CI 92.6%–99.1%) for the kDNA OligoC-TesT. The overall sensitivity in the patients was 67.9% (95% CI 60.9%–74.2%) for the satDNA OligoC-TesT and 79.1% (95% CI 72.8%–84.4%) for the kDNA OligoC-Test. Conclusions/Significance Specificities of the two T. cruzi OligoC-TesT prototypes are high on non-endemic and endemic controls. Sensitivities are moderate but significantly (p = 0.0004) higher for the kDNA OligoC-TesT compared to the satDNA OligoC-TesT. PMID:24392177
EST-PAC a web package for EST annotation and protein sequence prediction

PubMed Central

Strahm, Yvan; Powell, David; Lefèvre, Christophe

2006-01-01

With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics. PMID:17147782
Telomere Fragment Induced Amnion Cell Senescence: A Contributor to Parturition?

PubMed Central

Polettini, Jossimara; Behnia, Faranak; Taylor, Brandie D.; Saade, George R.; Taylor, Robert N.; Menon, Ramkumar

2015-01-01

Oxidative stress (OS)-induced senescence of the amniochorion has been associated with parturition at term. We investigated whether telomere fragments shed into the amniotic fluid (AF) correlated with labor status and tested if exogenous telomere fragments (T-oligos) could induce human and murine amnion cell senescence. In a cross-sectional clinical study, AF telomere fragment concentrations quantitated by a validated real-time PCR assay were higher in women in labor at term compared to those not in labor. In vitro treatment of primary human amnion epithelial cells with 40 μM T-oligos ([TTAGGG]2) that mimic telomere fragments, activated p38MAPK, produced senescence-associated (SA) β-gal staining and increased interleukin (IL)-6 and IL-8 production compared to cells treated with complementary DNA sequences (Cont-oligos, [AATCCC]2). T-oligos injected into the uteri of pregnant CD1 mice on day 14 of gestation, led to increased p38MAPK, SA-β-gal (SA β-gal) staining in murine amniotic sacs and higher AF IL-8 levels on day 18, compared to saline treated controls. In summary, term labor AF samples had higher telomere fragments than term not in labor AF. In vitro and in situ telomere fragments increased human and murine amnion p38MAPK, senescence and inflammatory cytokines. We propose that telomere fragments released from senescent fetal cells are indicative of fetal cell aging. Based on our data, these telomere fragments cause oxidative stress associated damages to the term amniotic sac and force them to release other DAMPS, which, in turn, provide a sterile immune response that may be one of the many inflammatory signals required to initiate parturition at term. PMID:26397719
Oligo-Miocene foraminiferal record (Miogypsinidae, Lepidocyclinidae and Nummulitidae) from the Western Taurides (SW Turkey): Biometry and implications for the regional geology

NASA Astrophysics Data System (ADS)

Özcan, Ercan; Less, György; Báldi-Beke, Mária; Kollányi, Katalin; Acar, Ferhat

2009-05-01

The marine Oligo-Miocene units of western Taurides, deposited under different tectonic regimes (in Bey Dağları platform in foreland and coeval sequences in hinterland), were studied to establish a high-resolution biostratigraphic framework. Biometric study of the full spectrum of larger foraminifera in a regional scale allowed us correlating them with the shallow benthic zonation (SBZ) system introduced by [Cahuzac, B., Poignant, A., 1997. Essai de biozonation de l'Oligo-Miocène dans les bassins européens à l'aide des grands foraminifères néritiques. Bulletin de la Société géologique de France 168, 155-169], and to determine the ages of these sites on zonal precision for the first time. In correlating these assemblages to standard shallow benthic zones, planktonic data were also used whenever possible. Taxa, classified under the genera Nummulites, Miogypsina, Miolepidocyclina, Nephrolepidina, Eulepidina, Heterostegina, Operculina and Cycloclypeus (?) and their assemblages, closely resemble to the fauna described from European basins. These groups characterize the SBZ 22B to 25 zones referring to a time interval from early Chattian to Burdigalian. However, a main gap in late Chattian (SBZ 23) and in early part of the Aquitanian (SBZ 24) is also recorded in the platform succession. In the meantime, rare Eulepidina in the Burdigalian levels suggest a clear Indo-Pacific influence. Based on the discovery of early Chattian (SBZ 22B) deposits (previously mapped under Eocene/Miocene units), the Oligo-Miocene stratigraphy of the Bey Dağları platform is also revised. A more precise chronology for regional Miocene transgression is presented based on the miogypsinid evolutionary scale.
Oligomeric BAX induces mitochondrial permeability transition and complete cytochrome c release without oxidative stress.

PubMed

Li, Tsyregma; Brustovetsky, Tatiana; Antonsson, Bruno; Brustovetsky, Nickolay

2008-11-01

In the present study, we investigated the mechanism of cytochrome c release from isolated brain mitochondria induced by recombinant oligomeric BAX (BAX(oligo)). We found that BAX(oligo) caused a complete release of cytochrome c in a concentration- and time-dependent manner. The release was similar to those induced by alamethicin, which causes maximal mitochondrial swelling and eliminates barrier properties of the OMM. BAX(oligo) also produced large amplitude mitochondrial swelling as judged by light scattering assay and transmission electron microscopy. In addition, BAX(oligo) resulted in a strong mitochondrial depolarization. ATP or a combination of cyclosporin A and ADP, inhibitors of the mPT, suppressed BAX(oligo)-induced mitochondrial swelling and depolarization as well as cytochrome c release but did not influence BAX(oligo) insertion into the OMM. Both BAX(oligo)- and alamethicin-induced cytochrome c releases were accompanied by inhibition of ROS generation, which was assessed by measuring mitochondrial H(2)O(2) release with an Amplex Red assay. The mPT inhibitors antagonized suppression of ROS generation caused by BAX(oligo) but not by alamethicin. Thus, BAX(oligo) resulted in a complete cytochrome c release from isolated brain mitochondria in the mPT-dependent manner without involvement of oxidative stress by the mechanism requiring mitochondrial remodeling and permeabilization of the OMM.
OligoG CF-5/20 normalizes cystic fibrosis mucus by chelating calcium.

PubMed

Ermund, Anna; Recktenwald, Christian V; Skjåk-Braek, Gudmund; Meiss, Lauren N; Onsøyen, Edvar; Rye, Philip D; Dessen, Arne; Myrset, Astrid Hilde; Hansson, Gunnar C

2017-06-01

The goal of this study was to determine whether the guluronate (G) rich alginate OligoG CF-5/20 (OligoG) could detach cystic fibrosis (CF) mucus by calcium chelation, which is also required for normal mucin unfolding. Since bicarbonate secretion is impaired in CF, leading to insufficient mucin unfolding and thereby attached mucus, and since bicarbonate has the ability to bind calcium, we hypothesized that the calcium chelating property of OligoG would lead to detachment of CF mucus. Indeed, OligoG could compete with the N-terminus of the MUC2 mucin for calcium binding as shown by microscale thermophoresis. Further, effects on mucus thickness and attachment induced by OligoG and other alginate fractions of different length and composition were evaluated in explants of CF mouse ileum mounted in horizontal Ussing-type chambers. OligoG at 1.5% caused effective detachment of CF mucus and the most potent alginate fraction tested, the poly-G fraction of about 12 residues, had similar potency compared to OligoG whereas mannuronate-rich (M) polymers had minimal effect. In conclusion, OligoG binds calcium with appropriate affinity without any overt harmful effect on the tissue and can be exploited for treating mucus stagnation. © 2017 John Wiley & Sons Australia, Ltd.
Counting of oligomers in sequences generated by markov chains for DNA motif discovery.

PubMed

Shan, Gao; Zheng, Wei-Mou

2009-02-01

By means of the technique of the imbedded Markov chain, an efficient algorithm is proposed to exactly calculate first, second moments of word counts and the probability for a word to occur at least once in random texts generated by a Markov chain. A generating function is introduced directly from the imbedded Markov chain to derive asymptotic approximations for the problem. Two Z-scores, one based on the number of sequences with hits and the other on the total number of word hits in a set of sequences, are examined for discovery of motifs on a set of promoter sequences extracted from A. thaliana genome. Source code is available at http://www.itp.ac.cn/zheng/oligo.c.
Sequence analysis of 497 mouse brain ESTs expressed in the substantia nigra

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stewart, G.J.; Savioz, A.; Davies, R.W.

1997-01-15

The use of subtracted, region-specific cDNA libraries combined with single-pass cDNA sequencing allows the discovery of novel genes and facilitates molecular description of the tissue or region involved. We report the sequence of 497 mouse expressed sequence tags (ESTs) from two subtracted libraries enriched for cDNAs expressed in the substantia nigra, a brain region with important roles in movement control and Parkinson disease. Of these, 238 ESTs give no database matches and therefore derive from novel genes. A further 115 ESTs show sequence similarity to ESTs from other organisms, which themselves do not yield any significant database matches to genesmore » of known function. Fifty-six ESTs show sequence similarity to previously identified genes whose mouse homologues have not been reported. The total number of ESTs reported that are new for the mouse is 407, which, together with the 90 ESTs corresponding to known mouse genes or cDNAs, contributes to the molecular description of the substantia nigra. 21 refs., 4 tabs.« less
Development of an Expressed Sequence Tag (EST) Resource for Wheat (Triticum aestivum L.)

PubMed Central

Lazo, G. R.; Chao, S.; Hummel, D. D.; Edwards, H.; Crossman, C. C.; Lui, N.; Matthews, D. E.; Carollo, V. L.; Hane, D. L.; You, F. M.; Butler, G. E.; Miller, R. E.; Close, T. J.; Peng, J. H.; Lapitan, N. L. V.; Gustafson, J. P.; Qi, L. L.; Echalier, B.; Gill, B. S.; Dilbirligi, M.; Randhawa, H. S.; Gill, K. S.; Greene, R. A.; Sorrells, M. E.; Akhunov, E. D.; Dvořák, J.; Linkiewicz, A. M.; Dubcovsky, J.; Hossain, K. G.; Kalavacharla, V.; Kianian, S. F.; Mahmoud, A. A.; Miftahudin; Ma, X.-F.; Conley, E. J.; Anderson, J. A.; Pathan, M. S.; Nguyen, H. T.; McGuire, P. E.; Qualset, C. O.; Anderson, O. D.

2004-01-01

This report describes the rationale, approaches, organization, and resource development leading to a large-scale deletion bin map of the hexaploid (2n = 6x = 42) wheat genome (Triticum aestivum L.). Accompanying reports in this issue detail results from chromosome bin-mapping of expressed sequence tags (ESTs) representing genes onto the seven homoeologous chromosome groups and a global analysis of the entire mapped wheat EST data set. Among the resources developed were the first extensive public wheat EST collection (113,220 ESTs). Described are protocols for sequencing, sequence processing, EST nomenclature, and the assembly of ESTs into contigs. These contigs plus singletons (unassembled ESTs) were used for selection of distinct sequence motif unigenes. Selected ESTs were rearrayed, validated by 5′ and 3′ sequencing, and amplified for probing a series of wheat aneuploid and deletion stocks. Images and data for all Southern hybridizations were deposited in databases and were used by the coordinators for each of the seven homoeologous chromosome groups to validate the mapping results. Results from this project have established the foundation for future developments in wheat genomics. PMID:15514037
Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing

PubMed Central

Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

2016-01-01

Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039
Small subunit ribosomal RNA sequence links the myxospore stage of Henneguya mississippiensis n. sp. from channel catfish Ictalurus punctatus to an actinospore released by the benthic oligochaete Dero digitata

USDA-ARS?s Scientific Manuscript database

There are more than 200 species of Henneguya described from fish. Of these, only three life cycles have been determined, identifying the actinospore and myxospore stages from their respective hosts. Two of these life cycles involve the channel catfish (Ictalurus punctatus) and the freshwater oligo...

Global gene expression profiles of Phytophthora ramorum strain pr102 in response to plant host and tissue differentiation

Treesearch

Caroline M. Press; Niklaus J. Grunwald

2008-01-01

The release of the draft genome sequence of P. ramorum strain Pr102, enabled the construction of an oligonucleotide microarray of the entire genome of Pr102. The array contains 344,680 features (oligos) that represent the transcriptome of Pr102. P. ramorum RNA was extracted from mycelium and sporangia and used to compare gene...
Genomic overview of mRNA 5′-leader trans-splicing in the ascidian Ciona intestinalis

PubMed Central

Satou, Yutaka; Hamaguchi, Makoto; Takeuchi, Keisuke; Hastings, Kenneth E. M.; Satoh, Nori

2006-01-01

Although spliced leader (SL) trans-splicing in the chordates was discovered in the tunicate Ciona intestinalis there has been no genomic overview analysis of the extent of trans-splicing or the make-up of the trans-spliced and non-trans-spliced gene populations of this model organism. Here we report such an analysis for Ciona based on the oligo-capping full-length cDNA approach. We randomly sampled 2078 5′-full-length ESTs representing 668 genes, or 4.2% of the entire genome. Our results indicate that Ciona contains a single major SL, which is efficiently trans-spliced to mRNAs transcribed from a specific set of genes representing ∼50% of the total number of expressed genes, and that individual trans-spliced mRNA species are, on average, 2–3-fold less abundant than non-trans-spliced mRNA species. Our results also identify a relationship between trans-splicing status and gene functional classification; ribosomal protein genes fall predominantly into the non-trans-spliced category. In addition, our data provide the first evidence for the occurrence of polycistronic transcription in Ciona. An interesting feature of the Ciona polycistronic transcription units is that the great majority entirely lack intercistronic sequences. PMID:16822859
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Template-directed synthesis using the heterogeneous templates produced by montmorillonite catalysis. A possible bridge between the prebiotic and RNA worlds

NASA Technical Reports Server (NTRS)

Ertem, G.; Ferris, J. P.

1997-01-01

The synthesis of oligoguanylates [oligo(G)s] is catalyzed by a template of oligocytidylates [oligo(C)s] containing 2',5'- and 3',5'-linked phosphodiester bonds with and without incorporated C5'ppC groupings. An oligo(C) template containing exclusively 2',5'-phosphodiester bonds also serves as a template for the synthesis of complementary oligo(G)s. The oligo(C) template was prepared by the condensation of the 5'-phosphorimidazolide of cytidine on montmorillonite clay. These studies establish that RNA oligomers prepared by mineral catalysis, or other routes on the primitive earth, did not have to be exclusively 3',5'-linked to catalyze template-directed synthesis, since oligo(C)s containing a variety of linkage isomers serve as templates for the formation of complementary oligo(G)s. These findings support the postulate that origin of the RNA world was initiated by the RNA oligomers produced by polymerization of activated monomers formed by prebiotic processes.
Proteinase K-catalyzed synthesis of linear and star oligo(L-phenylalanine) conjugates.

PubMed

Ageitos, Jose M; Baker, Peter J; Sugahara, Michihiro; Numata, Keiji

2013-10-14

Chemoenzymatic synthesis of peptides is a green and clean chemical reaction that offers high yields without using organic synthesis and serves as an alternative to traditional peptide synthesis methods. This report describes the chemoenzymatic synthesis of oligo(L-phenylalanine) mediated by proteinase K from Tritirachium album, which is one of the most widely used proteases in molecular biological studies. The synthesized linear oligo-phenylalanine showed a unique self-assembly in aqueous solutions. To further functionalize linear oligo(L-phenylalanine) as a low-molecular-weight gelator, it was cosynthesized with tris(2-aminoethyl)amine to obtain star-oligo(L-phenylalanine), which was bioconjugated to demonstrate its self-assembly into fluorescent fibers. The self-assembled fibers of star-oligo(L-phenylalanine) formed fibrous networks with various branching ratios, which depended on the molecular weights and molecular aspect ratios of star-oligo(L-phenylalanine). This is the first study to demonstrate that proteinase K is a suitable enzyme for chemoenzymatic cosynthesis of oligopeptides and star-shaped heteropeptides.
T-oligo as an anticancer agent in colorectal cancer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wojdyla, Luke; Stone, Amanda L.; Sethakorn, Nan

Highlights: • T-oligo induces cell cycle arrest, senescence, apoptosis, and differentiation in CRC. • Treatment with T-oligo downregulates telomere-associated proteins. • T-oligo combined with an EGFR-TKI additively inhibits cellular proliferation. • T-oligo has potential as an effective therapeutic agent for CRC. - Abstract: In the United States, there will be an estimated 96,830 new cases of colorectal cancer (CRC) and 50,310 deaths in 2014. CRC is often detected at late stages of the disease, at which point there is no effective chemotherapy. Thus, there is an urgent need for effective novel therapies that have minimal effects on normal cells. T-oligo,more » an oligonucleotide homologous to the 3′-telomere overhang, induces potent DNA damage responses in multiple malignant cell types, however, its efficacy in CRC has not been studied. This is the first investigation demonstrating T-oligo-induced anticancer effects in two CRC cell lines, HT-29 and LoVo, which are highly resistant to conventional chemotherapies. In this investigation, we show that T-oligo may mediate its DNA damage responses through the p53/p73 pathway, thereby inhibiting cellular proliferation and inducing apoptosis or senescence. Additionally, upregulation of downstream DNA damage response proteins, including E2F1, p53 or p73, was observed. In LoVo cells, T-oligo induced senescence, decreased clonogenicity, and increased expression of senescence associated proteins p21, p27, and p53. In addition, downregulation of POT1 and TRF2, two components of the shelterin protein complex which protects telomeric ends, was observed. Moreover, we studied the antiproliferative effects of T-oligo in combination with an EGFR tyrosine kinase inhibitor, Gefitinib, which resulted in an additive inhibitory effect on cellular proliferation. Collectively, these data provide evidence that T-oligo alone, or in combination with other molecularly targeted therapies, has potential as an anti-cancer agent in CRC.« less
JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes

PubMed Central

2009-01-01

Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence. PMID:19943962
Homooligomeric β3 (R)-valine peptides: Transformation between C14 and C12 helical structures induced by a guest Aib residue.

PubMed

Vasantha, Basavalingappa; George, Gijo; Raghothama, Srinivasarao; Balaram, Padmanabhan

2017-01-01

Novel helical, structures unprecedented in the chemistry of α-polypeptides, may be found in polypeptides containing β and γ amino acids. The structural characterization of C 12 and C 14 -helices in oligo β-peptides was originally achieved using conformationally constrained cyclic β-residues. This study explores the conformational characteristics of proteinogenic β 3 residues in homooligomeric sequences and addresses the issue of inducing a transition between C 14 and C 12 helices by the introduction of a guest α-residue. Folded C 14 -helical structures are demonstrated for the nonapeptide Boc-[β 3 (R)Val] 9 -OMe by NMR methods in CDCl 3 -DMSO mixtures, while the peptide was found to be aggregated in CDCl 3 . The insertion of a guest Aib residue into an oligo-β-valine sequence in the octapeptide model Boc-[(β 3 (R)Val) 3 -Aib-(β 3 (R)Val] 4 -OMe results in well dispersed NH region in the NMR spectrum indicating folded structures in CDCl 3 . Structure calculations for both the peptides using NOE distance constraints support a C 14 helical structure in the homooligomer which transform into a C 12 helix on introduction of the guest Aib residue. © 2016 Wiley Periodicals, Inc.
Single Cell Total RNA Sequencing through Isothermal Amplification in Picoliter-Droplet Emulsion.

PubMed

Fu, Yusi; Chen, He; Liu, Lu; Huang, Yanyi

2016-11-15

Prevalent single cell RNA amplification and sequencing chemistries mainly focus on polyadenylated RNAs in eukaryotic cells by using oligo(dT) primers for reverse transcription. We develop a new RNA amplification method, "easier-seq", to reverse transcribe and amplify the total RNAs, both with and without polyadenylate tails, from a single cell for transcriptome sequencing with high efficiency, reproducibility, and accuracy. By distributing the reverse transcribed cDNA molecules into 1.5 × 10 5 aqueous droplets in oil, the cDNAs are isothermally amplified using random primers in each of these 65-pL reactors separately. This new method greatly improves the ease of single-cell RNA sequencing by reducing the experimental steps. Meanwhile, with less chance to induce errors, this method can easily maintain the quality of single-cell sequencing. In addition, this polyadenylate-tail-independent method can be seamlessly applied to prokaryotic cell RNA sequencing.
SEAN: SNP prediction and display program utilizing EST sequence clusters.

PubMed

Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek

2006-02-15

SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.
ASFinder: a tool for genome-wide identification of alternatively splicing transcripts from EST-derived sequences.

PubMed

Min, Xiang Jia

2013-01-01

Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.
[Multiplexing mapping of human cDNAs]. Final report, September 1, 1991--February 28, 1994

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

Using PCR with automated product analysis, 329 human brain cDNA sequences have been assigned to individual human chromosomes. Primers were designed from single-pass cDNA sequences expressed sequence tags (ESTs). Primers were used in PCR reactions with DNA from somatic cell hybrid mapping panels as templates, often with multiplexing. Many ESTs mapped match sequence database records. To evaluate of these matches, the position of the primers relative to the matching region (In), the BLAST scores and the Poisson probability values of the EST/sequence record match were determined. In cases where the gene product was stringently identified by the sequence match hadmore » already been mapped, the gene locus determined by EST was consistent with the previous position which strongly supports the validity of assigning unknown genes to human chromosomes based on the EST sequence matches. In the present cases mapping the ESTs to a chromosome can also be considered to have mapped the known gene product: rolipram-sensitive cAMP phosphodiesterase, chromosome 1; protein phosphatase 2A{beta}, chromosome 4; alpha-catenin, chromosome 5; the ELE1 oncogene, chromosome 10q11.2 or q2.1-q23; MXII protein, chromosome l0q24-qter; ribosomal protein L18a homologue, chromosome 14; ribosomal protein L3, chromosome 17; and moesin, Xp11-cen. There were also ESTs mapped that were closely related to non-human sequence records. These matches therefore can be considered to identify human counterparts of known gene products, or members of known gene families. Examples of these include membrane proteins, translation-associated proteins, structural proteins, and enzymes. These data then demonstrate that single pass sequence information is sufficient to design PCR primers useful for assigning cDNA sequences to human chromosomes. When the EST sequence matches previous sequence database records, the chromosome assignments of the EST can be used to make preliminary assignments of the human gene to a chromosome.« less
Leaderless mRNAs are circularized in Chlamydomonas reinhardtii mitochondria.

PubMed

Cahoon, A Bruce; Qureshi, Ali A

2018-06-01

The mitochondrial genome of Chlamydomonas reinhardtii encodes eight protein coding genes transcribed on two polycistronic primary transcripts. The mRNAs are endonucleolytically cleaved from these transcripts directly upstream of their AUG start codons, creating leaderless mRNAs with 3' untranslated regions (UTR) comprised of most or all of their downstream intergenic regions. In this report, we provide evidence that these processed linear mRNAs are circularized, which places the 3' UTR upstream of the 5' start codon, creating a leader sequence ex post facto. The circular mRNAs were found to be ribosome associate by polysome profiling experiments suggesting they are translated. Sequencing of the 3'-5' junctions of the circularized mRNAs found the intra-molecular ligations occurred between fully processed 5' ends (the start AUG) and a variable 3' terminus. For five genes (cob, cox, nd2, nd4, and nd6), some of the 3' ends maintained an oligonucleotide addition during ligation, and for two of them, cob and nd6, these 3' termini were the most commonly recovered sequence. Previous reports have shown that after cleavage, three untemplated oligonucleotide additions may occur on the 3' termini of these mRNAs-adenylation, uridylylation, or cytidylation. These results suggest oligo(U) and oligo(C) additions may be part of the maturation process since they are maintained in the circular mRNAs. Circular RNAs occur in organisms across the biological spectrum, but their purpose in some systems, such as organelles (mitochondria and chloroplasts) is unclear. We hypothesize, that in C. reinhardtii mitochondria it may create a leader sequence to facilitate translation initiation, which may negate the need for an alternative translation initiation mechanism in this system, as previously speculated. In addition, circularization may play a protective role against exonucleases, and/or increase translational productivity.
Analysis of SSR information in EST resources of sugarcane

USDA-ARS?s Scientific Manuscript database

Expressed sequence tags ( ESTs) offer the opportunity to exploit single, low -copy, conserved sequence motifs for the development of simple sequence repeats ( SSRs). The total of 262 113 ESTs of sugarcane (Saccharum officinarum) in the database of NCBI were downloaded and analyzed, which resulted in...
Effect of dual-type oligosaccharides on constipation in loperamide-treated rats

PubMed Central

Han, Sung Hee; Hong, Ki Bae; Kim, Eun Young; Ahn, So Hyun

2016-01-01

BACKGROUND/OBJECTIVES Constipation is a condition that can result from intestinal deformation. Because humans have an upright posture, the effects of gravity can cause this shape deformation. Oligosaccharides are common prebiotics and their effects on bowel health are well known. However, studies of the physiological functionality of a product that contains both lactulose and galactooligosaccharides are insufficient. We investigated the constipation reduction effect of a dual-type oligosaccharide, Dual-Oligo, in loperamide-treated rats. MATERIALS/METHODS Dual-Oligo consists of galactooligosaccharides (15.80%) and lactulose (51.67%). Animals were randomly divided into four groups, the normal group (normal), control group (control), low concentration of Dual-Oligo (LDO) group, and high concentration of Dual-Oligo (HDO) group. After 7 days of oral administration, fecal pellet amount, fecal weight, water content of fecal were measured. Blood chemistry, short-chain fatty acid (SCFA), gastrointestinal transit ratio and length and intestinal mucosa were analyzed. RESULTS Dual-Oligo increased the fecal weight, and water content of feces in rats with loperamide-induced constipation. Gastrointestinal transit ratio and length and area of intestinal mucosa significantly increased after treatment with Dual-Oligo in loperamide-induced rats. A high concentration of Dual-Oligo tended to produce more acetic acid than that observed for the control group, and Dual-Oligo affected the production of total SCFA. Bifidobacteria concentration of cecal contents in the high-concentration oligosaccharide (HDO) and low-concentration oligosaccharide (LDO) groups was similar to the result of the normal group. CONCLUSIONS These results showed that Dual-Oligo is a functional material that is derived from a natural food product and is effective in ameliorating constipation. PMID:27909555
Chemoenzymatic Synthesis of Oligo(L-cysteine) for Use as a Thermostable Bio-Based Material.

PubMed

Ma, Yinan; Sato, Ryota; Li, Zhibo; Numata, Keiji

2016-01-01

Oligomerization of thiol-unprotected L-cysteine ethyl ester (Cys-OEt) catalyzed by proteinase K in aqueous solution has been used to synthesize oligo(L-cysteine) (OligoCys) with a well-defined chemical structure and relatively large degree of polymerization (DP) up to 16-17 (average 8.8). By using a high concentration of Cys-OEt, 78.0% free thiol content was achieved. The thermal properties of OligoCys are stable, with no glass transition until 200 °C, and the decomposition temperature could be increased by oxidation. Chemoenzymatically synthesized OligoCys has great potential for use as a thermostable bio-based material with resistance to oxidation. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Generation and Analysis of Expressed Sequence Tags from Olea europaea L.

PubMed Central

Ozdemir Ozgenturk, Nehir; Oruç, Fatma; Sezerman, Ugur; Kuçukural, Alper; Vural Korkut, Senay; Toksoz, Feriha; Un, Cemal

2010-01-01

Olive (Olea europaea L.) is an important source of edible oil which was originated in Near-East region. In this study, two cDNA libraries were constructed from young olive leaves and immature olive fruits for generation of ESTs to discover the novel genes and search the function of unknown genes of olive. The randomly selected 3840 colonies were sequenced for EST collection from both libraries. Readable 2228 sequences for olive leaf and 1506 sequences for olive fruit were assembled into 205 and 69 contigs, respectively, whereas 2478 were singletons. Putative functions of all 2752 differentially expressed unique sequences were designated by gene homology based on BLAST and annotated using BLAST2GO. While 1339 ESTs show no homology to the database, 2024 ESTs have homology (under 80%) with hypothetical proteins, putative proteins, expressed proteins, and unknown proteins in NCBI-GenBank. 635 EST's unique genes sequence have been identified by over 80% homology to known function in other species which were not previously described in Olea family. Only 3.1% of total EST's was shown similarity with olive database existing in NCBI. This generated EST's data and consensus sequences were submitted to NCBI as valuable source for functional genome studies of olive. PMID:21197085
Peanut gene expression profiling in developing seeds at different reproduction stages during Aspergillus parasiticus infection

PubMed Central

Guo, Baozhu; Chen, Xiaoping; Dang, Phat; Scully, Brian T; Liang, Xuanqiang; Holbrook, C Corley; Yu, Jiujiang; Culbreath, Albert K

2008-01-01

Background Peanut (Arachis hypogaea L.) is an important crop economically and nutritionally, and is one of the most susceptible host crops to colonization of Aspergillus parasiticus and subsequent aflatoxin contamination. Knowledge from molecular genetic studies could help to devise strategies in alleviating this problem; however, few peanut DNA sequences are available in the public database. In order to understand the molecular basis of host resistance to aflatoxin contamination, a large-scale project was conducted to generate expressed sequence tags (ESTs) from developing seeds to identify resistance-related genes involved in defense response against Aspergillus infection and subsequent aflatoxin contamination. Results We constructed six different cDNA libraries derived from developing peanut seeds at three reproduction stages (R5, R6 and R7) from a resistant and a susceptible cultivated peanut genotypes, 'Tifrunner' (susceptible to Aspergillus infection with higher aflatoxin contamination and resistant to TSWV) and 'GT-C20' (resistant to Aspergillus with reduced aflatoxin contamination and susceptible to TSWV). The developing peanut seed tissues were challenged by A. parasiticus and drought stress in the field. A total of 24,192 randomly selected cDNA clones from six libraries were sequenced. After removing vector sequences and quality trimming, 21,777 high-quality EST sequences were generated. Sequence clustering and assembling resulted in 8,689 unique EST sequences with 1,741 tentative consensus EST sequences (TCs) and 6,948 singleton ESTs. Functional classification was performed according to MIPS functional catalogue criteria. The unique EST sequences were divided into twenty-two categories. A similarity search against the non-redundant protein database available from NCBI indicated that 84.78% of total ESTs showed significant similarity to known proteins, of which 165 genes had been previously reported in peanuts. There were differences in overall expression patterns in different libraries and genotypes. A number of sequences were expressed throughout all of the libraries, representing constitutive expressed sequences. In order to identify resistance-related genes with significantly differential expression, a statistical analysis to estimate the relative abundance (R) was used to compare the relative abundance of each gene transcripts in each cDNA library. Thirty six and forty seven unique EST sequences with threshold of R > 4 from libraries of 'GT-C20' and 'Tifrunner', respectively, were selected for examination of temporal gene expression patterns according to EST frequencies. Nine and eight resistance-related genes with significant up-regulation were obtained in 'GT-C20' and 'Tifrunner' libraries, respectively. Among them, three genes were common in both genotypes. Furthermore, a comparison of our EST sequences with other plant sequences in the TIGR Gene Indices libraries showed that the percentage of peanut EST matched to Arabidopsis thaliana, maize (Zea mays), Medicago truncatula, rapeseed (Brassica napus), rice (Oryza sativa), soybean (Glycine max) and wheat (Triticum aestivum) ESTs ranged from 33.84% to 79.46% with the sequence identity ≥ 80%. These results revealed that peanut ESTs are more closely related to legume species than to cereal crops, and more homologous to dicot than to monocot plant species. Conclusion The developed ESTs can be used to discover novel sequences or genes, to identify resistance-related genes and to detect the differences among alleles or markers between these resistant and susceptible peanut genotypes. Additionally, this large collection of cultivated peanut EST sequences will make it possible to construct microarrays for gene expression studies and for further characterization of host resistance mechanisms. It will be a valuable genomic resource for the peanut community. The 21,777 ESTs have been deposited to the NCBI GenBank database with accession numbers ES702769 to ES724546. PMID:18248674
From Conventional to Next Generation Sequencing of Epstein-Barr Virus Genomes.

PubMed

Kwok, Hin; Chiang, Alan Kwok Shing

2016-02-24

Genomic sequences of Epstein-Barr virus (EBV) have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS) and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.
OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.

PubMed

Schreiber, Fabian; Pick, Kerstin; Erpenbeck, Dirk; Wörheide, Gert; Morgenstern, Burkhard

2009-07-16

Phylogenetic studies using expressed sequence tags (EST) are becoming a standard approach to answer evolutionary questions. Such studies are usually based on large sets of newly generated, unannotated, and error-prone EST sequences from different species. A first crucial step in EST-based phylogeny reconstruction is to identify groups of orthologous sequences. From these data sets, appropriate target genes are selected, and redundant sequences are eliminated to obtain suitable sequence sets as input data for tree-reconstruction software. Generating such data sets manually can be very time consuming. Thus, software tools are needed that carry out these steps automatically. We developed a flexible and user-friendly software pipeline, running on desktop machines or computer clusters, that constructs data sets for phylogenomic analyses. It automatically searches assembled EST sequences against databases of orthologous groups (OG), assigns ESTs to these predefined OGs, translates the sequences into proteins, eliminates redundant sequences assigned to the same OG, creates multiple sequence alignments of identified orthologous sequences and offers the possibility to further process this alignment in a last step by excluding potentially homoplastic sites and selecting sufficiently conserved parts. Our software pipeline can be used as it is, but it can also be adapted by integrating additional external programs. This makes the pipeline useful for non-bioinformaticians as well as to bioinformatic experts. The software pipeline is especially designed for ESTs, but it can also handle protein sequences. OrthoSelect is a tool that produces orthologous gene alignments from assembled ESTs. Our tests show that OrthoSelect detects orthologs in EST libraries with high accuracy. In the absence of a gold standard for orthology prediction, we compared predictions by OrthoSelect to a manually created and published phylogenomic data set. Our tool was not only able to rebuild the data set with a specificity of 98%, but it detected four percent more orthologous sequences. Furthermore, the results OrthoSelect produces are in absolut agreement with the results of other programs, but our tool offers a significant speedup and additional functionality, e.g. handling of ESTs, computing sequence alignments, and refining them. To our knowledge, there is currently no fully automated and freely available tool for this purpose. Thus, OrthoSelect is a valuable tool for researchers in the field of phylogenomics who deal with large quantities of EST sequences. OrthoSelect is written in Perl and runs on Linux/Mac OS X. The tool can be downloaded at (http://gobics.de/fabian/orthoselect.php).

A large scale analysis of cDNA in Arabidopsis thaliana: generation of 12,028 non-redundant expressed sequence tags from normalized and size-selected cDNA libraries.

PubMed

Asamizu, E; Nakamura, Y; Sato, S; Tabata, S

2000-06-30

For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.
Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum

PubMed Central

2010-01-01

Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644
Characterization of the OmyY1 region on the rainbow trout Y chromosome

USGS Publications Warehouse

Phillips, Ruth B.; DeKoning, Jenefer J.; Brunelli, Joseph P.; Faber-Hammond, Joshua J.; Hansen, John D.; Christensen, Kris A.; Renn, Suzy C.P.; Thorgaard, Gary H.

2013-01-01

We characterized the male-specific region on the Y chromosome of rainbow trout, which contains both sdY (the sex-determining gene) and the male-specific genetic marker, OmyY1. Several clones containing the OmyY1 marker were screened from a BAC library from a YY clonal line and found to be part of an 800 kb BAC contig. Using fluorescence in situ hybridization (FISH), these clones were localized to the end of the short arm of the Y chromosome in rainbow trout, with an additional signal on the end of the X chromosome in many cells. We sequenced a minimum tiling path of these clones using Illumina and 454 pyrosequencing. The region is rich in transposons and rDNA, but also appears to contain several single-copy protein-coding genes. Most of these genes are also found on the X chromosome; and in several cases sex-specific SNPs in these genes were identified between the male (YY) and female (XX) homozygous clonal lines. Additional genes were identified by hybridization of the BACs to the cGRASP salmonid 4x44K oligo microarray. By BLASTn evaluations using hypothetical transcripts of OmyY1-linked candidate genes as query against several EST databases, we conclude at least 12 of these candidate genes are likely functional, and expressed.
Characterization of EST-derived and non-EST simple sequence repeats in an F₁ hybrid population of Vitis vinifera L.

PubMed

Kayesh, E; Bilkish, N; Liu, G S; Chen, W; Leng, X P; Fang, J G

2014-03-31

Among different classes of molecular markers, expressed sequence tags (ESTs) are a new resource for developing simple sequence repeat (SSR) functional markers for genotyping and genetic mapping in F1 hybrid populations of Vitis vinifera L. Recently, because of the availability of an enormous amount of data for ESTs in the public domain, the emphasis has shifted from genomic SSRs to EST-SSRs, which belong to transcribed regions of the genome and may have a role in gene expression or function. The objective of this study was to assess the polymorphisms among 94 F1 hybrids from "Early Rose" and "Red Globe" using 25 EST-derived and 25 non-EST SSR markers. A total collection of 362,375 grape ESTs that were retrieved from the National Center for Biotechnology Information (NCBI) and 2522 EST-SSR sequences were identified. From them, 205 primer pairs were randomly selected, including 176 pairs that were EST-derived and 29 non-EST SSR primer pairs, for polymerase chain reaction amplification. A total of 131 alleles were amplified using 50 pairs of primers; 78 alleles were amplified using EST-derived SSR primers and 53 were from non-EST SSR primers. At most, 6 and 5 alleles were amplified by EST-derived and non-EST SSR primers, respectively. The EST-derived SSR markers showed a maximum polymorphic information content (PIC) value of 1 and a minimum of 0.33 while non-EST SSR markers had maximum and minimum PIC values of 1 and 0.25, respectively. The average PIC value was 0.56 for EST-derived SSR markers and 0.45 for non-EST SSR markers.
Specific analogues uncouple transport, signalling, oligo-ubiquitination and endocytosis in the yeast Gap1 amino acid transceptor

PubMed Central

Van Zeebroeck, Griet; Rubio-Texeira, Marta; Schothorst, Joep; Thevelein, Johan M

2014-01-01

The Saccharomyces cerevisiae amino acid transceptor Gap1 functions as receptor for signalling to the PKA pathway and concomitantly undergoes substrate-induced oligo-ubiquitination and endocytosis. We have identified specific amino acids and analogues that uncouple to certain extent signalling, transport, oligo-ubiquitination and endocytosis. l-lysine, l-histidine and l-tryptophan are transported by Gap1 but do not trigger signalling. Unlike l-histidine, l-lysine triggers Gap1 oligo-ubiquitination without substantial induction of endocytosis. Two transported, non-metabolizable signalling agonists, β-alanine and d-histidine, are strong and weak inducers of Gap1 endocytosis, respectively, but both causing Gap1 oligo-ubiquitination. The non-signalling agonist, non-transported competitive inhibitor of Gap1 transport, l-Asp-γ-l-Phe, induces oligo-ubiquitination but no discernible endocytosis. The Km of l-citrulline transport is much lower than the threshold concentration for signalling and endocytosis. These results show that molecules can be transported without triggering signalling or substantial endocytosis, and that oligo-ubiquitination and endocytosis do not require signalling nor metabolism. Oligo-ubiquitination is required, but apparently not sufficient to trigger endocytosis. In addition, we demonstrate intracellular cross-induction of endocytosis of transport-defective Gap1Y395C by ubiquitination- and endocytosis-deficient Gap1K9R,K16R. Our results support the concept that different substrates bind to partially overlapping binding sites in the same general substrate-binding pocket of Gap1, triggering divergent conformations, resulting in different conformation-induced downstream processes. PMID:24852066
Wheat EST resources for functional genomics of abiotic stress

PubMed Central

Houde, Mario; Belcaid, Mahdi; Ouellet, François; Danyluk, Jean; Monroy, Antonio F; Dryanova, Ani; Gulick, Patrick; Bergeron, Anne; Laroche, André; Links, Matthew G; MacCarthy, Luke; Crosby, William L; Sarhan, Fathey

2006-01-01

Background Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS) project. Results We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets). Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. Conclusion We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in wheat and other cereals. PMID:16772040
3' terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing.

PubMed

Goldfarb, Katherine C; Cech, Thomas R

2013-09-21

Post-transcriptional 3' end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3' RACE coupled with high-throughput sequencing to characterize the 3' terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. The 3' terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3' terminus of an in vitro transcribed MRP RNA control and the differing 3' terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). 3' RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3' terminal sequences of noncoding RNAs.
Rapid in silico cloning of genes using expressed sequence tags (ESTs).

PubMed

Gill, R W; Sanseau, P

2000-01-01

Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.
Generation and Analysis of Expressed Sequence Tags (ESTs) from Halophyte Atriplex canescens to Explore Salt-Responsive Related Genes

PubMed Central

Li, Jingtao; Sun, Xinhua; Yu, Gang; Jia, Chengguo; Liu, Jinliang; Pan, Hongyu

2014-01-01

Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs) were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs) were also identified contributing to the study of A. canescens resources. PMID:24960361
Development of a EST dataset and characterization of EST-SSRs in a traditional Chinese medicinal plant, Epimedium sagittatum (Sieb. Et Zucc.) Maxim

PubMed Central

2010-01-01

Background Epimedium sagittatum (Sieb. Et Zucc.) Maxim, a traditional Chinese medicinal plant species, has been used extensively as genuine medicinal materials. Certain Epimedium species are endangered due to commercial overexploition, while sustainable application studies, conservation genetics, systematics, and marker-assisted selection (MAS) of Epimedium is less-studied due to the lack of molecular markers. Here, we report a set of expressed sequence tags (ESTs) and simple sequence repeats (SSRs) identified in these ESTs for E. sagittatum. Results cDNAs of E. sagittatum are sequenced using 454 GS-FLX pyrosequencing technology. The raw reads are cleaned and assembled into a total of 76,459 consensus sequences comprising of 17,231 contigs and 59,228 singlets. About 38.5% (29,466) of the consensus sequences significantly match to the non-redundant protein database (E-value < 1e-10), 22,295 of which are further annotated using Gene Ontology (GO) terms. A total of 2,810 EST-SSRs is identified from the Epimedium EST dataset. Trinucleotide SSR is the dominant repeat type (55.2%) followed by dinucleotide (30.4%), tetranuleotide (7.3%), hexanucleotide (4.9%), and pentanucleotide (2.2%) SSR. The dominant repeat motif is AAG/CTT (23.6%) followed by AG/CT (19.3%), ACC/GGT (11.1%), AT/AT (7.5%), and AAC/GTT (5.9%). Thirty-two SSR-ESTs are randomly selected and primer pairs are synthesized for testing the transferability across 52 Epimedium species. Eighteen primer pairs (85.7%) could be successfully transferred to Epimedium species and sixteen of those show high genetic diversity with 0.35 of observed heterozygosity (Ho) and 0.65 of expected heterozygosity (He) and high number of alleles per locus (11.9). Conclusion A large EST dataset with a total of 76,459 consensus sequences is generated, aiming to provide sequence information for deciphering secondary metabolism, especially for flavonoid pathway in Epimedium. A total of 2,810 EST-SSRs is identified from EST dataset and ~1580 EST-SSR markers are transferable. E. sagittatum EST-SSR transferability to the major Epimedium germplasm is up to 85.7%. Therefore, this EST dataset and EST-SSRs will be a powerful resource for further studies such as taxonomy, molecular breeding, genetics, genomics, and secondary metabolism in Epimedium species. PMID:20141623
The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

Treesearch

Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

2014-01-01

Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...
The gene space in wheat: the complete γ-gliadin gene family from the wheat cultivar Chinese Spring.

PubMed

Anderson, Olin D; Huo, Naxin; Gu, Yong Q

2013-06-01

The complete set of unique γ-gliadin genes is described for the wheat cultivar Chinese Spring using a combination of expressed sequence tag (EST) and Roche 454 DNA sequences. Assemblies of Chinese Spring ESTs yielded 11 different γ-gliadin gene sequences. Two of the sequences encode identical polypeptides and are assumed to be the result of a recent gene duplication. One gene has a 3' coding mutation that changes the reading frame in the final eight codons. A second assembly of Chinese Spring γ-gliadin sequences was generated using Roche 454 total genomic DNA sequences. The 454 assembly confirmed the same 11 active genes as the EST assembly plus two pseudogenes not represented by ESTs. These 13 γ-gliadin sequences represent the complete unique set of γ-gliadin genes for cv Chinese Spring, although not ruled out are additional genes that are exact duplications of these 13 genes. A comparison with the ESTs of two other hexaploid cultivars (Butte 86 and Recital) finds that the most active genes are present in all three cultivars, with exceptions likely due to too few ESTs for detection in Butte 86 and Recital. A comparison of the numbers of ESTs per gene indicates differential levels of expression within the γ-gliadin gene family. Genome assignments were made for 6 of the 13 Chinese Spring γ-gliadin genes, i.e., one assignment from a match to two γ-gliadin genes found within a tetraploid wheat A genome BAC and four genes that match four distinct γ-gliadin sequences assembled from Roche 454 sequences from Aegilops tauschii, the hexaploid wheat D-genome ancestor.
ESTuber db: an online database for Tuber borchii EST sequences.

PubMed

Lazzari, Barbara; Caprera, Andrea; Cosentino, Cristian; Stella, Alessandra; Milanesi, Luciano; Viotti, Angelo

2007-03-08

The ESTuber database (http://www.itb.cnr.it/estuber) includes 3,271 Tuber borchii expressed sequence tags (EST). The dataset consists of 2,389 sequences from an in-house prepared cDNA library from truffle vegetative hyphae, and 882 sequences downloaded from GenBank and representing four libraries from white truffle mycelia and ascocarps at different developmental stages. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts. Data were collected in a MySQL database, which can be queried via a php-based web interface. Sequences included in the ESTuber db were clustered and annotated against three databases: the GenBank nr database, the UniProtKB database and a third in-house prepared database of fungi genomic sequences. An algorithm was implemented to infer statistical classification among Gene Ontology categories from the ontology occurrences deduced from the annotation procedure against the UniProtKB database. Ontologies were also deduced from the annotation of more than 130,000 EST sequences from five filamentous fungi, for intra-species comparison purposes. Further analyses were performed on the ESTuber db dataset, including tandem repeats search and comparison of the putative protein dataset inferred from the EST sequences to the PROSITE database for protein patterns identification. All the analyses were performed both on the complete sequence dataset and on the contig consensus sequences generated by the EST assembly procedure. The resulting web site is a resource of data and links related to truffle expressed genes. The Sequence Report and Contig Report pages are the web interface core structures which, together with the Text search utility and the Blast utility, allow easy access to the data stored in the database.
Partially esterified oligogalacturonides are the preferred substrates for pectin methylesterase of Aspergillus niger.

PubMed

van Alebeek, Gert-Jan W M; van Scherpenzeel, Katrien; Beldman, Gerrit; Schols, Henk A; Voragen, Alphons G J

2003-05-15

Investigations on the mode of action of Aspergillus niger pectin methylesterase (PME) towards differently C(6)- and C(1)-substituted oligogalacturonides (oligoGal p A) are described. De-esterification of methyl-esterified (un)saturated oligoGal p A proceeds via a specific pattern, depending on the degree of polymerization. Initially, a first methyl ester of the oligomer is hydrolysed, resulting in one free carboxyl group. Subsequently, this first product is preferred as a substrate and is de-esterified for a second time. This product is then accumulated and hereafter de-esterified further to the final product, i.e. oligoGal p A containing one methyl ester located at the non-reducing end residue for both saturated and unsaturated oligoGal p A, as found by post-source decay matrix-assisted laser-desorption/ionization-time-of-flight MS. The saturated hexamer is an exception to this: three methyl esters are removed very rapidly, instead of two methyl esters. When unsaturated oligoGal p A were used, the formation of the end product differed slightly, suggesting that the unsaturated bond at the non-reducing end influences the de-esterification process. In vivo, PME prefers methyl esters, but the enzyme appeared to be tolerant for other C(6)- and C(1)-substituents. Changing the type of ester (ethyl esterification) or addition of a methyl glycoside (C(1)) only reduced the activity or had no effect respectively. The specific product pattern was identical for all methyl- and ethyl-esterified oligoGal p A and methyl-glycosidated oligoGal p A, which strongly indicates that one or perhaps two non-esterified oligoGal p A are preferred in the active-site cleft.
Comparative study of the interaction of meso-tetrakis (N-para-trimethyl-anilium) porphyrin (TMAP) in its free base and Fe derivative form with oligo(dA.dT)15 and oligo(dG.dC)15.

PubMed

Bathaie, S Zahra; Ajloo, Davood; Daraie, Marzieh; Ghadamgahi, Maryam

2015-01-01

Interaction between a cationic porphyrin and its ferric derivative with oligo(dA.dT)15 and oligo(dG.dC)15 was studied by UV-vis spectroscopy, resonance light scattering (RLS), and circular dichroism (CD) at different ionic strengths; molecular docking and molecular dynamics simulation were also used for completion. Followings are the observed changes in the spectral properties of meso-tetrakis (N-para-trimethyl-anilium) porphyrin (TMAP), as a free-base porphyrin with no axial ligand, and its Fe derivative (FeTMAP) upon interaction with oligo(dA.dT)15 and oligo(dG.dC)15: (1) the substantial red shift and hypochromicity at the Soret maximum in the UV-vis spectra; (2) the increased RLS intensity by increasing the ionic strength; and (3) an intense bisignate excitonic CD signal. All of them are the reasons for TMAP and FeTMAP binding to oligo(dA.dT)15 and oligo(dG.dC)15 with the outside binding mode, accompanied by the self-stacking of the ligands along the oligonucleotide helix. The CD results demonstrated a drastic change from excitonic in monomeric behavior at higher ionic strengths, which indicates the groove binding of the ligands with oligonucleotides. Molecular docking also confirmed the groove binding mode of the ligands and estimated the binding constants and energies of the interactions. Their interaction trend was further confirmed by molecular dynamics technique and structure parameters obtained from simulation. It showed that TMAP reduced the number of intermolecular hydrogen bonds and increased the solvent accessible surface area in the oligonucleotide. The self-aggregation of ligands at lower concentrations was also confirmed.
Partially esterified oligogalacturonides are the preferred substrates for pectin methylesterase of Aspergillus niger.

PubMed Central

van Alebeek, Gert-Jan W M; van Scherpenzeel, Katrien; Beldman, Gerrit; Schols, Henk A; Voragen, Alphons G J

2003-01-01

Investigations on the mode of action of Aspergillus niger pectin methylesterase (PME) towards differently C(6)- and C(1)-substituted oligogalacturonides (oligoGal p A) are described. De-esterification of methyl-esterified (un)saturated oligoGal p A proceeds via a specific pattern, depending on the degree of polymerization. Initially, a first methyl ester of the oligomer is hydrolysed, resulting in one free carboxyl group. Subsequently, this first product is preferred as a substrate and is de-esterified for a second time. This product is then accumulated and hereafter de-esterified further to the final product, i.e. oligoGal p A containing one methyl ester located at the non-reducing end residue for both saturated and unsaturated oligoGal p A, as found by post-source decay matrix-assisted laser-desorption/ionization-time-of-flight MS. The saturated hexamer is an exception to this: three methyl esters are removed very rapidly, instead of two methyl esters. When unsaturated oligoGal p A were used, the formation of the end product differed slightly, suggesting that the unsaturated bond at the non-reducing end influences the de-esterification process. In vivo, PME prefers methyl esters, but the enzyme appeared to be tolerant for other C(6)- and C(1)-substituents. Changing the type of ester (ethyl esterification) or addition of a methyl glycoside (C(1)) only reduced the activity or had no effect respectively. The specific product pattern was identical for all methyl- and ethyl-esterified oligoGal p A and methyl-glycosidated oligoGal p A, which strongly indicates that one or perhaps two non-esterified oligoGal p A are preferred in the active-site cleft. PMID:12589708
Superimposed Code Theoretic Analysis of Deoxyribonucleic Acid (DNA) Codes and DNA Computing

DTIC Science & Technology

2010-01-01

partitioned by font type) of sequences are allowed to be in each position (e.g., Arial = position 0, Comic = position 1, etc. ) and within each collection...movement was modeled by a Brownian motion 3 dimensional random walk. The one dimensional diffusion coefficient D for the ellipsoid shape with 3...temperature, kB is Boltzmann’s constant, and η is the viscosity of the medium. The random walk motion is modeled by assuming the oligo is on a three
Multi-domain utilization by TUT4 and TUT7 in control of let-7 biogenesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Faehnle, Christopher R.; Walleshauser, Jack; Joshua-Tor, Leemor

2017-07-03

The uridyl transferases TUT4 and TUT7 (collectively called TUT4(7)) switch between two modes of activity, either promoting expression of let-7 microRNA (monoU) or marking it for degradation (oligoU). Lin28 modulates the switch via recruitment of TUT4(7) to the precursor pre-let-7 in stem cells and human cancers. We found that TUT4(7) utilize two multidomain functional modules during the switch from monoU to oligoU. The catalytic module (CM) is essential for both activities, while the Lin28-interacting module (LIM) is indispensable for oligoU. A TUT7 CM structure trapped in the monoU activity staterevealed a duplex-RNA-binding pocket that orients group II pre-let-7 hairpins tomore » favor monoU addition. Conversely, the switch to oligoU requires the ZK domain of Lin28 to drive the formation of a stable ternary complex between pre-let-7 and the inactive LIM. Finally, ZK2 of TUT4(7) aids oligoU addition by engaging the growing oligoU tail through uracil-specific interactions.« less
Comparison of three microarray probe annotation pipelines: differences in strategies and their effect on downstream analysis

PubMed Central

Neerincx, Pieter BT; Casel, Pierrot; Prickett, Dennis; Nie, Haisheng; Watson, Michael; Leunissen, Jack AM; Groenen, Martien AM; Klopp, Christophe

2009-01-01

Background Reliable annotation linking oligonucleotide probes to target genes is essential for functional biological analysis of microarray experiments. We used the IMAD, OligoRAP and sigReannot pipelines to update the annotation for the ARK-Genomics Chicken 20 K array as part of a joined EADGENE/SABRE workshop. In this manuscript we compare their annotation strategies and results. Furthermore, we analyse the effect of differences in updated annotation on functional analysis for an experiment involving Eimeria infected chickens and finally we propose guidelines for optimal annotation strategies. Results IMAD, OligoRAP and sigReannot update both annotation and estimated target specificity. The 3 pipelines can assign oligos to target specificity categories although with varying degrees of resolution. Target specificity is judged based on the amount and type of oligo versus target-gene alignments (hits), which are determined by filter thresholds that users can adjust based on their experimental conditions. Linking oligos to annotation on the other hand is based on rigid rules, which differ between pipelines. For 52.7% of the oligos from a subset selected for in depth comparison all pipelines linked to one or more Ensembl genes with consensus on 44.0%. In 31.0% of the cases none of the pipelines could assign an Ensembl gene to an oligo and for the remaining 16.3% the coverage differed between pipelines. Differences in updated annotation were mainly due to different thresholds for hybridisation potential filtering of oligo versus target-gene alignments and different policies for expanding annotation using indirect links. The differences in updated annotation packages had a significant effect on GO term enrichment analysis with consensus on only 67.2% of the enriched terms. Conclusion In addition to flexible thresholds to determine target specificity, annotation tools should provide metadata describing the relationships between oligos and the annotation assigned to them. These relationships can then be used to judge the varying degrees of reliability allowing users to fine-tune the balance between reliability and coverage. This is important as it can have a significant effect on functional microarray analysis as exemplified by the lack of consensus on almost one third of the terms found with GO term enrichment analysis based on updated IMAD, OligoRAP or sigReannot annotation. PMID:19615109
Comparison of three microarray probe annotation pipelines: differences in strategies and their effect on downstream analysis.

PubMed

Neerincx, Pieter Bt; Casel, Pierrot; Prickett, Dennis; Nie, Haisheng; Watson, Michael; Leunissen, Jack Am; Groenen, Martien Am; Klopp, Christophe

2009-07-16

Reliable annotation linking oligonucleotide probes to target genes is essential for functional biological analysis of microarray experiments. We used the IMAD, OligoRAP and sigReannot pipelines to update the annotation for the ARK-Genomics Chicken 20 K array as part of a joined EADGENE/SABRE workshop. In this manuscript we compare their annotation strategies and results. Furthermore, we analyse the effect of differences in updated annotation on functional analysis for an experiment involving Eimeria infected chickens and finally we propose guidelines for optimal annotation strategies. IMAD, OligoRAP and sigReannot update both annotation and estimated target specificity. The 3 pipelines can assign oligos to target specificity categories although with varying degrees of resolution. Target specificity is judged based on the amount and type of oligo versus target-gene alignments (hits), which are determined by filter thresholds that users can adjust based on their experimental conditions. Linking oligos to annotation on the other hand is based on rigid rules, which differ between pipelines.For 52.7% of the oligos from a subset selected for in depth comparison all pipelines linked to one or more Ensembl genes with consensus on 44.0%. In 31.0% of the cases none of the pipelines could assign an Ensembl gene to an oligo and for the remaining 16.3% the coverage differed between pipelines. Differences in updated annotation were mainly due to different thresholds for hybridisation potential filtering of oligo versus target-gene alignments and different policies for expanding annotation using indirect links. The differences in updated annotation packages had a significant effect on GO term enrichment analysis with consensus on only 67.2% of the enriched terms. In addition to flexible thresholds to determine target specificity, annotation tools should provide metadata describing the relationships between oligos and the annotation assigned to them. These relationships can then be used to judge the varying degrees of reliability allowing users to fine-tune the balance between reliability and coverage. This is important as it can have a significant effect on functional microarray analysis as exemplified by the lack of consensus on almost one third of the terms found with GO term enrichment analysis based on updated IMAD, OligoRAP or sigReannot annotation.

Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Catfish Genome Consortium; Wang, Shaolin; Peatman, Eric

2010-03-23

Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities tomore » known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.« less
ESTs and EST-linked polymorphisms for genetic mapping and phylogenetic reconstruction in the guppy, Poecilia reticulata

PubMed Central

Dreyer, Christine; Hoffmann, Margarete; Lanz, Christa; Willing, Eva-Maria; Riester, Markus; Warthmann, Norman; Sprecher, Andrea; Tripathi, Namita; Henz, Stefan R; Weigel, Detlef

2007-01-01

Background The guppy, Poecilia reticulata, is a well-known model organism for studying inheritance and variation of male ornamental traits as well as adaptation to different river habitats. However, genomic resources for studying this important model were not previously widely available. Results With the aim of generating molecular markers for genetic mapping of the guppy, cDNA libraries were constructed from embryos and different adult organs to generate expressed sequence tags (ESTs). About 18,000 ESTs were annotated according to BLASTN and BLASTX results and the sequence information from the 3' UTRs was exploited to generate PCR primers for re-sequencing of genomic DNA from different wild type strains. By comparison of EST-linked genomic sequences from at least four different ecotypes, about 1,700 polymorphisms were identified, representing about 400 distinct genes. Two interconnected MySQL databases were built to organize the ESTs and markers, respectively. A robust phylogeny of the guppy was reconstructed, based on 10 different nuclear genes. Conclusion Our EST and marker databases provide useful tools for genetic mapping and phylogenetic studies of the guppy. PMID:17686157
Iterative divergent/convergent doubling approach to linear conjugated oligomers. A rapid route to a 128 A long potential molecular wire

NASA Astrophysics Data System (ADS)

Tour, James M.; Schumm, Jeffrey S.; Pearson, Darren L.

1994-06-01

Described is the synthesis of oligo (2-ethylphenylene ethynylene)s and oligo (2-(3'ethylheptyl) phenylene ethynylene)s via an iterative divergent convergent approach. Synthesized were the monomer, dimer, tetramer, and octamer of the ethyl derivative and the monomer, dimer, tetramer, octamer, and 16-mer of the ethylheptyl derivative. The 16-mer is 128 A long. At each stage in the iteration, the length of the framework doubles. Only three sets of reaction conditions are needed for the entire iterative synthetic sequence; an iodination, a protodesilylation, and a Pd/Cu-catalyzed cross coupling. The oligomers were characterized spectroscopically and by mass spectrometry. The optical properties are presented which show the stage of optical absorbance saturation. The size exclusion chromatography values for the number average weights, relative to polystyrene, illustrate the tremendous differences in the hydrodynamic volume of these rigid rod oligomers verses the random coils of polystyrene. These differences become quite apparent at the octamer stage. These oligomers may act as molecular wires in molecular electronic devices and they also serve as useful models for understanding related bulk polymers.
Trimodal Control of Ion-Transport Activity on Cyclo-oligo-(1→6)-β-D-glucosamine-Based Artificial Ion-Transport Systems.

PubMed

Roy, Arundhati; Saha, Tanmoy; Gening, Marina L; Titov, Denis V; Gerbst, Alexey G; Tsvetkov, Yury E; Nifantiev, Nikolay E; Talukdar, Pinaki

2015-11-23

Cyclo-oligo-(1→6)-β-D-glucosamines functionalized with hydrophobic tails are reported as a new class of transmembrane ion-transport system. These macrocycles with hydrophilic cavities were introduced as an alternative to cyclodextrins, which are supramolecular systems with hydrophobic cavities. The transport activities of these glycoconjugates were manipulated by altering the oligomericity of the macrocycles, as well as the length and number of attached tails. Hydrophobic tails of 3 different sizes were synthesized and coupled with each glucosamine scaffold through the amide linkage to obtain 18 derivatives. The ion-transport activity increased from di- to tetrameric glucosamine macrocycles, but decreased further when flexible pentameric glucosamine was introduced. The ion-transport activity also increased with increasing length of attached linkers. For a fixed length of linkers, the transport activity decreased when the number of such tails was reduced. All glycoconjugates displayed a uniform anion-selectivity sequence: Cl(-) >Br(-) >I(-) . From theoretical studies, hydrogen bonding between the macrocycle backbone and the anion bridged through water molecules was observed. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A methods review on use of nonsense suppression to study 3′ end formation and other aspects of tRNA biogenesis

PubMed Central

Rijal, Keshab; Maraia, Richard J.; Arimbasseri, Aneeshkumar G.

2014-01-01

Suppressor tRNAs bear anticodon mutations that allow them to decode premature stop codons in metabolic marker gene mRNAs, that can be used as in vivo reporters of functional tRNA biogenesis. Here, we review key components of a suppressor tRNA system specific to S. pombe and its adaptations for use to study specific steps in tRNA biogenesis. Eukaryotic tRNA biogenesis begins with transcription initiation by RNA polymerase (pol) III. The nascent pre-tRNAs must undergo folding, 5′ and 3′ processing to remove the leader and trailer, nuclear export, and splicing if applicable, while multiple complex chemical modifications occur throughout the process. We review evidence that precursor-tRNA processing begins with transcription termination at the oligo(T) terminator element, which forms a 3′ oligo(U) tract on the nascent RNA, a sequence-specific binding site for the RNA chaperone, La protein. The processing pathway bifurcates depending on a poorly understood property of pol III termination that determines the 3′ oligo(U) length and therefore the affinity for La. We thus review the pol III termination process and the factors involved including advances using gene-specific random mutagenesis by dNTP analogs that identify key residues important for transcription termination in certain pol III subunits. The review ends with a ‘technical approaches’ section that includes a parts lists of suppressor-tRNA alleles, strains and plasmids, and graphic examples of its diverse uses. PMID:25447915
In silico analysis of expressed sequence tags from Trichostrongylus vitrinus (Nematoda): comparison of the automated ESTExplorer workflow platform with conventional database searches.

PubMed

Nagaraj, Shivashankar H; Gasser, Robin B; Nisbet, Alasdair J; Ranganathan, Shoba

2008-01-01

The analysis of expressed sequence tags (EST) offers a rapid and cost effective approach to elucidate the transcriptome of an organism, but requires several computational methods for assembly and annotation. Researchers frequently analyse each step manually, which is laborious and time consuming. We have recently developed ESTExplorer, a semi-automated computational workflow system, in order to achieve the rapid analysis of EST datasets. In this study, we evaluated EST data analysis for the parasitic nematode Trichostrongylus vitrinus (order Strongylida) using ESTExplorer, compared with database matching alone. We functionally annotated 1776 ESTs obtained via suppressive-subtractive hybridisation from T. vitrinus, an important parasitic trichostrongylid of small ruminants. Cluster and comparative genomic analyses of the transcripts using ESTExplorer indicated that 290 (41%) sequences had homologues in Caenorhabditis elegans, 329 (42%) in parasitic nematodes, 202 (28%) in organisms other than nematodes, and 218 (31%) had no significant match to any sequence in the current databases. Of the C. elegans homologues, 90 were associated with 'non-wildtype' double-stranded RNA interference (RNAi) phenotypes, including embryonic lethality, maternal sterility, sterile progeny, larval arrest and slow growth. We could functionally classify 267 (38%) sequences using the Gene Ontologies (GO) and establish pathway associations for 230 (33%) sequences using the Kyoto Encyclopedia of Genes and Genomes (KEGG). Further examination of this EST dataset revealed a number of signalling molecules, proteases, protease inhibitors, enzymes, ion channels and immune-related genes. In addition, we identified 40 putative secreted proteins that could represent potential candidates for developing novel anthelmintics or vaccines. We further compared the automated EST sequence annotations, using ESTExplorer, with database search results for individual T. vitrinus ESTs. ESTExplorer reliably and rapidly annotated 301 ESTs, with pathway and GO information, eliminating 60 low quality hits from database searches. We evaluated the efficacy of ESTExplorer in analysing EST data, and demonstrate that computational tools can be used to accelerate the process of gene discovery in EST sequencing projects. The present study has elucidated sets of relatively conserved and potentially novel genes for biological investigation, and the annotated EST set provides further insight into the molecular biology of T. vitrinus, towards the identification of novel drug targets.
Biological availability and nuclease resistance extend the in vitro activity of a phosphorothioate-3'hydroxypropylamine oligonucleotide.

PubMed Central

Tam, R C; Li, Y; Noonberg, S; Hwang, D G; Lui, G; Hunt, C A; Garovoy, M R

1994-01-01

Augmented biological activity in vitro has been demonstrated in oligonucleotides (oligos) modified to provide nuclease resistance, to enhance cellular uptake or to increase target affinity. How chemical modification affects the duration of effect of an oligo with potent activity has not been investigated directly. We postulated that modification with internucleotide phosphorothioates and 3' alkylamine provided additional nuclease protection which could significantly extend the biological activity of a 26 mer, (T2). We showed this analog, sT2a, could maximally inhibit interferon gamma-induced HLA-DR mRNA synthesis and surface expression in both HeLa and retinal pigmented epithelial cells and could continue to be effective, in the absence of oligo, 15 days following initial oligo treatment; an effect not observed with its 3'amine counterpart, T2a. In vitro stability studies confirmed that sT2a conferred the greatest stability to nucleases and that cellular accumulation of 32P-sT2a in both cell types was also greater than other T2 oligos. Using confocal microscopy, we revealed that the intracellular distribution of sT2a favored greater nuclear accumulation and release of oligo from cytoplasmic vesicles; a pattern not observed with T2a. These results suggest that phosphorothioate-3'amine modification could increase the duration of effect of T2 oligo by altering nuclease resistance as well as intracellular accumulation and distribution; factors known to affect biological availability. Images PMID:8152930
Using RSAT oligo-analysis and dyad-analysis tools to discover regulatory signals in nucleic sequences.

PubMed

Defrance, Matthieu; Janky, Rekin's; Sand, Olivier; van Helden, Jacques

2008-01-01

This protocol explains how to discover functional signals in genomic sequences by detecting over- or under-represented oligonucleotides (words) or spaced pairs thereof (dyads) with the Regulatory Sequence Analysis Tools (http://rsat.ulb.ac.be/rsat/). Two typical applications are presented: (i) predicting transcription factor-binding motifs in promoters of coregulated genes and (ii) discovering phylogenetic footprints in promoters of orthologous genes. The steps of this protocol include purging genomic sequences to discard redundant fragments, discovering over-represented patterns and assembling them to obtain degenerate motifs, scanning sequences and drawing feature maps. The main strength of the method is its statistical ground: the binomial significance provides an efficient control on the rate of false positives. In contrast with optimization-based pattern discovery algorithms, the method supports the detection of under- as well as over-represented motifs. Computation times vary from seconds (gene clusters) to minutes (whole genomes). The execution of the whole protocol should take approximately 1 h.
Preparing and Analyzing Expressed Sequence Tags (ESTs) Library for the Mammary Tissue of Local Turkish Kivircik Sheep

PubMed Central

Omeroglu Ulu, Zehra; Ulu, Salih; Un, Cemal; Ozdem Oztabak, Kemal; Altunatmaz, Kemal

2017-01-01

Kivircik sheep is an important local Turkish sheep according to its meat quality and milk productivity. The aim of this study was to analyze gene expression profiles of both prenatal and postnatal stages for the Kivircik sheep. Therefore, two different cDNA libraries, which were taken from the same Kivircik sheep mammary gland tissue at prenatal and postnatal stages, were constructed. Total 3072 colonies which were randomly selected from the two libraries were sequenced for developing a sheep ESTs collection. We used Phred/Phrap computer programs for analysis of the raw EST and readable EST sequences were assembled with the CAP3 software. Putative functions of all unique sequences and statistical analysis were determined by Geneious software. Total 422 ESTs have over 80% similarity to known sequences of other organisms in NCBI classified by Panther database for the Gene Ontology (GO) category. By comparing gene expression profiles, we observed some putative genes that may be relative to reproductive performance or play important roles in milk synthesis and secretion. A total of 2414 ESTs have been deposited to the NCBI GenBank database (GW996847–GW999260). EST data in this study have provided a new source of information to functional genome studies of sheep. PMID:28239610
Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism

USDA-ARS?s Scientific Manuscript database

Search for simple sequence repeat (SSR) motifs and design of flanking primers in expressed sequence tag (EST) sequences can be easily done at a large scale using bioinformatics programs. However, failed amplification and/or detection, along with lack of polymorphism, is often seen among randomly sel...
Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

PubMed Central

Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

2007-01-01

Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730
The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika

2010-01-27

Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set ofmore » tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in EST library sequencing approaches, and thus represent a rich resource for studies of environmental genomics.« less
Generation, annotation and analysis of ESTs from Trichoderma harzianum CECT 2413

PubMed Central

Vizcaíno, Juan Antonio; González, Francisco Javier; Suárez, M Belén; Redondo, José; Heinrich, Julian; Delgado-Jarana, Jesús; Hermosa, Rosa; Gutiérrez, Santiago; Monte, Enrique; Llobell, Antonio; Rey, Manuel

2006-01-01

Background The filamentous fungus Trichoderma harzianum is used as biological control agent of several plant-pathogenic fungi. In order to study the genome of this fungus, a functional genomics project called "TrichoEST" was developed to give insights into genes involved in biological control activities using an approach based on the generation of expressed sequence tags (ESTs). Results Eight different cDNA libraries from T. harzianum strain CECT 2413 were constructed. Different growth conditions involving mainly different nutrient conditions and/or stresses were used. We here present the analysis of the 8,710 ESTs generated. A total of 3,478 unique sequences were identified of which 81.4% had sequence similarity with GenBank entries, using the BLASTX algorithm. Using the Gene Ontology hierarchy, we performed the annotation of 51.1% of the unique sequences and compared its distribution among the gene libraries. Additionally, the InterProScan algorithm was used in order to further characterize the sequences. The identification of the putatively secreted proteins was also carried out. Later, based on the EST abundance, we examined the highly expressed genes and a hydrophobin was identified as the gene expressed at the highest level. We compared our collection of ESTs with the previous collections obtained from Trichoderma species and we also compared our sequence set with different complete eukaryotic genomes from several animals, plants and fungi. Accordingly, the presence of similar sequences in different kingdoms was also studied. Conclusion This EST collection and its annotation provide a significant resource for basic and applied research on T. harzianum, a fungus with a high biotechnological interest. PMID:16872539
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

PubMed

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
Identification of tissue-specific, abiotic stress-responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets

PubMed Central

2011-01-01

Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389
Sputnik: a database platform for comparative plant genomics.

PubMed

Rudd, Stephen; Mewes, Hans-Werner; Mayer, Klaus F X

2003-01-01

Two million plant ESTs, from 20 different plant species, and totalling more than one 1000 Mbp of DNA sequence, represents a formidable transcriptomic resource. Sputnik uses the potential of this sequence resource to fill some of the information gap in the un-sequenced plant genomes and to serve as the foundation for in silicio comparative plant genomics. The complexity of the individual EST collections has been reduced using optimised EST clustering techniques. Annotation of cluster sequences is performed by exploiting and transferring information from the comprehensive knowledgebase already produced for the completed model plant genome (Arabidopsis thaliana) and by performing additional state of-the-art sequence analyses relevant to today's plant biologist. Functional predictions, comparative analyses and associative annotations for 500 000 plant EST derived peptides make Sputnik (http://mips.gsf.de/proj/sputnik/) a valid platform for contemporary plant genomics.
Sputnik: a database platform for comparative plant genomics

PubMed Central

Rudd, Stephen; Mewes, Hans-Werner; Mayer, Klaus F.X.

2003-01-01

Two million plant ESTs, from 20 different plant species, and totalling more than one 1000 Mbp of DNA sequence, represents a formidable transcriptomic resource. Sputnik uses the potential of this sequence resource to fill some of the information gap in the un-sequenced plant genomes and to serve as the foundation for in silicio comparative plant genomics. The complexity of the individual EST collections has been reduced using optimised EST clustering techniques. Annotation of cluster sequences is performed by exploiting and transferring information from the comprehensive knowledgebase already produced for the completed model plant genome (Arabidopsis thaliana) and by performing additional state of-the-art sequence analyses relevant to today's plant biologist. Functional predictions, comparative analyses and associative annotations for 500 000 plant EST derived peptides make Sputnik (http://mips.gsf.de/proj/sputnik/) a valid platform for contemporary plant genomics. PMID:12519965
Spotting optimization for oligo microarrays on aldehyde-glass.

PubMed

Dawson, Erica D; Reppert, Amy E; Rowlen, Kathy L; Kuck, Laura R

2005-06-15

Low-density microarrays that utilize short oligos (<100 nt) for capture are highly attractive for use in diagnostic applications, yet these experiments require strict quality control and meticulous reproducibility. However, a survey of current literature indicates vast inconsistencies in the spotting and processing procedures. In this study, spotting and processing protocols were optimized for aldehyde-functionalized glass substrates. Figures of merit were developed for quantitative comparison of spot quality and reproducibility. Experimental variables examined included oligo concentration in the spotting buffer, composition of the spotting buffer, postspotting "curing" conditions, and postspotting wash conditions. Optimized conditions included the use of 3-4 microM oligo in a 3x standard saline citrate/0.05% sodium dodecyl sulfate/0.001% (3-[(3-cholamidopropyl) dimethylammonia]-1-propane sulfonate) spotting buffer, 24-h postspotting reaction at 100% relative humidity, and a four-step wash procedure. Evaluation of six types of aldehyde-functionalized glass substrates indicated that those manufactured by CEL Associates, Inc. yield the highest oligo coverage.
Differential transferability of EST-SSR primers developed from diploid species Pseudoroegneria spicata, Thinopyrum bessarabicum, and Th. elongatum

USDA-ARS?s Scientific Manuscript database

Simple sequence repeat technology based on expressed sequence tag (EST-SSR) is a useful genomic tool for genome mapping, characterizing plant species relationships, elucidating genome evolution, and tracing genes on alien chromosome segments. EST-SSR primers developed from three perennial diploid T...
PineElm_SSRdb: a microsatellite marker database identified from genomic, chloroplast, mitochondrial and EST sequences of pineapple (Ananas comosus (L.) Merrill).

PubMed

Chaudhary, Sakshi; Mishra, Bharat Kumar; Vivek, Thiruvettai; Magadum, Santoshkumar; Yasin, Jeshima Khan

2016-01-01

Simple Sequence Repeats or microsatellites are resourceful molecular genetic markers. There are only few reports of SSR identification and development in pineapple. Complete genome sequence of pineapple available in the public domain can be used to develop numerous novel SSRs. Therefore, an attempt was made to identify SSRs from genomic, chloroplast, mitochondrial and EST sequences of pineapple which will help in deciphering genetic makeup of its germplasm resources. A total of 359511 SSRs were identified in pineapple (356385 from genome sequence, 45 from chloroplast sequence, 249 in mitochondrial sequence and 2832 from EST sequences). The list of EST-SSR markers and their details are available in the database. PineElm_SSRdb is an open source database available for non-commercial academic purpose at http://app.bioelm.com/ with a mapping tool which can develop circular maps of selected marker set. This database will be of immense use to breeders, researchers and graduates working on Ananas spp. and to others working on cross-species transferability of markers, investigating diversity, mapping and DNA fingerprinting.

ApiEST-DB: analyzing clustered EST data of the apicomplexan parasites.

PubMed

Li, Li; Crabtree, Jonathan; Fischer, Steve; Pinney, Deborah; Stoeckert, Christian J; Sibley, L David; Roos, David S

2004-01-01

ApiEST-DB (http://www.cbil.upenn.edu/paradbs-servlet/) provides integrated access to publicly available EST data from protozoan parasites in the phylum Apicomplexa. The database currently incorporates a total of nearly 100,000 ESTs from several parasite species of clinical and/or veterinary interest, including Eimeria tenella, Neospora caninum, Plasmodium falciparum, Sarcocystis neurona and Toxoplasma gondii. To facilitate analysis of these data, EST sequences were clustered and assembled to form consensus sequences for each organism, and these assemblies were then subjected to automated annotation via similarity searches against protein and domain databases. The underlying relational database infrastructure, Genomics Unified Schema (GUS), enables complex biologically based queries, facilitating validation of gene models, identification of alternative splicing, detection of single nucleotide polymorphisms, identification of stage-specific genes and recognition of phylogenetically conserved and phylogenetically restricted sequences.
JUICE: a data management system that facilitates the analysis of large volumes of information in an EST project workflow.

PubMed

Latorre, Mariano; Silva, Herman; Saba, Juan; Guziolowski, Carito; Vizoso, Paula; Martinez, Veronica; Maldonado, Jonathan; Morales, Andrea; Caroca, Rodrigo; Cambiazo, Veronica; Campos-Vargas, Reinaldo; Gonzalez, Mauricio; Orellana, Ariel; Retamales, Julio; Meisel, Lee A

2006-11-23

Expressed sequence tag (EST) analyses provide a rapid and economical means to identify candidate genes that may be involved in a particular biological process. These ESTs are useful in many Functional Genomics studies. However, the large quantity and complexity of the data generated during an EST sequencing project can make the analysis of this information a daunting task. In an attempt to make this task friendlier, we have developed JUICE, an open source data management system (Apache + PHP + MySQL on Linux), which enables the user to easily upload, organize, visualize and search the different types of data generated in an EST project pipeline. In contrast to other systems, the JUICE data management system allows a branched pipeline to be established, modified and expanded, during the course of an EST project. The web interfaces and tools in JUICE enable the users to visualize the information in a graphical, user-friendly manner. The user may browse or search for sequences and/or sequence information within all the branches of the pipeline. The user can search using terms associated with the sequence name, annotation or other characteristics stored in JUICE and associated with sequences or sequence groups. Groups of sequences can be created by the user, stored in a clipboard and/or downloaded for further analyses. Different user profiles restrict the access of each user depending upon their role in the project. The user may have access exclusively to visualize sequence information, access to annotate sequences and sequence information, or administrative access. JUICE is an open source data management system that has been developed to aid users in organizing and analyzing the large amount of data generated in an EST Project workflow. JUICE has been used in one of the first functional genomics projects in Chile, entitled "Functional Genomics in nectarines: Platform to potentiate the competitiveness of Chile in fruit exportation". However, due to its ability to organize and visualize data from external pipelines, JUICE is a flexible data management system that should be useful for other EST/Genome projects. The JUICE data management system is released under the Open Source GNU Lesser General Public License (LGPL). JUICE may be downloaded from http://genoma.unab.cl/juice_system/ or http://www.genomavegetal.cl/juice_system/.
JUICE: a data management system that facilitates the analysis of large volumes of information in an EST project workflow

PubMed Central

Latorre, Mariano; Silva, Herman; Saba, Juan; Guziolowski, Carito; Vizoso, Paula; Martinez, Veronica; Maldonado, Jonathan; Morales, Andrea; Caroca, Rodrigo; Cambiazo, Veronica; Campos-Vargas, Reinaldo; Gonzalez, Mauricio; Orellana, Ariel; Retamales, Julio; Meisel, Lee A

2006-01-01

Background Expressed sequence tag (EST) analyses provide a rapid and economical means to identify candidate genes that may be involved in a particular biological process. These ESTs are useful in many Functional Genomics studies. However, the large quantity and complexity of the data generated during an EST sequencing project can make the analysis of this information a daunting task. Results In an attempt to make this task friendlier, we have developed JUICE, an open source data management system (Apache + PHP + MySQL on Linux), which enables the user to easily upload, organize, visualize and search the different types of data generated in an EST project pipeline. In contrast to other systems, the JUICE data management system allows a branched pipeline to be established, modified and expanded, during the course of an EST project. The web interfaces and tools in JUICE enable the users to visualize the information in a graphical, user-friendly manner. The user may browse or search for sequences and/or sequence information within all the branches of the pipeline. The user can search using terms associated with the sequence name, annotation or other characteristics stored in JUICE and associated with sequences or sequence groups. Groups of sequences can be created by the user, stored in a clipboard and/or downloaded for further analyses. Different user profiles restrict the access of each user depending upon their role in the project. The user may have access exclusively to visualize sequence information, access to annotate sequences and sequence information, or administrative access. Conclusion JUICE is an open source data management system that has been developed to aid users in organizing and analyzing the large amount of data generated in an EST Project workflow. JUICE has been used in one of the first functional genomics projects in Chile, entitled "Functional Genomics in nectarines: Platform to potentiate the competitiveness of Chile in fruit exportation". However, due to its ability to organize and visualize data from external pipelines, JUICE is a flexible data management system that should be useful for other EST/Genome projects. The JUICE data management system is released under the Open Source GNU Lesser General Public License (LGPL). JUICE may be downloaded from or . PMID:17123449
Pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of Plasmodium vivax in human patients.

PubMed

Merino, Emilio F; Fernandez-Becerra, Carmen; Madeira, Alda M B N; Machado, Ariane L; Durham, Alan; Gruber, Arthur; Hall, Neil; del Portillo, Hernando A

2003-07-21

Plasmodium vivax is the most widely distributed human malaria, responsible for 70-80 million clinical cases each year and large socio-economical burdens for countries such as Brazil where it is the most prevalent species. Unfortunately, due to the impossibility of growing this parasite in continuous in vitro culture, research on P. vivax remains largely neglected. A pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of P. vivax was performed. To do so, 1,184 clones from a cDNA library constructed with parasites obtained from 10 different human patients in the Brazilian Amazon were sequenced. Sequences were automatedly processed to remove contaminants and low quality reads. A total of 806 sequences with an average length of 586 bp met such criteria and their clustering revealed 666 distinct events. The consensus sequence of each cluster and the unique sequences of the singlets were used in similarity searches against different databases that included P. vivax, Plasmodium falciparum, Plasmodium yoelii, Plasmodium knowlesi, Apicomplexa and the GenBank non-redundant database. An E-value of <10(-30) was used to define a significant database match. ESTs were manually assigned a gene ontology (GO) terminology A total of 769 ESTs could be assigned a putative identity based upon sequence similarity to known proteins in GenBank. Moreover, 292 ESTs were annotated and a GO terminology was assigned to 164 of them. These are the first ESTs reported for P. vivax and, as such, they represent a valuable resource to assist in the annotation of the P. vivax genome currently being sequenced. Moreover, since the GC-content of the P. vivax genome is strikingly different from that of P. falciparum, these ESTs will help in the validation of gene predictions for P. vivax and to create a gene index of this malaria parasite.
SolEST database: a "one-stop shop" approach to the study of Solanaceae transcriptomes.

PubMed

D'Agostino, Nunzio; Traini, Alessandra; Frusciante, Luigi; Chiusano, Maria Luisa

2009-11-30

Since no genome sequences of solanaceous plants have yet been completed, expressed sequence tag (EST) collections represent a reliable tool for broad sampling of Solanaceae transcriptomes, an attractive route for understanding Solanaceae genome functionality and a powerful reference for the structural annotation of emerging Solanaceae genome sequences. We describe the SolEST database http://biosrv.cab.unina.it/solestdb which integrates different EST datasets from both cultivated and wild Solanaceae species and from two species of the genus Coffea. Background as well as processed data contained in the database, extensively linked to external related resources, represent an invaluable source of information for these plant families. Two novel features differentiate SolEST from other resources: i) the option of accessing and then visualizing Solanaceae EST/TC alignments along the emerging tomato and potato genome sequences; ii) the opportunity to compare different Solanaceae assemblies generated by diverse research groups in the attempt to address a common complaint in the SOL community. Different databases have been established worldwide for collecting Solanaceae ESTs and are related in concept, content and utility to the one presented herein. However, the SolEST database has several distinguishing features that make it appealing for the research community and facilitates a "one-stop shop" for the study of Solanaceae transcriptomes.
Development of simple sequence repeat markers and diversity analysis in alfalfa (Medicago sativa L.).

PubMed

Wang, Zan; Yan, Hongwei; Fu, Xinnian; Li, Xuehui; Gao, Hongwen

2013-04-01

Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di-(26.1 %), tetra-(11.5 %), penta-(9.7 %), and hexanucleotide (3.9 %). One hundred EST-SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST-SSR markers. Based on the 29 EST-SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST-SSR markers was also found for relative species.
Montmorillonite, Oligonucleotides, RNA and Origin of Life

NASA Astrophysics Data System (ADS)

Ertem, Gözen

2004-12-01

Na-montmorillonite prepared from Volclay by the titration method facilitates the self-condensation of ImpA, the 5'-phosphorimidazolide derivative of adenosine. As was shown by AE-HPLC analysis and selective enzymatic hydrolysis of products, oligo(A)s formed in this reaction are 10 monomer units long and contain 67% 3',5'-phosphodiester bonds (Ferris and Ertem, 1992a). Under the same reaction conditions, 5'-phosphorimidazolide derivatives of cytidine, uridine and guanosine also undergo self-condensation producing oligomers containing up to 12-14 monomer units for oligo(C)s to 6 monomer units for oligo(G)s. In oligo(C)s and oligo(U)s, 75-80% of the monomers are linked by 2',5'-phosphodiester bonds. Hexamer and higher oligomers isolated from synthetic oligo(C)s formed by montmorillonite catalysis, which contain both 3',5'- and 2',5'-linkages, serve as catalysts for the non-enzymatic template directed synthesis of oligo(G)s from activated monomer 2-MeImpG, guanosine 5'-phospho-2-methylimidazolide (Ertem and Ferris, 1996). Pentamer and higher oligomers containing exclusively 2',5'-linkages, which were isolated from the synthetic oligo(C)s, also serve as templates and produce oligo(G)s with both 2',5'- and 3',5'-phosphodiester bonds. Kinetic studies on montmorillonite catalyzed elongation rates of oligomers using the computer program SIMFIT demonstrated that the rate constants for the formation of oligo(A)s increased in the order of 2-mer <3-mer <4-mer ... <7-mer (Kawamura and Ferris, 1994). A decameric primer, dA(pdA)8pA bound to montmorillonite was elongated to contain up to 50 monomer units by daily addition of activated monomer ImpA to the reaction mixture (Ferris, Hill and Orgel, 1996). Analysis of dimer fractions formed in the montmorillonite catalyzed reaction of binary and quaternary mixtures of ImpA, ImpC, 2-MeImpG and ImpU suggested that only a limited number of oligomers could have formed on the primitive Earth rather than equal amounts of all possible isomers (Ertem and Ferris, 2000). Formation of phosphodiester bonds between mononucleotides by montmorillonite catalysis is a fascinating discovery, and a significant step forward in efforts to find out how the first RNA-like oligomers might have formed in the course of chemical evolution. However, as has been pointed out in several publications, these systems should be regarded as models rather than a literal representation of prebiotic chemistry (Orgel, 1998; Joyce and Orgel, 1999; Schwartz, 1999).
Montmorillonite, oligonucleotides, RNA and origin of life

NASA Technical Reports Server (NTRS)

Ertem, Gozen

2004-01-01

Na-montmorillonite prepared from Volclay by the titration method facilitates the self-condensation of ImpA, the 5'-phosphorimidazolide derivative of adenosine. As was shown by AE-HPLC analysis and selective enzymatic hydrolysis of products, oligo(A)s formed in this reaction are 10 monomer units long and contain 67% 3',5'-phosphodiester bonds (Ferris and Ertem, 1992a). Under the same reaction conditions, 5'-phosphorimidazolide derivatives of cytidine, uridine and guanosine also undergo self-condensation producing oligomers containing up to 12-14 monomer units for oligo(C)s to 6 monomer units for oligo(G)s. In oligo(C)s and oligo(U)s, 75-80% of the monomers are linked by 2',5'-phosphodiester bonds. Hexamer and higher oligomers isolated from synthetic oligo(C)s formed by montmorillonite catalysis, which contain both 3',5'- and 2',5'-linkages, serve as catalysts for the non-enzymatic template directed synthesis of oligo(G)s from activated monomer 2-MeImpG, guanosine 5'-phospho-2-methylimidazolide (Ertem and Ferris, 1996). Pentamer and higher oligomers containing exclusively 2',5'-linkages, which were isolated from the synthetic oligo(C)s, also serve as templates and produce oligo(G)s with both 2',5'- and 3',5'-phosphodiester bonds. Kinetic studies on montmorillonite catalyzed elongation rates of oligomers using the computer program SIMFIT demonstrated that the rate constants for the formation of oligo(A)s increased in the order of 2-mer < 3-mer < 4-mer ... < 7-mer (Kawamura and Ferris, 1994). A decameric primer, dA(pdA)8pA bound to montmorillonite was elongated to contain up to 50 monomer units by daily addition of activated monomer ImpA to the reaction mixture (Ferris, Hill and Orgel, 1996). Analysis of dimer fractions formed in the montmorillonite catalyzed reaction of binary and quaternary mixtures of ImpA, ImpC, 2-MeImpG and ImpU suggested that only a limited number of oligomers could have formed on the primitive Earth rather than equal amounts of all possible isomers (Ertem and Ferris, 2000). Formation of phosphodiester bonds between mononucleotides by montmorillonite catalysis is a fascinating discovery, and a significant step forward in efforts to find out how the first RNA-like oligomers might have formed in the course of chemical evolution. However, as has been pointed out in several publications, these systems should be regarded as models rather than a literal representation of prebiotic chemistry (Orgel, 1998; Joyce and Orgel, 1999; Schwartz, 1999).
openSputnik--a database to ESTablish comparative plant genomics using unsaturated sequence collections.

PubMed

Rudd, Stephen

2005-01-01

The public expressed sequence tag collections are continually being enriched with high-quality sequences that represent an ever-expanding range of taxonomically diverse plant species. While these sequence collections provide biased insight into the populations of expressed genes available within individual species and their associated tissues, the information is conceivably of wider relevance in a comparative context. When we consider the available expressed sequence tag (EST) collections of summer 2004, most of the major plant taxonomic clades are at least superficially represented. Investigation of the five million available plant ESTs provides a wealth of information that has applications in modelling the routes of plant genome evolution and the identification of lineage-specific genes and gene families. Over four million ESTs from over 50 distinct plant species have been collated within an EST analysis pipeline called openSputnik. The ESTs were resolved down into approximately one million unigene sequences. These have been annotated using orthology-based annotation transfer from reference plant genomes and using a variety of contemporary bioinformatics methods to assign peptide, structural and functional attributes. The openSputnik database is available at http://sputnik.btk.fi.
Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen).

PubMed

Rambaut, Andrew; Lam, Tommy T; Max Carvalho, Luiz; Pybus, Oliver G

2016-01-01

Gene sequences sampled at different points in time can be used to infer molecular phylogenies on a natural timescale of months or years, provided that the sequences in question undergo measurable amounts of evolutionary change between sampling times. Data sets with this property are termed heterochronous and have become increasingly common in several fields of biology, most notably the molecular epidemiology of rapidly evolving viruses. Here we introduce the cross-platform software tool, TempEst (formerly known as Path-O-Gen), for the visualization and analysis of temporally sampled sequence data. Given a molecular phylogeny and the dates of sampling for each sequence, TempEst uses an interactive regression approach to explore the association between genetic divergence through time and sampling dates. TempEst can be used to (1) assess whether there is sufficient temporal signal in the data to proceed with phylogenetic molecular clock analysis, and (2) identify sequences whose genetic divergence and sampling date are incongruent. Examination of the latter can help identify data quality problems, including errors in data annotation, sample contamination, sequence recombination, or alignment error. We recommend that all users of the molecular clock models implemented in BEAST first check their data using TempEst prior to analysis.
ESTree db: a Tool for Peach Functional Genomics

PubMed Central

Lazzari, Barbara; Caprera, Andrea; Vecchietti, Alberto; Stella, Alessandra; Milanesi, Luciano; Pozzi, Carlo

2005-01-01

Background The ESTree db represents a collection of Prunus persica expressed sequenced tags (ESTs) and is intended as a resource for peach functional genomics. A total of 6,155 successful EST sequences were obtained from four in-house prepared cDNA libraries from Prunus persica mesocarps at different developmental stages. Another 12,475 peach EST sequences were downloaded from public databases and added to the ESTree db. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts and data were collected in a MySQL database. A php-based web interface was developed to query the database. Results The ESTree db version as of April 2005 encompasses 18,630 sequences representing eight libraries. Contig assembly was performed with CAP3. Putative single nucleotide polymorphism (SNP) detection was performed with the AutoSNP program and a search engine was implemented to retrieve results. All the sequences and all the contig consensus sequences were annotated both with blastx against the GenBank nr db and with GOblet against the viridiplantae section of the Gene Ontology db. Links to NiceZyme (Expasy) and to the KEGG metabolic pathways were provided. A local BLAST utility is available. A text search utility allows querying and browsing the database. Statistics were provided on Gene Ontology occurrences to assign sequences to Gene Ontology categories. Conclusion The resulting database is a comprehensive resource of data and links related to peach EST sequences. The Sequence Report and Contig Report pages work as the web interface core structures, giving quick access to data related to each sequence/contig. PMID:16351742
ESTree db: a tool for peach functional genomics.

PubMed

Lazzari, Barbara; Caprera, Andrea; Vecchietti, Alberto; Stella, Alessandra; Milanesi, Luciano; Pozzi, Carlo

2005-12-01

The ESTree db http://www.itb.cnr.it/estree/ represents a collection of Prunus persica expressed sequenced tags (ESTs) and is intended as a resource for peach functional genomics. A total of 6,155 successful EST sequences were obtained from four in-house prepared cDNA libraries from Prunus persica mesocarps at different developmental stages. Another 12,475 peach EST sequences were downloaded from public databases and added to the ESTree db. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts and data were collected in a MySQL database. A php-based web interface was developed to query the database. The ESTree db version as of April 2005 encompasses 18,630 sequences representing eight libraries. Contig assembly was performed with CAP3. Putative single nucleotide polymorphism (SNP) detection was performed with the AutoSNP program and a search engine was implemented to retrieve results. All the sequences and all the contig consensus sequences were annotated both with blastx against the GenBank nr db and with GOblet against the viridiplantae section of the Gene Ontology db. Links to NiceZyme (Expasy) and to the KEGG metabolic pathways were provided. A local BLAST utility is available. A text search utility allows querying and browsing the database. Statistics were provided on Gene Ontology occurrences to assign sequences to Gene Ontology categories. The resulting database is a comprehensive resource of data and links related to peach EST sequences. The Sequence Report and Contig Report pages work as the web interface core structures, giving quick access to data related to each sequence/contig.
3′ terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing

PubMed Central

2013-01-01

Background Post-transcriptional 3′ end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3′ RACE coupled with high-throughput sequencing to characterize the 3′ terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. Results The 3′ terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3′ terminus of an in vitro transcribed MRP RNA control and the differing 3′ terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). Conclusions 3′ RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3′ terminal sequences of noncoding RNAs. PMID:24053768
Yellow lupin (Lupinus luteus L.) transcriptome sequencing: molecular marker development and comparative studies

PubMed Central

2012-01-01

Background Yellow lupin (Lupinus luteus L.) is a minor legume crop characterized by its high seed protein content. Although grown in several temperate countries, its orphan condition has limited the generation of genomic tools to aid breeding efforts to improve yield and nutritional quality. In this study, we report the construction of 454-expresed sequence tag (EST) libraries, carried out comparative studies between L. luteus and model legume species, developed a comprehensive set of EST-simple sequence repeat (SSR) markers, and validated their utility on diversity studies and transferability to related species. Results Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 (young leaves, buds and flowers) and L2 (immature seeds) EST- libraries. A combined assembly (L1L2) yielded 71,655 contigs with an average contig length of 632 nucleotides. L1L2 contigs were clustered into 55,309 isotigs. 38,200 isotigs translated into proteins and 8,741 of them were full length. Around 57% of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species. L. luteus isotigs were also screened for the presence of SSR sequences. A total of 2,572 isotigs contained at least one EST-SSR, with a frequency of one SSR per 17.75 kbp. Empirical evaluation of the EST-SSR candidate markers resulted in 222 polymorphic EST-SSRs. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively. Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions. Neighbor-joining distance analysis detected the existence of several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions. However, no clear clustering patterns followed the accession’s origin. Conclusion L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discovery, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparisons and putative gene candidates for QTL detection. PMID:22920992
Yellow lupin (Lupinus luteus L.) transcriptome sequencing: molecular marker development and comparative studies.

PubMed

Parra-González, Lorena B; Aravena-Abarzúa, Gabriela A; Navarro-Navarro, Cristell S; Udall, Joshua; Maughan, Jeff; Peterson, Louis M; Salvo-Garrido, Haroldo E; Maureira-Butler, Iván J

2012-08-24

Yellow lupin (Lupinus luteus L.) is a minor legume crop characterized by its high seed protein content. Although grown in several temperate countries, its orphan condition has limited the generation of genomic tools to aid breeding efforts to improve yield and nutritional quality. In this study, we report the construction of 454-expresed sequence tag (EST) libraries, carried out comparative studies between L. luteus and model legume species, developed a comprehensive set of EST-simple sequence repeat (SSR) markers, and validated their utility on diversity studies and transferability to related species. Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 (young leaves, buds and flowers) and L2 (immature seeds) EST- libraries. A combined assembly (L1L2) yielded 71,655 contigs with an average contig length of 632 nucleotides. L1L2 contigs were clustered into 55,309 isotigs. 38,200 isotigs translated into proteins and 8,741 of them were full length. Around 57% of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species. L. luteus isotigs were also screened for the presence of SSR sequences. A total of 2,572 isotigs contained at least one EST-SSR, with a frequency of one SSR per 17.75 kbp. Empirical evaluation of the EST-SSR candidate markers resulted in 222 polymorphic EST-SSRs. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively. Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions. Neighbor-joining distance analysis detected the existence of several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions. However, no clear clustering patterns followed the accession's origin. L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discovery, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparisons and putative gene candidates for QTL detection.
Mining SNPs from EST sequences using filters and ensemble classifiers.

PubMed

Wang, J; Zou, Q; Guo, M Z

2010-05-04

Abundant single nucleotide polymorphisms (SNPs) provide the most complete information for genome-wide association studies. However, due to the bottleneck of manual discovery of putative SNPs and the inaccessibility of the original sequencing reads, it is essential to develop a more efficient and accurate computational method for automated SNP detection. We propose a novel computational method to rapidly find true SNPs in public-available EST (expressed sequence tag) databases; this method is implemented as SNPDigger. EST sequences are clustered and aligned. SNP candidates are then obtained according to a measure of redundant frequency. Several new informative biological features, such as the structural neighbor profiles and the physical position of the SNP, were extracted from EST sequences, and the effectiveness of these features was demonstrated. An ensemble classifier, which employs a carefully selected feature set, was included for the imbalanced training data. The sensitivity and specificity of our method both exceeded 80% for human genetic data in the cross validation. Our method enables detection of SNPs from the user's own EST dataset and can be used on species for which there is no genome data. Our tests showed that this method can effectively guide SNP discovery in ESTs and will be useful to avoid and save the cost of biological analyses.
Genes expressed during the development and ripening of watermelon fruit.

PubMed

Levi, A; Davis, A; Hernandez, A; Wechter, P; Thimmapuram, J; Trebitsh, T; Tadmor, Y; Katzir, N; Portnoy, V; King, S

2006-11-01

A normalized cDNA library was constructed using watermelon flesh mRNA from three distinct developmental time-points and was subtracted by hybridization with leaf cDNA. Random cDNA clones of the watermelon flesh subtraction library were sequenced from the 5' end in order to identify potentially informative genes associated with fruit setting, development, and ripening. One-thousand and forty-six 5'-end sequences (expressed sequence tags; ESTs) were assembled into 832 non-redundant sequences, designated as "EST-unigenes". Of these 832 "EST-unigenes", 254 ( approximately 30%) have no significant homology to sequences published so far for other plant species. Additionally, 168 "EST-unigenes" ( approximately 20%) correspond to genes with unknown function, whereas 410 "EST-unigenes" ( approximately 50%) correspond to genes with known function in other plant species. These "EST-unigenes" are mainly associated with metabolism, membrane transport, cytoskeleton synthesis and structure, cell wall formation and cell division, signal transduction, nucleic acid binding and transcription factors, defense and stress response, and secondary metabolism. This study provides the scientific community with novel genetic information for watermelon as well as an expanded pool of genes associated with fruit development in watermelon. These genes will be useful targets in future genetic and functional genomic studies of watermelon and its development.
Sensitization of Prostate Cancer Cells to Androgen Deprivation and Radiation via Manipulation of the MDM2 Pathway

DTIC Science & Technology

2005-04-01

cell number apoptosis, and clonogenic assays of LNCaP- MST. Months 1-6. c. Time course experiments of AS effects on AD, RT, and AD+RT in LNCaP and LNCaP...to AS- MDM2, and have not found much of an effect . More recently, we >" 0" have initiated the measurement of SmRNA expression using the Oligo Pollack...AL, Joon DL, Meistrich M, Hachem P, Pollack A. Effect of sequencing androgen deprivation and radiation on prostate cancer growth. Int J Radiat Oncol
Alginate Oligosaccharides Inhibit Fungal Cell Growth and Potentiate the Activity of Antifungals against Candida and Aspergillus spp

PubMed Central

Tøndervik, Anne; Sletta, Håvard; Klinkenberg, Geir; Emanuel, Charlotte; Powell, Lydia C.; Pritchard, Manon F.; Khan, Saira; Craine, Kieron M.; Onsøyen, Edvar; Rye, Phil D.; Wright, Chris; Thomas, David W.; Hill, Katja E.

2014-01-01

The oligosaccharide OligoG, an alginate derived from seaweed, has been shown to have anti-bacterial and anti-biofilm properties and potentiates the activity of selected antibiotics against multi-drug resistant bacteria. The ability of OligoG to perturb fungal growth and potentiate conventional antifungal agents was evaluated using a range of pathogenic fungal strains. Candida (n = 11) and Aspergillus (n = 3) spp. were tested using germ tube assays, LIVE/DEAD staining, scanning electron microscopy (SEM), atomic force microscopy (AFM) and high-throughput minimum inhibition concentration assays (MICs). In general, the strains tested showed a significant dose-dependent reduction in cell growth at ≥6% OligoG as measured by optical density (OD600; P<0.05). OligoG (>0.5%) also showed a significant inhibitory effect on hyphal growth in germ tube assays, although strain-dependent variations in efficacy were observed (P<0.05). SEM and AFM both showed that OligoG (≥2%) markedly disrupted fungal biofilm formation, both alone, and in combination with fluconazole. Cell surface roughness was also significantly increased by the combination treatment (P<0.001). High-throughput robotic MIC screening demonstrated the potentiating effects of OligoG (2, 6, 10%) with nystatin, amphotericin B, fluconazole, miconazole, voriconazole or terbinafine with the test strains. Potentiating effects were observed for the Aspergillus strains with all six antifungal agents, with an up to 16-fold (nystatin) reduction in MIC. Similarly, all the Candida spp. showed potentiation with nystatin (up to 16-fold) and fluconazole (up to 8-fold). These findings demonstrate the antifungal properties of OligoG and suggest a potential role in the management of fungal infections and possible reduction of antifungal toxicity. PMID:25409186
Preparation of graphite intercalation compounds containing oligo and polyethers

NASA Astrophysics Data System (ADS)

Zhang, Hanyang; Lerner, Michael M.

2016-02-01

Layered host-polymer nanocomposites comprising polymeric guests between inorganic sheets have been prepared with many inorganic hosts, but there is limited evidence for the incorporation of polymeric guests into graphite. Here we report for the first time the preparation, and structural and compositional characterization of graphite intercalation compounds (GICs) containing polyether bilayers. The new GICs are obtained by either (1) reductive intercalation of graphite with an alkali metal in the presence of an oligo or polyether and an electrocatalyst, or (2) co-intercalate exchange of an amine for an oligo or polyether in a donor-type GIC. Structural characterization of products using powder X-ray diffraction, Raman spectroscopy, and thermal analyses supports the formation of well-ordered, first-stage GICs containing alkali metal cations and oligo or polyether bilayers between reduced graphene sheets.Layered host-polymer nanocomposites comprising polymeric guests between inorganic sheets have been prepared with many inorganic hosts, but there is limited evidence for the incorporation of polymeric guests into graphite. Here we report for the first time the preparation, and structural and compositional characterization of graphite intercalation compounds (GICs) containing polyether bilayers. The new GICs are obtained by either (1) reductive intercalation of graphite with an alkali metal in the presence of an oligo or polyether and an electrocatalyst, or (2) co-intercalate exchange of an amine for an oligo or polyether in a donor-type GIC. Structural characterization of products using powder X-ray diffraction, Raman spectroscopy, and thermal analyses supports the formation of well-ordered, first-stage GICs containing alkali metal cations and oligo or polyether bilayers between reduced graphene sheets. Electronic supplementary information (ESI) available: Domain size, additional Raman spectra info, compositional calculation, and packing fractions. See DOI: 10.1039/c5nr08226a

An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

PubMed Central

Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

2004-01-01

Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051
ESAP plus: a web-based server for EST-SSR marker development.

PubMed

Ponyared, Piyarat; Ponsawat, Jiradej; Tongsima, Sissades; Seresangtakul, Pusadee; Akkasaeng, Chutipong; Tantisuwichwong, Nathpapat

2016-12-22

Simple sequence repeats (SSRs) have become widely used as molecular markers in plant genetic studies due to their abundance, high allelic variation at each locus and simplicity to analyze using conventional PCR amplification. To study plants with unknown genome sequence, SSR markers from Expressed Sequence Tags (ESTs), which can be obtained from the plant mRNA (converted to cDNA), must be utilized. With the advent of high-throughput sequencing technology, huge EST sequence data have been generated and are now accessible from many public databases. However, SSR marker identification from a large in-house or public EST collection requires a computational pipeline that makes use of several standard bioinformatic tools to design high quality EST-SSR primers. Some of these computational tools are not users friendly and must be tightly integrated with reference genomic databases. A web-based bioinformatic pipeline, called EST Analysis Pipeline Plus (ESAP Plus), was constructed for assisting researchers to develop SSR markers from a large EST collection. ESAP Plus incorporates several bioinformatic scripts and some useful standard software tools necessary for the four main procedures of EST-SSR marker development, namely 1) pre-processing, 2) clustering and assembly, 3) SSR mining and 4) SSR primer design. The proposed pipeline also provides two alternative steps for reducing EST redundancy and identifying SSR loci. Using public sugarcane ESTs, ESAP Plus automatically executed the aforementioned computational pipeline via a simple web user interface, which was implemented using standard PHP, HTML, CSS and Java scripts. With ESAP Plus, users can upload raw EST data and choose various filtering options and parameters to analyze each of the four main procedures through this web interface. All input EST data and their predicted SSR results will be stored in the ESAP Plus MySQL database. Users will be notified via e-mail when the automatic process is completed and they can download all the results through the web interface. ESAP Plus is a comprehensive and convenient web-based bioinformatic tool for SSR marker development. ESAP Plus offers all necessary EST-SSR development processes with various adjustable options that users can easily use to identify SSR markers from a large EST collection. With familiar web interface, users can upload the raw EST using the data submission page and visualize/download the corresponding EST-SSR information from within ESAP Plus. ESAP Plus can handle considerably large EST datasets. This EST-SSR discovery tool can be accessed directly from: http://gbp.kku.ac.th/esap_plus/ .
Ferrocene conjugated oligonucleotide for electrochemical detection of DNA base mismatch.

PubMed

Hasegawa, Yusuke; Takada, Tadao; Nakamura, Mitsunobu; Yamana, Kazushige

2017-08-01

We describe the synthesis, binding, and electrochemical properties of ferrocene-conjugated oligonucleotides (Fc-oligos). The key step for the preparation of Fc-oligos contains the coupling of vinylferrocene to 5-iododeoxyuridine via Heck reaction. The Fc-conjugated deoxyuridine phosphoramidite was used in the Fc-oligonucleotide synthesis. We show that thiol-modified Fc-oligos deposited onto gold electrodes possess potential ability in electrochemical detection of DNA base mismatch. Copyright © 2017 Elsevier Ltd. All rights reserved.
Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery

PubMed Central

Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi

2013-01-01

Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799
Genome-wide characterization and selection of expressed sequence tag simple sequence repeat primers for optimized marker distribution and reliability in peach

USDA-ARS?s Scientific Manuscript database

Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...
Exploiting rice-sorghum synteny for targeted development of EST-SSRs to enrich the sorghum genetic linkage map.

PubMed

Ramu, P; Kassahun, B; Senthilvel, S; Ashok Kumar, C; Jayashree, B; Folkertsma, R T; Reddy, L Ananda; Kuruvinashetti, M S; Haussmann, B I G; Hash, C T

2009-11-01

The sequencing and detailed comparative functional analysis of genomes of a number of select botanical models open new doors into comparative genomics among the angiosperms, with potential benefits for improvement of many orphan crops that feed large populations. In this study, a set of simple sequence repeat (SSR) markers was developed by mining the expressed sequence tag (EST) database of sorghum. Among the SSR-containing sequences, only those sharing considerable homology with rice genomic sequences across the lengths of the 12 rice chromosomes were selected. Thus, 600 SSR-containing sorghum EST sequences (50 homologous sequences on each of the 12 rice chromosomes) were selected, with the intention of providing coverage for corresponding homologous regions of the sorghum genome. Primer pairs were designed and polymorphism detection ability was assessed using parental pairs of two existing sorghum mapping populations. About 28% of these new markers detected polymorphism in this 4-entry panel. A subset of 55 polymorphic EST-derived SSR markers were mapped onto the existing skeleton map of a recombinant inbred population derived from cross N13 x E 36-1, which is segregating for Striga resistance and the stay-green component of terminal drought tolerance. These new EST-derived SSR markers mapped across all 10 sorghum linkage groups, mostly to regions expected based on prior knowledge of rice-sorghum synteny. The ESTs from which these markers were derived were then mapped in silico onto the aligned sorghum genome sequence, and 88% of the best hits corresponded to linkage-based positions. This study demonstrates the utility of comparative genomic information in targeted development of markers to fill gaps in linkage maps of related crop species for which sufficient genomic tools are not available.
Two-dimensional honeycomb network through sequence-controlled self-assembly of oligopeptides.

PubMed

Abb, Sabine; Harnau, Ludger; Gutzler, Rico; Rauschenbach, Stephan; Kern, Klaus

2016-01-12

The sequence of a peptide programs its self-assembly and hence the expression of specific properties through non-covalent interactions. A large variety of peptide nanostructures has been designed employing different aspects of these non-covalent interactions, such as dispersive interactions, hydrogen bonding or ionic interactions. Here we demonstrate the sequence-controlled fabrication of molecular nanostructures using peptides as bio-organic building blocks for two-dimensional (2D) self-assembly. Scanning tunnelling microscopy reveals changes from compact or linear assemblies (angiotensin I) to long-range ordered, chiral honeycomb networks (angiotensin II) as a result of removal of steric hindrance by sequence modification. Guided by our observations, molecular dynamic simulations yield atomistic models for the elucidation of interpeptide-binding motifs. This new approach to 2D self-assembly on surfaces grants insight at the atomic level that will enable the use of oligo- and polypeptides as large, multi-functional bio-organic building blocks, and opens a new route towards rationally designed, bio-inspired surfaces.
Development, characterization and cross species amplification of polymorphic microsatellite markers from expressed sequence tags of turmeric (Curcuma longa L.).

PubMed

Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A

2010-02-01

Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.
Genetic variation patterns of American chestnut populations at EST-SSRs

Treesearch

Oliver Gailing; C. Dana Nelson

2017-01-01

The objective of this study is to analyze patterns of genetic variation at genic expressed sequence tag - simple sequence repeats (EST-SSRs) and at chloroplast DNA markers in populations of American chestnut (Castanea dentata Borkh.) to assist in conservation and breeding efforts. Allelic diversity at EST-SSRs decreased significantly from southwest to northeast along...
Boron nitride nanotubes for gene silencing.

PubMed

Şen, Özlem; Çobandede, Zehra; Emanet, Melis; Bayrak, Ömer Faruk; Çulha, Mustafa

2017-09-01

Non-viral gene delivery is increasingly investigated as an alternative to viral vectors due to low toxicity and immunogenicity, easy preparation, tissue specificity, and ability to transfer larger sizes of genes. In this study, boron nitride nanotubes (BNNTs) are functionalized with oligonucleotides (oligo-BNNTs). The morpholinos complementary to the oligonucleotides attached to the BNNTs (morpholino/oligo-BNNTs) are hybridized to silence the luciferase gene. The morpholino/oligo-BNNTs conjugates are administered to luciferase-expressing cells (MDA-MB-231-luc2) and the luciferase activity is monitored. The luciferase activity is decreased when MDA-MB-231-luc2 cells were treated with morpholino/oligo-BNNTs. The study suggests that BNNTs can be used as a potential vector to transfect cells. BNNTs are potential new nanocarriers for gene delivery applications. Copyright © 2017 Elsevier B.V. All rights reserved.
Mapping genes to human chromosome 19

DOE Office of Scientific and Technical Information (OSTI.GOV)

Connolly, Sarah

1996-05-01

For this project, 22 Expressed Sequence Tags (ESTs) were fine mapped to regions of human chromosome 19. An EST is a short DNA sequence that occurs once in the genome and corresponds to a single expressed gene. {sup 32}P-radiolabeled probes were made by polymerase chain reaction for each EST and hybridized to filters containing a chromosome 19-specific cosmid library. The location of the ESTs on the chromosome was determined by the location of the ordered cosmid to which the EST hybridized. Of the 22 ESTs that were sublocalized, 6 correspond to known genes, and 16 correspond to anonymous genes. Thesemore » localized ESTs may serve as potential candidates for disease genes, as well as markers for future physical mapping.« less
The Human EST Ontology Explorer: a tissue-oriented visualization system for ontologies distribution in human EST collections.

PubMed

Merelli, Ivan; Caprera, Andrea; Stella, Alessandra; Del Corvo, Marcello; Milanesi, Luciano; Lazzari, Barbara

2009-10-15

The NCBI dbEST currently contains more than eight million human Expressed Sequenced Tags (ESTs). This wide collection represents an important source of information for gene expression studies, provided it can be inspected according to biologically relevant criteria. EST data can be browsed using different dedicated web resources, which allow to investigate library specific gene expression levels and to make comparisons among libraries, highlighting significant differences in gene expression. Nonetheless, no tool is available to examine distributions of quantitative EST collections in Gene Ontology (GO) categories, nor to retrieve information concerning library-dependent EST involvement in metabolic pathways. In this work we present the Human EST Ontology Explorer (HEOE) http://www.itb.cnr.it/ptp/human_est_explorer, a web facility for comparison of expression levels among libraries from several healthy and diseased tissues. The HEOE provides library-dependent statistics on the distribution of sequences in the GO Direct Acyclic Graph (DAG) that can be browsed at each GO hierarchical level. The tool is based on large-scale BLAST annotation of EST sequences. Due to the huge number of input sequences, this BLAST analysis was performed with the aid of grid computing technology, which is particularly suitable to address data parallel task. Relying on the achieved annotation, library-specific distributions of ESTs in the GO Graph were inferred. A pathway-based search interface was also implemented, for a quick evaluation of the representation of libraries in metabolic pathways. EST processing steps were integrated in a semi-automatic procedure that relies on Perl scripts and stores results in a MySQL database. A PHP-based web interface offers the possibility to simultaneously visualize, retrieve and compare data from the different libraries. Statistically significant differences in GO categories among user selected libraries can also be computed. The HEOE provides an alternative and complementary way to inspect EST expression levels with respect to approaches currently offered by other resources. Furthermore, BLAST computation on the whole human EST dataset was a suitable test of grid scalability in the context of large-scale bioinformatics analysis. The HEOE currently comprises sequence analysis from 70 non-normalized libraries, representing a comprehensive overview on healthy and unhealthy tissues. As the analysis procedure can be easily applied to other libraries, the number of represented tissues is intended to increase.
Expressed sequence tag (EST) analysis of two subspecies of Metarhizium anisopliae reveals a plethora of secreted proteins with potential activity in insect hosts.

PubMed

Freimoser, Florian M; Screen, Steven; Bagga, Savita; Hu, Gang; St Leger, Raymond J

2003-01-01

Expressed sequence tag (EST) libraries for Metarhizium anisopliae, the causative agent of green muscardine disease, were developed from the broad host-range pathogen Metarhizium anisopliae sf. anisopliae and the specific grasshopper pathogen, M. anisopliae sf. acridum. Approximately 1,700 5' end sequences from each subspecies were generated from cDNA libraries representing fungi grown under conditions that maximize secretion of cuticle-degrading enzymes. Both subspecies had ESTs for virtually all pathogenicity-related genes cloned to date from M. anisopliae, but many novel genes encoding potential virulence factors were also tagged. Enzymes with potential targets in the insect host included proteases, chitinases, phospholipases, lipases, esterases, phosphatases and enzymes producing toxic secondary metabolites. A diverse array of proteases composed 36 % of all M. anisopliae sf. anisopliae ESTs. Eighty percent of the ESTs that could be clustered into functional groups had significant matches (E<10(-5)) in other ascomycete fungi. These included genes reported to have specific roles in pathogens with plant or vertebrate hosts. Many of the remaining ESTs had their best BLAST match among animal, plant and bacterial sequences. These include genes with plant and microbial counterparts that produce potent antimicrobials. The abundance of transcripts discovered for different functional groups varied between the two subspecies of M. anisopliae in a manner consistent with ecological adaptations of the two pathogens. By hastening gene discovery this project has enhanced development of improved mycoinsecticides. In addition, the M. anisopliae ESTs represent a significant contribution to the extensive database of sequences from ascomycetes that are saprophytes or plant and vertebrate pathogens. Comparative analyses of these sequences is providing important information about the biology and evolutionary history of this clade.
Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

PubMed

Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

2016-05-23

Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Identification of true EST alignments for recognising transcribed regions.

PubMed

Ma, Chuang; Wang, Jia; Li, Lun; Duan, Mo-Jie; Zhou, Yan-Hong

2011-01-01

Transcribed regions can be determined by aligning Expressed Sequence Tags (ESTs) with genome sequences. The kernel of this strategy is to effectively distinguish true EST alignments from spurious ones. In this study, three measures including Direction Check, Identity Check and Terminal Check were introduced to more effectively eliminate spurious EST alignments. On the basis of these introduced measures and other widely used measures, a computational tool, named ESTCleanser, has been developed to identify true EST alignments for obtaining reliable transcribed regions. The performance of ESTCleanser has been evaluated on the well-annotated human ENCyclopedia of DNA Elements (ENCODE) regions using human ESTs in the dbEST database. The evaluation results show that the accuracy of ESTCleanser at exon and intron levels is more remarkably enhanced than that of UCSC-spliced EST alignments. This work would be helpful to EST-based researches on finding new genes, complementing genome annotation, recognising alternative splicing events and Single Nucleotide Polymorphisms (SNPs), etc.
A Novel Real-Time PCR Assay of microRNAs Using S-Poly(T), a Specific Oligo(dT) Reverse Transcription Primer with Excellent Sensitivity and Specificity

PubMed Central

Kang, Kang; Zhang, Xiaoying; Liu, Hongtao; Wang, Zhiwei; Zhong, Jiasheng; Huang, Zhenting; Peng, Xiao; Zeng, Yan; Wang, Yuna; Yang, Yi; Luo, Jun; Gou, Deming

2012-01-01

Background MicroRNAs (miRNAs) are small, non-coding RNAs capable of postranscriptionally regulating gene expression. Accurate expression profiling is crucial for understanding the biological roles of miRNAs, and exploring them as biomarkers of diseases. Methodology/Principal Findings A novel, highly sensitive, and reliable miRNA quantification approach,termed S-Poly(T) miRNA assay, is designed. In this assay, miRNAs are subjected to polyadenylation and reverse transcription with a S-Poly(T) primer that contains a universal reverse primer, a universal Taqman probe, an oligo(dT)11 sequence and six miRNA-specific bases. Individual miRNAs are then amplified by a specific forward primer and a universal reverse primer, and the PCR products are detected by a universal Taqman probe. The S-Poly(T) assay showed a minimum of 4-fold increase in sensitivity as compared with the stem-loop or poly(A)-based methods. A remarkable specificity in discriminating among miRNAs with high sequence similarity was also obtained with this approach. Using this method, we profiled miRNAs in human pulmonary arterial smooth muscle cells (HPASMC) and identified 9 differentially expressed miRNAs associated with hypoxia treatment. Due to its outstanding sensitivity, the number of circulating miRNAs from normal human serum was significantly expanded from 368 to 518. Conclusions/Significance With excellent sensitivity, specificity, and high-throughput, the S-Poly(T) method provides a powerful tool for miRNAs quantification and identification of tissue- or disease-specific miRNA biomarkers. PMID:23152780
Gene discovery in EST sequences from the wheat leaf rust fungus Puccinia triticina sexual spores, asexual spores and haustoria, compared to other rust and corn smut fungi

PubMed Central

2011-01-01

Background Rust fungi are biotrophic basidiomycete plant pathogens that cause major diseases on plants and trees world-wide, affecting agriculture and forestry. Their biotrophic nature precludes many established molecular genetic manipulations and lines of research. The generation of genomic resources for these microbes is leading to novel insights into biology such as interactions with the hosts and guiding directions for breakthrough research in plant pathology. Results To support gene discovery and gene model verification in the genome of the wheat leaf rust fungus, Puccinia triticina (Pt), we have generated Expressed Sequence Tags (ESTs) by sampling several life cycle stages. We focused on several spore stages and isolated haustorial structures from infected wheat, generating 17,684 ESTs. We produced sequences from both the sexual (pycniospores, aeciospores and teliospores) and asexual (germinated urediniospores) stages of the life cycle. From pycniospores and aeciospores, produced by infecting the alternate host, meadow rue (Thalictrum speciosissimum), 4,869 and 1,292 reads were generated, respectively. We generated 3,703 ESTs from teliospores produced on the senescent primary wheat host. Finally, we generated 6,817 reads from haustoria isolated from infected wheat as well as 1,003 sequences from germinated urediniospores. Along with 25,558 previously generated ESTs, we compiled a database of 13,328 non-redundant sequences (4,506 singlets and 8,822 contigs). Fungal genes were predicted using the EST version of the self-training GeneMarkS algorithm. To refine the EST database, we compared EST sequences by BLASTN to a set of 454 pyrosequencing-generated contigs and Sanger BAC-end sequences derived both from the Pt genome, and to ESTs and genome reads from wheat. A collection of 6,308 fungal genes was identified and compared to sequences of the cereal rusts, Puccinia graminis f. sp. tritici (Pgt) and stripe rust, P. striiformis f. sp. tritici (Pst), and poplar leaf rust Melampsora species, and the corn smut fungus, Ustilago maydis (Um). While extensive homologies were found, many genes appeared novel and species-specific; over 40% of genes did not match any known sequence in existing databases. Focusing on spore stages, direct comparison to Um identified potential functional homologs, possibly allowing heterologous functional analysis in that model fungus. Many potentially secreted protein genes were identified by similarity searches against genes and proteins of Pgt and Melampsora spp., revealing apparent orthologs. Conclusions The current set of Pt unigenes contributes to gene discovery in this major cereal pathogen and will be invaluable for gene model verification in the genome sequence. PMID:21435244
Design of oligonucleotides for microarrays and perspectives for design of multi-transcriptome arrays.

PubMed

Nielsen, Henrik Bjørn; Wernersson, Rasmus; Knudsen, Steen

2003-07-01

Optimal design of oligonucleotides for microarrays involves tedious and laborious work evaluating potential oligonucleotides relative to a series of parameters. The currently available tools for this purpose are limited in their flexibility and do not present the oligonucleotide designer with an overview of these parameters. We present here a flexible tool named OligoWiz for designing oligonucleotides for multiple purposes. OligoWiz presents a set of parameter scores in a graphical interface to facilitate an overview for the user. Additional custom parameter scores can easily be added to the program to extend the default parameters: homology, DeltaTm, low-complexity, position and GATC-only. Furthermore we present an analysis of the limitations in designing oligonucleotide sets that can detect transcripts from multiple organisms. OligoWiz is available at www.cbs.dtu.dk/services/OligoWiz/.
Experimental and statistical post-validation of positive example EST sequences carrying peroxisome targeting signals type 1 (PTS1)

PubMed Central

Lingner, Thomas; Kataya, Amr R. A.; Reumann, Sigrun

2012-01-01

We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences.1 As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity.” Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals. PMID:22415050
Experimental and statistical post-validation of positive example EST sequences carrying peroxisome targeting signals type 1 (PTS1).

PubMed

Lingner, Thomas; Kataya, Amr R A; Reumann, Sigrun

2012-02-01

We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences. As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity." Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals.

Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST): A web tool for generating protein sequence similarity networks.

PubMed

Gerlt, John A; Bouvier, Jason T; Davidson, Daniel B; Imker, Heidi J; Sadkhin, Boris; Slater, David R; Whalen, Katie L

2015-08-01

The Enzyme Function Initiative, an NIH/NIGMS-supported Large-Scale Collaborative Project (EFI; U54GM093342; http://enzymefunction.org/), is focused on devising and disseminating bioinformatics and computational tools as well as experimental strategies for the prediction and assignment of functions (in vitro activities and in vivo physiological/metabolic roles) to uncharacterized enzymes discovered in genome projects. Protein sequence similarity networks (SSNs) are visually powerful tools for analyzing sequence relationships in protein families (H.J. Atkinson, J.H. Morris, T.E. Ferrin, and P.C. Babbitt, PLoS One 2009, 4, e4345). However, the members of the biological/biomedical community have not had access to the capability to generate SSNs for their "favorite" protein families. In this article we announce the EFI-EST (Enzyme Function Initiative-Enzyme Similarity Tool) web tool (http://efi.igb.illinois.edu/efi-est/) that is available without cost for the automated generation of SSNs by the community. The tool can create SSNs for the "closest neighbors" of a user-supplied protein sequence from the UniProt database (Option A) or of members of any user-supplied Pfam and/or InterPro family (Option B). We provide an introduction to SSNs, a description of EFI-EST, and a demonstration of the use of EFI-EST to explore sequence-function space in the OMP decarboxylase superfamily (PF00215). This article is designed as a tutorial that will allow members of the community to use the EFI-EST web tool for exploring sequence/function space in protein families. Copyright © 2015 Elsevier B.V. All rights reserved.
Identification and biochemical characterization of a GDSL-motif carboxylester hydrolase from Carica papaya latex.

PubMed

Abdelkafi, Slim; Ogata, Hiroyuki; Barouh, Nathalie; Fouquet, Benjamin; Lebrun, Régine; Pina, Michel; Scheirlinckx, Frantz; Villeneuve, Pierre; Carrière, Frédéric

2009-11-01

An esterase (CpEst) showing high specific activities on tributyrin and short chain vinyl esters was obtained from Carica papaya latex after an extraction step with zwitterionic detergent and sonication, followed by gel filtration chromatography. Although the protein could not be purified to complete homogeneity due to its presence in high molecular mass aggregates, a major protein band with an apparent molecular mass of 41 kDa was obtained by SDS-PAGE. This material was digested with trypsin and the amino acid sequences of the tryptic peptides were determined by LC/ESI/MS/MS. These sequences were used to identify a partial cDNA (679 bp) from expressed sequence tags (ESTs) of C. papaya. Based upon EST sequences, a full-length gene was identified in the genome of C. papaya, with an open reading frame of 1029 bp encoding a protein of 343 amino acid residues, with a theoretical molecular mass of 38 kDa. From sequence analysis, CpEst was identified as a GDSL-motif carboxylester hydrolase belonging to the SGNH protein family and four potential N-glycosylation sites were identified. The putative catalytic triad was localised (Ser(35)-Asp(307)-His(310)) with the nucleophile serine being part of the GDSL-motif. A 3D-model of CpEst was built from known X-ray structures and sequence alignments and the catalytic triad was found to be exposed at the surface of the molecule, thus confirming the results of CpEst inhibition by tetrahydrolipstatin suggesting a direct accessibility of the inhibitor to the active site.
Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

NASA Astrophysics Data System (ADS)

Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

2010-01-01

Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.
A direct detection of Escherichia coli genomic DNA using gold nanoprobes

PubMed Central

2012-01-01

Background In situation like diagnosis of clinical and forensic samples there exists a need for highly sensitive, rapid and specific DNA detection methods. Though conventional DNA amplification using PCR can provide fast results, it is not widely practised in diagnostic laboratories partially because it requires skilled personnel and expensive equipment. To overcome these limitations nanoparticles have been explored as signalling probes for ultrasensitive DNA detection that can be used in field applications. Among the nanomaterials, gold nanoparticles (AuNPs) have been extensively used mainly because of its optical property and ability to get functionalized with a variety of biomolecules. Results We report a protocol for the use of gold nanoparticles functionalized with single stranded oligonucleotide (AuNP- oligo probe) as visual detection probes for rapid and specific detection of Escherichia coli. The AuNP- oligo probe on hybridization with target DNA containing complementary sequences remains red whereas test samples without complementary DNA sequences to the probe turns purple due to acid induced aggregation of AuNP- oligo probes. The color change of the solution is observed visually by naked eye demonstrating direct and rapid detection of the pathogenic Escherichia coli from its genomic DNA without the need for PCR amplification. The limit of detection was ~54 ng for unamplified genomic DNA. The method requires less than 30 minutes to complete after genomic DNA extraction. However, by using unamplified enzymatic digested genomic DNA, the detection limit of 11.4 ng was attained. Results of UV-Vis spectroscopic measurement and AFM imaging further support the hypothesis of aggregation based visual discrimination. To elucidate its utility in medical diagnostic, the assay was validated on clinical strains of pathogenic Escherichia coli obtained from local hospitals and spiked urine samples. It was found to be 100% sensitive and proves to be highly specific without any cross reaction with non-Escherichia coli strains. Conclusion This work gives entry into a new class of DNA/gold nanoparticles hybrid materials which might have optical property that can be controlled for application in diagnostics. We note that it should be possible to extend this strategy easily for developing new types of DNA biosensor for point of care detection. The salient feature of this approach includes low-cost, robust reagents and simple colorimetric detection of pathogen. PMID:22309695
PipeOnline 2.0: automated EST processing and functional data sorting.

PubMed

Ayoubi, Patricia; Jin, Xiaojing; Leite, Saul; Liu, Xianghui; Martajaja, Jeson; Abduraham, Abdurashid; Wan, Qiaolan; Yan, Wei; Misawa, Eduardo; Prade, Rolf A

2002-11-01

Expressed sequence tags (ESTs) are generated and deposited in the public domain, as redundant, unannotated, single-pass reactions, with virtually no biological content. PipeOnline automatically analyses and transforms large collections of raw DNA-sequence data from chromatograms or FASTA files by calling the quality of bases, screening and removing vector sequences, assembling and rewriting consensus sequences of redundant input files into a unigene EST data set and finally through translation, amino acid sequence similarity searches, annotation of public databases and functional data. PipeOnline generates an annotated database, retaining the processed unigene sequence, clone/file history, alignments with similar sequences, and proposed functional classification, if available. Functional annotation is automatic and based on a novel method that relies on homology of amino acid sequence multiplicity within GenBank records. Records are examined through a function ordered browser or keyword queries with automated export of results. PipeOnline offers customization for individual projects (MyPipeOnline), automated updating and alert service. PipeOnline is available at http://stress-genomics.org.
Metformin, oral contraceptives or both to manage oligo-amenorrhea in adolescents with polycystic ovary syndrome? A clinical review.

PubMed

Palomba, Stefano; Materazzo, Caterina; Falbo, Angela; Orio, Francesco; La Sala, Giovanni Battista; Sultan, Charles

2014-05-01

The management of oligo-amenorrhea in adolescent patients with polycystic ovary syndrome (PCOS) represents an important and difficult challenge. Metformin and/or oral contraceptives (OCs) are different strategies widely proposed in these patients. The objective of the current review was to provide an overview on the use of metformin and/or OCs for the management of oligo-amenorrhea in adolescents with PCOS underlining their potential risks and benefits in order to help the clinician to choose the best patients' tailored treatment.
Functional regulation of RNA-induced silencing complex by photoreactive oligonucleotides.

PubMed

Matsuyama, Yohei; Yamayoshi, Asako; Kobori, Akio; Murakami, Akira

2014-02-01

We developed a novel method for regulation of RISC function by photoreactive oligonucleotides (Ps-Oligo) containing 2'-O-psoralenylmethoxyethyl adenosine (Aps). We observed that inhibitory effects of Ps-Oligos on RISC function were enhanced by UV-irradiation compared with 2'-O-methyl-oligonucleotide without Aps. These results suggest Ps-Oligo inhibited RISC function by cross-linking effect, and we propose that the concept described in this report may be promising and applicable one to regulate the small RNA-mediated post-transcriptional regulation. Crown Copyright © 2013. Published by Elsevier Ltd. All rights reserved.
MytiBase: a knowledgebase of mussel (M. galloprovincialis) transcribed sequences

PubMed Central

Venier, Paola; De Pittà, Cristiano; Bernante, Filippo; Varotto, Laura; De Nardi, Barbara; Bovo, Giuseppe; Roch, Philippe; Novoa, Beatriz; Figueras, Antonio; Pallavicini, Alberto; Lanfranchi, Gerolamo

2009-01-01

Background Although Bivalves are among the most studied marine organisms due to their ecological role, economic importance and use in pollution biomonitoring, very little information is available on the genome sequences of mussels. This study reports the functional analysis of a large-scale Expressed Sequence Tag (EST) sequencing from different tissues of Mytilus galloprovincialis (the Mediterranean mussel) challenged with toxic pollutants, temperature and potentially pathogenic bacteria. Results We have constructed and sequenced seventeen cDNA libraries from different Mediterranean mussel tissues: gills, digestive gland, foot, anterior and posterior adductor muscle, mantle and haemocytes. A total of 24,939 clones were sequenced from these libraries generating 18,788 high-quality ESTs which were assembled into 2,446 overlapping clusters and 4,666 singletons resulting in a total of 7,112 non-redundant sequences. In particular, a high-quality normalized cDNA library (Nor01) was constructed as determined by the high rate of gene discovery (65.6%). Bioinformatic screening of the non-redundant M. galloprovincialis sequences identified 159 microsatellite-containing ESTs. Clusters, consensuses, related similarities and gene ontology searches have been organized in a dedicated, searchable database . Conclusion We defined the first species-specific catalogue of M. galloprovincialis ESTs including 7,112 unique transcribed sequences. Putative microsatellite markers were identified. This annotated catalogue represents a valuable platform for expression studies, marker validation and genetic linkage analysis for investigations in the biology of Mediterranean mussels. PMID:19203376
Small RNA populations revealed by blocking rRNA fragments in Drosophila melanogaster reproductive tissues

PubMed Central

Dalmay, Tamas

2018-01-01

RNA interference (RNAi) is a complex and highly conserved regulatory mechanism mediated via small RNAs (sRNAs). Recent technical advances in high throughput sequencing have enabled an increasingly detailed analysis of sRNA abundances and profiles in specific body parts and tissues. This enables investigations of the localized roles of microRNAs (miRNAs) and small interfering RNAs (siRNAs). However, variation in the proportions of non-coding RNAs in the samples being compared can hinder these analyses. Specific tissues may vary significantly in the proportions of fragments of longer non-coding RNAs (such as ribosomal RNA or transfer RNA) present, potentially reflecting tissue-specific differences in biological functions. For example, in Drosophila, some tissues contain a highly abundant 30nt rRNA fragment (the 2S rRNA) as well as abundant 5’ and 3’ terminal rRNA fragments. These can pose difficulties for the construction of sRNA libraries as they can swamp the sequencing space and obscure sRNA abundances. Here we addressed this problem and present a modified “rRNA blocking” protocol for the construction of high-definition (HD) adapter sRNA libraries, in D. melanogaster reproductive tissues. The results showed that 2S rRNAs targeted by blocking oligos were reduced from >80% to < 0.01% total reads. In addition, the use of multiple rRNA blocking oligos to bind the most abundant rRNA fragments allowed us to reveal the underlying sRNA populations at increased resolution. Side-by-side comparisons of sequencing libraries of blocked and non-blocked samples revealed that rRNA blocking did not change the miRNA populations present, but instead enhanced their abundances. We suggest that this rRNA blocking procedure offers the potential to improve the in-depth analysis of differentially expressed sRNAs within and across different tissues. PMID:29474379
Construction and EST sequencing of full-length, drought stress cDNA libraries for common beans (Phaseolus vulgaris L.)

PubMed Central

2011-01-01

Background Common bean is an important legume crop with only a moderate number of short expressed sequence tags (ESTs) made with traditional methods. The goal of this research was to use full-length cDNA technology to develop ESTs that would overlap with the beginning of open reading frames and therefore be useful for gene annotation of genomic sequences. The library was also constructed to represent genes expressed under drought, low soil phosphorus and high soil aluminum toxicity. We also undertook comparisons of the full-length cDNA library to two previous non-full clone EST sets for common bean. Results Two full-length cDNA libraries were constructed: one for the drought tolerant Mesoamerican genotype BAT477 and the other one for the acid-soil tolerant Andean genotype G19833 which has been selected for genome sequencing. Plants were grown in three soil types using deep rooting cylinders subjected to drought and non-drought stress and tissues were collected from both roots and above ground parts. A total of 20,000 clones were selected robotically, half from each library. Then, nearly 10,000 clones from the G19833 library were sequenced with an average read length of 850 nucleotides. A total of 4,219 unigenes were identified consisting of 2,981 contigs and 1,238 singletons. These were functionally annotated with gene ontology terms and placed into KEGG pathways. Compared to other EST sequencing efforts in common bean, about half of the sequences were novel or represented the 5' ends of known genes. Conclusions The present full-length cDNA libraries add to the technological toolbox available for common bean and our sequencing of these clones substantially increases the number of unique EST sequences available for the common bean genome. All of this should be useful for both functional gene annotation, analysis of splice site variants and intron/exon boundary determination by comparison to soybean genes or with common bean whole-genome sequences. In addition the library has a large number of transcription factors and will be interesting for discovery and validation of drought or abiotic stress related genes in common bean. PMID:22118559
Sequence evaluation of four specific cDNA libraries for developmental genomics of sunflower.

PubMed

Tamborindeguy, C; Ben, C; Liboz, T; Gentzbittel, L

2004-04-01

Four different cDNA libraries were constructed from sunflower protoplasts growing under embryogenic and non-embryogenic conditions: one standard library from each condition and two subtractive libraries in opposite sense. A total of 22,876 cDNA clones were obtained and 4800 ESTs were sequenced, giving rise to 2479 high quality ESTs representing an unigene set of 1502 sequences. This set was compared with ESTs represented in public databases using the programs BLASTN and BLASTX, and its members were classified according to putative function using the catalog in the Kyoto Encyclopedia of Genes and Genomes (KEGG). Some 33% of sequences failed to align with existing plant ESTs and therefore represent putative novel genes. The libraries show a low level of redundancy and, on average, 50% of the present ESTs have not been previously reported for sunflower. Several potentially interesting genes were identified, based on their homology with genes involved in animal zygotic division or plant embryogenesis. We also identified two ESTs that show significantly different levels of expression under embryogenic and non-embryogenic conditions. The libraries described here represent an original and valuable resource for the discovery of yet unknown genes putatively involved in dicot embryogenesis and improving our knowledge of the mechanisms involved in polarity acquisition by plant embryos.
Analysis of expressed sequence tags from the four main developmental stages of Trypanosoma congolense

PubMed Central

Helm, Jared R.; Hertz-Fowler, Christiane; Aslett, Martin; Berriman, Matthew; Sanders, Mandy; Quail, Michael A.; Soares, Marcelo B.; Bonaldo, Maria F.; Sakurai, Tatsuya; Inoue, Noboru; Donelson, John E.

2009-01-01

Trypanosoma congolense is one of the most economically important pathogens of livestock in Africa. Culture-derived parasites of each of the three main insect stages of the T. congolense life cycle, i.e., the procyclic, epimastigote and metacyclic stages, and bloodstream stage parasites isolated from infected mice, were used to construct stage-specific cDNA libraries and expressed sequence tags (ESTs or cDNA clones) in each library were sequenced. Thirteen EST clusters encoding different variant surface glycoproteins (VSGs) were detected in the metacyclic library and twenty-six VSG EST clusters were found in the bloodstream library, six of which are shared by the metacyclic library. Rare VSG ESTs are present in the epimastigote library, and none were detected in the procyclic library. ESTs encoding enzymes that catalyze oxidative phosphorylation and amino acid metabolism are about twice as abundant in the procyclic and epimastigote stages as in the metacyclic and bloodstream stages. In contrast, ESTs encoding enzymes involved in glycolysis, the citric acid cycle and nucleotide metabolism are about the same in all four developmental stages. Cysteine proteases, kinases and phosphatases are the most abundant enzyme groups represented by the ESTs. All four libraries contain T. congolense-specific expressed sequences not present in the T. brucei and T. cruzi genomes. Normalized cDNA libraries were constructed from the metacyclic and bloodstream stages, and found to be further enriched for T. congolense-specific ESTs. Given that cultured T. congolense offers an experimental advantage over other African trypanosome species, these ESTs provide a basis for further investigation of the molecular properties of these four developmental stages, especially the epimastigote and metacyclic stages for which it is difficult to obtain large quantities of organisms. The T. congolense EST databases are available at: http://www.sanger.ac.uk/Projects/T_congolense/EST_index.shtml. PMID:19559733
Insights into rubber biosynthesis from transcriptome analysis of Hevea brasiliensis latex.

PubMed

Chow, Keng-See; Wan, Kiew-Lian; Isa, Mohd Noor Mat; Bahari, Azlina; Tan, Siang-Hee; Harikrishna, K; Yeang, Hoong-Yeet

2007-01-01

Hevea brasiliensis is the most widely cultivated species for commercial production of natural rubber (cis-polyisoprene). In this study, 10,040 expressed sequence tags (ESTs) were generated from the latex of the rubber tree, which represents the cytoplasmic content of a single cell type, in order to analyse the latex transcription profile with emphasis on rubber biosynthesis-related genes. A total of 3,441 unique transcripts (UTs) were obtained after quality editing and assembly of EST sequences. Functional classification of UTs according to the Gene Ontology convention showed that 73.8% were related to genes of unknown function. Among highly expressed ESTs, a significant proportion encoded proteins related to rubber biosynthesis and stress or defence responses. Sequences encoding rubber particle membrane proteins (RPMPs) belonging to three protein families accounted for 12% of the ESTs. Characterization of these ESTs revealed nine RPMP variants (7.9-27 kDa) including the 14 kDa REF (rubber elongation factor) and 22 kDa SRPP (small rubber particle protein). The expression of multiple RPMP isoforms in latex was shown using antibodies against REF and SRPP. Both EST and quantitative reverse transcription-PCR (QRT-PCR) analyses demonstrated REF and SRPP to be the most abundant transcripts in latex. Besides rubber biosynthesis, comparative sequence analysis showed that the RPMPs are highly similar to sequences in the plant kingdom having stress-related functions. Implications of the RPMP function in cis-polyisoprene biosynthesis in the context of transcript abundance and differential gene expression are discussed.
EvOligo: A Novel Software to Design and Group Libraries of Oligonucleotides Applicable for Nucleic Acid-Based Experiments.

PubMed

Milewski, Marek C; Kamel, Karol; Kurzynska-Kokorniak, Anna; Chmielewski, Marcin K; Figlerowicz, Marek

2017-10-01

Experimental methods based on DNA and RNA hybridization, such as multiplex polymerase chain reaction, multiplex ligation-dependent probe amplification, or microarray analysis, require the use of mixtures of multiple oligonucleotides (primers or probes) in a single test tube. To provide an optimal reaction environment, minimal self- and cross-hybridization must be achieved among these oligonucleotides. To address this problem, we developed EvOligo, which is a software package that provides the means to design and group DNA and RNA molecules with defined lengths. EvOligo combines two modules. The first module performs oligonucleotide design, and the second module performs oligonucleotide grouping. The software applies a nearest-neighbor model of nucleic acid interactions coupled with a parallel evolutionary algorithm to construct individual oligonucleotides, and to group the molecules that are characterized by the weakest possible cross-interactions. To provide optimal solutions, the evolutionary algorithm sorts oligonucleotides into sets, preserves preselected parts of the oligonucleotides, and shapes their remaining parts. In addition, the oligonucleotide sets can be designed and grouped based on their melting temperatures. For the user's convenience, EvOligo is provided with a user-friendly graphical interface. EvOligo was used to design individual oligonucleotides, oligonucleotide pairs, and groups of oligonucleotide pairs that are characterized by the following parameters: (1) weaker cross-interactions between the non-complementary oligonucleotides and (2) more uniform ranges of the oligonucleotide pair melting temperatures than other available software products. In addition, in contrast to other grouping algorithms, EvOligo offers time-efficient sorting of paired and unpaired oligonucleotides based on various parameters defined by the user.
MANGO: a new approach to multiple sequence alignment.

PubMed

Zhang, Zefeng; Lin, Hao; Li, Ming

2007-01-01

Multiple sequence alignment is a classical and challenging task for biological sequence analysis. The problem is NP-hard. The full dynamic programming takes too much time. The progressive alignment heuristics adopted by most state of the art multiple sequence alignment programs suffer from the 'once a gap, always a gap' phenomenon. Is there a radically new way to do multiple sequence alignment? This paper introduces a novel and orthogonal multiple sequence alignment method, using multiple optimized spaced seeds and new algorithms to handle these seeds efficiently. Our new algorithm processes information of all sequences as a whole, avoiding problems caused by the popular progressive approaches. Because the optimized spaced seeds are provably significantly more sensitive than the consecutive k-mers, the new approach promises to be more accurate and reliable. To validate our new approach, we have implemented MANGO: Multiple Alignment with N Gapped Oligos. Experiments were carried out on large 16S RNA benchmarks showing that MANGO compares favorably, in both accuracy and speed, against state-of-art multiple sequence alignment methods, including ClustalW 1.83, MUSCLE 3.6, MAFFT 5.861, Prob-ConsRNA 1.11, Dialign 2.2.1, DIALIGN-T 0.2.1, T-Coffee 4.85, POA 2.0 and Kalign 2.0.
Characterization and Amplification of Gene-Based Simple Sequence Repeat (SSR) Markers in Date Palm.

PubMed

Zhao, Yongli; Keremane, Manjunath; Prakash, Channapatna S; He, Guohao

2017-01-01

The paucity of molecular markers limits the application of genetic and genomic research in date palm (Phoenix dactylifera L.). Availability of expressed sequence tag (EST) sequences in date palm may provide a good resource for developing gene-based markers. This study characterizes a substantial fraction of transcriptome sequences containing simple sequence repeats (SSRs) from the EST sequences in date palm. The EST sequences studied are mainly homologous to those of Elaeis guineensis and Musa acuminata. A total of 911 gene-based SSR markers, characterized with functional annotations, have provided a useful basis not only for discovering candidate genes and understanding genetic basis of traits of interest but also for developing genetic and genomic tools for molecular research in date palm, such as diversity study, quantitative trait locus (QTL) mapping, and molecular breeding. The procedures of DNA extraction, polymerase chain reaction (PCR) amplification of these gene-based SSR markers, and gel electrophoresis of PCR products are described in this chapter.
Construction of a cDNA library from female adult of Toxocara canis, and analysis of EST and immune-related genes expressions.

PubMed

Zhou, Rongqiong; Xia, Qingyou; Huang, Hancheng; Lai, Min; Wang, Zhenxin

2011-10-01

Toxocara canis is a widespread intestinal nematode parasite of dogs, which can also cause disease in humans. We employed an expressed sequence tag (EST) strategy in order to study gene-expression including development, digestion and reproduction of T. canis. ESTs provided a rapid way to identify genes, particularly in organisms for which we have very little molecular information. In this study, a cDNA library was constructed from a female adult of T. canis and 215 high-quality ESTs from 5'-ends of the cDNA clones representing 79 unigenes were obtained. The titer of the primary cDNA library was 1.83×10(6)pfu/mL with a recombination rate of 99.33%. Most of the sequences ranged from 300 to 900bp with an average length of 656bp. Cluster analysis of these ESTs allowed identification of 79 unique sequences containing 28 contigs and 51 singletons. BLASTX searches revealed that 18 unigenes (22.78% of the total) or 70 ESTs (32.56% of the total) were novel genes that had no significant matches to any protein sequences in the public databases. The rest of the 61 unigenes (77.22% of the total) or 145 ESTs (67.44% of the total) were closely matched to the known genes or sequences deposited in the public databases. These genes were classified into seven groups based on their known or putative biological functions. We also confirmed the gene expression patterns of several immune-related genes using RT-PCR examination. This work will provide a valuable resource for the further investigations in the stage-, sex- and tissue-specific gene transcription or expression. Copyright © 2011. Published by Elsevier Inc.
Identification and characterization of gene-based SSR markers in date palm (Phoenix dactylifera L.).

PubMed

Zhao, Yongli; Williams, Roxanne; Prakash, C S; He, Guohao

2012-12-15

Date palm (Phoenix dactylifera L.) is an important tree in the Middle East and North Africa due to the nutritional value of its fruit. Molecular Breeding would accelerate genetic improvement of fruit tree through marker assisted selection. However, the lack of molecular markers in date palm restricts the application of molecular breeding. In this study, we analyzed 28,889 EST sequences from the date palm genome database to identify simple-sequence repeats (SSRs) and to develop gene-based markers, i.e. expressed sequence tag-SSRs (EST-SSRs). We identified 4,609 ESTs as containing SSRs, among which, trinucleotide motifs (69.7%) were the most common, followed by tetranucleotide (10.4%) and dinucleotide motifs (9.6%). The motif AG (85.7%) was most abundant in dinucleotides, while motifs AGG (26.8%), AAG (19.3%), and AGC (16.1%) were most common among trinucleotides. A total of 4,967 primer pairs were designed for EST-SSR markers from the computational data. In a follow up laboratory study, we tested a sample of 20 random selected primer pairs for amplification and polymorphism detection using genomic DNA from date palm cultivars. Nearly one-third of these primer pairs detected DNA polymorphism to differentiate the twelve date palm cultivars used. Functional categorization of EST sequences containing SSRs revealed that 3,108 (67.4%) of such ESTs had homology with known proteins. Date palm EST sequences exhibits a good resource for developing gene-based markers. These genic markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in date palm, such as diversity study, QTL mapping, and molecular breeding.
Systematic sequencing of mRNA from the Antarctic krill (Euphausia superba) and first tissue specific transcriptional signature

PubMed Central

De Pittà, Cristiano; Bertolucci, Cristiano; Mazzotta, Gabriella M; Bernante, Filippo; Rizzo, Giorgia; De Nardi, Barbara; Pallavicini, Alberto; Lanfranchi, Gerolamo; Costa, Rodolfo

2008-01-01

Background Little is known about the genome sequences of Euphausiacea (krill) although these crustaceans are abundant components of the pelagic ecosystems in all oceans and used for aquaculture and pharmaceutical industry. This study reports the results of an expressed sequence tag (EST) sequencing project from different tissues of Euphausia superba (the Antarctic krill). Results We have constructed and sequenced five cDNA libraries from different Antarctic krill tissues: head, abdomen, thoracopods and photophores. We have identified 1.770 high-quality ESTs which were assembled into 216 overlapping clusters and 801 singletons resulting in a total of 1.017 non-redundant sequences. Quantitative RT-PCR analysis was performed to quantify and validate the expression levels of ten genes presenting different EST countings in krill tissues. In addition, bioinformatic screening of the non-redundant E. superba sequences identified 69 microsatellite containing ESTs. Clusters, consensuses and related similarity and gene ontology searches were organized in a dedicated E. superba database . Conclusion We defined the first tissue transcriptional signatures of E. superba based on functional categorization among the examined tissues. The analyses of annotated transcripts showed a higher similarity with genes from insects with respect to Malacostraca possibly as an effect of the limited number of Malacostraca sequences in the public databases. Our catalogue provides for the first time a genomic tool to investigate the biology of the Antarctic krill. PMID:18226200
Complementary DNA sequencing and identification of mRNAs from the venomous gland of Agkistrodon piscivorus leucostoma.

PubMed

Jia, Ying; Cantu, Bruno A; Sánchez, Elda E; Pérez, John C

2008-06-15

To advance our knowledge on the snake venom composition and transcripts expressed in venom gland at the molecular level, we constructed a cDNA library from the venom gland of Agkistrodon piscivorus leucostoma for the generation of expressed sequence tags (ESTs) database. From the randomly sequenced 2112 independent clones, we have obtained ESTs for 1309 (62%) cDNAs, which showed significant deduced amino acid sequence similarity (scores >80) to previously characterized proteins in National Center for Biotechnology Information (NCBI) database. Ribosomal proteins make up 47 clones (2%) and the remaining 756 (36%) cDNAs represent either unknown identity or show BLASTX sequence identity scores of <80 with known GenBank accessions. The most highly expressed gene encoding phospholipase A(2) (PLA(2)) accounting for 35% of A. p. leucostoma venom gland cDNAs was identified and further confirmed by crude venom applied to sodium dodecyl sulfate/polyacrylamide gel electrophoresis (SDS-PAGE) electrophoresis and protein sequencing. A total of 180 representative genes were obtained from the sequence assemblies and deposited to EST database. Clones showing sequence identity to disintegrins, thrombin-like enzymes, hemorrhagic toxins, fibrinogen clotting inhibitors and plasminogen activators were also identified in our EST database. These data can be used to develop a research program that will help us identify genes encoding proteins that are of medical importance or proteins involved in the mechanisms of the toxin venom.

Gene identification and analysis of transcripts differentially regulated in fracture healing by EST sequencing in the domestic sheep.

PubMed

Hecht, Jochen; Kuhl, Heiner; Haas, Stefan A; Bauer, Sebastian; Poustka, Albert J; Lienau, Jasmin; Schell, Hanna; Stiege, Asita C; Seitz, Volkhard; Reinhardt, Richard; Duda, Georg N; Mundlos, Stefan; Robinson, Peter N

2006-07-05

The sheep is an important model animal for testing novel fracture treatments and other medical applications. Despite these medical uses and the well known economic and cultural importance of the sheep, relatively little research has been performed into sheep genetics, and DNA sequences are available for only a small number of sheep genes. In this work we have sequenced over 47 thousand expressed sequence tags (ESTs) from libraries developed from healing bone in a sheep model of fracture healing. These ESTs were clustered with the previously available 10 thousand sheep ESTs to a total of 19087 contigs with an average length of 603 nucleotides. We used the newly identified sequences to develop RT-PCR assays for 78 sheep genes and measured differential expression during the course of fracture healing between days 7 and 42 postfracture. All genes showed significant shifts at one or more time points. 23 of the genes were differentially expressed between postfracture days 7 and 10, which could reflect an important role for these genes for the initiation of osteogenesis. The sequences we have identified in this work are a valuable resource for future studies on musculoskeletal healing and regeneration using sheep and represent an important head-start for genomic sequencing projects for Ovis aries, with partial or complete sequences being made available for over 5,800 previously unsequenced sheep genes.
Analysis and Functional Annotation of an Expressed Sequence Tag Collection for Tropical Crop Sugarcane

PubMed Central

Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo

2003-01-01

To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979
In-silico mining, type and frequency analysis of genic microsatellites of finger millet (Eleusine coracana (L.) Gaertn.): a comparative genomic analysis of NBS-LRR regions of finger millet with rice.

PubMed

Kalyana Babu, B; Pandey, Dinesh; Agrawal, P K; Sood, Salej; Kumar, Anil

2014-05-01

In recent years, the increased availability of the DNA sequences has given the possibility to develop and explore the expressed sequence tags (ESTs) derived SSR markers. In the present study, a total of 1956 ESTs of finger millet were used to find the microsatellite type, distribution, frequency and developed a total of 545 primer pairs from the ESTs of finger millet. Thirty-two EST sequences had more than two microsatellites and 1357 sequences did not have any SSR repeats. The most frequent type of repeats was trimeric motif, however the second place was occupied by dimeric motif followed by tetra-, hexa- and penta repeat motifs. The most common dimer repeat motif was GA and in case of trimeric SSRs, it was CGG. The EST sequences of NBS-LRR region of finger millet and rice showed higher synteny and were found on nearly same positions on the rice chromosome map. A total of eight, out of 15 EST based SSR primers were polymorphic among the selected resistant and susceptible finger millet genotypes. The primer FMBLEST5 could able to differentiate them into resistant and susceptible genotypes. The alleles specific to the resistant and susceptible genotypes were sequenced using the ABI 3130XL genetic analyzer and found similarity to NBS-LRR regions of rice and finger millet and contained the characteristic kinase-2 and kinase 3a motifs of plant R-genes belonged to NBS-LRR region. The In-silico and comparative analysis showed that the genes responsible for blast resistance can be identified, mapped and further introgressed through molecular breeding approaches for enhancing the blast resistance in finger millet.
Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

PubMed

Badisco, Liesbeth; Huybrechts, Jurgen; Simonet, Gert; Verlinden, Heleen; Marchal, Elisabeth; Huybrechts, Roger; Schoofs, Liliane; De Loof, Arnold; Vanden Broeck, Jozef

2011-03-21

The desert locust (Schistocerca gregaria) displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming) phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS) is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. We have generated 34,672 raw expressed sequence tags (EST) from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular principles of phase polyphenism, in further establishing locusts as valuable research model organisms and in molecular evolutionary and comparative entomology.
Identification of single nucleotide polymorphism in ginger using expressed sequence tags

PubMed Central

Chandrasekar, Arumugam; Riju, Aikkal; Sithara, Kandiyl; Anoop, Sahadevan; Eapen, Santhosh J

2009-01-01

Ginger (Zingiber officinale Rosc) (Family: Zingiberaceae) is a herbaceous perennial, the rhizomes of which are used as a spice. Ginger is a plant which is well known for its medicinal applications. Recently EST-derived SNPs are a free by-product of the currently expanding EST (Expressed Sequence Tag) databases. The development of high-throughput methods for the detection of SNPs (Single Nucleotide Polymorphism) and small indels (insertion/deletion) has led to a revolution in their use as molecular markers. Available (38139) Ginger EST sequences were mined from dbEST of NCBI. CAP3 program was used to assemble EST sequences into contigs. Candidate SNPs and Indel polymorphisms were detected using the perl script AutoSNP version 1.0 which has used 31905 ESTs for detecting SNPs and Indel sites. We found 64026 SNP sites and 7034 indel polymorphisms with frequency of 0.84 SNPs / 100 bp. Among the three tissues from which the EST libraries had been generated, Rhizomes had high frequency of 1.08 SNPs/indels per 100 bp whereas the leaves had lowest frequency of 0.63 per 100 bp and root is showing relative frequency 0.82/100bp. Transitions and transversion ratio is 0.90. In overall detected SNP, transversion is high when compare to transition. These detected SNPs can be used as markers for genetic studies. Availability The results of the present study hosted in our webserver www.spices.res.in/spicesnip PMID:20198184
Bioinformatics and reanalysis of subtracted expressed sequence tags from the human ciliary body: Identification of novel biological functions.

PubMed

Escribano, Julio; Coca-Prados, Miguel

2002-08-28

The ciliary body is largely known for its major roles in the regulation of aqueous humor secretion, intraocular pressure, and accommodation of the lens. In this review article we applied bioinformatics to re-examine hundreds of expressed sequence tags (ESTs) previously isolated by subtractive hybridization from a human ciliary body library [1]. The DNA sequences of these clones have been recently added to the web site of NEIBank. DNA sequence comparisons of subtracted ESTs were performed against all entries in the last available release of the non-redundant database containing GenBank, EMBL, DDBJ and PDB sequences using the BlastN program accessed through NCBI's BLAST services on the internet (NCBI). Sequences were also compared and mapped using the Blast search program provided through the Internet by the Human Genome Project (UCSC). A total number of 284 independent ESTs were classified in 17 functional groups. Analysis of their relationships allowed to define the expression of five major groups of known genes: (i) protein synthesis, folding, secretion and degradation (20%); (ii) energy supply and biosynthesis (12%); (iii) contractility and cytoskeleton structure (6%); (iv) cellular signaling and cell cycle regulation (7%); and (v) nerve cell related tasks (2%), including neuropeptide processing and putative non-visual phototransduction and circadian rhythm control. The largest group contain unidentified sequences, a total of 105 sequences, accounting for 37% of ESTs. The unidentified sequences show similarity to genomic non-coding regions, or genes of unknown function. The most highly represented EST, correspond to myocilin, a gene involved in glaucoma. The data also confirms the secretory functions of the ciliary epithelium, and its high metabolism; the presence of a neuroendocrine peptidergic system presumably involved in the regulation of the intraocular pressure and/or aqueous humor secretion. Additional genes may be related to a non-visual phototransduction cascade and/or to circadian rhythms. Overall this initial group of subtracted ESTs can lead to uncover novel physiological functions of the ciliary body in normal and in disease, as well as novel candidate genes for ocular diseases.
E-microsatellite markers for Centella asiatica (Gotu Kola) genome: validation and cross-transferability in Apiaceae family for plant omics research and development.

PubMed

Sahu, Jagajjit; Das Talukdar, Anupam; Devi, Kamalakshi; Choudhury, Manabendra Dutta; Barooah, Madhumita; Modi, Mahendra Kumar; Sen, Priyabrata

2015-01-01

Abstract Centella asiatica (Gotu Kola) is a plant that grows in tropical swampy regions of the world and has important medicinal and culinary use. It is often considered as part of Ayurvedic medicine, traditional African medicine, and traditional Chinese medicine. The unavailability of genomics resources is significantly impeding its genetic improvement. To date, no attempt has been made to develop Expressed Sequence Tags (ESTs) derived Simple Sequence Repeat (SSR) markers (eSSRs) from the Centella genome. Hence, the present study aimed to develop eSSRs and their further experimental validation and cross-transferability of these markers in different genera of the Apiaceae family to which Centella belongs. An in-house pipeline was developed for the entire analyses by combining bioinformatics tools and perl scripts. A total of 4443 C. asiatica EST sequences from dbEST were processed, which generated 2617 nonredundant high quality EST sequences consisting 441 contigs and 2176 singletons. Out of 1776.5 kb of examined sequences, 417 (15.9%) ESTs containing 686 SSRs were detected with a density of one SSR per 2.59 kb. The gene ontology study revealed 282 functional domains involved in various processes, components, and functions, out of which 64 ESTs were found to have both SSRs and functional domains. Out of 603 designed EST-SSR primers, 18 pairs of primers were selected for validation based on the optimum parameter value. Reproducible amplification was obtained for six primer pairs in C. asiatica that were further tested for cross-transferability in nine other important genera/species of the Apiaceae family. Cross-transferability of the EST-SSR markers among the species were examined and Centella javanica showed highest transferability (83.3%). The study revealed six highly polymorphic EST-SSR primers with an average PIC value of 0.95. In conclusion, these EST-SSR markers hold a big promise for the genomics analysis of Centella asiatica, to facilitate comparative map-based analyses across other related species within the Apiaceae family, and future marker-assisted breeding programs. To the best of our knowledge, this is the first report of development of EST-SSRs in Centella asiatica by in silico approaches, which offers a veritable potential in further use in plant omics research and development.
[A new variant of the simian T-lymphotropic retrovirus type I (STLV-IF) in the Sukhumi colony of hamadryas baboons].

PubMed

Chikobaeva, M G; Schatzl, H; Rose, D; Bush, U; Iakovleva, L A; Deinhardt, F; Helm, K; Lapin, B A

1993-01-01

Polymerase chain reaction (PCR) was developed for the detection of simian T-lymphotropic virus type 1 (STLV-1) infection of P. hamadryas and direct sequencing using oligo-nucleotide primer pairs specific for the tax and env regions of the related human T-lymphotropic virus type 1 (HTLV-1). Excellent specificity was shown in the detection of STLV-1 provirus in infected baboons by PCR using HTLV-1-derived primers. The nucleotide sequences of env 467bp and tax 159bp of the proviral genome (env position 5700-6137, tax position 7373-7498 HTLV-1, according to Seiki et al., 1983) derived from STLV-1-infected P. hamadryas were analysed using PCR and direct sequencing techniques. Two STLV-1 isolates from different sources (Sukhumi main-SuTLV-1 and forest stocks-STLV-1F) were compared. Two variants of STLV-1 among P. hamadryas with different level of homology to HTLV-1 were wound (83.8% and 95.2%, respectively). A possible role of nucleotide changes in env and tax sequenced fragments and oncogenicity of STLV-1 variants is discussed.
Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing

PubMed Central

Hoque, Mainul; Ji, Zhe; Zheng, Dinghai; Luo, Wenting; Li, Wencheng; You, Bei; Park, Ji Yeon; Yehia, Ghassan; Tian, Bin

2012-01-01

Alternative cleavage and polyadenylation (APA) leads to mRNA isoforms with different coding sequences (CDS) and/or 3′ untranslated regions (3′UTRs). Using 3′ Region Extraction And Deep Sequencing (3′READS), a method which addresses the internal priming and oligo(A) tail issues that commonly plague polyA site (pA) identification, we comprehensively mapped pAs in the mouse genome, thoroughly annotating 3′ ends of genes and revealing over five thousand pAs (~8% of total) flanked by A-rich sequences, which have hitherto been overlooked. About 79% of mRNA genes and 66% of long non-coding RNA (lncRNA) genes have APA; but these two gene types have distinct usage patterns for pAs in introns and upstream exons. Promoter-distal pAs become relatively more abundant during embryonic development and cell differentiation, a trend affecting pAs in both 3′-most exons and upstream regions. Upregulated isoforms generally have stronger pAs, suggesting global modulation of the 3′ end processing activity in development and differentiation. PMID:23241633
Floral gene resources from basal angiosperms for comparative genomics research

PubMed Central

Albert, Victor A; Soltis, Douglas E; Carlson, John E; Farmerie, William G; Wall, P Kerr; Ilut, Daniel C; Solow, Teri M; Mueller, Lukas A; Landherr, Lena L; Hu, Yi; Buzgo, Matyas; Kim, Sangtae; Yoo, Mi-Jeong; Frohlich, Michael W; Perl-Treves, Rafael; Schlarbaum, Scott E; Bliss, Barbara J; Zhang, Xiaohong; Tanksley, Steven D; Oppenheimer, David G; Soltis, Pamela S; Ma, Hong; dePamphilis, Claude W; Leebens-Mack, James H

2005-01-01

Background The Floral Genome Project was initiated to bridge the genomic gap between the most broadly studied plant model systems. Arabidopsis and rice, although now completely sequenced and under intensive comparative genomic investigation, are separated by at least 125 million years of evolutionary time, and cannot in isolation provide a comprehensive perspective on structural and functional aspects of flowering plant genome dynamics. Here we discuss new genomic resources available to the scientific community, comprising cDNA libraries and Expressed Sequence Tag (EST) sequences for a suite of phylogenetically basal angiosperms specifically selected to bridge the evolutionary gaps between model plants and provide insights into gene content and genome structure in the earliest flowering plants. Results Random sequencing of cDNAs from representatives of phylogenetically important eudicot, non-grass monocot, and gymnosperm lineages has so far (as of 12/1/04) generated 70,514 ESTs and 48,170 assembled unigenes. Efficient sorting of EST sequences into putative gene families based on whole Arabidopsis/rice proteome comparison has permitted ready identification of cDNA clones for finished sequencing. Preliminarily, (i) proportions of functional categories among sequenced floral genes seem representative of the entire Arabidopsis transcriptome, (ii) many known floral gene homologues have been captured, and (iii) phylogenetic analyses of ESTs are providing new insights into the process of gene family evolution in relation to the origin and diversification of the angiosperms. Conclusion Initial comparisons illustrate the utility of the EST data sets toward discovery of the basic floral transcriptome. These first findings also afford the opportunity to address a number of conspicuous evolutionary genomic questions, including reproductive organ transcriptome overlap between angiosperms and gymnosperms, genome-wide duplication history, lineage-specific gene duplication and functional divergence, and analyses of adaptive molecular evolution. Since not all genes in the floral transcriptome will be associated with flowering, these EST resources will also be of interest to plant scientists working on other functions, such as photosynthesis, signal transduction, and metabolic pathways. PMID:15799777
A lower isoelectric point increases signal sequence-mediated secretion of recombinant proteins through a bacterial ABC transporter.

PubMed

Byun, Hyunjong; Park, Jiyeon; Kim, Sun Chang; Ahn, Jung Hoon

2017-12-01

Efficient protein production for industrial and academic purposes often involves engineering microorganisms to produce and secrete target proteins into the culture. Pseudomonas fluorescens has a TliDEF ATP-binding cassette transporter, a type I secretion system, which recognizes C-terminal LARD3 signal sequence of thermostable lipase TliA. Many proteins are secreted by TliDEF in vivo when recombined with LARD3, but there are still others that cannot be secreted by TliDEF even when LARD3 is attached. However, the factors that determine whether or not a recombinant protein can be secreted through TliDEF are still unknown. Here, we recombined LARD3 with several proteins and examined their secretion through TliDEF. We found that the proteins secreted via LARD3 are highly negatively charged with highly-acidic isoelectric points (pI) lower than 5.5. Attaching oligo-aspartate to lower the pI of negatively-charged recombinant proteins improved their secretion, and attaching oligo-arginine to negatively-charged proteins blocked their secretion by LARD3. In addition, negatively supercharged green fluorescent protein (GFP) showed improved secretion, whereas positively supercharged GFP did not secrete. These results disclosed that proteins' acidic pI and net negative charge are major factors that determine their secretion through TliDEF. Homology modeling for TliDEF revealed that TliD dimer forms evolutionarily-conserved positively-charged clusters in its pore and substrate entrance site, which also partially explains the pI dependence of the TliDEF-dependent secretions. In conclusion, lowering the isoelectric point improved LARD3-mediated protein secretion, both widening the range of protein targets for efficient production via secretion and signifying an important aspect of ABC transporter-mediated secretions. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Knockout of Cytidine Monophospho-N-Acetylneuraminic Acid (CMP-NeuAc) Hydroxylase From Porcine Endothelial Cells by a CRISPR System.

PubMed

Sakai, R; Esaki, Y; Hasuwa, H; Ikawa, M; Lo, P; Matsuura, R; Nakahata, K; Zenitani, M; Asada, M; Maeda, A; Eguchi, H; Okuyama, H; Miyagawa, S

2016-05-01

We attempted to knock out the expression of Hanganutziu-Deicher (H-D) antigens through the use of a CRISPR (clustered regulatory interspaced short palindromic repeat)/Cas9 system for pig cytidine monophospho-N-acetylneuraminic acid hydroxylase (CMAH). Plasmids expressing hCas9 and sgRNA for pCMAH were prepared by ligating oligos into the BbsI site of pX330. The N-terminal and C-terminal EGFP coding regions overlapping 482 bp were PCR-amplified and placed under a ubiquitous CAG promoter. The approximately 400-bp genomic fragments containing the sgRNA target sequence of pCMAH were placed into the multi-cloning sites flanked by the EGFP fragments. The pCAG-EGxxFP-target was mixed with pX330 with/without the sgRNA sequences and then introduced into HEK293T cells. Four oligos and primers, gSO1, gSO3, gSO4, and gSO8, were nominated from 8 candidates. Among them, gSO1 showed the best efficiency. Pig endothelial cells (PECs) from an α-Gal knockout pig were then used to examine the changes in the expression of the H-D antigen by the knockout of the CMAH genome by the pX330-gS01. Changes in the expression of the H-D antigen in the PECs with the CRISPR (gS01) were clear in comparison with those in the parental cells, on the basis of FACS analysis data. The expression of the H-D antigen can be knocked out by use of the CRISPR system for pCMAH, thus confirming that this system is a very convenient system for producing knockout pigs. Copyright © 2016 Elsevier Inc. All rights reserved.
A Raman spectroscopic analysis of the sequence-dependent structures of oligo-DNA duplexes: d(CGCG) 2, d(GCGC) 2, d(GGCC) 2, and d(CCGG) 2 in aqueous solution

NASA Astrophysics Data System (ADS)

Torigoe, Chikako; Nishimura, Yoshifumi; Tsuboi, Masamichi; Matsuzaki, Jun-ichi; Hotoda, Hitoshi; Sekine, Mitsuo; Hata, Tsujiaki

Raman spectra of four self-complementary tetradeoxyribonucleoside triphosphates containing only guanosine and cytidine residues have been examined in aqueous solutions of different ionic strengths and at different temperatures. Both in low salt (0.15 M NaCl) and in high salt (4 M NaCl) solutions (at -2°C) all of the four duplexes have different conformations, distinguishable by Raman spectroscopy from one another. Thus, the duplex conformation is sequence-dependent. On the basis of several rules proposed recently for structure—spectrum correlations, new information was provided on the local conformations of the duplexes of these oligo-DNAs. In the low-salt solution, d(CCGG) 2 is B-DNA like in its overall conformation, but in detail the backbone conformation of the CpC portion is considered to be different from that in the GpG portion. In either one of these two portions, the torsion angle (β) around the O5'C5' bond must be somewhat higher than the usual values for B-DNA (150-170°), so that it causes a 815 cm -1 Raman line instead of the usual B marker 830 cm -1 line. This may be related to the peculiar circular dichroism spectrum of d(CCGG) 2. On going to the high-salt solution, about 5% of the d(CCGG) 2 molecules are converted into the A form. In the high-salt form (Z form) of d(CGCG) 2, the terminal guanosine was concluded to be in a C2' endo-syn conformation, whereas the internal one is in C3' endo-syn.
ESTimating plant phylogeny: lessons from partitioning

PubMed Central

de la Torre, Jose EB; Egan, Mary G; Katari, Manpreet S; Brenner, Eric D; Stevenson, Dennis W; Coruzzi, Gloria M; DeSalle, Rob

2006-01-01

Background While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. Results A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. Conclusion Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products. PMID:16776834
The Completely Sequenced Plasmid pEST4011 Contains a Novel IncP1 Backbone and a Catabolic Transposon Harboring tfd Genes for 2,4-Dichlorophenoxyacetic Acid Degradation

PubMed Central

Vedler, Eve; Vahter, Merle; Heinaru, Ain

2004-01-01

The herbicide 2,4-dichlorophenoxyacetic acid (2,4-D)-degrading bacterium Achromobacter xylosoxidans subsp. denitrificans strain EST4002 contains plasmid pEST4011. This plasmid ensures its host a stable 2,4-D+ phenotype. We determined the complete 76,958-bp nucleotide sequence of pEST4011. This plasmid is a deletion and duplication derivative of pD2M4, the 95-kb highly unstable laboratory ancestor of pEST4011, and was self-generated during different laboratory manipulations performed to increase the stability of the 2,4-D+ phenotype of the original strain, strain D2M4(pD2M4). The 47,935-bp catabolic region of pEST4011 forms a transposon-like structure with identical copies of the hybrid insertion element IS1071::IS1471 at the two ends. The catabolic regions of pEST4011 and pJP4, the best-studied 2,4-D-degradative plasmid, both contain homologous, tfd-like genes for complete 2,4-D degradation, but they have little sequence similarity other than that. The backbone genes of pEST4011 are most similar to the corresponding genes of broad-host-range self-transmissible IncP1 plasmids. The backbones of the other three IncP1 catabolic plasmids that have been sequenced (the 2,4-D-degradative plasmid pJP4, the haloacetate-catabolic plasmid pUO1, and the atrazine-catabolic plasmid pADP-1) are nearly identical to the backbone of R751, the archetype plasmid of the IncP1 β subgroup. We show that despite the overall similarity in plasmid organization, the pEST4011 backbone is sufficiently different (51 to 86% amino acid sequence identity between individual backbone genes) from the backbones of members of the three IncP1 subgroups (the α, β, and γ subgroups) that it belongs to a new IncP1subgroup, the δ subgroup. This conclusion was also supported by a phylogenetic analysis of the trfA2, korA, and traG gene products of different IncP1 plasmids. PMID:15489427
Expressed sequence tags (ESTs) from immune tissues of turbot (Scophthalmus maximus) challenged with pathogens

PubMed Central

Pardo, Belén G; Fernández, Carlos; Millán, Adrián; Bouza, Carmen; Vázquez-López, Araceli; Vera, Manuel; Alvarez-Dios, José A; Calaza, Manuel; Gómez-Tato, Antonio; Vázquez, María; Cabaleiro, Santiago; Magariños, Beatriz; Lemos, Manuel L; Leiro, José M; Martínez, Paulino

2008-01-01

Background The turbot (Scophthalmus maximus; Scophthalmidae; Pleuronectiformes) is a flatfish species of great relevance for marine aquaculture in Europe. In contrast to other cultured flatfish, very few genomic resources are available in this species. Aeromonas salmonicida and Philasterides dicentrarchi are two pathogens that affect turbot culture causing serious economic losses to the turbot industry. Little is known about the molecular mechanisms for disease resistance and host-pathogen interactions in this species. In this work, thousands of ESTs for functional genomic studies and potential markers linked to ESTs for mapping (microsatellites and single nucleotide polymorphisms (SNPs)) are provided. This information enabled us to obtain a preliminary view of regulated genes in response to these pathogens and it constitutes the basis for subsequent and more accurate microarray analysis. Results A total of 12584 cDNAs partially sequenced from three different cDNA libraries of turbot (Scophthalmus maximus) infected with Aeromonas salmonicida, Philasterides dicentrarchi and from healthy fish were analyzed. Three immune-relevant tissues (liver, spleen and head kidney) were sampled at several time points in the infection process for library construction. The sequences were processed into 9256 high-quality sequences, which constituted the source for the turbot EST database. Clustering and assembly of these sequences, revealed 3482 different putative transcripts, 1073 contigs and 2409 singletons. BLAST searches with public databases detected significant similarity (e-value ≤ 1e-5) in 1766 (50.7%) sequences and 816 of them (23.4%) could be functionally annotated. Two hundred three of these genes (24.9%), encoding for defence/immune-related proteins, were mostly identified for the first time in turbot. Some ESTs showed significant differences in the number of transcripts when comparing the three libraries, suggesting regulation in response to these pathogens. A total of 191 microsatellites, with 104 having sufficient flanking sequences for primer design, and 1158 putative SNPs were identified from these EST resources in turbot. Conclusion A collection of 9256 high-quality ESTs was generated representing 3482 unique turbot sequences. A large proportion of defence/immune-related genes were identified, many of them regulated in response to specific pathogens. Putative microsatellites and SNPs were identified. These genome resources constitute the basis to develop a microarray for functional genomics studies and marker validation for genetic linkage and QTL analysis in turbot. PMID:18817567
Transcriptome characterization and polymorphism detection between subspecies of big sagebrush (Artemisia tridentata)

PubMed Central

2011-01-01

Background Big sagebrush (Artemisia tridentata) is one of the most widely distributed and ecologically important shrub species in western North America. This species serves as a critical habitat and food resource for many animals and invertebrates. Habitat loss due to a combination of disturbances followed by establishment of invasive plant species is a serious threat to big sagebrush ecosystem sustainability. Lack of genomic data has limited our understanding of the evolutionary history and ecological adaptation in this species. Here, we report on the sequencing of expressed sequence tags (ESTs) and detection of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers in subspecies of big sagebrush. Results cDNA of A. tridentata sspp. tridentata and vaseyana were normalized and sequenced using the 454 GS FLX Titanium pyrosequencing technology. Assembly of the reads resulted in 20,357 contig consensus sequences in ssp. tridentata and 20,250 contigs in ssp. vaseyana. A BLASTx search against the non-redundant (NR) protein database using 29,541 consensus sequences obtained from a combined assembly resulted in 21,436 sequences with significant blast alignments (≤ 1e-15). A total of 20,952 SNPs and 119 polymorphic SSRs were detected between the two subspecies. SNPs were validated through various methods including sequence capture. Validation of SNPs in different individuals uncovered a high level of nucleotide variation in EST sequences. EST sequences of a third, tetraploid subspecies (ssp. wyomingensis) obtained by Illumina sequencing were mapped to the consensus sequences of the combined 454 EST assembly. Approximately one-third of the SNPs between sspp. tridentata and vaseyana identified in the combined assembly were also polymorphic within the two geographically distant ssp. wyomingensis samples. Conclusion We have produced a large EST dataset for Artemisia tridentata, which contains a large sample of the big sagebrush leaf transcriptome. SNP mapping among the three subspecies suggest the origin of ssp. wyomingensis via mixed ancestry. A large number of SNP and SSR markers provide the foundation for future research to address questions in big sagebrush evolution, ecological genetics, and conservation using genomic approaches. PMID:21767398
Effect of Commercial Cyanobacteria Products on the Growth and Antagonistic Ability of Some Bioagents under Laboratory Conditions

PubMed Central

El-Mougy, Nehal S.; Abdel-Kader, Mokhtar M.

2013-01-01

Evaluation of the efficacy of blue-green algal compounds against the growth of either pathogenic or antagonistic microorganisms as well as their effect on the antagonistic ability of bioagents was studied under in vitro conditions. The present study was undertaken to explore the inhibitory effect of commercial algal compounds, Weed-Max and Oligo-Mix, against some soil-borne pathogens. In growth medium supplemented with these algal compounds, the linear growth of pathogenic fungi decreased by increasing tested concentrations of the two algal compounds. Complete reduction in pathogenic fungal growth was observed at 2% of both Weed-Max and Oligo-Mix. Gradual significant reduction in the pathogenic fungal growth was caused by the two bioagents and by increasing the concentrations of algal compounds Weed-Max and Oligo-Mix. The present work showed that commercial algal compounds, Weed-Max and Oligo-Mix, have potential for the suppression of soil-borne fungi and enhance the antagonistic ability of fungal, bacterial, and yeast bio-agents. PMID:24307948
Implications of the 2014 Androgen Excess and Polycystic Ovary Syndrome Society guidelines on polycystic ovarian morphology for polycystic ovary syndrome diagnosis.

PubMed

Christ, J P; Gunning, M N; Fauser, B C J M

2017-10-01

The Androgen Excess and Polycystic Ovary Syndrome Society (AEPCOS) has recommended an updated threshold for polycystic ovarian morphology (PCOM) of 25 follicles or more, 10 ml or more of ovarian volume, or both. We describe the effect of these guidelines on reproductive and metabolic characteristics in 404 women. These women were separated into four groups: group A: hyperandrogenism and oligo-amenorrhoea (n = 157); group B: hyperandrogenism or oligo-amenorrhoea and PCOM meeting AEPCOS 2014 criteria (n = 125); group C: hyperandrogenism or oligo-amenorrhoea and PCOM meeting Rotterdam 2003 but not AEPCOS 2014 criteria (n = 72); and group D: non-PCOS not meeting either criteria (n = 50). Groups B, C and D did not differ across any metabolic markers. The AEPCOS 2014 guidelines may have limited utility in distinguishing metabolic risk factors and result in the exclusion of a large group of oligo-anovulatory women. Copyright © 2017 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.
Using variable rate models to identify genes under selection in sequence pairs: their validity and limitations for EST sequences.

PubMed

Church, Sheri A; Livingstone, Kevin; Lai, Zhao; Kozik, Alexander; Knapp, Steven J; Michelmore, Richard W; Rieseberg, Loren H

2007-02-01

Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.

RSAT: regulatory sequence analysis tools.

PubMed

Thomas-Chollier, Morgane; Sand, Olivier; Turatsinze, Jean-Valéry; Janky, Rekin's; Defrance, Matthieu; Vervisch, Eric; Brohée, Sylvain; van Helden, Jacques

2008-07-01

The regulatory sequence analysis tools (RSAT, http://rsat.ulb.ac.be/rsat/) is a software suite that integrates a wide collection of modular tools for the detection of cis-regulatory elements in genome sequences. The suite includes programs for sequence retrieval, pattern discovery, phylogenetic footprint detection, pattern matching, genome scanning and feature map drawing. Random controls can be performed with random gene selections or by generating random sequences according to a variety of background models (Bernoulli, Markov). Beyond the original word-based pattern-discovery tools (oligo-analysis and dyad-analysis), we recently added a battery of tools for matrix-based detection of cis-acting elements, with some original features (adaptive background models, Markov-chain estimation of P-values) that do not exist in other matrix-based scanning tools. The web server offers an intuitive interface, where each program can be accessed either separately or connected to the other tools. In addition, the tools are now available as web services, enabling their integration in programmatic workflows. Genomes are regularly updated from various genome repositories (NCBI and EnsEMBL) and 682 organisms are currently supported. Since 1998, the tools have been used by several hundreds of researchers from all over the world. Several predictions made with RSAT were validated experimentally and published.
Differential impacts of juvenile hormone, soldier head extract and alternate caste phenotypes on host and symbiont transcriptome composition in the gut of the termite Reticulitermes flavipes.

PubMed

Sen, Ruchira; Raychoudhury, Rhitoban; Cai, Yunpeng; Sun, Yijun; Lietze, Verena-Ulrike; Boucias, Drion G; Scharf, Michael E

2013-07-19

Termites are highly eusocial insects and show a division of labor whereby morphologically distinct individuals specialize in distinct tasks. In the lower termite Reticulitermes flavipes (Rhinotermitidae), non-reproducing individuals form the worker and soldier castes, which specialize in helping (e.g., brood care, cleaning, foraging) and defense behaviors, respectively. Workers are totipotent juveniles that can either undergo status quo molts or develop into soldiers or neotenic reproductives. This caste differentiation can be regulated by juvenile hormone (JH) and primer pheromones contained in soldier head extracts (SHE). Here we offered worker termites a cellulose diet treated with JH or SHE for 24-hr, or held them with live soldiers (LS) or live neotenic reproductives (LR). We then determined gene expression profiles of the host termite gut and protozoan symbionts concurrently using custom cDNA oligo-microarrays containing 10,990 individual ESTs. JH was the most influential treatment (501 total ESTs affected), followed by LS (24 ESTs), LR (12 ESTs) and SHE treatments (6 ESTs). The majority of JH up- and downregulated ESTs were of host and symbiont origin, respectively; in contrast, SHE, LR and LS treatments had more uniform impacts on host and symbiont gene expression. Repeat "follow-up" bioassays investigating combined JH + SHE impacts in relation to individual JH and SHE treatments on a subset of array-positive genes revealed (i) JH and SHE treatments had opposite impacts on gene expression and (ii) JH + SHE impacts on gene expression were generally intermediate between JH and SHE. Our results show that JH impacts hundreds of termite and symbiont genes within 24-hr, strongly suggesting a role for the termite gut in JH-dependent caste determination. Additionally, differential impacts of SHE and LS treatments were observed that are in strong agreement with previous studies that specifically investigated soldier caste regulation. However, it is likely that gene expression outside the gut may be of equal or greater importance than gut gene expression.
Differential impacts of juvenile hormone, soldier head extract and alternate caste phenotypes on host and symbiont transcriptome composition in the gut of the termite Reticulitermes flavipes

PubMed Central

2013-01-01

Background Termites are highly eusocial insects and show a division of labor whereby morphologically distinct individuals specialize in distinct tasks. In the lower termite Reticulitermes flavipes (Rhinotermitidae), non-reproducing individuals form the worker and soldier castes, which specialize in helping (e.g., brood care, cleaning, foraging) and defense behaviors, respectively. Workers are totipotent juveniles that can either undergo status quo molts or develop into soldiers or neotenic reproductives. This caste differentiation can be regulated by juvenile hormone (JH) and primer pheromones contained in soldier head extracts (SHE). Here we offered worker termites a cellulose diet treated with JH or SHE for 24-hr, or held them with live soldiers (LS) or live neotenic reproductives (LR). We then determined gene expression profiles of the host termite gut and protozoan symbionts concurrently using custom cDNA oligo-microarrays containing 10,990 individual ESTs. Results JH was the most influential treatment (501 total ESTs affected), followed by LS (24 ESTs), LR (12 ESTs) and SHE treatments (6 ESTs). The majority of JH up- and downregulated ESTs were of host and symbiont origin, respectively; in contrast, SHE, LR and LS treatments had more uniform impacts on host and symbiont gene expression. Repeat “follow-up” bioassays investigating combined JH + SHE impacts in relation to individual JH and SHE treatments on a subset of array-positive genes revealed (i) JH and SHE treatments had opposite impacts on gene expression and (ii) JH + SHE impacts on gene expression were generally intermediate between JH and SHE. Conclusions Our results show that JH impacts hundreds of termite and symbiont genes within 24-hr, strongly suggesting a role for the termite gut in JH-dependent caste determination. Additionally, differential impacts of SHE and LS treatments were observed that are in strong agreement with previous studies that specifically investigated soldier caste regulation. However, it is likely that gene expression outside the gut may be of equal or greater importance than gut gene expression. PMID:23870282
Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

PubMed Central

Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

2012-01-01

Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.).

PubMed

Bushakra, Jill M; Lewers, Kim S; Staton, Margaret E; Zhebentyayeva, Tetyana; Saski, Christopher A

2015-10-26

Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs. Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.
Characterization of the Kenaf (Hibiscus cannabinus) Global Transcriptome Using Illumina Paired-End Sequencing and Development of EST-SSR Markers

PubMed Central

Li, Hui; Li, Defang; Chen, Anguo; Tang, Huijuan; Li, Jianjun; Huang, Siqi

2016-01-01

Kenaf (Hibiscus cannabinus L.) is an economically important natural fiber crop grown worldwide. However, only 20 expressed tag sequences (ESTs) for kenaf are available in public databases. The aim of this study was to develop large-scale simple sequence repeat (SSR) markers to lay a solid foundation for the construction of genetic linkage maps and marker-assisted breeding in kenaf. We used Illumina paired-end sequencing technology to generate new EST-simple sequences and MISA software to mine SSR markers. We identified 71,318 unigenes with an average length of 1143 nt and annotated these unigenes using four different protein databases. Overall, 9324 complementary pairs were designated as EST-SSR markers, and their quality was validated using 100 randomly selected SSR markers. In total, 72 primer pairs reproducibly amplified target amplicons, and 61 of these primer pairs detected significant polymorphism among 28 kenaf accessions. Thus, in this study, we have developed large-scale SSR markers for kenaf, and this new resource will facilitate construction of genetic linkage maps, investigation of fiber growth and development in kenaf, and also be of value to novel gene discovery and functional genomic studies. PMID:26960153
SCARF: maximizing next-generation EST assemblies for evolutionary and population genomic analyses.

PubMed

Barker, Michael S; Dlugosch, Katrina M; Reddy, A Chaitanya C; Amyotte, Sarah N; Rieseberg, Loren H

2009-02-15

Scaffolded and Corrected Assembly of Roche 454 (SCARF) is a next-generation sequence assembly tool for evolutionary genomics that is designed especially for assembling 454 EST sequences against high-quality reference sequences from related species. The program was created to knit together 454 contigs that do not assemble during traditional de novo assembly, using a reference sequence library to orient the 454 sequences. SCARF is freely available at http://msbarker.com/software.htm, and is released under the open source GPLv3 license (http://www.opensource.org/licenses/gpl-3.0.html.
Comparative expression profiling in grape (Vitis vinifera) berries derived from frequency analysis of ESTs and MPSS signatures.

PubMed

Iandolino, Alberto; Nobuta, Kan; da Silva, Francisco Goes; Cook, Douglas R; Meyers, Blake C

2008-05-12

Vitis vinifera (V. vinifera) is the primary grape species cultivated for wine production, with an industry valued annually in the billions of dollars worldwide. In order to sustain and increase grape production, it is necessary to understand the genetic makeup of grape species. Here we performed mRNA profiling using Massively Parallel Signature Sequencing (MPSS) and combined it with available Expressed Sequence Tag (EST) data. These tag-based technologies, which do not require a priori knowledge of genomic sequence, are well-suited for transcriptional profiling. The sequence depth of MPSS allowed us to capture and quantify almost all the transcripts at a specific stage in the development of the grape berry. The number and relative abundance of transcripts from stage II grape berries was defined using Massively Parallel Signature Sequencing (MPSS). A total of 2,635,293 17-base and 2,259,286 20-base signatures were obtained, representing at least 30,737 and 26,878 distinct sequences. The average normalized abundance per signature was approximately 49 TPM (Transcripts Per Million). Comparisons of the MPSS signatures with available Vitis species' ESTs and a unigene set demonstrated that 6,430 distinct contigs and 2,190 singletons have a perfect match to at least one MPSS signature. Among the matched sequences, ESTs were identified from tissues other than berries or from berries at different developmental stages. Additional MPSS signatures not matching to known grape ESTs can extend our knowledge of the V. vinifera transcriptome, particularly when these data are used to assist in annotation of whole genome sequences from Vitis vinifera. The MPSS data presented here not only achieved a higher level of saturation than previous EST based analyses, but in doing so, expand the known set of transcripts of grape berries during the unique stage in development that immediately precedes the onset of ripening. The MPSS dataset also revealed evidence of antisense expression not previously reported in grapes but comparable to that reported in other plant species. Finally, we developed a novel web-based, public resource for utilization of the grape MPSS data [1].
Composition and metabolism of fecal microbiota from normal and overweight children are differentially affected by melibiose, raffinose and raffinose-derived fructans.

PubMed

Adamberg, Kaarel; Adamberg, Signe; Ernits, Karin; Larionova, Anneli; Voor, Tiia; Jaagura, Madis; Visnapuu, Triinu; Alamäe, Tiina

2018-06-20

The aim of the study was to investigate the metabolism of non-digestible oligo- and polysaccharides by fecal microbiota, using isothermal microcalorimetry. The five tested substrates were raffinose, melibiose, a mixture of oligo- and polysaccharides produced from raffinose by levansucrase, levan synthesized from raffinose, and levan from timothy grass. Two inocula were comprised of pooled fecal samples from overweight or normal-weight children, from healthy adult volunteers and a pure culture of Bacteroides thetaiotaomicron as a reference bacterium for colon microbiota. The growth was analyzed based on the heat evolution curves, and the production of organic acids and gases. Taxonomic profiles of the microbiota were assessed by 16S rDNA sequencing. Raffinose and melibiose promoted the growth of bifidobacteria in all fecal pools. Several pool-specific substrate-related responses to raffinose and melibiose were revealed. Lactate-producing bacteria (Streptococcus and Enterococcus) became enriched in the pool of overweight children resulting in lactic acid as the major fermentation product on short saccharides. Acetic and butyric acids were prevalent at fermentation in the normal-weight pool coinciding with the enrichment of Catenibacterium. In the adult pool, the specific promotion of Bacteroides and Lachnospiraceae by levans was disclosed. In the fecal pool of normal-weight children, levans stimulated the growth of Senegalimassilia and Lachnoclostridium and this particular pool also showed the highest maximum heat production rate at levan fermentation. Levans and raffinose-derived oligosaccharides, but not raffinose and melibiose were completely fermented by a pure culture of Bacteroides thetaiotaomicron. The main conclusion from the study is that fecal microbiota of normal and overweight children have different compositions and they respond in specific manners to non-digestible oligo- and polysaccharides: raffinose, melibiose, raffinose-derived oligosaccharides and levans. The potential of the tested saccharides to support a healthy balance of colon microbiota requires further studies. Copyright © 2018. Published by Elsevier Ltd.
Combinatorial enzyme technology: Conversion of pectin to oligo species and its effect on microbial growth

USDA-ARS?s Scientific Manuscript database

Plant cell wall polysaccharides, which consist of polymeric backbones with various types of substitution, were studied using the concept of combinatorial enzyme technology for conversion of agricultural fibers to functional products. Using citrus pectin as the starting substrate, an active oligo spe...
Maillard reaction products of rice protein hydrolysates with mono-, oligo- and polysaccharides

USDA-ARS?s Scientific Manuscript database

Rice protein, a byproduct of rice syrup production, is abundant but, its lack of functionality prevents its wide use as a food ingredient. Maillard reaction products of (MRPs) hydrolysates from the limited hydrolysis of rice protein (LHRP) and various mono-, oligo- and polysaccharides were evaluat...
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries

PubMed Central

Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

2008-01-01

Background Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. Results We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. Conclusion EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects. PMID:18402700
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.

PubMed

Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

2008-04-10

Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.
Transcriptome Sequencing of Hevea brasiliensis for Development of Microsatellite Markers and Construction of a Genetic Linkage Map

PubMed Central

Triwitayakorn, Kanokporn; Chatkulkawin, Pornsupa; Kanjanawattanawong, Supanath; Sraphet, Supajit; Yoocha, Thippawan; Sangsrakru, Duangjai; Chanprasert, Juntima; Ngamphiw, Chumpol; Jomchai, Nukoon; Therawattanasuk, Kanikar; Tangphatsornruang, Sithichoke

2011-01-01

To obtain more information on the Hevea brasiliensis genome, we sequenced the transcriptome from the vegetative shoot apex yielding 2 311 497 reads. Clustering and assembly of the reads produced a total of 113 313 unique sequences, comprising 28 387 isotigs and 84 926 singletons. Also, 17 819 expressed sequence tag (EST)-simple sequence repeats (SSRs) were identified from the data set. To demonstrate the use of this EST resource for marker development, primers were designed for 430 of the EST-SSRs. Three hundred and twenty-three primer pairs were amplifiable in H. brasiliensis clones. Polymorphic information content values of selected 47 SSRs among 20 H. brasiliensis clones ranged from 0.13 to 0.71, with an average of 0.51. A dendrogram of genetic similarities between the 20 H. brasiliensis clones using these 47 EST-SSRs suggested two distinct groups that correlated well with clone pedigree. These novel EST-SSRs together with the published SSRs were used for the construction of an integrated parental linkage map of H. brasiliensis based on 81 lines of an F1 mapping population. The map consisted of 97 loci, consisting of 37 novel EST-SSRs and 60 published SSRs, distributed on 23 linkage groups and covered 842.9 cM with a mean interval of 11.9 cM and ∼4 loci per linkage group. Although the numbers of linkage groups exceed the haploid number (18), but with several common markers between homologous linkage groups with the previous map indicated that the F1 map in this study is appropriate for further study in marker-assisted selection. PMID:22086998
Utility of EST-derived SSR in cultivated peanut (Arachis hypogaea L.) and Arachis wild species

PubMed Central

Liang, Xuanqiang; Chen, Xiaoping; Hong, Yanbin; Liu, Haiyan; Zhou, Guiyuan; Li, Shaoxiong; Guo, Baozhu

2009-01-01

Background Lack of sufficient molecular markers hinders current genetic research in peanuts (Arachis hypogaea L.). It is necessary to develop more molecular markers for potential use in peanut genetic research. With the development of peanut EST projects, a vast amount of available EST sequence data has been generated. These data offered an opportunity to identify SSR in ESTs by data mining. Results In this study, we investigated 24,238 ESTs for the identification and development of SSR markers. In total, 881 SSRs were identified from 780 SSR-containing unique ESTs. On an average, one SSR was found per 7.3 kb of EST sequence with tri-nucleotide motifs (63.9%) being the most abundant followed by di- (32.7%), tetra- (1.7%), hexa- (1.0%) and penta-nucleotide (0.7%) repeat types. The top six motifs included AG/TC (27.7%), AAG/TTC (17.4%), AAT/TTA (11.9%), ACC/TGG (7.72%), ACT/TGA (7.26%) and AT/TA (6.3%). Based on the 780 SSR-containing ESTs, a total of 290 primer pairs were successfully designed and used for validation of the amplification and assessment of the polymorphism among 22 genotypes of cultivated peanuts and 16 accessions of wild species. The results showed that 251 primer pairs yielded amplification products, of which 26 and 221 primer pairs exhibited polymorphism among the cultivated and wild species examined, respectively. Two to four alleles were found in cultivated peanuts, while 3–8 alleles presented in wild species. The apparent broad polymorphism was further confirmed by cloning and sequencing of amplified alleles. Sequence analysis of selected amplified alleles revealed that allelic diversity could be attributed mainly to differences in repeat type and length in the microsatellite regions. In addition, a few single base mutations were observed in the microsatellite flanking regions. Conclusion This study gives an insight into the frequency, type and distribution of peanut EST-SSRs and demonstrates successful development of EST-SSR markers in cultivated peanut. These EST-SSR markers could enrich the current resource of molecular markers for the peanut community and would be useful for qualitative and quantitative trait mapping, marker-assisted selection, and genetic diversity studies in cultivated peanut as well as related Arachis species. All of the 251 working primer pairs with names, motifs, repeat types, primer sequences, and alleles tested in cultivated and wild species are listed in Additional File 1. PMID:19309524
Development and characterization of novel EST-SSR markers and their application for genetic diversity analysis of Jerusalem artichoke (Helianthus tuberosus L.).

PubMed

Mornkham, T; Wangsomnuk, P P; Mo, X C; Francisco, F O; Gao, L Z; Kurzweil, H

2016-10-24

Jerusalem artichoke (Helianthus tuberosus L.) is a perennial tuberous plant and a traditional inulin-rich crop in Thailand. It has become the most important source of inulin and has great potential for use in chemical and food industries. In this study, expressed sequence tag (EST)-based simple sequence repeat (SSR) markers were developed from 40,362 Jerusalem artichoke ESTs retrieved from the NCBI database. Among 23,691 non-redundant identified ESTs, 1949 SSR motifs harboring 2 to 6 nucleotides with varied repeat motifs were discovered from 1676 assembled sequences. Seventy-nine primer pairs were generated from EST sequences harboring SSR motifs. Our results show that 43 primers are polymorphic for the six studied populations, while the remaining 36 were either monomorphic or failed to amplify. These 43 SSR loci exhibited a high level of genetic diversity among populations, with allele numbers varying from 2 to 7, with an average of 3.95 alleles per loci. Heterozygosity ranged from 0.096 to 0.774, with an average of 0.536; polymorphic index content ranged from 0.096 to 0.854, with an average of 0.568. Principal component analysis and neighbor-joining analysis revealed that the six populations could be divided into six clusters. Our results indicate that these newly characterized EST-SSR markers may be useful in the exploration of genetic diversity and range expansion of the Jerusalem artichoke, and in cross-species application for the genus Helianthus.
In-silico and in-vivo analyses of EST databases unveil conserved miRNAs from Carthamus tinctorius and Cynara cardunculus

PubMed Central

2012-01-01

Background MicroRNAs (miRNAs) are small RNAs (21-24 bp) providing an RNA-based system of gene regulation highly conserved in plants and animals. In plants, miRNAs control mRNA degradation or restrain translation, affecting development and responses to stresses. Plant miRNAs show imperfect but extensive complementarity to mRNA targets, making their computational prediction possible, useful when data mining is applied on different species. In this study we used a comparative approach to identify both miRNAs and their targets, in artichoke and safflower. Results Two complete expressed sequence tags (ESTs) datasets from artichoke (3.6·104 entries) and safflower (4.2·104), were analysed with a bioinformatic pipeline and in vitro experiments, identifying 17 potential miRNAs. For each EST, using RNAhybrid program and 953 non redundant miRNA mature sequences, available in mirBase as reference, we searched matching putative targets. 8730 out of 42011 ESTs from safflower and 7145 of 36323 ESTs from artichoke showed at least one predicted miRNA target. BLAST analysis showed that 75% of all ESTs shared at least a common homologous region (E-value < 10-4) and about 50% of these displayed 400 bp or longer aligned sequences as conserved homologous/orthologous (COS) regions. 960 and 890 ESTs of safflower and artichoke organized in COS shared 79 different miRNA targets, considered functionally conserved, and statistically significant when compared with random sequences (signal to noise ratio > 2 and specificity ≥ 0.85). Four highly significant miRNAs selected from in silico data were experimentally validated in globe artichoke leaves. Conclusions Mature miRNAs and targets were predicted within EST sequences of safflower and artichoke. Most of the miRNA targets appeared highly/moderately conserved, highlighting an important and conserved function. In this study we introduce a stringent parameter for the comparative sequence analysis, represented by the identification of the same target in the COS region. After statistical analysis 79 targets, found on the COS regions and belonging to 60 miRNA families, have a signal to noise ratio > 2, with ≥ 0.85 specificity. The putative miRNAs identified belong to 55 dicotyledon plants and to 24 families only in monocotyledon. PMID:22536958
Analysis of expressed sequence tags from Maize mosaic rhabdovirus-infected gut tissues of Peregrinus maidis reveals the presence of key components of insect innate immunity.

PubMed

Whitfield, A E; Rotenberg, D; Aritua, V; Hogenhout, S A

2011-04-01

The corn planthopper, Peregrinus maidis, causes direct feeding damage to plants and transmits Maize mosaic rhabdovirus (MMV) in a persistent-propagative manner. MMV must cross several insect tissue layers for successful transmission to occur, and the gut serves as an important barrier for rhabdovirus transmission. In order to facilitate the identification of proteins that may interact with MMV either by facilitating acquisition or responding to virus infection, we generated and analysed the gut transcriptome of P. maidis. From two normalized cDNA libraries, we generated a P. maidis gut transcriptome composed of 20,771 expressed sequence tags (ESTs). Assembly of the sequences yielded 1860 contigs and 14,032 singletons, and biological roles were assigned to 5793 (36%). Comparison of P. maidis ESTs with other insect amino acid sequences revealed that P. maidis shares greatest sequence similarity with another hemipteran, the brown planthopper Nilaparvata lugens. We identified 202 P. maidis transcripts with putative homology to proteins associated with insect innate immunity, including those implicated in the Toll, Imd, JAK/STAT, Jnk and the small-interfering RNA-mediated pathways. Sequence comparisons between our P. maidis gut EST collection and the currently available National Center for Biotechnology Information EST database collection for Ni. lugens revealed that a pathogen recognition receptor in the Imd pathway, peptidoglycan recognition protein-long class (PGRP-LC), is present in these two members of the family Delphacidae; however, these recognition receptors are lacking in the model hemipteran Acyrthosiphon pisum. In addition, we identified sequences in the P. maidis gut transcriptome that share significant amino acid sequence similarities with the rhabdovirus receptor molecule, acetylcholine receptor (AChR), found in other hosts. This EST analysis sheds new light on immune response pathways in hemipteran guts that will be useful for further dissecting innate defence response pathways to rhabdovirus infection. © 2011 The Authors. Insect Molecular Biology © 2011 The Royal Entomological Society.
Quantum-dot-based quantitative identification of pathogens in complex mixture

NASA Astrophysics Data System (ADS)

Lim, Sun Hee; Bestwater, Felix; Buchy, Philippe; Mardy, Sek; Yu, Alexey Dan Chin

2010-02-01

In the present study we describe sandwich design hybridization probes consisting of magnetic particles (MP) and quantum dots (QD) with target DNA, and their application in the detection of avian influenza virus (H5N1) sequences. Hybridization of 25-, 40-, and 100-mer target DNA with both probes was analyzed and quantified by flow cytometry and fluorescence microscopy on the scale of single particles. The following steps were used in the assay: (i) target selection by MP probes and (ii) target detection by QD probes. Hybridization efficiency between MP conjugated probes and target DNA hybrids was controlled by a fluorescent dye specific for nucleic acids. Fluorescence was detected by flow cytometry to distinguish differences in oligo sequences as short as 25-mer capturing in target DNA and by gel-electrophoresis in the case of QD probes. This report shows that effective manipulation and control of micro- and nanoparticles in hybridization assays is possible.
Annotated ESTs from various tissues of the brown planthopper Nilaparvata lugens: a genomic resource for studying agricultural pests.

PubMed

Noda, Hiroaki; Kawai, Sawako; Koizumi, Yoko; Matsui, Kageaki; Zhang, Qiang; Furukawa, Shigetoyo; Shimomura, Michihiko; Mita, Kazuei

2008-03-03

The brown planthopper (BPH), Nilaparvata lugens (Hemiptera, Delphacidae), is a serious insect pests of rice plants. Major means of BPH control are application of agricultural chemicals and cultivation of BPH resistant rice varieties. Nevertheless, BPH strains that are resistant to agricultural chemicals have developed, and BPH strains have appeared that are virulent against the resistant rice varieties. Expressed sequence tag (EST) analysis and related applications are useful to elucidate the mechanisms of resistance and virulence and to reveal physiological aspects of this non-model insect, with its poorly understood genetic background. More than 37,000 high-quality ESTs, excluding sequences of mitochondrial genome, microbial genomes, and rDNA, have been produced from 18 libraries of various BPH tissues and stages. About 10,200 clusters have been made from whole EST sequences, with average EST size of 627 bp. Among the top ten most abundantly expressed genes, three are unique and show no homology in BLAST searches. The actin gene was highly expressed in BPH, especially in the thorax. Tissue-specifically expressed genes were extracted based on the expression frequency among the libraries. An EST database is available at our web site. The EST library will provide useful information for transcriptional analyses, proteomic analyses, and gene functional analyses of BPH. Moreover, specific genes for hemimetabolous insects will be identified. The microarray fabricated based on the EST information will be useful for finding genes related to agricultural and biological problems related to this pest.

In silico mining and characterization of simple sequence repeats from gilthead sea bream (Sparus aurata) expressed sequence tags (EST-SSRs); PCR amplification, polymorphism evaluation and multiplexing and cross-species assays.

PubMed

Vogiatzi, Emmanouella; Lagnel, Jacques; Pakaki, Victoria; Louro, Bruno; Canario, Adelino V M; Reinhardt, Richard; Kotoulas, Georgios; Magoulas, Antonios; Tsigenopoulos, Costas S

2011-06-01

We screened for simple sequence repeats (SSRs) found in ESTs derived from an EST-database development project ('Marine Genomics Europe' Network of Excellence). Different motifs of di-, tri-, tetra-, penta- and hexanucleotide SSRs were evaluated for variation in length and position in the expressed sequences, relative abundance and distribution in gilthead sea bream (Sparus aurata). We found 899 ESTs that harbor 997 SSRs (4.94%). On average, one SSR was found per 2.95 kb of EST sequence and the dinucleotide SSRs are the most abundant accounting for 47.6% of the total number. EST-SSRs were used as template for primer design. 664 primer pairs could be successfully identified and a subset of 206 pairs of primers was synthesized, PCR-tested and visualized on ethidium bromide stained agarose gels. The main objective was to further assess the potential of EST-SSRs as informative markers and investigate their cross-species amplification in sixteen teleost fish species: seven sparid species and nine other species from different families. Approximately 78% of the primer pairs gave PCR products of expected size in gilthead sea bream, and as expected, the rate of successful amplification of sea bream EST-SSRs was higher in sparids, lower in other perciforms and even lower in species of the Clupeiform and Gadiform orders. We finally determined the polymorphism and the heterozygosity of 63 markers in a wild gilthead sea bream population; fifty-eight loci were found to be polymorphic with the expected heterozygosity and the number of alleles ranging from 0.089 to 0.946 and from 2 to 27, respectively. These tools and markers are expected to enhance the available genetic linkage map in gilthead sea bream, to assist comparative mapping and genome analyses for this species and further with other model fish species and finally to help advance genetic analysis for cultivated and wild populations and accelerate breeding programs. Copyright © 2011 Elsevier B.V. All rights reserved.
Construction and analysis of an SSH cDNA library of early heat-induced genes of Vigna aconitifolia variety RMO-40.

PubMed

Rampuria, Sakshi; Joshi, Uma; Palit, Paramita; Deokar, Amit A; Meghwal, Raju R; Mohapatra, T; Srinivasan, R; Bhatt, K V; Sharma, Ramavtar

2012-11-01

Moth bean ( Vigna aconitifolia (Jacq.) Marechal) is an important grain legume crop grown in rain fed areas of hot desert regions of Thar, India, under scorching sun rays with very little supplementation of water. An SSH cDNA library was generated from leaf tissues of V. aconitifolia var. RMO-40 exposed to an elevated temperature of 42 °C for 5 min to identify early-induced genes. A total of 488 unigenes (114 contigs and 374 singletons) were derived by cluster assembly and sequence alignment of 738 ESTs; out of 206 ESTs (28%) of unknown proteins, 160 ESTs (14%) were found to be novel to moth bean. Only 578 ESTs (78%) showed significant BLASTX similarity (<1 × 10(-6)) in the NCBI non-redundant database. Gene ontology functional classification terms were retrieved for 479 (65%) sequences, and 339 sequences were annotated with 165 EC codes and mapped to 68 different KEGG pathways. Four hundred and fifty-two ESTs were further annotated with InterProScan (IPS), and no IPS was assigned to 153 ESTs. In addition, the expression level of 27 ESTs in response to heat stress was evaluated through semiquantitative RT-PCR assay. Approximately 20 different signaling genes and 16 different transcription factors have been shown to be associated with heat stress in moth bean for the first time.
Analysis and functional annotation of expressed sequence tags from the fall armyworm Spodoptera frugiperda

PubMed Central

Deng, Youping; Dong, Yinghua; Thodima, Venkata; Clem, Rollie J; Passarelli, A Lorena

2006-01-01

Background Little is known about the genome sequences of lepidopteran insects, although this group of insects has been studied extensively in the fields of endocrinology, development, immunity, and pathogen-host interactions. In addition, cell lines derived from Spodoptera frugiperda and other lepidopteran insects are routinely used for baculovirus foreign gene expression. This study reports the results of an expressed sequence tag (EST) sequencing project in cells from the lepidopteran insect S. frugiperda, the fall armyworm. Results We have constructed an EST database using two cDNA libraries from the S. frugiperda-derived cell line, SF-21. The database consists of 2,367 ESTs which were assembled into 244 contigs and 951 singlets for a total of 1,195 unique sequences. Conclusion S. frugiperda is an agriculturally important pest insect and genomic information will be instrumental for establishing initial transcriptional profiling and gene function studies, and for obtaining information about genes manipulated during infections by insect pathogens such as baculoviruses. PMID:17052344
Galaptin Mediates the Effect of Hypergravity on Vascular Smooth Muscle cell (SMC) Adhesion to Laminin Containing Matrices

NASA Technical Reports Server (NTRS)

Enahora, Fatisha T.; Bosah, Francis N.; Harris-Hooker, Sandra; Sanford, Gary L.

1997-01-01

Galaptin, an endogenous beta-galactoside specific lectin, has been reported to bind to laminin and subsequently decrease the binding of SMC. Cellular function depend on cell:matrix interactions. Hypergravity (HGrav) affect a number of cellular functions, yet little is known about its affect on cell adhesion. We examined the possibility that galaptin mediates the effects of hypergravity on SMC adherence. Confluent primate aorta SMC cultures were subjected to Hgrav (centrifuged at 6G) for 24 and 48 hr. Cells were non-enzymatically dispersed, pretreated with antisense (AS-oligo) or control sense (SS-oligo) oligonucleotides to galaptin mRNA (0.01 micro g/ml), then seeded in uncoated or ECL-matrix coated plates. Adhesion of cells were monitored after 6 hr. HGrav increased adhesion by 100-300% compared to controls. AS-oligo decreased adhesion for both HGrav and control cells. SS-oligo did not affect adhesion for either HGrav or control cells. These studies show that HGrav affects cell adhesion and that galaptin expression is required for this effect.
Water-based oligochitosan and nanowhisker chitosan as potential food preservatives for shelf-life extension of minced pork.

PubMed

Chantarasataporn, Patomporn; Tepkasikul, Preenapha; Kingcha, Yutthana; Yoksan, Rangrong; Pichyangkura, Rath; Visessanguan, Wonnop; Chirachanchai, Suwabun

2014-09-15

Water-based chitosans in the forms of oligochitosan (OligoCS) and nanowhisker chitosan (CSWK) are proposed as a novel food preservative based on a minced pork model study. The high surface area with a positive charge over the neutral pH range (pH 5-8) of OligoCS and CSWK lead to an inhibition against Gram-positive (Staphylococcus aureus, Listeria monocytogenes, and Bacillus cereus) and Gram-negative microbes (Salmonella enteritidis and Escherichia coli O157:H7). In the minced pork model, OligoCS effectively performs a food preservative for shelf-life extension as clarified from the retardation of microbial growth, biogenic amine formation and lipid oxidation during the storage. OligoCS maintains almost all myosin heavy chain protein degradation as observed in the electrophoresis. The present work points out that water-based chitosan with its unique morphology not only significantly inhibits antimicrobial activity but also maintains the meat quality with an extension of shelf-life, and thus has the potential to be used as a food preservative. Copyright © 2014 Elsevier Ltd. All rights reserved.
Difference of carboxybetaine and oligo(ethylene glycol) moieties in altering hydrophobic interactions: a molecular simulation study.

PubMed

Shao, Qing; White, Andrew D; Jiang, Shaoyi

2014-01-09

Polycarboxybetaine and poly(ethylene glycol) materials resist nonspecific protein adsorption but differ in influencing biological functions such as enzymatic activity. To investigate this difference, we studied the influence of carboxybetaine and oligo(ethylene glycol) moieties on hydrophobic interactions using molecular simulations. We employed a model system composed of two non-polar plates and studied the potential of mean force of plate-plate association in carboxybetaine, (ethylene glycol)4, and (ethylene glycol)2 solutions using well-tempered metadynamics simulations. Water, trimethylamine N-oxide, and urea solutions were used as reference systems. We analyzed the variation of the potential of mean force in various solutions to study how carboxybetaine and oligo(ethylene glycol) moieties influence the hydrophobic interactions. To study the origin of their influence, we analyzed the normalized distributions of moieties and water molecules using molecular dynamics simulations. The simulation results showed that oligo(ethylene glycol) moieties repel water molecules away from the non-polar plates and weaken the hydrophobic interactions. Carboxybetaine moieties do not repel water molecules away from the plates and therefore do not influence the hydrophobic interactions.
Adsorption of lignocelluloses of pre-hydrolysis liquor on calcium carbonate to induce functional filler.

PubMed

Fatehi, Pedram; Hamdan, Fadia C; Ni, Yonghao

2013-04-15

In this work, we aimed at adsorbing the oligo-sugars of prehydrolysis liquor on precipitated calcium carbonate (PCC) to produce modified PCC. The results showed that the adsorptions of oligo-sugars, lignin and furfural were greater on porous PCC (PCC2) than on nano-sized PCC (PCC1) due to the larger surface area of PCC2. The adsorption reached its maximum in 5 h on PCC1, but it gradually increased on PCC2 due to the diffusion of oligo-sugars and lignin into the pores of PCC2. Also, the experimental isotherm and kinetic results were well fitted into Langmuir and pseudo-second order models, respectively. The adsorption was greater at a lower temperature (i.e. 40°C) and pH (i.e. 7). Alternatively, cationic poly acrylamide (CPAM) was added to the PHL/PCC system, which led to more promising results (than that to PHL/PCC system) with the maximum lignocelluloses adsorption of 0.36 g/g on PCC2, among which 0.22 g/g was oligo-sugars. Copyright © 2013 Elsevier Ltd. All rights reserved.
ContEst16S: an algorithm that identifies contaminated prokaryotic genomes using 16S RNA gene sequences.

PubMed

Lee, Imchang; Chalita, Mauricio; Ha, Sung-Min; Na, Seong-In; Yoon, Seok-Hwan; Chun, Jongsik

2017-06-01

Thanks to the recent advancement of DNA sequencing technology, the cost and time of prokaryotic genome sequencing have been dramatically decreased. It has repeatedly been reported that genome sequencing using high-throughput next-generation sequencing is prone to contaminations due to its high depth of sequencing coverage. Although a few bioinformatics tools are available to detect potential contaminations, these have inherited limitations as they only use protein-coding genes. Here we introduce a new algorithm, called ContEst16S, to detect potential contaminations using 16S rRNA genes from genome assemblies. We screened 69 745 prokaryotic genomes from the NCBI Assembly Database using ContEst16S and found that 594 were contaminated by bacteria, human and plants. Of the predicted contaminated genomes, 8 % were not predicted by the existing protein-coding gene-based tool, implying that both methods can be complementary in the detection of contaminations. A web-based service of the algorithm is available at www.ezbiocloud.net/tools/contest16s.
Development and Use of Integrated Microarray-Based Genomic Technologies for Assessing Microbial Community Composition and Dynamics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhou, J.; Wu, L.; Gentry, T.

2006-04-05

To effectively monitor microbial populations involved in various important processes, a 50-mer-based oligonucleotide microarray was developed based on known genes and pathways involved in: biodegradation, metal resistance and reduction, denitrification, nitrification, nitrogen fixation, methane oxidation, methanogenesis, carbon polymer decomposition, and sulfate reduction. This array contains approximately 2000 unique and group-specific probes with <85% similarity to their non-target sequences. Based on artificial probes, our results showed that at hybridization conditions of 50 C and 50% formamide, the 50-mer microarray hybridization can differentiate sequences having <88% similarity. Specificity tests with representative pure cultures indicated that the designed probes on the arrays appearedmore » to be specific to their corresponding target genes. Detection limits were about 5-10ng genomic DNA in the absence of background DNA, and 50-100ng ({approx}1.3{sup o} 10{sup 7} cells) in the presence background DNA. Strong linear relationships between signal intensity and target DNA and RNA concentration were observed (r{sup 2} = 0.95-0.99). Application of this microarray to naphthalene-amended enrichments and soil microcosms demonstrated that composition of the microflora varied depending on incubation conditions. While the naphthalene-degrading genes from Rhodococcus-type microorganisms were dominant in enrichments, the genes involved in naphthalene degradation from Gram-negative microorganisms such as Ralstonia, Comamonas, and Burkholderia were most abundant in the soil microcosms (as well as those for polyaromatic hydrocarbon and nitrotoluene degradation). Although naphthalene degradation is widely known and studied in Pseudomonas, Pseudomonas genes were not detected in either system. Real-time PCR analysis of 4 representative genes was consistent with microarray-based quantification (r{sup 2} = 0.95). Currently, we are also applying this microarray to the study of several different microbial communities and processes at the NABIR-FRC in Oak Ridge, TN. One project involves the monitoring of the development and dynamics of the microbial community of a fluidized bed reactor (FBR) used for reducing nitrate and the other project monitors microbial community responses to stimulation of uranium reducing populations via ethanol donor additions in situ and in a model system. Additionally, we are developing novel strategies for increasing microarray hybridization sensitivity. Finally, great improvements to our methods of probe design were made by the development of a new computer program, CommOligo. CommOligo designs unique and group-specific oligo probes for whole-genomes, metagenomes, and groups of environmental sequences and uses a new global alignment algorithm to design single or multiple probes for each gene or group. We are now using this program to design a more comprehensive functional gene array for environmental studies. Overall, our results indicate that the 50mer-based microarray technology has potential as a specific and quantitative tool to reveal the composition of microbial communities and their dynamics important to processes within contaminated environments.« less
A resource of large-scale molecular markers for monitoring Agropyron cristatum chromatin introgression in wheat background based on transcriptome sequences.

PubMed

Zhang, Jinpeng; Liu, Weihua; Lu, Yuqing; Liu, Qunxing; Yang, Xinming; Li, Xiuquan; Li, Lihui

2017-09-20

Agropyron cristatum is a wild grass of the tribe Triticeae and serves as a gene donor for wheat improvement. However, very few markers can be used to monitor A. cristatum chromatin introgressions in wheat. Here, we reported a resource of large-scale molecular markers for tracking alien introgressions in wheat based on transcriptome sequences. By aligning A. cristatum unigenes with the Chinese Spring reference genome sequences, we designed 9602 A. cristatum expressed sequence tag-sequence-tagged site (EST-STS) markers for PCR amplification and experimental screening. As a result, 6063 polymorphic EST-STS markers were specific for the A. cristatum P genome in the single-receipt wheat background. A total of 4956 randomly selected polymorphic EST-STS markers were further tested in eight wheat variety backgrounds, and 3070 markers displaying stable and polymorphic amplification were validated. These markers covered more than 98% of the A. cristatum genome, and the marker distribution density was approximately 1.28 cM. An application case of all EST-STS markers was validated on the A. cristatum 6 P chromosome. These markers were successfully applied in the tracking of alien A. cristatum chromatin. Altogether, this study provided a universal method of large-scale molecular marker development to monitor wild relative chromatin in wheat.
[EST-SSR identification, markers development of Ligusticum chuanxiong based on Ligusticum chuanxiong transcriptome sequences].

PubMed

Yuan, Can; Peng, Fang; Yang, Ze-Mao; Zhong, Wen-Juan; Mou, Fang-Sheng; Gong, Yi-Yun; Ji, Pei-Cheng; Pu, De-Qiang; Huang, Hai-Yan; Yang, Xiao; Zhang, Chao

2017-09-01

Ligusticum chuanxiong is a well-known traditional Chinese medicine plant. The study on its molecular markers development and germplasm resources is very important. In this study, we obtained 24 422 unigenes by assembling transcriptome sequencing reads of L. chuanxiong root. EST-SSR was detected and 4 073 SSR loci were identified. EST-SSR distribution and characteristic analysis results showed that the mono-nucleotide repeats were the main repeat types, accounting for 41.0%. In addition, the sequences containing SSR were functionally annotated in Gene Ontology (GO) and KEGG pathway and were assigned to 49 GO categories, 242 KEGG pathways, among them 2 201 sequences were annotated against Nr database. By validating 235 EST-SSRs,74 primer pairs were ultimately proved to have high quality amplification. Subsequently, genetic diversity analysis, UPGMA cluster analysis, PCoA analysis and population structure analysis of 34 L. chuanxiong germplasm resources were carried out with 74 primer pairs. In both UPGMA tree and PCoA results, L. chuanxiong resources were clustered into two groups, which are believed to be partial related to their geographical distribution. In this study, EST-SSRs in L. chuanxiong was firstly identified, and newly developed molecular markers would contribute significantly to further genetic diversity study, the purity detection, gene mapping, and molecular breeding. Copyright© by the Chinese Pharmaceutical Association.
Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome

PubMed Central

Kim, Gunjune

2017-01-01

Contact with poison ivy plants is widely dreaded because they produce a natural product called urushiol that is responsible for allergenic contact delayed-dermatitis symptoms lasting for weeks. For this reason, the catchphrase most associated with poison ivy is “leaves of three, let it be”, which serves the purpose of both identification and an appeal for avoidance. Ironically, despite this notoriety, there is a dearth of specific knowledge about nearly all other aspects of poison ivy physiology and ecology. As a means of gaining a more molecular-oriented understanding of poison ivy physiology and ecology, Next Generation DNA sequencing technology was used to develop poison ivy root and leaf RNA-seq transcriptome resources. De novo assembled transcriptomes were analyzed to generate a core set of high quality expressed transcripts present in poison ivy tissue. The predicted protein sequences were evaluated for similarity to SwissProt homologs and InterProScan domains, as well as assigned both GO terms and KEGG annotations. Over 23,000 simple sequence repeats were identified in the transcriptome, and corresponding oligo nucleotide primer pairs were designed. A pan-transcriptome analysis of existing Anacardiaceae transcriptomes revealed conserved and unique transcripts among these species. PMID:29125533
Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome.

PubMed

Weisberg, Alexandra J; Kim, Gunjune; Westwood, James H; Jelesko, John G

2017-11-10

Contact with poison ivy plants is widely dreaded because they produce a natural product called urushiol that is responsible for allergenic contact delayed-dermatitis symptoms lasting for weeks. For this reason, the catchphrase most associated with poison ivy is "leaves of three, let it be", which serves the purpose of both identification and an appeal for avoidance. Ironically, despite this notoriety, there is a dearth of specific knowledge about nearly all other aspects of poison ivy physiology and ecology. As a means of gaining a more molecular-oriented understanding of poison ivy physiology and ecology, Next Generation DNA sequencing technology was used to develop poison ivy root and leaf RNA-seq transcriptome resources. De novo assembled transcriptomes were analyzed to generate a core set of high quality expressed transcripts present in poison ivy tissue. The predicted protein sequences were evaluated for similarity to SwissProt homologs and InterProScan domains, as well as assigned both GO terms and KEGG annotations. Over 23,000 simple sequence repeats were identified in the transcriptome, and corresponding oligo nucleotide primer pairs were designed. A pan-transcriptome analysis of existing Anacardiaceae transcriptomes revealed conserved and unique transcripts among these species.
Identification of expressed sequences in the coffee genome potentially associated with somatic embryogenesis.

PubMed

Silva, A T; Paiva, L V; Andrade, A C; Barduche, D

2013-05-21

Brazil possesses the most modern and productive coffee growing farms in the world, but technological development is desired to cope with the increasing world demand. One way to increase Brazilian coffee growing productivity is wide scale production of clones with superior genotypes, which can be obtained with in vitro propagation technique, or from tissue culture. These procedures can generate thousands of clones. However, the methodologies for in vitro cultivation are genotype-dependent, which leads to an almost empirical development of specific protocols for each species. Therefore, molecular markers linked to the biochemical events of somatic embryogenesis would greatly facilitate the development of such protocols. In this context, sequences potentially involved in embryogenesis processes in the coffee plant were identified in silico from libraries generated by the Brazilian Coffee Genome Project. Through these in silico analyses, we identified 15 EST-contigs related to the embryogenesis process. Among these, 5 EST-contigs (3605, 9850, 13686, 17240, and 17265) could readily be associated with plant embryogenesis. Sequence analysis of EST-contig 3605, 9850, and 17265 revealed similarity to a polygalacturonase, to a cysteine-proteinase, and to an allergenine, respectively. Results also show that EST-contig 17265 sequences presented similarity to an expansin. Finally, analysis of EST-contig 17240 revealed similarity to a protein of unknown function, but it grouped in the similarity dendrogram with the WUSCHEL transcription factor. The data suggest that these EST-contigs are related to the embryogenic process and have potential as molecular markers to increase methodological efficiency in obtaining coffee plant embryogenic materials.
Construction of a full-length cDNA Library from Chinese oak silkworm pupa and identification of a KK-42-binding protein gene in relation to pupa-diapause termination.

PubMed

Li, Yu-Ping; Xia, Run-Xi; Wang, Huan; Li, Xi-Sheng; Liu, Yan-Qun; Wei, Zhao-Jun; Lu, Cheng; Xiang, Zhong-Huai

2009-06-24

In this study we successfully constructed a full-length cDNA library from Chinese oak silkworm, Antheraea pernyi, the most well-known wild silkworm used for silk production and insect food. Total RNA was extracted from a single fresh female pupa at the diapause stage. The titer of the library was 5 x 10(5) cfu/ml and the proportion of recombinant clones was approximately 95%. Expressed sequence tag (EST) analysis was used to characterize the library. A total of 175 clustered ESTs consisting of 24 contigs and 151 singlets were generated from 250 effective sequences. Of the 175 unigenes, 97 (55.4%) were known genes but only five from A. pernyi, 37 (21.2%) were known ESTs without function annotation, and 41 (23.4%) were novel ESTs. By EST sequencing, a gene coding KK-42-binding protein in A. pernyi (named as ApKK42-BP; GenBank accession no. FJ744151) was identified and characterized. Protein sequence analysis showed that ApKK42-BP was not a membrane protein but an extracellular protein with a signal peptide at position 1-18, and contained two putative conserved domains, abhydro_lipase and abhydrolase_1, suggesting it may be a member of lipase superfamily. Expression analysis based on number of ESTs showed that ApKK42-BP was an abundant gene in the period of diapause stage, suggesting it may also be involved in pupa-diapause termination.
Construction of a full-length cDNA Library from Chinese oak silkworm pupa and identification of a KK-42-binding protein gene in relation to pupa-diapause termination

PubMed Central

Li, Yu-Ping; Xia, Run-Xi; Wang, Huan; Li, Xi-Sheng; Liu, Yan-Qun; Wei, Zhao-Jun; Lu, Cheng; Xiang, Zhong-Huai

2009-01-01

In this study we successfully constructed a full-length cDNA library from Chinese oak silkworm, Antheraea pernyi, the most well-known wild silkworm used for silk production and insect food. Total RNA was extracted from a single fresh female pupa at the diapause stage. The titer of the library was 5 × 105 cfu/ml and the proportion of recombinant clones was approximately 95%. Expressed sequence tag (EST) analysis was used to characterize the library. A total of 175 clustered ESTs consisting of 24 contigs and 151 singlets were generated from 250 effective sequences. Of the 175 unigenes, 97 (55.4%) were known genes but only five from A. pernyi, 37 (21.2%) were known ESTs without function annotation, and 41 (23.4%) were novel ESTs. By EST sequencing, a gene coding KK-42-binding protein in A. pernyi (named as ApKK42-BP; GenBank accession no. FJ744151) was identified and characterized. Protein sequence analysis showed that ApKK42-BP was not a membrane protein but an extracellular protein with a signal peptide at position 1-18, and contained two putative conserved domains, abhydro_lipase and abhydrolase_1, suggesting it may be a member of lipase superfamily. Expression analysis based on number of ESTs showed that ApKK42-BP was an abundant gene in the period of diapause stage, suggesting it may also be involved in pupa-diapause termination. PMID:19564928
Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening

PubMed Central

Crowhurst, Ross N; Gleave, Andrew P; MacRae, Elspeth A; Ampomah-Dwamena, Charles; Atkinson, Ross G; Beuning, Lesley L; Bulley, Sean M; Chagne, David; Marsh, Ken B; Matich, Adam J; Montefiori, Mirco; Newcomb, Richard D; Schaffer, Robert J; Usadel, Björn; Allan, Andrew C; Boldingh, Helen L; Bowen, Judith H; Davy, Marcus W; Eckloff, Rheinhart; Ferguson, A Ross; Fraser, Lena G; Gera, Emma; Hellens, Roger P; Janssen, Bart J; Klages, Karin; Lo, Kim R; MacDiarmid, Robin M; Nain, Bhawana; McNeilage, Mark A; Rassam, Maysoon; Richardson, Annette C; Rikkerink, Erik HA; Ross, Gavin S; Schröder, Roswitha; Snowden, Kimberley C; Souleyre, Edwige JF; Templeton, Matt D; Walton, Eric F; Wang, Daisy; Wang, Mindy Y; Wang, Yanming Y; Wood, Marion; Wu, Rongmei; Yauk, Yar-Khing; Laing, William A

2008-01-01

Background Kiwifruit (Actinidia spp.) are a relatively new, but economically important crop grown in many different parts of the world. Commercial success is driven by the development of new cultivars with novel consumer traits including flavor, appearance, healthful components and convenience. To increase our understanding of the genetic diversity and gene-based control of these key traits in Actinidia, we have produced a collection of 132,577 expressed sequence tags (ESTs). Results The ESTs were derived mainly from four Actinidia species (A. chinensis, A. deliciosa, A. arguta and A. eriantha) and fell into 41,858 non redundant clusters (18,070 tentative consensus sequences and 23,788 EST singletons). Analysis of flavor and fragrance-related gene families (acyltransferases and carboxylesterases) and pathways (terpenoid biosynthesis) is presented in comparison with a chemical analysis of the compounds present in Actinidia including esters, acids, alcohols and terpenes. ESTs are identified for most genes in color pathways controlling chlorophyll degradation and carotenoid biosynthesis. In the health area, data are presented on the ESTs involved in ascorbic acid and quinic acid biosynthesis showing not only that genes for many of the steps in these pathways are represented in the database, but that genes encoding some critical steps are absent. In the convenience area, genes related to different stages of fruit softening are identified. Conclusion This large EST resource will allow researchers to undertake the tremendous challenge of understanding the molecular basis of genetic diversity in the Actinidia genus as well as provide an EST resource for comparative fruit genomics. The various bioinformatics analyses we have undertaken demonstrates the extent of coverage of ESTs for genes encoding different biochemical pathways in Actinidia. PMID:18655731
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

PubMed Central

2010-01-01

Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882
Identification and Evaluation of Single-Nucleotide Polymorphisms in Allotetraploid Peanut (Arachis hypogaea L.) Based on Amplicon Sequencing Combined with High Resolution Melting (HRM) Analysis.

PubMed

Hong, Yanbin; Pandey, Manish K; Liu, Ying; Chen, Xiaoping; Liu, Hong; Varshney, Rajeev K; Liang, Xuanqiang; Huang, Shangzhi

2015-01-01

The cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB) species derived from the A-genome (Arachis duranensis) and B-genome (Arachis ipaensis) progenitors. Presence of two versions of a DNA sequence based on the two progenitor genomes poses a serious technical and analytical problem during single nucleotide polymorphism (SNP) marker identification and analysis. In this context, we have analyzed 200 amplicons derived from expressed sequence tags (ESTs) and genome survey sequences (GSS) to identify SNPs in a panel of genotypes consisting of 12 cultivated peanut varieties and two diploid progenitors representing the ancestral genomes. A total of 18 EST-SNPs and 44 genomic-SNPs were identified in 12 peanut varieties by aligning the sequence of A. hypogaea with diploid progenitors. The average frequency of sequence polymorphism was higher for genomic-SNPs than the EST-SNPs with one genomic-SNP every 1011 bp as compared to one EST-SNP every 2557 bp. In order to estimate the potential and further applicability of these identified SNPs, 96 peanut varieties were genotyped using high resolution melting (HRM) method. Polymorphism information content (PIC) values for EST-SNPs ranged between 0.021 and 0.413 with a mean of 0.172 in the set of peanut varieties, while genomic-SNPs ranged between 0.080 and 0.478 with a mean of 0.249. Total 33 SNPs were used for polymorphism detection among the parents and 10 selected lines from mapping population Y13Zh (Zhenzhuhei × Yueyou13). Of the total 33 SNPs, nine SNPs showed polymorphism in the mapping population Y13Zh, and seven SNPs were successfully mapped into five linkage groups. Our results showed that SNPs can be identified in allotetraploid peanut with high accuracy through amplicon sequencing and HRM assay. The identified SNPs were very informative and can be used for different genetic and breeding applications in peanut.
Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

PubMed Central

Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

2013-01-01

In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

Construction of a full-length enriched cDNA library and preliminary analysis of expressed sequence tags from Bengal Tiger Panthera tigris tigris.

PubMed

Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

2013-05-24

In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.
Identification and characterization of 43 microsatellite markers derived from expressed sequence tags of the sea cucumber ( Apostichopus japonicus)

NASA Astrophysics Data System (ADS)

Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng

2011-06-01

The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.
Expressed sequence tag (EST) analysis of the pine wood nematode Bursaphelenchus xylophilus and B. mucronatus.

PubMed

Kikuchi, Taisei; Aikawa, Takuya; Kosaka, Hajime; Pritchard, Leighton; Ogura, Nobuo; Jones, John T

2007-09-01

Most Bursaphelenchus species feed on fungi that colonise dead or dying trees. However, Bursaphelenchus xylophilus is unique in that in addition to feeding on fungi it has the capacity to be a parasite of live pine trees. We present an analysis of over 13,000 expressed sequence tags (ESTs) from B. xylophilus and, by way of contrast, over 3000 ESTs from a closely related species that does not parasitise plants as readily; B. mucronatus. Four libraries from B. xylophilus, from a variety of life stages including fungal feeding nematodes, nematodes extracted from plants and dauer-like stage nematodes, and one library from B. mucronatus were constructed and used to generate ESTs. Contig analysis showed that the 13,327 B. xylophilus ESTs could be grouped into 2110 contigs and 4377 singletons giving a total of 6487 identified genes. Similarly the 3193 B. mucronatus ESTs yielded a total of 2219 identified genes from 425 contigs and 1794 singletons. A variety of proteins potentially important in the parasitic process of B. xylophilus and B. mucronatus, including plant and fungal cell wall degrading enzymes and a novel gene potentially encoding a expansin-like protein that may disrupt non-covalent bonds in the plant cell wall were identified in the libraries. Additionally several gene candidates potentially involved in dauer entry or maintenance were also identified in the EST dataset. The EST sequences from this study will provide a solid base for future research on the biology, pathogenicity and evolutionary history of this nematode group.
Annotated ESTs from various tissues of the brown planthopper Nilaparvata lugens: A genomic resource for studying agricultural pests

PubMed Central

Noda, Hiroaki; Kawai, Sawako; Koizumi, Yoko; Matsui, Kageaki; Zhang, Qiang; Furukawa, Shigetoyo; Shimomura, Michihiko; Mita, Kazuei

2008-01-01

Background The brown planthopper (BPH), Nilaparvata lugens (Hemiptera, Delphacidae), is a serious insect pests of rice plants. Major means of BPH control are application of agricultural chemicals and cultivation of BPH resistant rice varieties. Nevertheless, BPH strains that are resistant to agricultural chemicals have developed, and BPH strains have appeared that are virulent against the resistant rice varieties. Expressed sequence tag (EST) analysis and related applications are useful to elucidate the mechanisms of resistance and virulence and to reveal physiological aspects of this non-model insect, with its poorly understood genetic background. Results More than 37,000 high-quality ESTs, excluding sequences of mitochondrial genome, microbial genomes, and rDNA, have been produced from 18 libraries of various BPH tissues and stages. About 10,200 clusters have been made from whole EST sequences, with average EST size of 627 bp. Among the top ten most abundantly expressed genes, three are unique and show no homology in BLAST searches. The actin gene was highly expressed in BPH, especially in the thorax. Tissue-specifically expressed genes were extracted based on the expression frequency among the libraries. An EST database is available at our web site. Conclusion The EST library will provide useful information for transcriptional analyses, proteomic analyses, and gene functional analyses of BPH. Moreover, specific genes for hemimetabolous insects will be identified. The microarray fabricated based on the EST information will be useful for finding genes related to agricultural and biological problems related to this pest. PMID:18315884
Mass fingerprinting of the venom and transcriptome of venom gland of scorpion Centruroides tecomanus.

PubMed

Valdez-Velázquez, Laura L; Quintero-Hernández, Verónica; Romero-Gutiérrez, Maria Teresa; Coronas, Fredy I V; Possani, Lourival D

2013-01-01

Centruroides tecomanus is a Mexican scorpion endemic of the State of Colima, that causes human fatalities. This communication describes a proteome analysis obtained from milked venom and a transcriptome analysis from a cDNA library constructed from two pairs of venom glands of this scorpion. High perfomance liquid chromatography separation of soluble venom produced 80 fractions, from which at least 104 individual components were identified by mass spectrometry analysis, showing to contain molecular masses from 259 to 44,392 Da. Most of these components are within the expected molecular masses for Na(+)- and K(+)-channel specific toxic peptides, supporting the clinical findings of intoxication, when humans are stung by this scorpion. From the cDNA library 162 clones were randomly chosen, from which 130 sequences of good quality were identified and were clustered in 28 contigs containing, each, two or more expressed sequence tags (EST) and 49 singlets with only one EST. Deduced amino acid sequence analysis from 53% of the total ESTs showed that 81% (24 sequences) are similar to known toxic peptides that affect Na(+)-channel activity, and 19% (7 unique sequences) are similar to K(+)-channel especific toxins. Out of the 31 sequences, at least 8 peptides were confirmed by direct Edman degradation, using components isolated directly from the venom. The remaining 19%, 4%, 4%, 15% and 5% of the ESTs correspond respectively to proteins involved in cellular processes, antimicrobial peptides, venom components, proteins without defined function and sequences without similarity in databases. Among the cloned genes are those similar to metalloproteinases.
Identification, validation and high-throughput genotyping of transcribed gene SNPs in cassava.

PubMed

Ferguson, Morag E; Hearne, Sarah J; Close, Timothy J; Wanamaker, Steve; Moskal, William A; Town, Christopher D; de Young, Joe; Marri, Pradeep Reddy; Rabbi, Ismail Yusuf; de Villiers, Etienne P

2012-03-01

The availability of genomic resources can facilitate progress in plant breeding through the application of advanced molecular technologies for crop improvement. This is particularly important in the case of less researched crops such as cassava, a staple and food security crop for more than 800 million people. Here, expressed sequence tags (ESTs) were generated from five drought stressed and well-watered cassava varieties. Two cDNA libraries were developed: one from root tissue (CASR), the other from leaf, stem and stem meristem tissue (CASL). Sequencing generated 706 contigs and 3,430 singletons. These sequences were combined with those from two other EST sequencing initiatives and filtered based on the sequence quality. Quality sequences were aligned using CAP3 and embedded in a Windows browser called HarvEST:Cassava which is made available. HarvEST:Cassava consists of a Unigene set of 22,903 quality sequences. A total of 2,954 putative SNPs were identified. Of these 1,536 SNPs from 1,170 contigs and 53 cassava genotypes were selected for SNP validation using Illumina's GoldenGate assay. As a result 1,190 SNPs were validated technically and biologically. The location of validated SNPs on scaffolds of the cassava genome sequence (v.4.1) is provided. A diversity assessment of 53 cassava varieties reveals some sub-structure based on the geographical origin, greater diversity in the Americas as opposed to Africa, and similar levels of diversity in West Africa and southern, eastern and central Africa. The resources presented allow for improved genetic dissection of economically important traits and the application of modern genomics-based approaches to cassava breeding and conservation.
Update of the Diatom EST Database: a new tool for digital transcriptomics

PubMed Central

Maheswari, Uma; Mock, Thomas; Armbrust, E. Virginia; Bowler, Chris

2009-01-01

The Diatom Expressed Sequence Tag (EST) Database was constructed to provide integral access to ESTs from these ecologically and evolutionarily interesting microalgae. It has now been updated with 130 000 Phaeodactylum tricornutum ESTs from 16 cDNA libraries and 77 000 Thalassiosira pseudonana ESTs from seven libraries, derived from cells grown in different nutrient and stress regimes. The updated relational database incorporates results from statistical analyses such as log-likelihood ratios and hierarchical clustering, which help to identify differentially expressed genes under different conditions, and allow similarities in gene expression in different libraries to be investigated in a functional context. The database also incorporates links to the recently sequenced genomes of P. tricornutum and T. pseudonana, enabling an easy cross-talk between the expression pattern of diatom orthologs and the genome browsers. These improvements will facilitate exploration of diatom responses to conditions of ecological relevance and will aid gene function identification of diatom-specific genes and in silico gene prediction in this largely unexplored class of eukaryotes. The updated Diatom EST Database is available at http://www.biologie.ens.fr/diatomics/EST3. PMID:19029140
The first set of EST resource for gene discovery and marker development in pigeonpea (Cajanus cajan L.).

PubMed

Raju, Nikku L; Gnanesh, Belaghihalli N; Lekha, Pazhamala; Jayashree, Balaji; Pande, Suresh; Hiremath, Pavana J; Byregowda, Munishamappa; Singh, Nagendra K; Varshney, Rajeev K

2010-03-11

Pigeonpea (Cajanus cajan (L.) Millsp) is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW), sterility mosaic disease (SMD), etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs). A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376') and SMD ('ICP 7035' and 'TTB 7') and a total of 9,888 (9,468 high quality) ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes) including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%), rice (28.3%), Arabidopsis (33.7%) and poplar (35.4%). As expected, pigeonpea ESTs are more closely related to soybean (60.3%) and cowpea ESTs (43.6%) than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1%) out of 4,557 total unigenes correspond to known proteins in the UniProt database (or= 5 sequences detected 102 single nucleotide polymorphisms (SNPs) in 37 contigs. As an example, a set of 10 contigs were used for confirming in silico predicted SNPs in a set of four genotypes using wet lab experiments. Occurrence of SNPs were confirmed for all the 6 contigs for which scorable and sequenceable amplicons were generated. PCR amplicons were not obtained in case of 4 contigs. Recognition sites for restriction enzymes were identified for 102 SNPs in 37 contigs that indicates possibility of assaying SNPs in 37 genes using cleaved amplified polymorphic sequences (CAPS) assay. The pigeonpea EST dataset generated here provides a transcriptomic resource for gene discovery and development of functional markers associated with biotic stress resistance. Sequence analyses of this dataset have showed conservation of a considerable number of pigeonpea transcripts across legume and model plant species analysed as well as some putative pigeonpea specific genes. Validation of identified biotic stress responsive genes should provide candidate genes for allele mining as well as candidate markers for molecular breeding.
The first set of EST resource for gene discovery and marker development in pigeonpea (Cajanus cajan L.)

PubMed Central

2010-01-01

Background Pigeonpea (Cajanus cajan (L.) Millsp) is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW), sterility mosaic disease (SMD), etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs). Results A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376') and SMD ('ICP 7035' and 'TTB 7') and a total of 9,888 (9,468 high quality) ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes) including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%), rice (28.3%), Arabidopsis (33.7%) and poplar (35.4%). As expected, pigeonpea ESTs are more closely related to soybean (60.3%) and cowpea ESTs (43.6%) than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1%) out of 4,557 total unigenes correspond to known proteins in the UniProt database (≤ 1E-08). Functional categorization of the annotated unigenes sequences showed that 153 (3.3%) genes were assigned to cellular component category, 132 (2.8%) to biological process, and 132 (2.8%) in molecular function. Further, 19 genes were identified differentially expressed between FW- responsive genotypes and 20 between SMD- responsive genotypes. Generated ESTs were compiled together with 908 ESTs available in public domain, at the time of analysis, and a set of 5,085 unigenes were defined that were used for identification of molecular markers in pigeonpea. For instance, 3,583 simple sequence repeat (SSR) motifs were identified in 1,365 unigenes and 383 primer pairs were designed. Assessment of a set of 84 primer pairs on 40 elite pigeonpea lines showed polymorphism with 15 (28.8%) markers with an average of four alleles per marker and an average polymorphic information content (PIC) value of 0.40. Similarly, in silico mining of 133 contigs with ≥ 5 sequences detected 102 single nucleotide polymorphisms (SNPs) in 37 contigs. As an example, a set of 10 contigs were used for confirming in silico predicted SNPs in a set of four genotypes using wet lab experiments. Occurrence of SNPs were confirmed for all the 6 contigs for which scorable and sequenceable amplicons were generated. PCR amplicons were not obtained in case of 4 contigs. Recognition sites for restriction enzymes were identified for 102 SNPs in 37 contigs that indicates possibility of assaying SNPs in 37 genes using cleaved amplified polymorphic sequences (CAPS) assay. Conclusion The pigeonpea EST dataset generated here provides a transcriptomic resource for gene discovery and development of functional markers associated with biotic stress resistance. Sequence analyses of this dataset have showed conservation of a considerable number of pigeonpea transcripts across legume and model plant species analysed as well as some putative pigeonpea specific genes. Validation of identified biotic stress responsive genes should provide candidate genes for allele mining as well as candidate markers for molecular breeding. PMID:20222972
Optimization of the hybridization-based method for purification of thermostable tRNAs in the presence of tetraalkylammonium salts

PubMed Central

Yokogawa, Takashi; Kitamura, Yusuke; Nakamura, Daigo; Ohno, Satoshi; Nishikawa, Kazuya

2010-01-01

We found that both tetramethylammonium chloride (TMA-Cl) and tetra-ethylammonium chloride (TEA-Cl), which are used as monovalent cations for northern hybridization, drastically destabilized the tertiary structures of tRNAs and enhanced the formation of tRNA•oligoDNA hybrids. These effects are of great advantage for the hybridization-based method for purification of specific tRNAs from unfractionated tRNA mixtures through the use of an immobilized oligoDNA complementary to the target tRNA. Replacement of NaCl by TMA-Cl or TEA-Cl in the hybridization buffer greatly improved the recovery of a specific tRNA, even from unfractionated tRNAs derived from a thermophile. Since TEA-Cl destabilized tRNAs more strongly than TMA-Cl, it was necessary to lower the hybridization temperature at the sacrifice of the purity of the recovered tRNA when using TEA-Cl. Therefore, we propose two alternative protocols, depending on the desired properties of the tRNA to be purified. When the total recovery of the tRNA is important, hybridization should be carried out in the presence of TEA-Cl. However, if the purity of the recovered tRNA is important, TMA-Cl should be used for the hybridization. In principle, this procedure for tRNA purification should be applicable to any small-size RNA whose gene sequence is already known. PMID:20040572
Double-labeled donor probe can enhance the signal of fluorescence resonance energy transfer (FRET) in detection of nucleic acid hybridization

PubMed Central

Okamura, Yukio; Kondo, Satoshi; Sase, Ichiro; Suga, Takayuki; Mise, Kazuyuki; Furusawa, Iwao; Kawakami, Shigeki; Watanabe, Yuichiro

2000-01-01

A set of fluorescently-labeled DNA probes that hybridize with the target RNA and produce fluorescence resonance energy transfer (FRET) signals can be utilized for the detection of specific RNA. We have developed probe sets to detect and discriminate single-strand RNA molecules of plant viral genome, and sought a method to improve the FRET signals to handle in vivo applications. Consequently, we found that a double-labeled donor probe labeled with Bodipy dye yielded a remarkable increase in fluorescence intensity compared to a single-labeled donor probe used in an ordinary FRET. This double-labeled donor system can be easily applied to improve various FRET probes since the dependence upon sequence and label position in enhancement is not as strict. Furthermore this method could be applied to other nucleic acid substances, such as oligo RNA and phosphorothioate oligonucleotides (S-oligos) to enhance FRET signal. Although the double-labeled donor probes labeled with a variety of fluorophores had unexpected properties (strange UV-visible absorption spectra, decrease of intensity and decay of donor fluorescence) compared with single-labeled ones, they had no relation to FRET enhancement. This signal amplification mechanism cannot be explained simply based on our current results and knowledge of FRET. Yet it is possible to utilize this double-labeled donor system in various applications of FRET as a simple signal-enhancement method. PMID:11121494
Analysis of the interaction with the hepatitis C virus mRNA reveals an alternative mode of RNA recognition by the human La protein.

PubMed

Martino, Luigi; Pennell, Simon; Kelly, Geoff; Bui, Tam T T; Kotik-Kogan, Olga; Smerdon, Stephen J; Drake, Alex F; Curry, Stephen; Conte, Maria R

2012-02-01

Human La protein is an essential factor in the biology of both coding and non-coding RNAs. In the nucleus, La binds primarily to 3' oligoU containing RNAs, while in the cytoplasm La interacts with an array of different mRNAs lacking a 3' UUU(OH) trailer. An example of the latter is the binding of La to the IRES domain IV of the hepatitis C virus (HCV) RNA, which is associated with viral translation stimulation. By systematic biophysical investigations, we have found that La binds to domain IV using an RNA recognition that is quite distinct from its mode of binding to RNAs with a 3' UUU(OH) trailer: although the La motif and first RNA recognition motif (RRM1) are sufficient for high-affinity binding to 3' oligoU, recognition of HCV domain IV requires the La motif and RRM1 to work in concert with the atypical RRM2 which has not previously been shown to have a significant role in RNA binding. This new mode of binding does not appear sequence specific, but recognizes structural features of the RNA, in particular a double-stranded stem flanked by single-stranded extensions. These findings pave the way for a better understanding of the role of La in viral translation initiation.
Role of monomer sequence and backbone chemistry in polypeptoid copolymers for marine antifouling coatings

NASA Astrophysics Data System (ADS)

Patterson, Anastasia; Wenning, Brandon; Rizis, Georgios; Calabrese, David; Finlay, John; Franco, Sofia; Clare, Anthony; Kramer, Edward; Ober, Christopher; Segalman, Rachel

The design rules elucidated in this work suggest that antifouling coatings bearing pendant peptoid side chains perform better overall in marine fouling tests than those with peptide side chains, with extremely low attachment of N. incerta and high removal of U. linza. This difference in performance is likely due to the lack of a hydrogen bond donor in the peptoid backbone. Furthermore, we show that the bulk polymer material of these hierarchical coatings (based on PEO or PDMS) plays a key role in determining both surface presentation and fouling release performance. We demonstrate these trends utilizing a modular coating based on a triblock copolymer consisting of polystyrene and a vinyl-containing midblock, to which sequence-defined pendant oligomers (peptides or peptoids with sequences of oligo-PEO and fluoroalkyl groups) are attached via thiol-ene ``click'' chemistry. Surface presentation was analyzed with X-ray photoelectron spectroscopy and captive bubble water contact angle, and antifouling performance was evaluated with attachment and removal bioassays of the marine macroalga U. linza and diatom N. incerta. NSF GRFP and ONR PECASE.
In silico search, characterization and validation of new EST-SSR markers in the genus Prunus.

PubMed

Sorkheh, Karim; Prudencio, Angela S; Ghebinejad, Azim; Dehkordi, Mehrana Kohei; Erogul, Deniz; Rubio, Manuel; Martínez-Gómez, Pedro

2016-07-07

Simple sequence repeats (SSRs) are defined as sequence repeat units between 1 and 6 bp that occur in both coding and non-coding regions abundant in eukaryotic genomes, which may affect the expression of genes. In this study, expressed sequence tags (ESTs) of eight Prunus species were analyzed for in silico mining of EST-SSRs, protein annotation, and open reading frames (ORFs), and the identification of codon repetitions. A total of 316 SSRs were identified using MISA software. Dinucleotide SSR motifs (26.31 %) were found to be the most abundant type of repeats, followed by tri- (14.58 %), tetra- (0.53 %), and penta- (0.27 %) nucleotide motifs. An attempt was made to design primer pairs for 316 identified SSRs but these were successful for only 175 SSR sequences. The positions of SSRs with respect to ORFs were detected, and annotation of sequences containing SSRs was performed to assign function to each sequence. SSRs were also characterized (in terms of position in the reference genome and associated gene) using the two available Prunus reference genomes (mei and peach). Finally, 38 SSR markers were validated across peach, almond, plum, and apricot genotypes. This validation showed a higher transferability level of EST-SSR developed in P. mume (mei) in comparison with the rest of species analyzed. Findings will aid analysis of functionally important molecular markers and facilitate the analysis of genetic diversity.
Genome-wide analysis of esterase-like genes in the striped rice stem borer, Chilo suppressalis.

PubMed

Wang, Baoju; Wang, Ying; Zhang, Yang; Han, Ping; Li, Fei; Han, Zhaojun

2015-06-01

The striped rice stem borer, Chilo suppressalis, a destructive pest of rice, has developed high levels of resistance to certain insecticides. Esterases are reported to be involved in insecticide resistance in several insects. Therefore, this study systematically analyzed esterase-like genes in C. suppressalis. Fifty-one esterase-like genes were identified in the draft genomic sequences of the species, and 20 cDNA sequences were derived which encoded full- or nearly full-length proteins. The putative esterase proteins derived from these full-length genes are overall highly diversified. However, key residues that are functionally important including the serine residue in the active site are conserved in 18 out of the 20 proteins. Phylogenetic analysis revealed that most of these genes have homologues in other lepidoptera insects. Genes CsuEst6, CsuEst10, CsuEst11, and CsuEst51 were induced by the insecticide triazophos, and genes CsuEst9, CsuEst11, CsuEst14, and CsuEst51 were induced by the insecticide chlorantraniliprole. Our results provide a foundation for future studies of insecticide resistance in C. suppressalis and for comparative research with esterase genes from other insect species.
A comprehensive resource of drought- and salinity- responsive ESTs for gene discovery and marker development in chickpea (Cicer arietinum L.)

PubMed Central

2009-01-01

Background Chickpea (Cicer arietinum L.), an important grain legume crop of the world is seriously challenged by terminal drought and salinity stresses. However, very limited number of molecular markers and candidate genes are available for undertaking molecular breeding in chickpea to tackle these stresses. This study reports generation and analysis of comprehensive resource of drought- and salinity-responsive expressed sequence tags (ESTs) and gene-based markers. Results A total of 20,162 (18,435 high quality) drought- and salinity- responsive ESTs were generated from ten different root tissue cDNA libraries of chickpea. Sequence editing, clustering and assembly analysis resulted in 6,404 unigenes (1,590 contigs and 4,814 singletons). Functional annotation of unigenes based on BLASTX analysis showed that 46.3% (2,965) had significant similarity (≤1E-05) to sequences in the non-redundant UniProt database. BLASTN analysis of unique sequences with ESTs of four legume species (Medicago, Lotus, soybean and groundnut) and three model plant species (rice, Arabidopsis and poplar) provided insights on conserved genes across legumes as well as novel transcripts for chickpea. Of 2,965 (46.3%) significant unigenes, only 2,071 (32.3%) unigenes could be functionally categorised according to Gene Ontology (GO) descriptions. A total of 2,029 sequences containing 3,728 simple sequence repeats (SSRs) were identified and 177 new EST-SSR markers were developed. Experimental validation of a set of 77 SSR markers on 24 genotypes revealed 230 alleles with an average of 4.6 alleles per marker and average polymorphism information content (PIC) value of 0.43. Besides SSR markers, 21,405 high confidence single nucleotide polymorphisms (SNPs) in 742 contigs (with ≥ 5 ESTs) were also identified. Recognition sites for restriction enzymes were identified for 7,884 SNPs in 240 contigs. Hierarchical clustering of 105 selected contigs provided clues about stress- responsive candidate genes and their expression profile showed predominance in specific stress-challenged libraries. Conclusion Generated set of chickpea ESTs serves as a resource of high quality transcripts for gene discovery and development of functional markers associated with abiotic stress tolerance that will be helpful to facilitate chickpea breeding. Mapping of gene-based markers in chickpea will also add more anchoring points to align genomes of chickpea and other legume species. PMID:19912666
Analysis of Expressed Sequence Tags (EST) in Date Palm.

PubMed

Al-Faifi, Sulieman A; Migdadi, Hussein M; Algamdi, Salem S; Khan, Mohammad Altaf; Al-Obeed, Rashid S; Ammar, Megahed H; Jakse, Jerenj

2017-01-01

Expressed sequence tags (EST) were generated from a normalized cDNA library of the date palm Sukkari cv. to understand the high-quality and better field performance of this well-known commercial cultivar. A total of 6943 high-quality ESTs were generated, out of them 6671 are submitted to the GenBank dbEST (LIBEST_028537). The generated ESTs were assembled into 6362 unigenes, consisting of 494 (14.4%) contigs and 5868 (84.53%) singletons. The functional annotation shows that the majority of the ESTs are associated with binding (44%), catalytic (40%), transporter (5%), and structural molecular (5%) activities. The blastx results show that 73% of unigenes are significantly similar to known plant genes and 27% are novel. The latter could be of particular interest in date palm genetic studies. Further analysis shows that some ESTs are categorized as stress/defense- and fruit development-related genes. These newly generated ESTs could significantly enhance date palm EST databases in the public domain and are available to scientists and researchers across the globe. This knowledge will facilitate the discovery of candidate genes that govern important developmental and agronomical traits in date palm. It will provide important resources for developing genetic tools, comparative genomics, and genome evolution among date palm cultivars.
Exploring root symbiotic programs in the model legume Medicago truncatula using EST analysis.

PubMed

Journet, Etienne-Pascal; van Tuinen, Diederik; Gouzy, Jérome; Crespeau, Hervé; Carreau, Véronique; Farmer, Mary-Jo; Niebel, Andreas; Schiex, Thomas; Jaillon, Olivier; Chatagnier, Odile; Godiard, Laurence; Micheli, Fabienne; Kahn, Daniel; Gianinazzi-Pearson, Vivienne; Gamas, Pascal

2002-12-15

We report on a large-scale expressed sequence tag (EST) sequencing and analysis program aimed at characterizing the sets of genes expressed in roots of the model legume Medicago truncatula during interactions with either of two microsymbionts, the nitrogen-fixing bacterium Sinorhizobium meliloti or the arbuscular mycorrhizal fungus Glomus intraradices. We have designed specific tools for in silico analysis of EST data, in relation to chimeric cDNA detection, EST clustering, encoded protein prediction, and detection of differential expression. Our 21 473 5'- and 3'-ESTs could be grouped into 6359 EST clusters, corresponding to distinct virtual genes, along with 52 498 other M.truncatula ESTs available in the dbEST (NCBI) database that were recruited in the process. These clusters were manually annotated, using a specifically developed annotation interface. Analysis of EST cluster distribution in various M.truncatula cDNA libraries, supported by a refined R test to evaluate statistical significance and by 'electronic northern' representation, enabled us to identify a large number of novel genes predicted to be up- or down-regulated during either symbiotic root interaction. These in silico analyses provide a first global view of the genetic programs for root symbioses in M.truncatula. A searchable database has been built and can be accessed through a public interface.
MODEST: a web-based design tool for oligonucleotide-mediated genome engineering and recombineering

PubMed Central

Bonde, Mads T.; Klausen, Michael S.; Anderson, Mads V.; Wallin, Annika I.N.; Wang, Harris H.; Sommer, Morten O.A.

2014-01-01

Recombineering and multiplex automated genome engineering (MAGE) offer the possibility to rapidly modify multiple genomic or plasmid sites at high efficiencies. This enables efficient creation of genetic variants including both single mutants with specifically targeted modifications as well as combinatorial cell libraries. Manual design of oligonucleotides for these approaches can be tedious, time-consuming, and may not be practical for larger projects targeting many genomic sites. At present, the change from a desired phenotype (e.g. altered expression of a specific protein) to a designed MAGE oligo, which confers the corresponding genetic change, is performed manually. To address these challenges, we have developed the MAGE Oligo Design Tool (MODEST). This web-based tool allows designing of MAGE oligos for (i) tuning translation rates by modifying the ribosomal binding site, (ii) generating translational gene knockouts and (iii) introducing other coding or non-coding mutations, including amino acid substitutions, insertions, deletions and point mutations. The tool automatically designs oligos based on desired genotypic or phenotypic changes defined by the user, which can be used for high efficiency recombineering and MAGE. MODEST is available for free and is open to all users at http://modest.biosustain.dtu.dk. PMID:24838561
Biobank classification in an Australian setting.

PubMed

Rush, Amanda; Christiansen, Jeffrey H; Farrell, Jake P; Goode, Susan M; Scott, Rodney J; Spring, Kevin J; Byrne, Jennifer A

2015-06-01

In 2011, Watson and Barnes proposed a schema for classifying biobanks into 3 groups (mono-, oligo-, and poly-user), primarily based upon biospecimen access policies. We used results from a recent comprehensive survey of cancer biobanks in New South Wales, Australia to assess the applicability of this biobank classification schema in an Australian setting. Cancer biobanks were identified using publically available data, and by consulting with research managers. A comprehensive survey was developed and administered through a face-to-face setting. Data were analyzed using Microsoft Excel™ 2010 and IBM SPSS Statistics™ version 21.0. The cancer biobank cohort (n=23) represented 5 mono-user biobanks, 7 oligo-user biobanks, and 11 poly-user biobanks, and was analyzed as two groups (mono-/oligo- versus poly-user biobanks). Poly-user biobanks employed significantly more full-time equivalent staff, and were significantly more likely to have a website, share staff between biobanks, access governance support, utilize quality control measures, be aware of biobanking best practice documents, and offer staff training. Mono-/oligo-user biobanks were significantly more likely to seek advice from other biobanks. Our results further delineate a biobank classification system that is primarily based on access policy, and demonstrate its relevance in an Australian setting.

Modified bases enable high-efficiency oligonucleotide-mediated allelic replacement via mismatch repair evasion

PubMed Central

Wang, Harris H.; Xu, George; Vonner, Ashley J.; Church, George

2011-01-01

Genome engineering using single-stranded oligonucleotides is an efficient method for generating small chromosomal and episomal modifications in a variety of host organisms. The efficiency of this allelic replacement strategy is highly dependent on avoidance of the endogenous mismatch repair (MMR) machinery. However, global MMR inactivation generally results in significant accumulation of undesired background mutations. Here, we present a novel strategy using oligos containing chemically modified bases (2′-Fluoro-Uridine, 5-Methyl-deoxyCytidine, 2,6-Diaminopurine or Iso-deoxyGuanosine) in place of the standard T, C, A or G to avoid mismatch detection and repair, which we tested in Escherichia coli. This strategy increases transient allelic-replacement efficiencies by up to 20-fold, while maintaining a 100-fold lower background mutation level. We further show that the mismatched bases between the full length oligo and the chromosome are often not incorporated at the target site, probably due to nuclease activity at the 5′ and 3′ termini of the oligo. These results further elucidate the mechanism of oligo-mediated allelic replacement (OMAR) and enable improved methodologies for efficient, large-scale engineering of genomes. PMID:21609953
Enzymatic generation of galactose-rich oligosaccharides/oligomers from potato rhamnogalacturonan I pectic polysaccharides.

PubMed

Khodaei, Nastaran; Karboune, Salwa

2016-04-15

Potato pulp by-product rich in galactan-rich rhamnogalacturonan I (RG I) was investigated as a new source of oligosaccharides with potential prebiotic properties. The efficiency of selected monocomponent enzymes and multi-enzymatic preparations to generate oligosaccharides/oligomers from potato RG I was evaluated. These overall results of yield were dependent on the activity profile of the multi-enzymatic preparations. Highest oligo-RG I yield of 93.9% was achieved using multi-enzymatic preparation (Depol 670L) with higher hydrolytic activity toward side chains of RG I as compared to its backbone. Main oligo-RG I products were oligosaccharides with DP of 2-12 (79.8-100%), while the oligomers with DP of 13-70 comprised smaller proportion (0.0-20.2%). Galactose (58.9-91.2%, w/w) was the main monosaccharide of oligo-RG I, while arabinose represented 0.0-12.1%. An understanding of the relationship between the activity profile of multi-enzymatic preparations and the yield/DP of oligo-RG I was achieved. This is expected to provide the capability to generate galacto- and galacto(arabino) oligosaccharides and their corresponding oligomers from an abundant by-product. Copyright © 2015 Elsevier Ltd. All rights reserved.
Digging into the low molecular weight peptidome with the OligoNet web server.

PubMed

Liu, Youzhong; Forcisi, Sara; Lucio, Marianna; Harir, Mourad; Bahut, Florian; Deleris-Bou, Magali; Krieger-Weber, Sibylle; Gougeon, Régis D; Alexandre, Hervé; Schmitt-Kopplin, Philippe

2017-09-15

Bioactive peptides play critical roles in regulating many biological processes. Recently, natural short peptides biomarkers are drawing significant attention and are considered as "hidden treasure" of drug candidates. High resolution and high mass accuracy provided by mass spectrometry (MS)-based untargeted metabolomics would enable the rapid detection and wide coverage of the low-molecular-weight peptidome. However, translating unknown masses (<1 500 Da) into putative peptides is often limited due to the lack of automatic data processing tools and to the limit of peptide databases. The web server OligoNet responds to this challenge by attempting to decompose each individual mass into a combination of amino acids out of metabolomics datasets. It provides an additional network-based data interpretation named "Peptide degradation network" (PDN), which unravels interesting relations between annotated peptides and generates potential functional patterns. The ab initio PDN built from yeast metabolic profiling data shows a great similarity with well-known metabolic networks, and could aid biological interpretation. OligoNet allows also an easy evaluation and interpretation of annotated peptides in systems biology, and is freely accessible at https://daniellyz200608105.shinyapps.io/OligoNet/ .
Gambling on a shortcut to genome sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roberts, L.

1991-06-21

Almost from the start of the Human Genome Project, a debate has been raging over whether to sequence the entire human genome, all 3 billion bases, or just the genes - a mere 2% or 3% of the genome, and by far the most interesting part. In England, Sydney Brenner convinced the Medical Research Council (MRC) to start with the expressed genes, or complementary DNAs. But the US stance has been that the entire sequence is essential if we are to understand the blueprint of man. Craig Venter of the National Institute of Neurological Disorders and Stroke says that focusingmore » on the expressed genes may be even more useful than expected. His strategy involves randomly selecting clones from cDNA libraries which theoretically contain all the genes that are switched on at a particular time in a particular tissue. Then the researchers sequence just a short stretch of each clone, about 400 to 500 bases, to create can expressed sequence tag or EST. The sequences of these ESTs are then stored in a database. Using that information, other researchers can then recreate that EST by using polymerase chain reaction techniques.« less
DNA sequence chromatogram browsing using JAVA and CORBA.

PubMed

Parsons, J D; Buehler, E; Hillier, L

1999-03-01

DNA sequence chromatograms (traces) are the primary data source for all large-scale genomic and expressed sequence tags (ESTs) sequencing projects. Access to the sequencing trace assists many later analyses, for example contig assembly and polymorphism detection, but obtaining and using traces is problematic. Traces are not collected and published centrally, they are much larger than the base calls derived from them, and viewing them requires the interactivity of a local graphical client with local data. To provide efficient global access to DNA traces, we developed a client/server system based on flexible Java components integrated into other applications including an applet for use in a WWW browser and a stand-alone trace viewer. Client/server interaction is facilitated by CORBA middleware which provides a well-defined interface, a naming service, and location independence. [The software is packaged as a Jar file available from the following URL: http://www.ebi.ac.uk/jparsons. Links to working examples of the trace viewers can be found at http://corba.ebi.ac.uk/EST. All the Washington University mouse EST traces are available for browsing at the same URL.
AntiHunter 2.0: increased speed and sensitivity in searching BLAST output for EST antisense transcripts.

PubMed

Lavorgna, Giovanni; Triunfo, Riccardo; Santoni, Federico; Orfanelli, Ugo; Noci, Sara; Bulfone, Alessandro; Zanetti, Gianluigi; Casari, Giorgio

2005-07-01

An increasing number of eukaryotic and prokaryotic genes are being found to have natural antisense transcripts (NATs). There is also growing evidence to suggest that antisense transcription could play a key role in many human diseases. Consequently, there have been several recent attempts to set up computational procedures aimed at identifying novel NATs. Our group has developed the AntiHunter program for the identification of expressed sequence tag (EST) antisense transcripts from BLAST output. In order to perform an analysis, the program requires a genomic sequence plus an associated list of transcript names and coordinates of the genomic region. After masking the repeated regions, the program carries out a BLASTN search of this sequence in the selected EST database, reporting via email the EST entries that reveal an antisense transcript according to the user-supplied list. Here, we present the newly developed version 2.0 of the AntiHunter tool. Several improvements have been added to this version of the program in order to increase its ability to detect a larger number of antisense ESTs. As a result, AntiHunter can now detect, on average, >45% more antisense ESTs with little or no increase in the percentage of the false positives. We also raised the maximum query size to 3 Mb (previously 1 Mb). Moreover, we found that a reasonable trade-off between the program search sensitivity and the maximum allowed size of the input-query sequence could be obtained by querying the database with the MEGABLAST program, rather than by using the BLAST one. We now offer this new opportunity to users, i.e. if choosing the MEGABLAST option, users can input a query sequence up to 30 Mb long, thus considerably improving the possibility to analyze longer query regions. The AntiHunter tool is freely available at http://bioinfo.crs4.it/AH2.0.
Profiling mRNAs of Two Cuscuta Species Reveals Possible Candidate Transcripts Shared by Parasitic Plants

PubMed Central

Wijeratne, Saranga; Fraga, Martina; Meulia, Tea; Doohan, Doug; Li, Zhaohu; Qu, Feng

2013-01-01

Dodders are among the most important parasitic plants that cause serious yield losses in crop plants. In this report, we sought to unveil the genetic basis of dodder parasitism by profiling the trancriptomes of Cuscuta pentagona and C. suaveolens, two of the most common dodder species using a next-generation RNA sequencing platform. De novo assembly of the sequence reads resulted in more than 46,000 isotigs and contigs (collectively referred to as expressed sequence tags or ESTs) for each species, with more than half of them predicted to encode proteins that share significant sequence similarities with known proteins of non-parasitic plants. Comparing our datasets with transcriptomes of 12 other fully sequenced plant species confirmed a close evolutionary relationship between dodder and tomato. Using a rigorous set of filtering parameters, we were able to identify seven pairs of ESTs that appear to be shared exclusively by parasitic plants, thus providing targets for tailored management approaches. In addition, we also discovered ESTs with sequences similarities to known plant viruses, including cryptic viruses, in the dodder sequence assemblies. Together this study represents the first comprehensive transcriptome profiling of parasitic plants in the Cuscuta genus, and is expected to contribute to our understanding of the molecular mechanisms of parasitic plant-host plant interactions. PMID:24312295
Profiling mRNAs of two Cuscuta species reveals possible candidate transcripts shared by parasitic plants.

PubMed

Jiang, Linjian; Wijeratne, Asela J; Wijeratne, Saranga; Fraga, Martina; Meulia, Tea; Doohan, Doug; Li, Zhaohu; Qu, Feng

2013-01-01

Dodders are among the most important parasitic plants that cause serious yield losses in crop plants. In this report, we sought to unveil the genetic basis of dodder parasitism by profiling the trancriptomes of Cuscuta pentagona and C. suaveolens, two of the most common dodder species using a next-generation RNA sequencing platform. De novo assembly of the sequence reads resulted in more than 46,000 isotigs and contigs (collectively referred to as expressed sequence tags or ESTs) for each species, with more than half of them predicted to encode proteins that share significant sequence similarities with known proteins of non-parasitic plants. Comparing our datasets with transcriptomes of 12 other fully sequenced plant species confirmed a close evolutionary relationship between dodder and tomato. Using a rigorous set of filtering parameters, we were able to identify seven pairs of ESTs that appear to be shared exclusively by parasitic plants, thus providing targets for tailored management approaches. In addition, we also discovered ESTs with sequences similarities to known plant viruses, including cryptic viruses, in the dodder sequence assemblies. Together this study represents the first comprehensive transcriptome profiling of parasitic plants in the Cuscuta genus, and is expected to contribute to our understanding of the molecular mechanisms of parasitic plant-host plant interactions.
Exploring Redox States, Doping and Ordering of Electroactive Star-Shaped Oligo(aniline)s.

PubMed

Mills, Benjamin M; Fey, Natalie; Marszalek, Tomasz; Pisula, Wojciech; Rannou, Patrice; Faul, Charl F J

2016-11-14

We have prepared a simple star-shaped oligo(aniline) (TDPB) and characterised it in detail by MALDI-TOF MS, UV/Vis/NIR spectroscopy, time-dependent DFT, cyclic voltammetry and EPR spectroscopy. TDPB is part of an underdeveloped class of π-conjugated molecules with great potential for organic electronics, display and sensor applications. It is redox active and reacts with acids to form radical cations. Acid-doped TDPB shows behaviour similar to discotic liquid crystals, with X-ray scattering investigations revealing columnar self-assembled arrays. The combination of unpaired electrons and supramolecular stacking suggests that star-shaped oligo(aniline)s like TDPB have the potential to form conducting nanowires and organic magnetic materials. © 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.
Oligo-branched peptides for tumor targeting: from magic bullets to magic forks.

PubMed

Falciani, Chiara; Pini, Alessandro; Bracci, Luisa

2009-02-01

Selective targeting of tumor cells is the final goal of research and drug discovery for cancer diagnosis, imaging and therapy. After the invention of hybridoma technology, the concept of magic bullet was introduced into the field of oncology, referring to selective killing of tumor cells, by specific antibodies. More recently, small molecules and peptides have also been proposed as selective targeting agents. We analyze the state of the art of tumor-selective agents that are presently available and tested in clinical settings. A novel approach based on 'armed' oligo-branched peptides as tumor targeting agents, is discussed and compared with existing tumor-selective therapies mediated by antibodies, small molecules or monomeric peptides. Oligo-branched peptides could be novel drugs that combine the advantages of antibodies and small molecules.
Generation and Analysis of the Expressed Sequence Tags from the Mycelium of Ganoderma lucidum

PubMed Central

Huang, Yen-Hua; Wu, Hung-Yi; Wu, Keh-Ming; Liu, Tze-Tze; Liou, Ruey-Fen; Tsai, Shih-Feng; Shiao, Ming-Shi; Ho, Low-Tone; Tzean, Shean-Shong; Yang, Ueng-Cheng

2013-01-01

Ganoderma lucidum (G. lucidum) is a medicinal mushroom renowned in East Asia for its potential biological effects. To enable a systematic exploration of the genes associated with the various phenotypes of the fungus, the genome consortium of G. lucidum has carried out an expressed sequence tag (EST) sequencing project. Using a Sanger sequencing based approach, 47,285 ESTs were obtained from in vitro cultures of G. lucidum mycelium of various durations. These ESTs were further clustered and merged into 7,774 non-redundant expressed loci. The features of these expressed contigs were explored in terms of over-representation, alternative splicing, and natural antisense transcripts. Our results provide an invaluable information resource for exploring the G. lucidum transcriptome and its regulation. Many cases of the genes over-represented in fast-growing dikaryotic mycelium are closely related to growth, such as cell wall and bioactive compound synthesis. In addition, the EST-genome alignments containing putative cassette exons and retained introns were manually curated and then used to make inferences about the predominating splice-site recognition mechanism of G. lucidum. Moreover, a number of putative antisense transcripts have been pinpointed, from which we noticed that two cases are likely to reveal hitherto undiscovered biological pathways. To allow users to access the data and the initial analysis of the results of this project, a dedicated web site has been created at http://csb2.ym.edu.tw/est/. PMID:23658685
Expressed sequence tag based identification and expression analysis of some cold inducible elements in seabuckthorn (Hippophae rhamnoides L.).

PubMed

Ghangal, Rajesh; Raghuvanshi, Saurabh; Sharma, Prakash C

2012-02-01

A cDNA library was constructed from the mature leaves of seabuckthorn (Hippophae rhamnoides). Expressed Sequence Tags (ESTs) were generated by single pass sequencing of 4500 cDNA clones. We submitted 3412 ESTs to dbEST of NCBI. Clustering of these ESTs yielded 1665 unigenes comprising of 345 contigs and 1320 singletons. Out of 1665 unigenes, 1278 unigenes were annotated by similarity search while the remaining 387 unannotated unigenes were considered as organism specific. Gene Ontology (GO) analysis of the unigene dataset showed 691 unigenes related to biological processes, 727 to molecular functions and 588 to cellular component category. On the basis of similarity search and GO annotation, 43 unigenes were found responsive to biotic and abiotic stresses. To validate this observation, 13 genes that are known to be associated with cold stress tolerance from previous studies in Arabidopsis and 3 novel transcripts were examined by Real time RT-PCR to understand the change in expression pattern under cold/freeze stress. In silico study of occurrence of microsatellites in these ESTs revealed the presence of 62 Simple Sequence Repeats (SSRs), some of which are being explored to assess genetic diversity among seabuckthorn collections. This is the first report of generation of transcriptome data providing information about genes involved in managing plant abiotic stress in seabuckthorn, a plant known for its enormous medicinal and ecological value. Copyright © 2011 Elsevier Masson SAS. All rights reserved.
Gene Discovery in the Apicomplexa as Revealed by EST Sequencing and Assembly of a Comparative Gene Database

PubMed Central

Li, Li; Brunk, Brian P.; Kissinger, Jessica C.; Pape, Deana; Tang, Keliang; Cole, Robert H.; Martin, John; Wylie, Todd; Dante, Mike; Fogarty, Steven J.; Howe, Daniel K.; Liberator, Paul; Diaz, Carmen; Anderson, Jennifer; White, Michael; Jerome, Maria E.; Johnson, Emily A.; Radke, Jay A.; Stoeckert, Christian J.; Waterston, Robert H.; Clifton, Sandra W.; Roos, David S.; Sibley, L. David

2003-01-01

Large-scale EST sequencing projects for several important parasites within the phylum Apicomplexa were undertaken for the purpose of gene discovery. Included were several parasites of medical importance (Plasmodium falciparum, Toxoplasma gondii) and others of veterinary importance (Eimeria tenella, Sarcocystis neurona, and Neospora caninum). A total of 55,192 ESTs, deposited into dbEST/GenBank, were included in the analyses. The resulting sequences have been clustered into nonredundant gene assemblies and deposited into a relational database that supports a variety of sequence and text searches. This database has been used to compare the gene assemblies using BLAST similarity comparisons to the public protein databases to identify putative genes. Of these new entries, ∼15%–20% represent putative homologs with a conservative cutoff of p < 10−9, thus identifying many conserved genes that are likely to share common functions with other well-studied organisms. Gene assemblies were also used to identify strain polymorphisms, examine stage-specific expression, and identify gene families. An interesting class of genes that are confined to members of this phylum and not shared by plants, animals, or fungi, was identified. These genes likely mediate the novel biological features of members of the Apicomplexa and hence offer great potential for biological investigation and as possible therapeutic targets. [The sequence data from this study have been submitted to dbEST division of GenBank under accession nos.: Toxoplasma gondii: –, –, –, –, – , –, –, –, –. Plasmodium falciparum: –, –, –, –. Sarcocystis neurona: , , , , , , , , , , , , , –, –, –, –, –. Eimeria tenella: –, –, –, –, –, –, –, –, – , –, –, –, –, –, –, –, –, –, –, –. Neospora caninum: –, –, , – , –, –.] PMID:12618375
ExprAlign - the identification of ESTs in non-model species by alignment of cDNA microarray expression profiles

PubMed Central

2009-01-01

Background Sequence identification of ESTs from non-model species offers distinct challenges particularly when these species have duplicated genomes and when they are phylogenetically distant from sequenced model organisms. For the common carp, an environmental model of aquacultural interest, large numbers of ESTs remained unidentified using BLAST sequence alignment. We have used the expression profiles from large-scale microarray experiments to suggest gene identities. Results Expression profiles from ~700 cDNA microarrays describing responses of 7 major tissues to multiple environmental stressors were used to define a co-expression landscape. This was based on the Pearsons correlation coefficient relating each gene with all other genes, from which a network description provided clusters of highly correlated genes as 'mountains'. We show that these contain genes with known identities and genes with unknown identities, and that the correlation constitutes evidence of identity in the latter. This procedure has suggested identities to 522 of 2701 unknown carp ESTs sequences. We also discriminate several common carp genes and gene isoforms that were not discriminated by BLAST sequence alignment alone. Precision in identification was substantially improved by use of data from multiple tissues and treatments. Conclusion The detailed analysis of co-expression landscapes is a sensitive technique for suggesting an identity for the large number of BLAST unidentified cDNAs generated in EST projects. It is capable of detecting even subtle changes in expression profiles, and thereby of distinguishing genes with a common BLAST identity into different identities. It benefits from the use of multiple treatments or contrasts, and from the large-scale microarray data. PMID:19939286
The barley EST DNA Replication and Repair Database (bEST-DRRD) as a tool for the identification of the genes involved in DNA replication and repair.

PubMed

Gruszka, Damian; Marzec, Marek; Szarejko, Iwona

2012-06-14

The high level of conservation of genes that regulate DNA replication and repair indicates that they may serve as a source of information on the origin and evolution of the species and makes them a reliable system for the identification of cross-species homologs. Studies that had been conducted to date shed light on the processes of DNA replication and repair in bacteria, yeast and mammals. However, there is still much to be learned about the process of DNA damage repair in plants. These studies, which were conducted mainly using bioinformatics tools, enabled the list of genes that participate in various pathways of DNA repair in Arabidopsis thaliana (L.) Heynh to be outlined; however, information regarding these mechanisms in crop plants is still very limited. A similar, functional approach is particularly difficult for a species whose complete genomic sequences are still unavailable. One of the solutions is to apply ESTs (Expressed Sequence Tags) as the basis for gene identification. For the construction of the barley EST DNA Replication and Repair Database (bEST-DRRD), presented here, the Arabidopsis nucleotide and protein sequences involved in DNA replication and repair were used to browse for and retrieve the deposited sequences, derived from four barley (Hordeum vulgare L.) sequence databases, including the "Barley Genome version 0.05" database (encompassing ca. 90% of barley coding sequences) and from two databases covering the complete genomes of two monocot models: Oryza sativa L. and Brachypodium distachyon L. in order to identify homologous genes. Sequences of the categorised Arabidopsis queries are used for browsing the repositories, which are located on the ViroBLAST platform. The bEST-DRRD is currently used in our project during the identification and validation of the barley genes involved in DNA repair. The presented database provides information about the Arabidopsis genes involved in DNA replication and repair, their expression patterns and models of protein interactions. It was designed and established to provide an open-access tool for the identification of monocot homologs of known Arabidopsis genes that are responsible for DNA-related processes. The barley genes identified in the project are currently being analysed to validate their function.
A Bayesian nonparametric method for prediction in EST analysis

PubMed Central

Lijoi, Antonio; Mena, Ramsés H; Prünster, Igor

2007-01-01

Background Expressed sequence tags (ESTs) analyses are a fundamental tool for gene identification in organisms. Given a preliminary EST sample from a certain library, several statistical prediction problems arise. In particular, it is of interest to estimate how many new genes can be detected in a future EST sample of given size and also to determine the gene discovery rate: these estimates represent the basis for deciding whether to proceed sequencing the library and, in case of a positive decision, a guideline for selecting the size of the new sample. Such information is also useful for establishing sequencing efficiency in experimental design and for measuring the degree of redundancy of an EST library. Results In this work we propose a Bayesian nonparametric approach for tackling statistical problems related to EST surveys. In particular, we provide estimates for: a) the coverage, defined as the proportion of unique genes in the library represented in the given sample of reads; b) the number of new unique genes to be observed in a future sample; c) the discovery rate of new genes as a function of the future sample size. The Bayesian nonparametric model we adopt conveys, in a statistically rigorous way, the available information into prediction. Our proposal has appealing properties over frequentist nonparametric methods, which become unstable when prediction is required for large future samples. EST libraries, previously studied with frequentist methods, are analyzed in detail. Conclusion The Bayesian nonparametric approach we undertake yields valuable tools for gene capture and prediction in EST libraries. The estimators we obtain do not feature the kind of drawbacks associated with frequentist estimators and are reliable for any size of the additional sample. PMID:17868445
De Novo Assembly of Auricularia polytricha Transcriptome Using Illumina Sequencing for Gene Discovery and SSR Marker Identification

PubMed Central

Zhou, Yan; Chen, Lianfu; Fan, Xiuzhi; Bian, Yinbing

2014-01-01

Auricularia polytricha (Mont.) Sacc., a type of edible black-brown mushroom with a gelatinous and modality-specific fruiting body, is in high demand in Asia due to its nutritional and medicinal properties. Illumina Solexa sequenceing technology was used to generate very large transcript sequences from the mycelium and the mature fruiting body of A. polytricha for gene discovery and molecular marker development. De novo assembly generated 36,483 ESTs with an N50 length of 636 bp. A total of 28,108 ESTs demonstrated significant hits with known proteins in the nr database, and 94.03% of the annotated ESTs showed the greatest similarity to A. delicata, a related species of A. polytricha. Functional categorization of the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways revealed the conservation of genes involved in various biological processes in A. polytricha. Gene expression profile analysis indicated that a total of 2,057 ESTs were differentially expressed, including 1,020 ESTs that were up-regulated in the mycelium and 1,037 up-regulated in the fruiting body. Functional enrichment showed that the ESTs associated with biosynthesis, metabolism and assembly of proteins were more active in fruiting body development. The expression patterns of homologous transcription factors indicated that the molecular mechanisms of fruiting body formation and development were not exactly the same as for other agarics. Interestingly, an EST encoding tyrosinase was significantly up-regulated in the fruiting body, indicating that melanins accumulated during the processes of the formation of the black-brown color of the fruiting body in A. polytricha development. In addition, a total of 1,715 potential SSRs were detected in this transcriptome. The transcriptome analysis of A. polytricha provides valuable sequence resources and numerous molecular markers to facilitate further functional genomics studies and genetic researches on this fungus. PMID:24626227
Investigation of SnSPR1, a novel and abundant surface protein of Sarcocystis neurona merozoites.

PubMed

Zhang, Deqing; Howe, Daniel K

2008-04-15

An expressed sequence tag (EST) sequencing project has produced over 15,000 partial cDNA sequences from the equine pathogen Sarcocystis neurona. While many of the sequences are clear homologues of previously characterized genes, a significant number of the S. neurona ESTs do not exhibit similarity to anything in the extensive sequence databases that have been generated. In an effort to characterize parasite proteins that are novel to S. neurona, a seemingly unique gene was selected for further investigation based on its abundant representation in the collection of ESTs and the predicted presence of a signal peptide and glycolipid anchor addition on the encoded protein. The gene was expressed in E. coli, and monospecific polyclonal antiserum against the recombinant protein was produced by immunization of a rabbit. Characterization of the native protein in S. neurona merozoites and schizonts revealed that it is a low molecular weight surface protein that is expressed throughout intracellular development of the parasite. The protein was designated Surface Protein 1 (SPR1) to reflect its display on the outer surface of merozoites and to distinguish it from the ubiquitous SAG/SRS surface antigens of the heteroxenous Coccidia. Interestingly, infection assays in the presence of the polyclonal antiserum suggested that SnSPR1 plays some role in attachment and/or invasion of host cells by S. neurona merozoites. The work described herein represents a general template for selecting and characterizing the various unidentified gene sequences that are plentiful in the EST databases for S. neurona and other apicomplexans. Furthermore, this study illustrates the value of investigating these novel sequences since it can offer new candidates for diagnostic or vaccine development while also providing greater insight into the biology of these parasites.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis

PubMed Central

Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting

2013-01-01

Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
Rapid transcriptome characterization and parsing of sequences in a non-model host-pathogen interaction; pea-Sclerotinia sclerotiorum

PubMed Central

2012-01-01

Background White mold, caused by Sclerotinia sclerotiorum, is one of the most important diseases of pea (Pisum sativum L.), however, little is known about the genetics and biochemistry of this interaction. Identification of genes underlying resistance in the host or pathogenicity and virulence factors in the pathogen will increase our knowledge of the pea-S. sclerotiorum interaction and facilitate the introgression of new resistance genes into commercial pea varieties. Although the S. sclerotiorum genome sequence is available, no pea genome is available, due in part to its large genome size (~3500 Mb) and extensive repeated motifs. Here we present an EST data set specific to the interaction between S. sclerotiorum and pea, and a method to distinguish pathogen and host sequences without a species-specific reference genome. Results 10,158 contigs were obtained by de novo assembly of 128,720 high-quality reads generated by 454 pyrosequencing of the pea-S. sclerotiorum interactome. A method based on the tBLASTx program was modified to distinguish pea and S. sclerotiorum ESTs. To test this strategy, a mixture of known ESTs (18,490 pea and 17,198 S. sclerotiorum ESTs) from public databases were pooled and parsed; the tBLASTx method successfully separated 90.1% of the artificial EST mix with 99.9% accuracy. The tBLASTx method successfully parsed 89.4% of the 454-derived EST contigs, as validated by PCR, into pea (6,299 contigs) and S. sclerotiorum (2,780 contigs) categories. Two thousand eight hundred and forty pea ESTs and 996 S. sclerotiorum ESTs were predicted to be expressed specifically during the pea-S. sclerotiorum interaction as determined by homology search against 81,449 pea ESTs (from flowers, leaves, cotyledons, epi- and hypocotyl, and etiolated and light treated etiolated seedlings) and 57,751 S. sclerotiorum ESTs (from mycelia at neutral pH, developing apothecia and developing sclerotia). Among those ESTs specifically expressed, 277 (9.8%) pea ESTs were predicted to be involved in plant defense and response to biotic or abiotic stress, and 93 (9.3%) S. sclerotiorum ESTs were predicted to be involved in pathogenicity/virulence. Additionally, 142 S. sclerotiorum ESTs were identified as secretory/signal peptides of which only 21 were previously reported. Conclusions We present and characterize an EST resource specific to the pea-S. sclerotiorum interaction. Additionally, the tBLASTx method used to parse S. sclerotiorum and pea ESTs was demonstrated to be a reliable and accurate method to distinguish ESTs without a reference genome. PMID:23181755

Analysis of expressed sequence tags for Frankliniella occidentalis, the western flower thrips.

PubMed

Rotenberg, D; Whitfield, A E

2010-08-01

Thrips are members of the insect order Thysanoptera and Frankliniella occidentalis (the western flower thrips) is the most economically important pest within this order. F. occidentalis is both a direct pest of crops and an efficient vector of plant viruses, including Tomato spotted wilt virus (TSWV). Despite the world-wide importance of thrips in agriculture, there is little knowledge of the F. occidentalis genome or gene functions at this time. A normalized cDNA library was constructed from first instar thrips and 13 839 expressed sequence tags (ESTs) were obtained. Our EST data assembled into 894 contigs and 11 806 singletons (12 700 nonredundant sequences). We found that 31% of these sequences had significant similarity (E< or = 10(-10)) to protein sequences in the National Center for Biotechnology Information nonredundant (nr) protein database, and 25% were functionally annotated using Blast 2GO. We identified 74 sequences with putative homology to proteins associated with insect innate immunity. Sixteen sequences had significant similarity to proteins associated with small RNA-mediated gene silencing pathways (RNA interference; RNAi), including the antiviral pathway (short interfering RNA-mediated pathway). Our EST collection provides new sequence resources for characterizing gene functions in F. occidentalis and other thrips species with regards to vital biological processes, studying the mechanism of interactions with the viruses harboured and transmitted by the vector, and identifying new insect gene-centred targets for plant disease and insect control.
High polymorphism in Est-SSR loci for cellulose synthase and β-amylase of sugarcane varieties (Saccharum spp.) used by the industrial sector for ethanol production.

PubMed

Augusto, Raphael; Maranho, Rone Charles; Mangolin, Claudete Aparecida; Pires da Silva Machado, Maria de Fátima

2015-01-01

High and low polymorphisms in simple sequence repeats of expressed sequence tag (EST-SSR) for specific proteins and enzymes, such as β-amylase, cellulose synthase, xyloglucan endotransglucosylase, fructose 1,6-bisphosphate aldolase, and fructose 1,6-bisphosphatase, were used to illustrate the genetic divergence within and between varieties of sugarcane (Saccharum spp.) and to guide the technological paths to optimize ethanol production from lignocellulose biomass. The varieties RB72454, RB867515, RB92579, and SP813250 on the second stage of cutting, all grown in the state of Paraná (PR), and the varieties RB92579 and SP813250 cultured in the PR state and in Northeastern Brazil, state of Pernambuco (PE), were analyzed using five EST-SSR primers for EstC66, EstC67, EstC68, EstC69, and EstC91 loci. Genetic divergence was evident in the EstC67 and EstC69 loci for β-amylase and cellulose synthase, respectively, among the four sugarcane varieties. An extremely high level of genetic differentiation was also detected in the EstC67 locus from the RB82579 and SP813250 varieties cultured in the PR and PE states. High polymorphism in SSR of the cellulose synthase locus may explain the high variability of substrates used in pretreatment and enzymatic hydrolysis processes, which has been an obstacle to effective industrial adaptations.
Generation and Analysis of a Large-Scale Expressed Sequence Tag Database from a Full-Length Enriched cDNA Library of Developing Leaves of Gossypium hirsutum L

PubMed Central

Pang, Chaoyou; Fan, Shuli; Song, Meizhen; Yu, Shuxun

2013-01-01

Background Cotton (Gossypium hirsutum L.) is one of the world’s most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. Methodology/Principal Findings In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. Conclusions/Significance These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation in G. hirsutum and comparative genomics among Gossypium species. PMID:24146870
Symposium on Dissertations on Chemical Oceanography, March 5-9, 1984. Abstracts.

DTIC Science & Technology

1984-03-09

polysaccharides ; to determine their chemical structures by the application of various chemical and physical methods; and, finally, to clarity the distri...conducted to determine linkage types of monosaccharide constituents of oligo- and poly- saccharides from seawater samples. The following results were...coastal water. Mono-, oligo- and polysaccharides accounted for 7-9%, lb-26 , and ;1- 43% of the dissolved carbohydrates, respectively. The polysaccharide
A low fermentable oligo-di-mono-saccharides and polyols (FODMAP) diet is a balanced therapy for fibromyalgia with nutritional and symptomatic benefits

PubMed

Marum, Ana Paula; Moreira, Cátia; Tomas-Carus, Pablo; Saraiva, Fernando; Guerreiro, Catarina Sousa

2017-06-05

Fibromyalgia is a chronic rheumatic disease producing widespread pain, associated to a major comorbidity -irritable bowel syndrome. Low FODMAPS diet (low fermentable oligo-di-mono-saccharides and polyols diet) has been effective in controlling irritable bowel syndrome symptoms. Overweight is an aggravating factor for fibromyalgia. We studied effects of low fermentable oligo-di-mono-saccharides and polyols diets on fibromyalgia symptoms and weight status. A longitudinal study was performed on 38 fibromyalgia patients using a four-week, repeated assessment as follow: M1 = first assessments/presentation of individual low fermentable oligo-di-mono-saccharides and polyols diet; M2 = second assessments/reintroduction of FODMAPs; M3 = final assessments/nutritional counselling. The assessment instruments applied were: Fibromyalgia Survey Questionnaire (FSQ); Severity Score System (IBS-SSS); visual analogic scale (VAS). Body mass-index/composition and waist circumference (WC) were also measured. Daily macro-micronutrients and FODMAP intake were quantified at each moment of the study. The studied cohort was 37% overweight, 34% obese (average body mass-index 27.4 ± 4.6; excess fat mass 39.4 ± 7%). Weight, body mass-index and waist circumference decreased significantly (p < 0.01) with low fermentable oligo-di-mono-saccharides and polyols diet, but no significant effect on body composition was observed. All fibromyalgiasymptoms, including somatic pain, declined significantly post-LFD (p < 0.01); as well for severity of fibromyalgia [Fibromyalgia survey questionnaire: M1 = 21.8; M2 = 16.9; M3 = 17.0 (p < 0.01)]. The intake of essential nutrients (fiber, calcium, magnesium and vitamin D) showed no significant difference. The significant reduction in FODMAP intake (M1 = 24.4 g; M2 = 2.6g; p < 0.01) reflected the "Diet adherence" (85%). "Satisfaction with improvement of symptoms" (76%), showed correlating with "diet adherence" (r = 0.65; p < 0.01). Results are highly encouraging, showing low fermentable oligo-di-mono-saccharides and polyols diets as a nutritionally balanced approach, contributing to weight loss and reducing the severity of FM fibromyalgiasymptoms.
Analysis of expressed sequence tags from a single wheat cultivar facilitates interpretation of tandem mass spectrometry data and discrimination of gamma gliadin proteins that may play different functional roles in flour

USDA-ARS?s Scientific Manuscript database

The complement of gamma gliadin genes expressed in the wheat cultivar Butte 86 was evaluated by analyzing publicly available expressed sequence tag (EST) data. Eleven contigs were assembled from 153 Butte 86 ESTs. Nine of the contigs encoded full-length proteins and four of the proteins contained an...
Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak

PubMed Central

2010-01-01

Background The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. Results We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. Conclusions This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations. PMID:21092232
Ordered shotgun sequencing of a 135 kb Xq25 YAC containing ANT2 and four possible genes, including three confirmed by EST matches.

PubMed Central

Chen, C N; Su, Y; Baybayan, P; Siruno, A; Nagaraja, R; Mazzarella, R; Schlessinger, D; Chen, E

1996-01-01

Ordered shotgun sequencing (OSS) has been successfully carried out with an Xq25 YAC substrate. yWXD703 DNA was subcloned into lambda phage and sequences of insert ends of the lambda subclones were used to generate a map to select a minimum tiling path of clones to be completely sequenced. The sequence of 135 038 nt contains the entire ANT2 cDNA as well as four other candidates suggested by computer-assisted analyses. One of the putative genes is homologous to a gene implicated in Graves' disease and it, ANT2 and two others are confirmed by EST matches. The results suggest that OSS can be applied to YACs in accord with earlier simulations and further indicate that the sequence of the YAC accurately reflects the sequence of uncloned human DNA. PMID:8918809
Cloning and characterization of a pyrethroid pesticide decomposing esterase gene, Est3385, from Rhodopseudomonas palustris PSB-S.

PubMed

Luo, Xiangwen; Zhang, Deyong; Zhou, Xuguo; Du, Jiao; Zhang, Songbai; Liu, Yong

2018-05-09

Full length open reading frame of pyrethroid detoxification gene, Est3385, contains 963 nucleotides. This gene was identified and cloned based on the genome sequence of Rhodopseudomonas palustris PSB-S available at the GneBank. The predicted amino acid sequence of Est3385 shared moderate identities (30-46%) with the known homologous esterases. Phylogenetic analysis revealed that Est3385 was a member in the esterase family I. Recombinant Est3385 was heterologous expressed in E. coli, purified and characterized for its substrate specificity, kinetics and stability under various conditions. The optimal temperature and pH for Est3385 were 35 °C and 6.0, respectively. This enzyme could detoxify various pyrethroid pesticides and degrade the optimal substrate fenpropathrin with a Km and Vmax value of 0.734 ± 0.013 mmol·l -1 and 0.918 ± 0.025 U·µg -1 , respectively. No cofactor was found to affect Est3385 activity but substantial reduction of enzymatic activity was observed when metal ions were applied. Taken together, a new pyrethroid degradation esterase was identified and characterized. Modification of Est3385 with protein engineering toolsets should enhance its potential for field application to reduce the pesticide residue from agroecosystems.
Development and Application of a Salmonid EST Database and cDNA Microarray: Data Mining and Interspecific Hybridization Characteristics

PubMed Central

Rise, Matthew L.; von Schalburg, Kristian R.; Brown, Gordon D.; Mawer, Melanie A.; Devlin, Robert H.; Kuipers, Nathanael; Busby, Maura; Beetz-Sargent, Marianne; Alberto, Roberto; Gibbs, A. Ross; Hunt, Peter; Shukin, Robert; Zeznik, Jeffrey A.; Nelson, Colleen; Jones, Simon R.M.; Smailus, Duane E.; Jones, Steven J.M.; Schein, Jacqueline E.; Marra, Marco A.; Butterfield, Yaron S.N.; Stott, Jeff M.; Ng, Siemon H.S.; Davidson, William S.; Koop, Ben F.

2004-01-01

We report 80,388 ESTs from 23 Atlantic salmon (Salmo salar) cDNA libraries (61,819 ESTs), 6 rainbow trout (Oncorhynchus mykiss) cDNA libraries (14,544 ESTs), 2 chinook salmon (Oncorhynchus tshawytscha) cDNA libraries (1317 ESTs), 2 sockeye salmon (Oncorhynchus nerka) cDNA libraries (1243 ESTs), and 2 lake whitefish (Coregonus clupeaformis) cDNA libraries (1465 ESTs). The majority of these are 3′ sequences, allowing discrimination between paralogs arising from a recent genome duplication in the salmonid lineage. Sequence assembly reveals 28,710 different S. salar, 8981 O. mykiss, 1085 O. tshawytscha, 520 O. nerka, and 1176 C. clupeaformis putative transcripts. We annotate the submitted portion of our EST database by molecular function. Higher- and lower-molecular-weight fractions of libraries are shown to contain distinct gene sets, and higher rates of gene discovery are associated with higher-molecular weight libraries. Pyloric caecum library group annotations indicate this organ may function in redox control and as a barrier against systemic uptake of xenobiotics. A microarray is described, containing 7356 salmonid elements representing 3557 different cDNAs. Analyses of cross-species hybridizations to this cDNA microarray indicate that this resource may be used for studies involving all salmonids. PMID:14962987
Normal Telomere Length Maintenance in Saccharomyces cerevisiae Requires Nuclear Import of the Ever Shorter Telomeres 1 (Est1) Protein via the Importin Alpha Pathway

PubMed Central

Hawkins, Charlene

2014-01-01

The Est1 (ever shorter telomeres 1) protein is an essential component of yeast telomerase, a ribonucleoprotein complex that restores the repetitive sequences at chromosome ends (telomeres) that would otherwise be lost during DNA replication. Previous work has shown that the telomerase RNA component (TLC1) transits through the cytoplasm during telomerase biogenesis, but mechanisms of protein import have not been addressed. Here we identify three nuclear localization sequences (NLSs) in Est1p. Mutation of the most N-terminal NLS in the context of full-length Est1p reduces Est1p nuclear localization and causes telomere shortening—phenotypes that are rescued by fusion with the NLS from the simian virus 40 (SV40) large-T antigen. In contrast to that of the TLC1 RNA, Est1p nuclear import is facilitated by Srp1p, the yeast homolog of importin α. The reduction in telomere length observed at the semipermissive temperature in a srp1 mutant strain is rescued by increased Est1p expression, consistent with a defect in Est1p nuclear import. These studies suggest that at least two nuclear import pathways are required to achieve normal telomere length homeostasis in yeast. PMID:24906415
mRNA-Seq and microarray development for the Grooved carpet shell clam, Ruditapes decussatus: a functional approach to unravel host -parasite interaction

PubMed Central

2013-01-01

Background The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. Results A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. Conclusions This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported. PMID:24168212
mRNA-Seq and microarray development for the Grooved Carpet shell clam, Ruditapes decussatus: a functional approach to unravel host-parasite interaction.

PubMed

Leite, Ricardo B; Milan, Massimo; Coppe, Alessandro; Bortoluzzi, Stefania; dos Anjos, António; Reinhardt, Richard; Saavedra, Carlos; Patarnello, Tomaso; Cancela, M Leonor; Bargelloni, Luca

2013-10-29

The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported.
Spectrum of metabolic dysfunction in relationship with hyperandrogenemia in obese adolescent girls with polycystic ovary syndrome.

PubMed

Alemzadeh, Ramin; Kichler, Jessica; Calhoun, Mariaelena

2010-06-01

Polycystic ovary syndrome (PCOS) in adult women is associated with increased risk of metabolic syndrome (MS) and atherosclerosis. We evaluated the spectrum of metabolic dysfunction in relationship with hyperandrogenemia (HA) in adolescent girls with PCOS. Ovulatory function, acne, hirsutism (HS), body mass index (BMI), body composition, fasting lipids, glucose, insulin, free testosterone (FT), high-sensitivity C-reactive protein (hs-CRP), and HbA1c were evaluated in 103 girls. The homeostatic assessment model equations (HOMA-IR and HOMA-%B) were used for determination of insulin resistance and beta-cell function respectively. The oligo-ovulation (Oligo)+HA+HS (n=44), Oligo+HA (n=28), and Oligo+HS (n=31) phenotypes had similar BMI. However, hyperandrogenemic phenotypes had higher prevalence of acanthosis nigricans (AN) and acne (P<0.01) and higher insulin, HOMA-IR, HOMA-%B, HbA1c, and hs-CRP levels than Oligo+HS group (P<0.01). Serum FT was correlated with HOMA-IR (r=0.38, P<0.01), HOMA-%B (r=0.49, P<0.01), hs-CRP (r=0.42, P<0.01), AN (r=0.39, P<0.01), and HbA1c (r=0.27, P<0.01). Furthermore, 34% of girls met diagnostic criteria for MS displaying higher BMI, FT, HOMA-%B, HOMA-IR, hs-CRP, and HbA1c than subjects without MS (P<0.01). Using combined HOMA-IR>or=4.0 and hs-CRP>3.0 cut-off values, 71.4% of MS versus 23.5% non-MS group were considered at risk of diabetes and atherosclerosis (P<0.0001). Hyperandrogenemic PCOS phenotypes have greatest degree of insulin resistance and inflammation. The use of insulin resistance and inflammatory markers may help identify adolescent girls with PCOS at risk of cardiometabolic syndrome.
Highly Permeable Oligo(ethylene oxide)- co-poly(dimethylsiloxane) Membranes for Carbon Dioxide Separation

DOE PAGES

Hong, Tao; Lai, Sophia C.; Mahurin, Shannon Mark; ...

2017-12-27

Here, a series of cross–linked, freestanding oligo(ethylene oxide)– co–(polydimethylsiloxane–norbornene) membranes with varied composition is synthesized via in situ ring–opening metathesis polymerization. These membranes show remarkably high CO 2 permeabilities (3400 Barrer) and their separation performance approaches the Robeson upper bound. The excellent permeability of these copolymer membranes provides great potential for real–world applications where enormous volumes of gases must be separated. The gas transport properties of these films are found to be directly proportional to oligo(ethylene oxide) content incorporation, which stems from the increased solubility selectivity change within the copolymer matrix. This work provides a systematic study of how gasmore » separation performance in rubbery membranes can be enhanced by tuning the CO 2–philicity of their constituent monomeric subunits.« less
Uneven distribution of expressed sequence tag loci on maize pachytene chromosomes

PubMed Central

Anderson, Lorinda K.; Lai, Ann; Stack, Stephen M.; Rizzon, Carene; Gaut, Brandon S.

2006-01-01

Examining the relationships among DNA sequence, meiotic recombination, and chromosome structure at a genome-wide scale has been difficult because only a few markers connect genetic linkage maps with physical maps. Here, we have positioned 1195 genetically mapped expressed sequence tag (EST) markers onto the 10 pachytene chromosomes of maize by using a newly developed resource, the RN-cM map. The RN-cM map charts the distribution of crossing over in the form of recombination nodules (RNs) along synaptonemal complexes (SCs, pachytene chromosomes) and allows genetic cM distances to be converted into physical micrometer distances on chromosomes. When this conversion is made, most of the EST markers used in the study are located distally on the chromosomes in euchromatin. ESTs are significantly clustered on chromosomes, even when only euchromatic chromosomal segments are considered. Gene density and recombination rate (as measured by EST and RN frequencies, respectively) are strongly correlated. However, crossover frequencies for telomeric intervals are much higher than was expected from their EST frequencies. For pachytene chromosomes, EST density is about fourfold higher in euchromatin compared with heterochromatin, while DNA density is 1.4 times higher in heterochromatin than in euchromatin. Based on DNA density values and the fraction of pachytene chromosome length that is euchromatic, we estimate that ∼1500 Mbp of the maize genome is in euchromatin. This overview of the organization of the maize genome will be useful in examining genome and chromosome evolution in plants. PMID:16339046
RoxB Is a Novel Type of Rubber Oxygenase That Combines Properties of Rubber Oxygenase RoxA and Latex Clearing Protein (Lcp).

PubMed

Birke, Jakob; Röther, Wolf; Jendrossek, Dieter

2017-07-15

Only two types of rubber oxygenases, rubber oxygenase (RoxA) and latex clearing protein (Lcp), have been described so far. RoxA proteins (RoxAs) are c -type cytochromes of ≈70 kDa produced by Gram-negative rubber-degrading bacteria, and they cleave polyisoprene into 12-oxo-4,8-dimethyltrideca-4,8-diene-1-al (ODTD), a C 15 oligo-isoprenoid, as the major end product. Lcps are common among Gram-positive rubber degraders and do not share amino acid sequence similarities with RoxAs. Furthermore, Lcps have much smaller molecular masses (≈40 kDa), are b -type cytochromes, and cleave polyisoprene to a mixture of C 20 , C 25 , C 30 , and higher oligo-isoprenoids as end products. In this article, we purified a new type of rubber oxygenase, RoxB Xsp (RoxB of Xanthomonas sp. strain 35Y). RoxB Xsp is distantly related to RoxAs and resembles RoxAs with respect to molecular mass (70.3 kDa for mature protein) and cofactor content (2 c -type hemes). However, RoxB Xsp differs from all currently known RoxAs in having a distinctive product spectrum of C 20 , C 25 , C 30 , and higher oligo-isoprenoids that has been observed only for Lcps so far. Purified RoxB Xsp revealed the highest specific activity of 4.5 U/mg (at 23°C) of all currently known rubber oxygenases and exerts a synergistic effect on the efficiency of polyisoprene cleavage by RoxA Xsp RoxB homologs were identified in several other Gram-negative rubber-degrading species, pointing to a prominent function of RoxB for the biodegradation of rubber in Gram-negative bacteria. IMPORTANCE The enzymatic cleavage of rubber (polyisoprene) is of high environmental importance given that enormous amounts of rubber waste materials are permanently released (e.g., by abrasion of tires). Research from the last decade has discovered rubber oxygenase A, RoxA, and latex clearing protein (Lcp) as being responsible for the primary enzymatic attack on the hydrophobic and water-insoluble biopolymer poly( cis -1,4-isoprene) in Gram-negative and Gram-positive rubber-degrading bacteria, respectively. Here, we provide evidence that a third type of rubber oxygenase is present in Gram-negative rubber-degrading species. Due to its characteristics, we suggest the designation RoxB for the new type of rubber oxygenase. Bioinformatic analysis of genome sequences indicates the presence of roxB homologs in other Gram-negative rubber degraders. Copyright © 2017 American Society for Microbiology.
Induced Accelerated Aging in Induced Pluripotent Stem Cell Lines from Patients with Parkinson’s Disease

DTIC Science & Technology

2014-07-01

after manipulation of the cells prohibited this approach. 2. Differentiation into oligoprecursor cells ( OPCs ) and oligodendrocytes As we have...Jing Bian, PhD and Birgitt Schuele, MD 18 Development of expandable OPCs from human iPSC derived...Neri et al., 2010) + + + + Human iPSC derived OPCs and Oligos (Wang et al., 2013) + + + + mEpsc derived OPCs and Oligos (Najm et al
Sequence-defined oligo(ortho-arylene) foldamers derived from the benzannulation of ortho(arylene ethynylene)s† †Electronic supplementary information (ESI) available. CCDC 1483959–1483967. For ESI and crystallographic data in CIF or other electronic format see DOI: 10.1039/c6sc02520j Click here for additional data file. Click here for additional data file.

PubMed Central

Lehnherr, Dan; Chen, Chen; Pedramrazi, Zahra; DeBlase, Catherine R.; Alzola, Joaquin M.; Keresztes, Ivan; Lobkovsky, Emil B.

2016-01-01

A Cu-catalyzed benzannulation reaction transforms ortho(arylene ethynylene) oligomers into ortho-arylenes. This approach circumvents iterative Suzuki cross-coupling reactions previously used to assemble hindered ortho-arylene backbones. These derivatives form helical folded structures in the solid-state and in solution, as demonstrated by X-ray crystallography and solution-state NMR analysis. DFT calculations of misfolded conformations are correlated with variable-temperature 1H and EXSY NMR to reveal that folding is cooperative and more favorable in halide-substituted naphthalenes. Helical ortho-arylene foldamers with specific aromatic sequences organize functional π-electron systems into arrangements ideal for ambipolar charge transport and show preliminary promise for the surface-mediated synthesis of structurally defined graphene nanoribbons. PMID:28567248
N6-Methylation Assessment in Escherichia coli 23S rRNA Utilizing a Bulge Loop in an RNA-DNA Hybrid.

PubMed

Yoshioka, Kyoko; Kurita, Ryoji

2018-06-07

We propose a sequence-selective assay of N6-methyl-adenosine (m6A) in RNA without PCR or reverse transcription, by employing a hybridization assay with a DNA probe designed to form a bulge loop at the position of a target modified nucleotide. The m6A in the bulge in the RNA-DNA hybrid was assumed to be sufficiently mobile to be selectively recognized by an anti-m6A antibody with a high affinity. By employing a surface-plasmon-resonance measurement or using a microtiter-plate immunoassay method, a specific m6A in the Escherichia coli 23S rRNA sequence could be detected at the nanomolar level when synthesized and purified oligo-RNA fragments were used for measurement. We have successfully achieved the first selective detection of m6A 2030 specifically in 23S rRNA from real samples of E. coli total RNA by using our immunochemical approach.

Development and characterization of microsatellite markers for the Pacific abalone ( Haliotis discus) via EST database mining

NASA Astrophysics Data System (ADS)

Zhan, Aibin; Bao, Zhenmin; Wang, Mingling; Chang, Dan; Yuan, Jian; Wang, Xiaolong; Hu, Xiaoli; Liang, Chengzhu; Hu, Jingjie

2008-05-01

The EST database of the Pacific abalone ( Haliotis discus) was mined for developing microsatellite markers. A total of 1476 EST sequences were registered in GenBank when data mining was performed. Fifty sequences (approximately 3.4%) were found to contain one or more microsatellites. Based on the length and GC content of the flanking regions, cluster analysis and BLASTN, 13 microsatellite-containing ESTs were selected for PCR primer design. The results showed that 10 out of 13 primer pairs could amplify scorable PCR products and showed polymorphism. The number of alleles ranged from 2 to 13 and the values of H o and H e varied from 0.1222 to 0.8611 and 0.2449 to 0.9311, respectively. No significant linkage disequilibrium (LD) between any pairs of these loci was found, and 6 of 10 loci conformed to the Hardy-Weinberg equilibrium (HWE). These EST-SSRs are therefore potential tools for studies of intraspecies variation and hybrid identification.
Molecular characterization of the amplified carboxylesterase gene associated with organophosphorus insecticide resistance in the brown planthopper, Nilaparvata lugens.

PubMed

Small, G J; Hemingway, J

2000-12-01

Widespread resistance to organophosphorus insecticides (OPs) in Nilaparvata lugens is associated with elevation of carboxylesterase activity. A cDNA encoding a carboxylesterase, Nl-EST1, has been isolated from an OP-resistant Sri Lankan strain of N. lugens. The full-length cDNA codes for a 547-amino acid protein with high homology to other esterases/lipases. Nl-EST1 has an N-terminal hydrophobic signal peptide sequence of 24 amino acids which suggests that the mature protein is secreted from cells expressing it. The nucleotide sequence of the homologue of Nl-EST1 in an OP-susceptible, low esterase Sri Lankan strain of N. lugens is identical to Nl-EST1. Southern analysis of genomic DNA from the Sri Lankan OP-resistant and susceptible strains suggests that Nl-EST1 is amplified in the resistant strain. Therefore, resistance to OPs in the Sri Lankan strain is through amplification of a gene identical to that found in the susceptible strain.
Epidemiology of infertility and polycystic ovarian disease: endocrinological and demographic studies.

PubMed

Hull, M G

1987-09-01

The frequency of polycystic ovarian disease (PCOD) as a cause of oligo-amenorrhea and infertility was determined, first by characterizing clinically occult PCOD using endocrinological methods, and secondly by estimating the frequency of overt and occult PCOD amongst infertile women residing in a particular area. Four groups of infertile women with oligo-amenorrhea due to 'functional' disorder were compared. The results show that by contrast with the groups having hyperprolactinemia or hypothalamic disorder the group with hirsutism (and therefore presumed PCOD) was closely resembled by a non-hirsute group in terms of estrogenization, LH level, LH/FSH ratio, prolactin level, body mass and responsiveness to clomiphene. The last group was therefore concluded to have a mild occult form of PCOD. The population studies revealed, first, that overt and occult PCOD accounted for 90% of patients with oligomenorrhea and 37% with amenorrhea, or 73% with oligo- or amenorrhea. Oligo- or amenorrhea accounted for 21% of couples with infertility and the annual incidence was 247 patients per million of the general population. The annual incidence of infertility due to PCOD per million was 41 with overt PCOD and 139 with occult PCOD (total 180). Of those, 140 appeared to respond well to clomiphene (78%) but 40 (22%) failed, requiring alternative therapy.
Preparation of a New Oligolamellar Stratum Corneum Lipid Model.

PubMed

Mueller, Josefin; Schroeter, Annett; Steitz, Roland; Trapp, Marcus; Neubert, Reinhard H H

2016-05-10

In this study, we present a preparation method for a new stratum corneum (SC) model system, which is closer to natural SC than the commonly used multilayer models. The complex setup of the native SC lipid matrix was mimicked by a ternary lipid mixture of ceramide [AP], cholesterol, and stearic acid. A spin coating procedure was applied to realize oligo-layered samples. The influence of lipid concentration, rotation speed, polyethylenimine, methanol content, cholesterol fraction, and annealing on the molecular arrangement of the new SC model was investigated by X-ray reflectivity measurements. The new oligo-SC model is closer to native SC in the total number of lipid membranes found between corneocytes. The reduction in thickness provides the opportunity to study the effects of drugs and/or hydrophilic penetration enhancers on the structure of SC in full detail by X-ray or neutron reflectivity. In addition, the oligo-lamellar systems allows one to infer not only the lamellar spacing, but also the total thickness of the oligo-SC model and changes thereof can be monitored. This improvement is most helpful for the understanding of transdermal drug administration on the nanoscale. The results are compared to the commonly used multilamellar lipid model systems and advantages and disadvantages of both models are discussed.
Biosynthesis and Degradation of Mono-, Oligo-, and Polysaccharides: Introduction

NASA Astrophysics Data System (ADS)

Wilson, Iain B. H.

Glycomolecules, whether they be mono-, oligo-, or polysaccharides or simple glycosides, are—as any biological molecules—the products of biosynthetic processes; on the other hand, at the end of their lifespan, they are also subject to degradation. The beginning point, biochemically, is the fixation of carbon by photosynthesis; subsequent metabolism in plants and other organisms results in the generation of the various monosaccharides. These must be activated—typically as nucleotide sugars or lipid-phosphosugars—before transfer by glycosyltransferases can take place in order to produce the wide variety of oligo- and polysaccharides seen in Nature; complicated remodelling processes may take place—depending on the pathway—which result in partial trimming of a precursor by glycosidases prior to the addition of further monosaccharide units. Upon completion of the 'life' of a glycoconjugate, glycosidases will degrade the macromolecule finally into monosaccharide units which can be metabolized or salvaged for incorporation into new glycan chains. In modern glycoscience, a wide variety of methods—genetic, biochemical, analytical—are being employed in order to understand these various pathways and to place them within their biological and medical context. In this chapter, these processes and relevant concepts and methods are introduced, prior to elaboration in the subsequent more specialized chapters on biosynthesis and degradation of mono-, oligo-, and polysaccharides.
GarlicESTdb: an online database and mining tool for garlic EST sequences.

PubMed

Kim, Dae-Won; Jung, Tae-Sung; Nam, Seong-Hyeuk; Kwon, Hyuk-Ryul; Kim, Aeri; Chae, Sung-Hwa; Choi, Sang-Haeng; Kim, Dong-Wook; Kim, Ryong Nam; Park, Hong-Seog

2009-05-18

Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at http://garlicdb.kribb.re.kr. GarlicESTdb is the first incorporated online information database of EST sequences isolated from garlic that can be freely accessed and downloaded. It has many useful features for interactive mining of EST contigs and datasets from each library, including curation of annotated information, expression profiling, information retrieval, and summary of statistics of functional annotation. Consequently, the development of GarlicESTdb will provide a crucial contribution to biologists for data-mining and more efficient experimental studies.
From biomedicine to natural history research: EST resources for ambystomatid salamanders

PubMed Central

Putta, Srikrishna; Smith, Jeramiah J; Walker, John A; Rondet, Mathieu; Weisrock, David W; Monaghan, James; Samuels, Amy K; Kump, Kevin; King, David C; Maness, Nicholas J; Habermann, Bianca; Tanaka, Elly; Bryant, Susan V; Gardiner, David M; Parichy, David M; Voss, S Randal

2004-01-01

Background Establishing genomic resources for closely related species will provide comparative insights that are crucial for understanding diversity and variability at multiple levels of biological organization. We developed ESTs for Mexican axolotl (Ambystoma mexicanum) and Eastern tiger salamander (A. tigrinum tigrinum), species with deep and diverse research histories. Results Approximately 40,000 quality cDNA sequences were isolated for these species from various tissues, including regenerating limb and tail. These sequences and an existing set of 16,030 cDNA sequences for A. mexicanum were processed to yield 35,413 and 20,599 high quality ESTs for A. mexicanum and A. t. tigrinum, respectively. Because the A. t. tigrinum ESTs were obtained primarily from a normalized library, an approximately equal number of contigs were obtained for each species, with 21,091 unique contigs identified overall. The 10,592 contigs that showed significant similarity to sequences from the human RefSeq database reflected a diverse array of molecular functions and biological processes, with many corresponding to genes expressed during spinal cord injury in rat and fin regeneration in zebrafish. To demonstrate the utility of these EST resources, we searched databases to identify probes for regeneration research, characterized intra- and interspecific nucleotide polymorphism, saturated a human – Ambystoma synteny group with marker loci, and extended PCR primer sets designed for A. mexicanum / A. t. tigrinum orthologues to a related tiger salamander species. Conclusions Our study highlights the value of developing resources in traditional model systems where the likelihood of information transfer to multiple, closely related taxa is high, thus simultaneously enabling both laboratory and natural history research. PMID:15310388
NemaPath: online exploration of KEGG-based metabolic pathways for nematodes

PubMed Central

Wylie, Todd; Martin, John; Abubucker, Sahar; Yin, Yong; Messina, David; Wang, Zhengyuan; McCarter, James P; Mitreva, Makedonka

2008-01-01

Background Nematode.net is a web-accessible resource for investigating gene sequences from parasitic and free-living nematode genomes. Beyond the well-characterized model nematode C. elegans, over 500,000 expressed sequence tags (ESTs) and nearly 600,000 genome survey sequences (GSSs) have been generated from 36 nematode species as part of the Parasitic Nematode Genomics Program undertaken by the Genome Center at Washington University School of Medicine. However, these sequencing data are not present in most publicly available protein databases, which only include sequences in Swiss-Prot. Swiss-Prot, in turn, relies on GenBank/Embl/DDJP for predicted proteins from complete genomes or full-length proteins. Description Here we present the NemaPath pathway server, a web-based pathway-level visualization tool for navigating putative metabolic pathways for over 30 nematode species, including 27 parasites. The NemaPath approach consists of two parts: 1) a backend tool to align and evaluate nematode genomic sequences (curated EST contigs) against the annotated Kyoto Encyclopedia of Genes and Genomes (KEGG) protein database; 2) a web viewing application that displays annotated KEGG pathway maps based on desired confidence levels of primary sequence similarity as defined by a user. NemaPath also provides cross-referenced access to nematode genome information provided by other tools available on Nematode.net, including: detailed NemaGene EST cluster information; putative translations; GBrowse EST cluster views; links from nematode data to external databases for corresponding synonymous C. elegans counterparts, subject matches in KEGG's gene database, and also KEGG Ontology (KO) identification. Conclusion The NemaPath server hosts metabolic pathway mappings for 30 nematode species and is available on the World Wide Web at . The nematode source sequences used for the metabolic pathway mappings are available via FTP , as provided by the Genome Center at Washington University School of Medicine. PMID:18983679
Eléments traces dans le sérum des enfants malnutris et bien nourris vivants à Lubumbashi et Kawama dans un contexte d'un environnement de pollution minière

PubMed Central

Musimwa, Aimée Mudekereza; Kanteng, Gray Wakamb; Kitoko, Hermann Tamubango; Luboya, Oscar Numbi

2016-01-01

Introduction La place des éléments traces métalliques essentiels en nutrition humaine ne peut plus être ignorée. Les déficits d'apports, les carences secondaires souvent sous – estimées, et les carences iatrogènes font le lit de pathologies telles que les infections et autres. D'où leurs dosages ont une importance particulière pour en évaluer la gravité et faciliter une prise en charge précoce ou améliorer le régime alimentaire. Cette étude a eu pour objectif de déterminer le profil sanguin en éléments traces (cuivre, sélénium, zinc, fer, chrome, cobalt, etc) chez les enfants malnutris et biens nourris dans un milieu minier à Lubumbashi. Méthodes Trois cents onze cas ont été colligés, 182 malnutris et 129 biens nourris, dans une étude descriptive transversale, effectuée de juillet 2013 à décembre 2014. Pour lequel un échantillonnage exhaustif a été réalisé. Le dosage des métaux dans le sérum s'est fait à l’ ICP-OES (spectrométrie de masse à plasma gon induit) au laboratoire de l'Office Congolais de Contrôle de Lubumbashi. Résultats Les oligoéléments essentiels (cuivre, zinc, sélénium et fer) se retrouvent à des concentrations très basses chez les enfants malnutris comme chez les biens nourris. L'arsenic, le cadmium, le magnésium et le manganèse se présentent à des concentrations normales par rapport aux valeurs de références chez les enfants biens nourris. L'antimoine, le chrome, le plomb et le cobalt se retrouvent élevés chez les malnutris et biens nourris. Le nickel est normal chez les malnutris et les biens nourris. Le magnésium, manganèse se sont présentés à des taux très bas chez les enfants malnutris. Conclusion Les enfants malnutris et biens nourris présentent une malnutrition aux oligo-éléments essentiels associés aux éléments traces métalliques. Ce qui permet de supposer qu'une carence en micronutriments essentiel favorise l'absorption des métaux lourds. PMID:27583075
Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles

PubMed Central

2011-01-01

Background Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and molecular basis of these changes and significantly expand the watermelon transcript catalog, we have selected four critical stages of watermelon fruit development and used Roche/454 next-generation sequencing technology to generate a large expressed sequence tag (EST) dataset and a comprehensive transcriptome profile for watermelon fruit flesh tissues. Results We performed half Roche/454 GS-FLX run for each of the four watermelon fruit developmental stages (immature white, white-pink flesh, red flesh and over-ripe) and obtained 577,023 high quality ESTs with an average length of 302.8 bp. De novo assembly of these ESTs together with 11,786 watermelon ESTs collected from GenBank produced 75,068 unigenes with a total length of approximately 31.8 Mb. Overall 54.9% of the unigenes showed significant similarities to known sequences in GenBank non-redundant (nr) protein database and around two-thirds of them matched proteins of cucumber, the most closely-related species with a sequenced genome. The unigenes were further assigned with gene ontology (GO) terms and mapped to biochemical pathways. More than 5,000 SSRs were identified from the EST collection. Furthermore we carried out digital gene expression analysis of these ESTs and identified 3,023 genes that were differentially expressed during watermelon fruit development and ripening, which provided novel insights into watermelon fruit biology and a comprehensive resource of candidate genes for future functional analysis. We then generated profiles of several interesting metabolites that are important to fruit quality including pigmentation and sweetness. Integrative analysis of metabolite and digital gene expression profiles helped elucidating molecular mechanisms governing these important quality-related traits during watermelon fruit development. Conclusion We have generated a large collection of watermelon ESTs, which represents a significant expansion of the current transcript catalog of watermelon and a valuable resource for future studies on the genomics of watermelon and other closely-related species. Digital expression analysis of this EST collection allowed us to identify a large set of genes that were differentially expressed during watermelon fruit development and ripening, which provide a rich source of candidates for future functional analysis and represent a valuable increase in our knowledge base of watermelon fruit biology. PMID:21936920
Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles.

PubMed

Guo, Shaogui; Liu, Jingan; Zheng, Yi; Huang, Mingyun; Zhang, Haiying; Gong, Guoyi; He, Hongju; Ren, Yi; Zhong, Silin; Fei, Zhangjun; Xu, Yong

2011-09-21

Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and molecular basis of these changes and significantly expand the watermelon transcript catalog, we have selected four critical stages of watermelon fruit development and used Roche/454 next-generation sequencing technology to generate a large expressed sequence tag (EST) dataset and a comprehensive transcriptome profile for watermelon fruit flesh tissues. We performed half Roche/454 GS-FLX run for each of the four watermelon fruit developmental stages (immature white, white-pink flesh, red flesh and over-ripe) and obtained 577,023 high quality ESTs with an average length of 302.8 bp. De novo assembly of these ESTs together with 11,786 watermelon ESTs collected from GenBank produced 75,068 unigenes with a total length of approximately 31.8 Mb. Overall 54.9% of the unigenes showed significant similarities to known sequences in GenBank non-redundant (nr) protein database and around two-thirds of them matched proteins of cucumber, the most closely-related species with a sequenced genome. The unigenes were further assigned with gene ontology (GO) terms and mapped to biochemical pathways. More than 5,000 SSRs were identified from the EST collection. Furthermore we carried out digital gene expression analysis of these ESTs and identified 3,023 genes that were differentially expressed during watermelon fruit development and ripening, which provided novel insights into watermelon fruit biology and a comprehensive resource of candidate genes for future functional analysis. We then generated profiles of several interesting metabolites that are important to fruit quality including pigmentation and sweetness. Integrative analysis of metabolite and digital gene expression profiles helped elucidating molecular mechanisms governing these important quality-related traits during watermelon fruit development. We have generated a large collection of watermelon ESTs, which represents a significant expansion of the current transcript catalog of watermelon and a valuable resource for future studies on the genomics of watermelon and other closely-related species. Digital expression analysis of this EST collection allowed us to identify a large set of genes that were differentially expressed during watermelon fruit development and ripening, which provide a rich source of candidates for future functional analysis and represent a valuable increase in our knowledge base of watermelon fruit biology.
Close evolutionary relatedness among functionally distantly related members of the (alpha/beta)8-barrel glycosyl hydrolases suggested by the similarity of their fifth conserved sequence region.

PubMed

Janecek, S

1995-12-11

A short conserved sequence equivalent to the fifth conserved sequence region of alpha-amylases (173_LPDLD, Aspergillus oryzae alpha-amylase) comprising the calcium-ligand aspartate, Asp-175, was identified in the amino acid sequences of several members of the family of (alpha/beta)8-barrel glycosyl hydrolases. Despite the fact that the aspartate is not invariantly conserved, the stretch can be easily recognised in all sequences to be positioned 26-28 amino acid residues in front of the well-known catalytic aspartate (Asp-206, A. oryzae alpha-amylase) located in the beta 4-strand of the barrel. The identification of this region revealed remarkable similarities between some alpha-amylases (those from Bacillus megaterium, Bacillus subtilis and Dictyoglomus thermophilum) on the one hand and several different enzyme specificities (such as oligo-1,6-glucosidase, amylomaltase and neopullulanase, respectively) on the other hand. The most interesting example was offered by B. subtilis alpha-amylase and potato amylomaltase with the regions LYDWN and LYDWK, respectively. These observations support the idea that all members of the family of glycosyl hydrolases adopting the structure of the alpha-amylase-type (alpha/beta)8-barrel are mutually closely related and the strict evolutionary borders separating the individual enzyme specificities can be hardly defined.
Accurate and rapid modeling of iron-bleomycin-induced DNA damage using tethered duplex oligonucleotides and electrospray ionization ion trap mass spectrometric analysis.

PubMed

Harsch, A; Marzilli, L A; Bunt, R C; Stubbe, J; Vouros, P

2000-05-01

Bleomycin B(2)(BLM) in the presence of iron [Fe(II)] and O(2)catalyzes single-stranded (ss) and double-stranded (ds) cleavage of DNA. Electrospray ionization ion trap mass spectrometry was used to monitor these cleavage processes. Two duplex oligonucleotides containing an ethylene oxide tether between both strands were used in this investigation, allowing facile monitoring of all ss and ds cleavage events. A sequence for site-specific binding and cleavage by Fe-BLM was incorporated into each analyte. One of these core sequences, GTAC, is a known hot-spot for ds cleavage, while the other sequence, GGCC, is a hot-spot for ss cleavage. Incubation of each oligo-nucleotide under anaerobic conditions with Fe(II)-BLM allowed detection of the non-covalent ternary Fe-BLM/oligonucleotide complex in the gas phase. Cleavage studies were then performed utilizing O(2)-activated Fe(II)-BLM. No work-up or separation steps were required and direct MS and MS/MS analyses of the crude reaction mixtures confirmed sequence-specific Fe-BLM-induced cleavage. Comparison of the cleavage patterns for both oligonucleotides revealed sequence-dependent preferences for ss and ds cleavages in accordance with previously established gel electrophoresis analysis of hairpin oligonucleotides. This novel methodology allowed direct, rapid and accurate determination of cleavage profiles of model duplex oligonucleotides after exposure to activated Fe-BLM.
The Oligo-/Miocene Qom Formation (Iran): evidence for an early Burdigalian restriction of the Tethyan Seaway and closure of its Iranian gateways

NASA Astrophysics Data System (ADS)

Reuter, M.; Piller, W. E.; Harzhauser, M.; Mandic, O.; Berning, B.; Rögl, F.; Kroh, A.; Aubry, M.-P.; Wielandt-Schuster, U.; Hamedani, A.

2009-04-01

In the central Iranian Esfahan-Sirjan and Qom basins sedimentation of the Oligo-/Miocene Qom Formation took place on extensive mixed carbonate-siliciclastic ramps. During this time, both basins were positioned at the Eurasian margin of the Tethyan Seaway, which connected the western and eastern regions of the Tethys Ocean at least until the late Burdigalian. During the so-called Terminal Tethyan Event the Tethyan Seaway was then closed due to the collision of the African/Arabian and Iranian/Eurasian plates. Facies analysis of the sedimentary record of both basins indicates paleoenvironments ranging from terrestrial to open marine settings, including mangrove, restricted inner shelf lagoon, seagrass meadow, reefal, and deeper offshore environments. Recognition of eight depositional sequences and elaboration of an integrated biostratigraphic framework (calcareous nannoplankton, planktic and larger benthic foraminifers, gastropods, and pectinids) allow us to construct a basin-spanning stratigraphy. The assignment of the recognized sea-level lowstands to the Ru 3 to Bur 3 lowstands of the global sea-level curve enables a comparison with time-equivalent sections from the Zagros Basin, which was part of the African/Arabian Plate on the opposing southern margin of the Tethyan Seaway. The so calibrated sections display restrictions of the Tethyan Seaway and interruption of the south Iranian gateways between the Qom Basin and the Proto-Indopacific in relation to ongoing plate collision during the early Burdigalian.
FrameD: A flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences.

PubMed

Schiex, Thomas; Gouzy, Jérôme; Moisan, Annick; de Oliveira, Yannick

2003-07-01

We describe FrameD, a program that predicts coding regions in prokaryotic and matured eukaryotic sequences. Initially targeted at gene prediction in bacterial GC rich genomes, the gene model used in FrameD also allows to predict genes in the presence of frameshifts and partially undetermined sequences which makes it also very suitable for gene prediction and frameshift correction in unfinished sequences such as EST and EST cluster sequences. Like recent eukaryotic gene prediction programs, FrameD also includes the ability to take into account protein similarity information both in its prediction and its graphical output. Its performances are evaluated on different bacterial genomes. The web site (http://genopole.toulouse.inra.fr/bioinfo/FrameD/FD) allows direct prediction, sequence correction and translation and the ability to learn new models for new organisms.
Sequence and RT-PCR expression analysis of two peroxidases from Arabidopsis thaliana belonging to a novel evolutionary branch of plant peroxidases.

PubMed

Kjaersgård, I V; Jespersen, H M; Rasmussen, S K; Welinder, K G

1997-03-01

cDNA clones encoding two new Arabidopsis thaliana peroxidases, ATP 1a and ATP 2a, have been identified by searching the Arabidopsis database of expressed sequence tags (dbEST). They represent a novel branch of hitherto uncharacterized plant peroxidases which is only 35% identical in amino acid sequence to the well characterized group of basic plant peroxidases represented by the horseradish (Armoracia rusticana) isoperoxidases HRP C, HRP E5 and the similar Arabidopsis isoperoxidases ATP Ca, ATP Cb, and ATP Ea. However ATP 1a is 87% identical in amino acid sequence to a peroxidase encoded by an mRNA isolated from cotton (Gossypium hirsutum). As cotton and Arabidopsis belong to rather diverse families (Malvaceae and Crucifereae, respectively), in contrast with Arabidopsis and horseradish (both Crucifereae), the high degree of sequence identity indicates that this novel type of peroxidase, albeit of unknown function, is likely to be widespread in plant species. The atp 1 and atp 2 types of cDNA sequences were the most redundant among the 28 different isoperoxidases identified among about 200 peroxidase encoding ESTs. Interestingly, 8 out of totally 38 EST sequences coding for ATP 1 showed three identical nucleotide substitutions. This variant form is designated ATP 1b. Similarly, six out of totally 16 EST sequences coding for ATP 2 showed a number of deletions and nucleotide changes. This variant form is designated ATP 2b. The selected EST clones are full-length and contain coding regions of 993 nucleotides for atp 1a, and 984 nucleotides for atp 2a. These regions show 61% DNA sequence identity. The predicted mature proteins ATP 1a, and ATP 2a are 57% identical in sequence and contain the structurally and functionally important residues, characteristic of the plant peroxidase superfamily. However, they do show two differences of importance to peroxidase catalysis: (1) the asparagine residue linked with the active site distal histidine via hydrogen bonding is absent; (2) an N-glycosylation site is located right at the entrance to the heme channel. The reverse transcriptase polymerase chain reaction (RT-PCR) was used to identify mRNAs coding for ATP 1a/b and ATP 2a/b in germinating seeds, seedlings, roots, leaves, stems, flowers and cell suspension culture using elongation factor 1alpha (EF-1alpha) for the first time as a positive control. Both mRNAs were transcribed at levels comparable to EF-1alpha in all plant tissues investigated which were more than two days old, and in cell suspension culture. In addition, the mRNA coding for ATP 1a/b was found in two day old germinating seeds. The abundant transcription of ATP 1a/b and ATP 2a/b is in line with their many entries in dbEST, and indicates essential roles for these novel peroxidases.
Construction of new EST-SSRs for Fusarium resistant wheat breeding.

PubMed

Yumurtaci, Aysen; Sipahi, Hulya; Al-Abdallat, Ayed; Jighly, Abdulqader; Baum, Michael

2017-06-01

Surveying Fusarium resistance in wheat with easy applicable molecular markers such as simple sequence repeats (SSRs) is a prerequest for molecular breeding. Expressed sequence tags (ESTs) are one of the main sources for development of new SSR candidates. Therefore, 18.292 publicly available wheat ESTs were mined and genotyping of newly developed 55 EST-SSR derived primer pairs produced clear fragments in ten wheat cultivars carrying different levels of Fusarium resistance. Among the proved markers, 23 polymorphic EST-SSRs were obtained and related alleles were mostly found on B and D genome. Based on the fragment profiling and similarity analysis, a 327bp amplicon, which was a product of contig 1207 (chromosome 5BL), was detected only in Fusarium head blight (FHB) resistant cultivars (CM82036 and Sumai) and the amino acid sequences showed a similarity to pathogen related proteins. Another FHB resistance related EST-SSR, Contig 556 (chromosome 1BL) produced a 151bp fragment in Sumai and was associated to wax2-like protein. A polymorphic 204bp fragment, derived from Contig 578 (chromosome 1DL), was generated from root rot (FRR) resistant cultivars (2-49; Altay2000 and Sunco). A total of 98 alleles were displayed with an average of 1.8 alleles per locus and the polymorphic information content (PIC) ranged from 0.11 to 0.78. Dendrogram tree with two main and five sub-groups were displayed the highest genetic relationship between FRR resistant cultivars (2-49 and Altay2000), FRR sensitive cultivars (Seri82 and Scout66) and FHB resistant cultivars (CM82036 and Sumai). Thus, exploitation of these candidate EST-SSRs may help to genotype other wheat sources for Fusarium resistance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Efficient computation of optimal oligo-RNA binding.

PubMed

Hodas, Nathan O; Aalberts, Daniel P

2004-01-01

We present an algorithm that calculates the optimal binding conformation and free energy of two RNA molecules, one or both oligomeric. This algorithm has applications to modeling DNA microarrays, RNA splice-site recognitions and other antisense problems. Although other recent algorithms perform the same calculation in time proportional to the sum of the lengths cubed, O((N1 + N2)3), our oligomer binding algorithm, called bindigo, scales as the product of the sequence lengths, O(N1*N2). The algorithm performs well in practice with the aid of a heuristic for large asymmetric loops. To demonstrate its speed and utility, we use bindigo to investigate the binding proclivities of U1 snRNA to mRNA donor splice sites.
Flexible CRISPR library construction using parallel oligonucleotide retrieval

PubMed Central

Read, Abigail; Gao, Shaojian; Batchelor, Eric

2017-01-01

Abstract CRISPR/Cas9-based gene knockout libraries have emerged as a powerful tool for functional screens. We present here a set of pre-designed human and mouse sgRNA sequences that are optimized for both high on-target potency and low off-target effect. To maximize the chance of target gene inactivation, sgRNAs were curated to target both 5΄ constitutive exons and exons that encode conserved protein domains. We describe here a robust and cost-effective method to construct multiple small sized CRISPR library from a single oligo pool generated by array synthesis using parallel oligonucleotide retrieval. Together, these resources provide a convenient means for individual labs to generate customized CRISPR libraries of variable size and coverage depth for functional genomics application. PMID:28334828
Nitroxide-mediated radical ring-opening copolymerization: chain-end investigation and block copolymer synthesis.

PubMed

Delplace, Vianney; Harrisson, Simon; Tardy, Antoine; Gigmes, Didier; Guillaneuf, Yohann; Nicolas, Julien

2014-02-01

Well-defined, degradable copolymers are successfully prepared by nitroxide-mediated radical ring opening polymerization (NMrROP) of oligo(ethylene glycol) methyl ether methacrylate (OEGMA) or methyl methacrylate (MMA), a small amount of acrylonitrile (AN) and cyclic ketene acetals (CKAs) of different structures. Phosphorous nuclear magnetic resonance allows in-depth chain-end characterization and gives crucial insights into the nature of the copoly-mer terminal sequences and the living chain fractions. By using a small library of P(OEGMA-co-AN-co-CKA) and P(MMA-co-AN-co-CKA) as macroinitiators, chain extensions with styrene are performed to furnish (amphiphilic) block copolymers comprising a degradable segment. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

PubMed Central

2011-01-01

Background Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences. Conclusion The collection of melon ESTs generated from full-length enriched and standard cDNA libraries is expected to play significant roles in annotating the melon genome. The ESTs and associated analysis results will be useful resources for gene discovery, functional analysis, marker-assisted breeding of melon and closely related species, comparative genomic studies and for gaining insights into gene expression patterns. PMID:21599934
Isolation and characterization of novel EST-derived genic markers in Pisum sativum (Fabaceae)1

PubMed Central

Jain, Shalu; McPhee, Kevin E.

2013-01-01

• Premise of the study: Novel markers were developed for pea (Pisum sativum) from pea expressed sequence tags (ESTs) having significant homology to Medicago truncatula gene sequences to investigate genetic diversity, linkage mapping, and cross-species transferability. • Methods and Results: Seventy-seven EST-derived genic markers were developed through comparative mapping between M. truncatula and P. sativum in which 75 markers produced PCR products and 33 were polymorphic among 16 pea genotypes. • Conclusions: The novel markers described here will be useful for future genetic studies of P. sativum; their amplification in lentil (Lens culinaris) demonstrates their potential for use in closely related species. PMID:25202494
Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

PubMed Central

Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

2012-01-01

Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

USDA-ARS?s Scientific Manuscript database

Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Analysis of 14-3-3 Family Member Function in Xenopus Embryos by Microinjection of Antisense Morpholino Oligos

NASA Astrophysics Data System (ADS)

Lau, Jeffrey M. C.; Muslin, Anthony J.

The 14-3-3 intracellular phosphoserine/threonine-binding proteins are adapter molecules that regulate signal transduction, cell cycle, nutrient sensing, apoptotic, and cytoskeletal pathways. There are seven 14-3-3 family members, encoded by separate genes, in vertebrate organisms. To evaluate the role of individual 14-3-3 proteins in vertebrate embryonic development, we utilized an antisense morpholino oligo microinjection technique in Xenopus laevis embryos. By use of this method, we showed that embryos lacking specific 14-3-3 proteins displayed unique phenotypic abnormalities. Specifically, embryos lacking 14-3-3 τ exhibited gastrulation and axial patterning defects, but embryos lacking 14-3-3 γ exhibited eye defects without other abnormalities, and embryos lacking 14-3-3 ζ appeared completely normal. These and other results demonstrate the power and specificity of the morpholino antisense oligo microinjection technique.
SigReannot-mart: a query environment for expression microarray probe re-annotations.

PubMed

Moreews, François; Rauffet, Gaelle; Dehais, Patrice; Klopp, Christophe

2011-01-01

Expression microarrays are commonly used to study transcriptomes. Most of the arrays are now based on oligo-nucleotide probes. Probe design being a tedious task, it often takes place once at the beginning of the project. The oligo set is then used for several years. During this time period, the knowledge gathered by the community on the genome and the transcriptome increases and gets more precise. Therefore re-annotating the set is essential to supply the biologists with up-to-date annotations. SigReannot-mart is a query environment populated with regularly updated annotations for different oligo sets. It stores the results of the SigReannot pipeline that has mainly been used on farm and aquaculture species. It permits easy extraction in different formats using filters. It is used to compare probe sets on different criteria, to choose the set for a given experiment to mix probe sets in order to create a new one.
Mixed non-covalent assemblies of ethynyl nile red and ethynyl pyrene along oligonucleotide templates.

PubMed

Ensslen, Philipp; Fritz, Yannic; Wagenknecht, Hans-Achim

2015-01-14

Ethynyl pyrene and ethynyl nile red as modifications at the 5-position of 2'-deoxyuridines self-assemble non-covalently and specifically along oligo-2'-deoxyadenosines as templates. Oligo-2'-deoxyadenosines of the lengths (dA)10-(dA)20 are able to retain nearly exactly as many ethynyl nile red units in solution as binding sites are available on these templates. In contrast, in the presence of oligo-2'-thymidines the ethynyl nile red moieties are similarly insoluble to those in the absence of any oligonucleotide and yield an aggregate. The mixed assemblies of both chromophores are highly ordered, show left-handed chirality and yield dual fluorescence. The strong excitonic coupling indicates assemblies with a high degree of order. These results show that DNA represents an important supramolecular scaffold for the templated, helical and non-covalent arrangement not only for one type of chromophore but also for mixtures of two different chromophores.
Backfilling-Free Strategy for Biopatterning on Intrinsically Dual-Functionalized Poly[2-Aminoethyl Methacrylate-co-Oligo(Ethylene Glycol) Methacrylate] Films.

PubMed

Lee, Bong Soo; Lee, Juno; Han, Gyeongyeop; Ha, EunRae; Choi, Insung S; Lee, Jungkyu K

2016-07-20

We demonstrated protein and cellular patterning with a soft lithography technique using poly[2-aminoethyl methacrylate-co-oligo(ethylene glycol) methacrylate] films on gold surfaces without employing a backfilling process. The backfilling process plays an important role in successfully generating biopatterns; however, it has potential disadvantages in several interesting research and technical applications. To overcome the issue, a copolymer system having highly reactive functional groups and bioinert properties was introduced through a surface-initiated controlled radical polymerization with 2-aminoethyl methacrylate hydrochloride (AMA) and oligo(ethylene glycol) methacrylate (OEGMA). The prepared poly(AMA-co-OEGMA) film was fully characterized, and among the films having different thicknesses, the 35 nm-thick biotinylated, poly(AMA-co-OEGMA) film exhibited an optimum performance, such as the lowest nonspecific adsorption and the highest specific binding capability toward proteins. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Development of highly polymorphic EST-SSR markers and segregation in F₁ hybrid population of Vitis vinifera L.

PubMed

Kayesh, E; Zhang, Y Y; Liu, G S; Bilkish, N; Sun, X; Leng, X P; Fang, J G

2013-09-23

The objectives of this investigation were to develop and validate the expressed sequence tag (EST)-simple sequence repeat (SSR) markers from large EST sequences, and to study the segregation and distribution of SSRs within two grapevine parental lines. In total, 94 F₁ lines crossed between "Early Rose" and "Red Globe" were studied. Approximately 2100 EST-SSR sequences of Vitis vinifera L. were searched for SSRs and analyzed for the design of polymerase chain reaction (PCR) primers amplifying the SSR-rich regions. Trinucleotide repeats were found to be the most abundant, followed by other nucleotide repeats. A total of 182 SSR primer pairs were first developed for the study on the parental polymorphism. Among the 182 SSR primers, 142 primer pairs (78%) could amplify the anticipated PCR products, among which only 52 primer pairs (36.62%) showed polymorphism between the two parents. These polymorphic bands were further surveyed among the 94 F₁ lines, and the results showed that a total of 162 bands were amplified, and 98 of them were polymorphic in both parents (60.86% polymorphism), with an average of 1.88 polymorphic DNA bands for each primer pair. After testing with the chi-square test, 33 of the clearly amplified polymorphic bands followed a 3:1 ratio, and 37 followed a 1:1 ratio. The rest showed distorted segregation ratios.
Construction of an Integrated High Density Simple Sequence Repeat Linkage Map in Cultivated Strawberry (Fragaria × ananassa) and its Applicability

PubMed Central

Isobe, Sachiko N.; Hirakawa, Hideki; Sato, Shusei; Maeda, Fumi; Ishikawa, Masami; Mori, Toshiki; Yamamoto, Yuko; Shirasawa, Kenta; Kimura, Mitsuhiro; Fukami, Masanobu; Hashizume, Fujio; Tsuji, Tomoko; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Tsuruoka, Hisano; Minami, Chiharu; Takahashi, Chika; Wada, Tsuyuko; Ono, Akiko; Kawashima, Kumiko; Nakazaki, Naomi; Kishida, Yoshie; Kohara, Mitsuyo; Nakayama, Shinobu; Yamada, Manabu; Fujishiro, Tsunakazu; Watanabe, Akiko; Tabata, Satoshi

2013-01-01

The cultivated strawberry (Fragaria× ananassa) is an octoploid (2n = 8x = 56) of the Rosaceae family whose genomic architecture is still controversial. Several recent studies support the AAA′A′BBB′B′ model, but its complexity has hindered genetic and genomic analysis of this important crop. To overcome this difficulty and to assist genome-wide analysis of F. × ananassa, we constructed an integrated linkage map by organizing a total of 4474 of simple sequence repeat (SSR) markers collected from published Fragaria sequences, including 3746 SSR markers [Fragaria vesca expressed sequence tag (EST)-derived SSR markers] derived from F. vesca ESTs, 603 markers (F. × ananassa EST-derived SSR markers) from F. × ananassa ESTs, and 125 markers (F. × ananassa transcriptome-derived SSR markers) from F. × ananassa transcripts. Along with the previously published SSR markers, these markers were mapped onto five parent-specific linkage maps derived from three mapping populations, which were then assembled into an integrated linkage map. The constructed map consists of 1856 loci in 28 linkage groups (LGs) that total 2364.1 cM in length. Macrosynteny at the chromosome level was observed between the LGs of F. × ananassa and the genome of F. vesca. Variety distinction on 129 F. × ananassa lines was demonstrated using 45 selected SSR markers. PMID:23248204
Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger

PubMed Central

Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui

2010-01-01

In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers. PMID:20941376
Two EST-derived marker systems for cultivar identification in tree peony.

PubMed

Zhang, J J; Shu, Q Y; Liu, Z A; Ren, H X; Wang, L S; De Keyser, E

2012-02-01

Tree peony (Paeonia suffruticosa Andrews), a woody deciduous shrub, belongs to the section Moutan DC. in the genus of Paeonia of the Paeoniaceae family. To increase the efficiency of breeding, two EST-derived marker systems were developed based on a tree peony expressed sequence tag (EST) database. Using target region amplification polymorphism (TRAP), 19 of 39 primer pairs showed good amplification for 56 accessions with amplicons ranging from 120 to 3,000 bp long, among which 99.3% were polymorphic. In contrast, 7 of 21 primer pairs demonstrated adequate amplification with clear bands for simple sequence repeats (SSRs) developed from ESTs, and a total of 33 alleles were found in 56 accessions. The similarity matrices generated by TRAP and EST-SSR markers were compared, and the Mantel test (r = 0.57778, P = 0.0020) showed a moderate correlation between the two types of molecular markers. TRAP markers were suitable for DNA fingerprinting and EST-SSR markers were more appropriate for discriminating synonyms (the same cultivars with different names due to limited information exchanged among different geographic areas). The two sets of EST-derived markers will be used further for genetic linkage map construction and quantitative trait locus detection in tree peony.
HUNT: launch of a full-length cDNA database from the Helix Research Institute.

PubMed

Yudate, H T; Suwa, M; Irie, R; Matsui, H; Nishikawa, T; Nakamura, Y; Yamaguchi, D; Peng, Z Z; Yamamoto, T; Nagai, K; Hayashi, K; Otsuki, T; Sugiyama, T; Ota, T; Suzuki, Y; Sugano, S; Isogai, T; Masuho, Y

2001-01-01

The Helix Research Institute (HRI) in Japan is releasing 4356 HUman Novel Transcripts and related information in the newly established HUNT database. The institute is a joint research project principally funded by the Japanese Ministry of International Trade and Industry, and the clones were sequenced in the governmental New Energy and Industrial Technology Development Organization (NEDO) Human cDNA Sequencing Project. The HUNT database contains an extensive amount of annotation from advanced analysis and represents an essential bioinformatics contribution towards understanding of the gene function. The HRI human cDNA clones were obtained from full-length enriched cDNA libraries constructed with the oligo-capping method and have resulted in novel full-length cDNA sequences. A large fraction has little similarity to any proteins of known function and to obtain clues about possible function we have developed original analysis procedures. Any putative function deduced here can be validated or refuted by complementary analysis results. The user can also extract information from specific categories like PROSITE patterns, PFAM domains, PSORT localization, transmembrane helices and clones with GENIUS structure assignments. The HUNT database can be accessed at http://www.hri.co.jp/HUNT.
Structural basis for regulation of rhizobial nodulation and symbiosis gene expression by the regulatory protein NolR.

PubMed

Lee, Soon Goo; Krishnan, Hari B; Jez, Joseph M

2014-04-29

The symbiosis between rhizobial microbes and host plants involves the coordinated expression of multiple genes, which leads to nodule formation and nitrogen fixation. As part of the transcriptional machinery for nodulation and symbiosis across a range of Rhizobium, NolR serves as a global regulatory protein. Here, we present the X-ray crystal structures of NolR in the unliganded form and complexed with two different 22-base pair (bp) double-stranded operator sequences (oligos AT and AA). Structural and biochemical analysis of NolR reveals protein-DNA interactions with an asymmetric operator site and defines a mechanism for conformational switching of a key residue (Gln56) to accommodate variation in target DNA sequences from diverse rhizobial genes for nodulation and symbiosis. This conformational switching alters the energetic contributions to DNA binding without changes in affinity for the target sequence. Two possible models for the role of NolR in the regulation of different nodulation and symbiosis genes are proposed. To our knowledge, these studies provide the first structural insight on the regulation of genes involved in the agriculturally and ecologically important symbiosis of microbes and plants that leads to nodule formation and nitrogen fixation.
Sequencing the Black Aspergilli species complex

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kuo, Alan; Salamov, Asaf; Zhou, Kemin

2011-03-11

The ~15 members of the Aspergillus section Nigri species complex (the "Black Aspergilli") are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as food processing and spoilage agents and agricultural toxigens. Despite their utility and ubiquity, the morphological and metabolic distinctiveness of the complex's members, and thus their taxonomy, is poorly defined. We are using short read pyrosequencing technology (Roche/454 and Illumina/Solexa) to rapidly scale up genomic and transcriptomic analysis of this species complex. To date we predict 11197 genes in Aspergillus niger, 11624 genes inmore » A. carbonarius, and 10845 genes in A. aculeatus. A. aculeatus is our most recent genome, and was assembled primarily from 454-sequenced reads and annotated with the aid of >2 million 454 ESTs and >300 million Solexa ESTs. To most effectively deploy these very large numbers of ESTs we developed 2 novel methods for clustering the ESTs into assemblies. We have also developed a pipeline to propose orthologies and paralogies among genes in the species complex. In the near future we will apply these methods to additional species of Black Aspergilli that are currently in our sequencing pipeline.« less
Poly A- transcripts expressed in HeLa cells.

PubMed

Wu, Qingfa; Kim, Yeong C; Lu, Jian; Xuan, Zhenyu; Chen, Jun; Zheng, Yonglan; Zhou, Tom; Zhang, Michael Q; Wu, Chung-I; Wang, San Ming

2008-07-30

Transcripts expressed in eukaryotes are classified as poly A+ transcripts or poly A- transcripts based on the presence or absence of the 3' poly A tail. Most transcripts identified so far are poly A+ transcripts, whereas the poly A- transcripts remain largely unknown. We developed the TRD (Total RNA Detection) system for transcript identification. The system detects the transcripts through the following steps: 1) depleting the abundant ribosomal and small-size transcripts; 2) synthesizing cDNA without regard to the status of the 3' poly A tail; 3) applying the 454 sequencing technology for massive 3' EST collection from the cDNA; and 4) determining the genome origins of the detected transcripts by mapping the sequences to the human genome reference sequences. Using this system, we characterized the cytoplasmic transcripts from HeLa cells. Of the 13,467 distinct 3' ESTs analyzed, 24% are poly A-, 36% are poly A+, and 40% are bimorphic with poly A+ features but without the 3' poly A tail. Most of the poly A- 3' ESTs do not match known transcript sequences; they have a similar distribution pattern in the genome as the poly A+ and bimorphic 3' ESTs, and their mapped intergenic regions are evolutionarily conserved. Experiments confirmed the authenticity of the detected poly A- transcripts. Our study provides the first large-scale sequence evidence for the presence of poly A- transcripts in eukaryotes. The abundance of the poly A- transcripts highlights the need for comprehensive identification of these transcripts for decoding the transcriptome, annotating the genome and studying biological relevance of the poly A- transcripts.
A SSR-based genetic linkage map of cultivated peanut (Arachis hypogaea L.)

USDA-ARS?s Scientific Manuscript database

The objective of this study was to construct a molecular linkage map of cultivated tetraploid peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Three recombinant inbre...
Reconciling Structural and Thermodynamic Predictions Using All-Atom and Coarse-Grain Force Fields: The Case of Charged Oligo-Arginine Translocation into DMPC Bilayers

PubMed Central

2015-01-01

Using the translocation of short, charged cationic oligo-arginine peptides (mono-, di-, and triarginine) from bulk aqueous solution into model DMPC bilayers, we explore the question of the similarity of thermodynamic and structural predictions obtained from molecular dynamics simulations using all-atom and Martini coarse-grain force fields. Specifically, we estimate potentials of mean force associated with translocation using standard all-atom (CHARMM36 lipid) and polarizable and nonpolarizable Martini force fields, as well as a series of modified Martini-based parameter sets. We find that we are able to reproduce qualitative features of potentials of mean force of single amino acid side chain analogues into model bilayers. In particular, modifications of peptide–water and peptide–membrane interactions allow prediction of free energy minima at the bilayer–water interface as obtained with all-atom force fields. In the case of oligo-arginine peptides, the modified parameter sets predict interfacial free energy minima as well as free energy barriers in almost quantitative agreement with all-atom force field based simulations. Interfacial free energy minima predicted by a modified coarse-grained parameter set are −2.51, −4.28, and −5.42 for mono-, di-, and triarginine; corresponding values from all-atom simulations are −0.83, −3.33, and −3.29, respectively, all in units of kcal/mol. We found that a stronger interaction between oligo-arginine and the membrane components and a weaker interaction between oligo-arginine and water are crucial for producing such minima in PMFs using the polarizable CG model. The difference between bulk aqueous and bilayer center states predicted by the modified coarse-grain force field are 11.71, 14.14, and 16.53 kcal/mol, and those by the all-atom model are 6.94, 8.64, and 12.80 kcal/mol; those are of almost the same order of magnitude. Our simulations also demonstrate a remarkable similarity in the structural aspects of the ensemble of configurations generated using the all-atom and coarse-grain force fields. Both resolutions show that oligo-arginine peptides adopt preferential orientations as they translocate into the bilayer. The guiding theme centers on charged groups maintaining coordination with polar and charged bilayer components as well as local water. We also observe similar behaviors related with membrane deformations. PMID:25290376
Reconciling structural and thermodynamic predictions using all-atom and coarse-grain force fields: the case of charged oligo-arginine translocation into DMPC bilayers.

PubMed

Hu, Yuan; Sinha, Sudipta Kumar; Patel, Sandeep

2014-10-16

Using the translocation of short, charged cationic oligo-arginine peptides (mono-, di-, and triarginine) from bulk aqueous solution into model DMPC bilayers, we explore the question of the similarity of thermodynamic and structural predictions obtained from molecular dynamics simulations using all-atom and Martini coarse-grain force fields. Specifically, we estimate potentials of mean force associated with translocation using standard all-atom (CHARMM36 lipid) and polarizable and nonpolarizable Martini force fields, as well as a series of modified Martini-based parameter sets. We find that we are able to reproduce qualitative features of potentials of mean force of single amino acid side chain analogues into model bilayers. In particular, modifications of peptide-water and peptide-membrane interactions allow prediction of free energy minima at the bilayer-water interface as obtained with all-atom force fields. In the case of oligo-arginine peptides, the modified parameter sets predict interfacial free energy minima as well as free energy barriers in almost quantitative agreement with all-atom force field based simulations. Interfacial free energy minima predicted by a modified coarse-grained parameter set are -2.51, -4.28, and -5.42 for mono-, di-, and triarginine; corresponding values from all-atom simulations are -0.83, -3.33, and -3.29, respectively, all in units of kcal/mol. We found that a stronger interaction between oligo-arginine and the membrane components and a weaker interaction between oligo-arginine and water are crucial for producing such minima in PMFs using the polarizable CG model. The difference between bulk aqueous and bilayer center states predicted by the modified coarse-grain force field are 11.71, 14.14, and 16.53 kcal/mol, and those by the all-atom model are 6.94, 8.64, and 12.80 kcal/mol; those are of almost the same order of magnitude. Our simulations also demonstrate a remarkable similarity in the structural aspects of the ensemble of configurations generated using the all-atom and coarse-grain force fields. Both resolutions show that oligo-arginine peptides adopt preferential orientations as they translocate into the bilayer. The guiding theme centers on charged groups maintaining coordination with polar and charged bilayer components as well as local water. We also observe similar behaviors related with membrane deformations.
Est10: A Novel Alkaline Esterase Isolated from Bovine Rumen Belonging to the New Family XV of Lipolytic Enzymes

PubMed Central

Rodríguez, María Cecilia; Loaces, Inés; Amarelle, Vanesa; Senatore, Daniella; Iriarte, Andrés; Fabiano, Elena; Noya, Francisco

2015-01-01

A metagenomic fosmid library from bovine rumen was used to identify clones with lipolytic activity. One positive clone was isolated. The gene responsible for the observed phenotype was identified by in vitro transposon mutagenesis and sequencing and was named est10. The 367 amino acids sequence harbors a signal peptide, the conserved secondary structure arrangement of alpha/beta hydrolases, and a GHSQG pentapeptide which is characteristic of esterases and lipases. Homology based 3D-modelling confirmed the conserved spatial orientation of the serine in a nucleophilic elbow. By sequence comparison, Est10 is related to hydrolases that are grouped into the non-specific Pfam family DUF3089 and to other characterized esterases that were recently classified into the new family XV of lipolytic enzymes. Est10 was heterologously expressed in Escherichia coli as a His-tagged fusion protein, purified and biochemically characterized. Est10 showed maximum activity towards C4 aliphatic chains and undetectable activity towards C10 and longer chains which prompted its classification as an esterase. However, it was able to efficiently catalyze the hydrolysis of aryl esters such as methyl phenylacetate and phenyl acetate. The optimum pH of this enzyme is 9.0, which is uncommon for esterases, and it exhibits an optimal temperature at 40°C. The activity of Est10 was inhibited by metal ions, detergents, chelating agents and additives. We have characterized an alkaline esterase produced by a still unidentified bacterium belonging to a recently proposed new family of esterases. PMID:25973851

Est10: A Novel Alkaline Esterase Isolated from Bovine Rumen Belonging to the New Family XV of Lipolytic Enzymes.

PubMed

Rodríguez, María Cecilia; Loaces, Inés; Amarelle, Vanesa; Senatore, Daniella; Iriarte, Andrés; Fabiano, Elena; Noya, Francisco

2015-01-01

A metagenomic fosmid library from bovine rumen was used to identify clones with lipolytic activity. One positive clone was isolated. The gene responsible for the observed phenotype was identified by in vitro transposon mutagenesis and sequencing and was named est10. The 367 amino acids sequence harbors a signal peptide, the conserved secondary structure arrangement of alpha/beta hydrolases, and a GHSQG pentapeptide which is characteristic of esterases and lipases. Homology based 3D-modelling confirmed the conserved spatial orientation of the serine in a nucleophilic elbow. By sequence comparison, Est10 is related to hydrolases that are grouped into the non-specific Pfam family DUF3089 and to other characterized esterases that were recently classified into the new family XV of lipolytic enzymes. Est10 was heterologously expressed in Escherichia coli as a His-tagged fusion protein, purified and biochemically characterized. Est10 showed maximum activity towards C4 aliphatic chains and undetectable activity towards C10 and longer chains which prompted its classification as an esterase. However, it was able to efficiently catalyze the hydrolysis of aryl esters such as methyl phenylacetate and phenyl acetate. The optimum pH of this enzyme is 9.0, which is uncommon for esterases, and it exhibits an optimal temperature at 40 °C. The activity of Est10 was inhibited by metal ions, detergents, chelating agents and additives. We have characterized an alkaline esterase produced by a still unidentified bacterium belonging to a recently proposed new family of esterases.
An annotated cDNA library of juvenile Euprymna scolopes with and without colonization by the symbiont Vibrio fischeri

PubMed Central

Chun, Carlene K; Scheetz, Todd E; Bonaldo, Maria de Fatima; Brown, Bartley; Clemens, Anik; Crookes-Goodson, Wendy J; Crouch, Keith; DeMartini, Tad; Eyestone, Mari; Goodson, Michael S; Janssens, Bernadette; Kimbell, Jennifer L; Koropatnick, Tanya A; Kucaba, Tamara; Smith, Christina; Stewart, Jennifer J; Tong, Deyan; Troll, Joshua V; Webster, Sarahrose; Winhall-Rice, Jane; Yap, Cory; Casavant, Thomas L; McFall-Ngai, Margaret J; Soares, M Bento

2006-01-01

Background Biologists are becoming increasingly aware that the interaction of animals, including humans, with their coevolved bacterial partners is essential for health. This growing awareness has been a driving force for the development of models for the study of beneficial animal-bacterial interactions. In the squid-vibrio model, symbiotic Vibrio fischeri induce dramatic developmental changes in the light organ of host Euprymna scolopes over the first hours to days of their partnership. We report here the creation of a juvenile light-organ specific EST database. Results We generated eleven cDNA libraries from the light organ of E. scolopes at developmentally significant time points with and without colonization by V. fischeri. Single pass 3' sequencing efforts generated 42,564 expressed sequence tags (ESTs) of which 35,421 passed our quality criteria and were then clustered via the UIcluster program into 13,962 nonredundant sequences. The cDNA clones representing these nonredundant sequences were sequenced from the 5' end of the vector and 58% of these resulting sequences overlapped significantly with the associated 3' sequence to generate 8,067 contigs with an average sequence length of 1,065 bp. All sequences were annotated with BLASTX (E-value < -03) and Gene Ontology (GO). Conclusion Both the number of ESTs generated from each library and GO categorizations are reflective of the activity state of the light organ during these early stages of symbiosis. Future analyses of the sequences identified in these libraries promise to provide valuable information not only about pathways involved in colonization and early development of the squid light organ, but also about pathways conserved in response to bacterial colonization across the animal kingdom. PMID:16780587
Generation and analysis of expressed sequence tags from the bone marrow of Chinese Sika deer.

PubMed

Yao, Baojin; Zhao, Yu; Zhang, Mei; Li, Juan

2012-03-01

Sika deer is one of the best-known and highly valued animals of China. Despite its economic, cultural, and biological importance, there has not been a large-scale sequencing project for Sika deer to date. With the ultimate goal of sequencing the complete genome of this organism, we first established a bone marrow cDNA library for Sika deer and generated a total of 2,025 reads. After processing the sequences, 2,017 high-quality expressed sequence tags (ESTs) were obtained. These ESTs were assembled into 1,157 unigenes, including 238 contigs and 919 singletons. Comparative analyses indicated that 888 (76.75%) of the unigenes had significant matches to sequences in the non-redundant protein database, In addition to highly expressed genes, such as stearoyl-CoA desaturase, cytochrome c oxidase, adipocyte-type fatty acid-binding protein, adiponectin and thymosin beta-4, we also obtained vascular endothelial growth factor-A and heparin-binding growth-associated molecule, both of which are of great importance for angiogenesis research. There were 244 (21.09%) unigenes with no significant match to any sequence in current protein or nucleotide databases, and these sequences may represent genes with unknown function in Sika deer. Open reading frame analysis of the sequences was performed using the getorf program. In addition, the sequences were functionally classified using the gene ontology hierarchy, clusters of orthologous groups of proteins and Kyoto encyclopedia of genes and genomes databases. Analysis of ESTs described in this paper provides an important resource for the transcriptome exploration of Sika deer, and will also facilitate further studies on functional genomics, gene discovery and genome annotation of Sika deer.
Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery.

PubMed

Kaur, Sukhjiwan; Cogan, Noel O I; Pembleton, Luke W; Shinozuka, Maiko; Savin, Keith W; Materne, Michael; Forster, John W

2011-05-25

Lentil (Lens culinaris Medik.) is a cool-season grain legume which provides a rich source of protein for human consumption. In terms of genomic resources, lentil is relatively underdeveloped, in comparison to other Fabaceae species, with limited available data. There is hence a significant need to enhance such resources in order to identify novel genes and alleles for molecular breeding to increase crop productivity and quality. Tissue-specific cDNA samples from six distinct lentil genotypes were sequenced using Roche 454 GS-FLX Titanium technology, generating c. 1.38 × 106 expressed sequence tags (ESTs). De novo assembly generated a total of 15,354 contigs and 68,715 singletons. The complete unigene set was sequence-analysed against genome drafts of the model legume species Medicago truncatula and Arabidopsis thaliana to identify 12,639, and 7,476 unique matches, respectively. When compared to the genome of Glycine max, a total of 20,419 unique hits were observed corresponding to c. 31% of the known gene space. A total of 25,592 lentil unigenes were subsequently annoated from GenBank. Simple sequence repeat (SSR)-containing ESTs were identified from consensus sequences and a total of 2,393 primer pairs were designed. A subset of 192 EST-SSR markers was screened for validation across a panel 12 cultivated lentil genotypes and one wild relative species. A total of 166 primer pairs obtained successful amplification, of which 47.5% detected genetic polymorphism. A substantial collection of ESTs has been developed from sequence analysis of lentil genotypes using second-generation technology, permitting unigene definition across a broad range of functional categories. As well as providing resources for functional genomics studies, the unigene set has permitted significant enhancement of the number of publicly-available molecular genetic markers as tools for improvement of this species.
Predicting oligonucleotide affinity to nucleic acid targets.

PubMed Central

Mathews, D H; Burkard, M E; Freier, S M; Wyatt, J R; Turner, D H

1999-01-01

A computer program, OligoWalk, is reported that predicts the equilibrium affinity of complementary DNA or RNA oligonucleotides to an RNA target. This program considers the predicted stability of the oligonucleotide-target helix and the competition with predicted secondary structure of both the target and the oligonucleotide. Both unimolecular and bimolecular oligonucleotide self structure are considered with a user-defined concentration. The application of OligoWalk is illustrated with three comparisons to experimental results drawn from the literature. PMID:10580474
Refinement of light-responsive transcript lists using rice oligonucleotide arrays: evaluation of gene-redundancy.

PubMed

Jung, Ki-Hong; Dardick, Christopher; Bartley, Laura E; Cao, Peijian; Phetsom, Jirapa; Canlas, Patrick; Seo, Young-Su; Shultz, Michael; Ouyang, Shu; Yuan, Qiaoping; Frank, Bryan C; Ly, Eugene; Zheng, Li; Jia, Yi; Hsia, An-Ping; An, Kyungsook; Chou, Hui-Hsien; Rocke, David; Lee, Geun Cheol; Schnable, Patrick S; An, Gynheung; Buell, C Robin; Ronald, Pamela C

2008-10-06

Studies of gene function are often hampered by gene-redundancy, especially in organisms with large genomes such as rice (Oryza sativa). We present an approach for using transcriptomics data to focus functional studies and address redundancy. To this end, we have constructed and validated an inexpensive and publicly available rice oligonucleotide near-whole genome array, called the rice NSF45K array. We generated expression profiles for light- vs. dark-grown rice leaf tissue and validated the biological significance of the data by analyzing sources of variation and confirming expression trends with reverse transcription polymerase chain reaction. We examined trends in the data by evaluating enrichment of gene ontology terms at multiple false discovery rate thresholds. To compare data generated with the NSF45K array with published results, we developed publicly available, web-based tools (www.ricearray.org). The Oligo and EST Anatomy Viewer enables visualization of EST-based expression profiling data for all genes on the array. The Rice Multi-platform Microarray Search Tool facilitates comparison of gene expression profiles across multiple rice microarray platforms. Finally, we incorporated gene expression and biochemical pathway data to reduce the number of candidate gene products putatively participating in the eight steps of the photorespiration pathway from 52 to 10, based on expression levels of putatively functionally redundant genes. We confirmed the efficacy of this method to cope with redundancy by correctly predicting participation in photorespiration of a gene with five paralogs. Applying these methods will accelerate rice functional genomics.
Clonality and distribution of clinical Ureaplasma isolates recovered from male patients and infertile couples in China

PubMed Central

Ruan, Zhi; Yang, Ting; Shi, Xinyan; Kong, Yingying; Xie, Xinyou

2017-01-01

Ureaplasma spp. have gained increasing recognition as pathogens in both adult and neonatal patients with multiple clinical presentations. However, the clonality of this organism in the male population and infertile couples in China is largely unknown. In this study, 96 (53 U. parvum and 43 U. urealyticum) of 103 Ureaplasma spp. strains recovered from genital specimens from male patients and 15 pairs of infertile couples were analyzed using multilocus sequence typing (MLST)/expanded multilocus sequence typing (eMLST) schemes. A total of 39 sequence types (STs) and 53 expanded sequence types (eSTs) were identified, with three predominant STs (ST1, ST9 and ST22) and eSTs (eST16, eST41 and eST82). Moreover, phylogenetic analysis revealed two distinct clusters that were highly congruent with the taxonomic differences between the two Ureaplasma species. We found significant differences in the distributions of both clusters and sub-groups between the male and female patients (P < 0.001). Moreover, 66.7% and 40.0% of the male and female partners of the infertile couples tested positive for Ureaplasma spp. The present study also attained excellent agreement of the identification of both Ureaplasma species between paired urine and semen specimens from the male partners (k > 0.80). However, this concordance was observed only for the detection of U. urealyticum within the infertile couples. In conclusion, the distributions of the clusters and sub-groups significantly differed between the male and female patients. U. urealyticum is more likely to transmit between infertile couples and be associated with clinical manifestations by the specific epidemic clonal lineages. PMID:28859153
Clonality and distribution of clinical Ureaplasma isolates recovered from male patients and infertile couples in China.

PubMed

Ruan, Zhi; Yang, Ting; Shi, Xinyan; Kong, Yingying; Xie, Xinyou; Zhang, Jun

2017-01-01

Ureaplasma spp. have gained increasing recognition as pathogens in both adult and neonatal patients with multiple clinical presentations. However, the clonality of this organism in the male population and infertile couples in China is largely unknown. In this study, 96 (53 U. parvum and 43 U. urealyticum) of 103 Ureaplasma spp. strains recovered from genital specimens from male patients and 15 pairs of infertile couples were analyzed using multilocus sequence typing (MLST)/expanded multilocus sequence typing (eMLST) schemes. A total of 39 sequence types (STs) and 53 expanded sequence types (eSTs) were identified, with three predominant STs (ST1, ST9 and ST22) and eSTs (eST16, eST41 and eST82). Moreover, phylogenetic analysis revealed two distinct clusters that were highly congruent with the taxonomic differences between the two Ureaplasma species. We found significant differences in the distributions of both clusters and sub-groups between the male and female patients (P < 0.001). Moreover, 66.7% and 40.0% of the male and female partners of the infertile couples tested positive for Ureaplasma spp. The present study also attained excellent agreement of the identification of both Ureaplasma species between paired urine and semen specimens from the male partners (k > 0.80). However, this concordance was observed only for the detection of U. urealyticum within the infertile couples. In conclusion, the distributions of the clusters and sub-groups significantly differed between the male and female patients. U. urealyticum is more likely to transmit between infertile couples and be associated with clinical manifestations by the specific epidemic clonal lineages.
Exploiting EST databases for the development and characterisation of 3425 gene-tagged CISP markers in biofuel crop sugarcane and their transferability in cereals and orphan tropical grasses.

PubMed

Chandra, Amaresh; Jain, Radha; Solomon, Sushil; Shrivastava, Shiksha; Roy, Ajoy K

2013-02-04

Sugarcane is an important cash crop, providing 70% of the global raw sugar as well as raw material for biofuel production. Genetic analysis is hindered in sugarcane because of its large and complex polyploid genome and lack of sufficiently informative gene-tagged markers. Modern genomics has produced large amount of ESTs, which can be exploited to develop molecular markers based on comparative analysis with EST datasets of related crops and whole rice genome sequence, and accentuate their cross-technical functionality in orphan crops like tropical grasses. Utilising 246,180 Saccharum officinarum EST sequences vis-à-vis its comparative analysis with ESTs of sorghum and barley and the whole rice genome sequence, we have developed 3425 novel gene-tagged markers - namely, conserved-intron scanning primers (CISP) - using the web program GeMprospector. Rice orthologue annotation results indicated homology of 1096 sequences with expressed proteins, 491 with hypothetical proteins. The remaining 1838 were miscellaneous in nature. A total of 367 primer-pairs were tested in diverse panel of samples. The data indicate amplification of 41% polymorphic bands leading to 0.52 PIC and 3.50 MI with a set of sugarcane varieties and Saccharum species. In addition, a moderate technical functionality of a set of such markers with orphan tropical grasses (22%) and fodder cum cereal oat (33%) is observed. Developed gene-tagged CISP markers exhibited considerable technical functionality with varieties of sugarcane and unexplored species of tropical grasses. These markers would thus be particularly useful in identifying the economical traits in sugarcane and developing conservation strategies for orphan tropical grasses.
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Treesearch

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
Analysis of expressed sequence tags (ESTs) from cocoa (Theobroma cacao L) upon infection with Phytophthora megakarya.

PubMed

Naganeeswaran, Sudalaimuthu Asari; Subbian, Elain Apshara; Ramaswamy, Manimekalai

2012-01-01

Phytophthora megakarya, the causative agent of cacao black pod disease in West African countries causes an extensive loss of yield. In this study we have analyzed 4 libraries of ESTs derived from Phytophthora megakarya infected cocoa leaf and pod tissues. Totally 6379 redundant sequences were retrieved from ESTtik database and EST processing was performed using seqclean tool. Clustering and assembling using CAP3 generated 3333 non-redundant (907 contigs and 2426 singletons) sequences. The primary sequence analysis of 3333 non-redundant sequences showed that the GC percentage was 42.7 and the sequence length ranged from 101 - 2576 nucleotides. Further, functional analysis (Blast, Interproscan, Gene ontology and KEGG search) were executed and 1230 orthologous genes were annotated. Totally 272 enzymes corresponding to 114 metabolic pathways were identified. Functional annotation revealed that most of the sequences are related to molecular function, stress response and biological processes. The annotated enzymes are aldehyde dehydrogenase (E.C: 1.2.1.3), catalase (E.C: 1.11.1.6), acetyl-CoA C-acetyltransferase (E.C: 2.3.1.9), threonine ammonia-lyase (E.C: 4.3.1.19), acetolactate synthase (E.C: 2.2.1.6), O-methyltransferase (E.C: 2.1.1.68) which play an important role in amino acid biosynthesis and phenyl propanoid biosynthesis. All this information was stored in MySQL database management system to be used in future for reconstruction of biotic stress response pathway in cocoa.
Gene-based SSR markers for common bean (Phaseolus vulgaris L.) derived from root and leaf tissue ESTs: an integration of the BMc series.

PubMed

Blair, Matthew W; Hurtado, Natalia; Chavarro, Carolina M; Muñoz-Torres, Monica C; Giraldo, Martha C; Pedraza, Fabio; Tomkins, Jeff; Wing, Rod

2011-03-22

Sequencing of cDNA libraries for the development of expressed sequence tags (ESTs) as well as for the discovery of simple sequence repeats (SSRs) has been a common method of developing microsatellites or SSR-based markers. In this research, our objective was to further sequence and develop common bean microsatellites from leaf and root cDNA libraries derived from the Andean gene pool accession G19833 and the Mesoamerican gene pool accession DOR364, mapping parents of a commonly used reference map. The root libraries were made from high and low phosphorus treated plants. A total of 3,123 EST sequences from leaf and root cDNA libraries were screened and used for direct simple sequence repeat discovery. From these EST sequences we found 184 microsatellites; the majority containing tri-nucleotide motifs, many of which were GC rich (ACC, AGC and AGG in particular). Di-nucleotide motif microsatellites were about half as common as the tri-nucleotide motif microsatellites but most of these were AGn microsatellites with a moderate number of ATn microsatellites in root ESTs followed by few ACn and no GCn microsatellites. Out of the 184 new SSR loci, 120 new microsatellite markers were developed in the BMc (Bean Microsatellites from cDNAs) series and these were evaluated for their capacity to distinguish bean diversity in a germplasm panel of 18 genotypes. We developed a database with images of the microsatellites and their polymorphism information content (PIC), which averaged 0.310 for polymorphic markers. The present study produced information about microsatellite frequency in root and leaf tissues of two important genotypes for common bean genomics: namely G19833, the Andean genotype selected for whole genome shotgun sequencing from race Peru, and DOR364 a race Mesoamerica subgroup 2 genotype that is a small-red seeded, released variety in Central America. Both race Peru and Mesoamerica subgroup 2 (small red beans) have been understudied in comparison to race Nueva Granada and Mesoamerica subgroup 1 (black beans) both with regards to gene expression and as sources of markers. However, we found few differences between SSR type and frequency between the G19833 leaf and DOR364 root tissue-derived ESTs. Overall, our work adds to the analysis of microsatellite frequency evaluation for common bean and provides a new set of 120 BMc markers which combined with the 248 previously developed BMc markers brings the total in this series to 368 markers. Once we include BMd markers, which are derived from GenBank sequences, the current total of gene-based markers from our laboratory surpasses 500 markers. These markers are basic for studies of the transcriptome of common bean and can form anchor points for genetic mapping studies in the future.
Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.

PubMed

Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R

2005-09-01

We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.
Whole genome sequence phylogenetic analysis of four Mexican rabies viruses isolated from cattle.

PubMed

Bárcenas-Reyes, I; Loza-Rubio, E; Cantó-Alarcón, G J; Luna-Cozar, J; Enríquez-Vázquez, A; Barrón-Rodríguez, R J; Milián-Suazo, F

2017-08-01

Phylogenetic analysis of the rabies virus in molecular epidemiology has been traditionally performed on partial sequences of the genome, such as the N, G, and P genes; however, that approach raises concerns about the discriminatory power compared to whole genome sequencing. In this study we characterized four strains of the rabies virus isolated from cattle in Querétaro, Mexico by comparing the whole genome sequence to that of strains from the American, European and Asian continents. Four cattle brain samples positive to rabies and characterized as AgV11, genotype 1, were used in the study. A cDNA sequence was generated by reverse transcription PCR (RT-PCR) using oligo dT. cDNA samples were sequenced in an Illumina NextSeq 500 platform. The phylogenetic analysis was performed with MEGA 6.0. Minimum evolution phylogenetic trees were constructed with the Neighbor-Joining method and bootstrapped with 1000 replicates. Three large and seven small clusters were formed with the 26 sequences used. The largest cluster grouped strains from different species in South America: Brazil, and the French Guyana. The second cluster grouped five strains from Mexico. A Mexican strain reported in a different study was highly related to our four strains, suggesting common source of infection. The phylogenetic analysis shows that the type of host is different for the different regions in the American Continent; rabies is more related to bats. It was concluded that the rabies virus in central Mexico is genetically stable and that it is transmitted by the vampire bat Desmodus rotundus. Copyright © 2017 Elsevier Ltd. All rights reserved.
Separation and parallel sequencing of the genomes and transcriptomes of single cells using G&T-seq.

PubMed

Macaulay, Iain C; Teng, Mabel J; Haerty, Wilfried; Kumar, Parveen; Ponting, Chris P; Voet, Thierry

2016-11-01

Parallel sequencing of a single cell's genome and transcriptome provides a powerful tool for dissecting genetic variation and its relationship with gene expression. Here we present a detailed protocol for G&T-seq, a method for separation and parallel sequencing of genomic DNA and full-length polyA(+) mRNA from single cells. We provide step-by-step instructions for the isolation and lysis of single cells; the physical separation of polyA(+) mRNA from genomic DNA using a modified oligo-dT bead capture and the respective whole-transcriptome and whole-genome amplifications; and library preparation and sequence analyses of these amplification products. The method allows the detection of thousands of transcripts in parallel with the genetic variants captured by the DNA-seq data from the same single cell. G&T-seq differs from other currently available methods for parallel DNA and RNA sequencing from single cells, as it involves physical separation of the DNA and RNA and does not require bespoke microfluidics platforms. The process can be implemented manually or through automation. When performed manually, paired genome and transcriptome sequencing libraries from eight single cells can be produced in ∼3 d by researchers experienced in molecular laboratory work. For users with experience in the programming and operation of liquid-handling robots, paired DNA and RNA libraries from 96 single cells can be produced in the same time frame. Sequence analysis and integration of single-cell G&T-seq DNA and RNA data requires a high level of bioinformatics expertise and familiarity with a wide range of informatics tools.
tropiTree: An NGS-Based EST-SSR Resource for 24 Tropical Tree Species

PubMed Central

Russell, Joanne R.; Hedley, Peter E.; Cardle, Linda; Dancey, Siobhan; Morris, Jenny; Booth, Allan; Odee, David; Mwaura, Lucy; Omondi, William; Angaine, Peter; Machua, Joseph; Muchugi, Alice; Milne, Iain; Kindt, Roeland; Jamnadass, Ramni; Dawson, Ian K.

2014-01-01

The development of genetic tools for non-model organisms has been hampered by cost, but advances in next-generation sequencing (NGS) have created new opportunities. In ecological research, this raises the prospect for developing molecular markers to simultaneously study important genetic processes such as gene flow in multiple non-model plant species within complex natural and anthropogenic landscapes. Here, we report the use of bar-coded multiplexed paired-end Illumina NGS for the de novo development of expressed sequence tag-derived simple sequence repeat (EST-SSR) markers at low cost for a range of 24 tree species. Each chosen tree species is important in complex tropical agroforestry systems where little is currently known about many genetic processes. An average of more than 5,000 EST-SSRs was identified for each of the 24 sequenced species, whereas prior to analysis 20 of the species had fewer than 100 nucleotide sequence citations. To make results available to potential users in a suitable format, we have developed an open-access, interactive online database, tropiTree (http://bioinf.hutton.ac.uk/tropiTree), which has a range of visualisation and search facilities, and which is a model for the efficient presentation and application of NGS data. PMID:25025376
A sequence-based genetic map of Medicago truncatula and comparison of marker colinearity with M. sativa.

PubMed Central

Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R

2004-01-01

A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Development and use of EST-SSR markers for assessing genetic diversity in the brown planthopper (Nilaparvata lugens Stål).

PubMed

Jing, S; Liu, B; Peng, L; Peng, X; Zhu, L; Fu, Q; He, G

2012-02-01

To assess genetic diversity in populations of the brown planthopper (Nilaparvata lugens Stål) (Homoptera: Delphacidae), we have developed and applied microsatellite, or simple sequence repeat (SSR), markers from expressed sequence tags (ESTs). We found that the brown planthopper clusters of ESTs were rich in SSRs with unique frequencies and distributions of SSR motifs. Three hundred and fifty-one EST-SSR markers were developed and yielded clear bands from samples of four brown planthopper populations. High cross-species transferability of these markers was detected in the closely related planthopper N. muiri. The newly developed EST-SSR markers provided sufficient resolution to distinguish within and among biotypes. Analyses based on SSR data revealed host resistance-based genetic differentiation among different brown planthopper populations; the genetic diversity of populations feeding on susceptible rice varieties was lower than that of populations feeding on resistant rice varieties. This is the first large-scale development of brown planthopper SSR markers, which will be useful for future molecular genetics and genomics studies of this serious agricultural pest.
Characterization and comparison of EST-SSR and TRAP markers for genetic analysis of the Japanese persimmon Diospyros kaki.

PubMed

Luo, C; Zhang, F; Zhang, Q L; Guo, D Y; Luo, Z R

2013-01-09

We developed and characterized expressed sequence tags (ESTs)-simple sequence repeats (SSRs) and targeted region amplified polymorphism (TRAP) markers to examine genetic relationships in the persimmon genus Diospyros gene pool. In total, we characterized 14 EST-SSR primer pairs and 36 TRAP primer combinations, which were amplified across 20 germplasms of 4 species in the genus Diospyros. We used various genetic parameters, including effective multiplex ratio (EMR), diversity index (DI), and marker index (MI), to test the utility of these markers. TRAP markers gave higher EMR (24.85) but lower DI (0.33), compared to EST-SSRs (EMR = 3.65, DI = 0.34). TRAP gave a very high MI (8.08), which was about 8 times than the MI of EST-SSR (1.25). These markers were utilized for phylogenetic inference of 20 genotypes of Diospyros kaki Thunb. and allied species, with a result that all kaki genotypes clustered closely and 3 allied species formed an independent group. These markers could be further exploited for large-scale genetic relationship inference.
Expressed sequence tags from heat-shocked seagrass Zostera noltii (Hornemann) from its southern distribution range.

PubMed

Massa, Sónia I; Pearson, Gareth A; Aires, Tânia; Kube, Michael; Olsen, Jeanine L; Reinhardt, Richard; Serrão, Ester A; Arnaud-Haond, Sophie

2011-09-01

Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera noltii during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to four recovery times following sub-lethal temperature stress, and a non-stressed control. We sequenced and analyzed 7009 sequence reads from 30min, 2h, 4h and 24h after the beginning of the heat-shock (AHS), and 1585 from the control library, for a total of 8594 sequence reads. Among 51 Tentative UniGenes (TUGs) exhibiting significantly different expression between libraries, 19 (37.3%) were identified as 'molecular chaperones' and were over-expressed following heat-shock, while 12 (23.5%) were 'photosynthesis TUGs' generally under-expressed in heat-shocked plants. A time course analysis of expression showed a rapid increase in expression of the molecular chaperone class, most of which were heat-shock proteins; which increased from 2 sequence reads in the control library to almost 230 in the 30min AHS library, followed by a slow decrease during further recovery. In contrast, 'photosynthesis TUGs' were under-expressed 30min AHS compared with the control library, and declined progressively with recovery time in the stress libraries, with a total of 29 sequence reads 24h AHS, compared with 125 in the control. A total of 4734 TUGs were screened for EST-Single Sequence Repeats (EST-SSRs) and 86 microsatellites were identified. Copyright © 2011 Elsevier B.V. All rights reserved.

Impact of antepartum diagnostic amnioinfusion on targeted ultrasound imaging of pregnancies presenting with severe oligo- and anhydramnios: An analysis of 61 cases.

PubMed

Vikraman, Seneesh Kumar; Chandra, Vipin; Balakrishnan, Bijoy; Batra, Meenu; Sethumadhavan, Sreeja; Patil, Swapneel Neelkanth; Nair, Sabila; Kannoly, Gopinathan

2017-05-01

The primary objective our study was to assess the role of diagnostic antepartum amnioinfusion on the yield from targeted ultrasounds performed in pregnancies with severe oligo- and anhydramnios. This was a retrospective and descriptive study, conducted in the fetal medicine units of two private tertiary care referral centers in south India. The details of all the cases of diagnostic amnioinfusion performed at these two centers from January 2009 to June 2016 were collected and analyzed. Inclusion criteria were pregnancies between 17 and 26 weeks of gestational age with severe oligo- or anhydramnios. Pregnancies with obvious preterm premature rupture of membranes (PPROM) were excluded. The primary outcome measure was the improvement in diagnostic information pertaining to cause of severe oligo- and anhydramnios, and the nature of such anomalies. A total of 61 cases of were identified. The median gestational age at performance of the procedure was 22 weeks [IQR, 19.5-23]. The mean volume of normal saline infused was 314±54ml. A significant increase in the single vertical pocket (SVP) was observed following the procedure (pre-procedure SVP=0.6±0.9cm, post procedure SVP=3.4±1.7; paired t test, p<0.001). In 37 cases (37/61, 60.7%), there were no pre-procedure ultrasound findings. There was significant overall detection of abnormalities post procedure (mean pre-procedure findings=0.39±0.49, mean post procedure findings=1.59±1.24; paired t test, p<0.001). The most frequent group of anomalies/abnormalities were renal (36/61, 59%), followed by PPROM (13/61, 21.3%) and finally fetal growth restriction (11/61, 18%). Antepartum amnioinfusion is a valuable ancillary technique in prenatal diagnosis as it increases the diagnostic yield from pregnancies presenting with severe oligo- and anhydramnios. Copyright © 2017 Elsevier B.V. All rights reserved.
Lipo-chitin oligosaccharides, plant symbiosis signalling molecules that modulate mammalian angiogenesis in vitro.

PubMed

Djordjevic, Michael A; Bezos, Anna; Susanti; Marmuse, Laurence; Driguez, Hugues; Samain, Eric; Vauzeilles, Boris; Beau, Jean-Marie; Kordbacheh, Farzaneh; Rolfe, Barry G; Schwörer, Ralf; Daines, Alison M; Gresshoff, Peter M; Parish, Christopher R

2014-01-01

Lipochitin oligosaccharides (LCOs) are signaling molecules required by ecologically and agronomically important bacteria and fungi to establish symbioses with diverse land plants. In plants, oligo-chitins and LCOs can differentially interact with different lysin motif (LysM) receptors and affect innate immunity responses or symbiosis-related pathways. In animals, oligo-chitins also induce innate immunity and other physiological responses but LCO recognition has not been demonstrated. Here LCO and LCO-like compounds are shown to be biologically active in mammals in a structure dependent way through the modulation of angiogenesis, a tightly-regulated process involving the induction and growth of new blood vessels from existing vessels. The testing of 24 LCO, LCO-like or oligo-chitin compounds resulted in structure-dependent effects on angiogenesis in vitro leading to promotion, or inhibition or nil effects. Like plants, the mammalian LCO biological activity depended upon the presence and type of terminal substitutions. Un-substituted oligo-chitins of similar chain lengths were unable to modulate angiogenesis indicating that mammalian cells, like plant cells, can distinguish between LCOs and un-substituted oligo-chitins. The cellular mode-of-action of the biologically active LCOs in mammals was determined. The stimulation or inhibition of endothelial cell adhesion to vitronectin or fibronectin correlated with their pro- or anti-angiogenic activity. Importantly, novel and more easily synthesised LCO-like disaccharide molecules were also biologically active and de-acetylated chitobiose was shown to be the primary structural basis of recognition. Given this, simpler chitin disaccharides derivatives based on the structure of biologically active LCOs were synthesised and purified and these showed biological activity in mammalian cells. Since important chronic disease states are linked to either insufficient or excessive angiogenesis, LCO and LCO-like molecules may have the potential to be a new, carbohydrate-based class of therapeutics for modulating angiogenesis.
Construction of a Lotus japonicus late nodulin expressed sequence tag library and identification of novel nodule-specific genes.

PubMed Central

Szczyglowski, K; Hamburger, D; Kapranov, P; de Bruijn, F J

1997-01-01

A range of novel expressed sequence tags (ESTs) associated with late developmental events during nodule organogenesis in the legume Lotus japonicus were identified using mRNA differential display; 110 differentially displayed polymerase chain reaction products were cloned and analyzed. Of 88 unique cDNAs obtained, 22 shared significant homology to DNA/protein sequences in the respective databases. This group comprises, among others, a nodule-specific homolog of protein phosphatase 2C, a peptide transporter protein, and a nodule-specific form of cytochrome P450. RNA gel-blot analysis of 16 differentially displayed ESTs confirmed their nodule-specific expression pattern. The kinetics of mRNA accumulation of the majority of the ESTs analyzed were found to resemble the expression pattern observed for the L. japonicus leghemoglobin gene. These results indicate that the newly isolated molecular markers correspond to genes induced during late developmental stages of L. japonicus nodule organogenesis and provide important, novel tools for the study of nodulation. PMID:9276951
Transcriptome analysis of carnation (Dianthus caryophyllus L.) based on next-generation sequencing technology.

PubMed

Tanase, Koji; Nishitani, Chikako; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Ohmiya, Akemi; Onozaki, Takashi

2012-07-02

Carnation (Dianthus caryophyllus L.), in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST) database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. We constructed a normalized cDNA library and a 3'-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380) of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO) and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs) in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant.
Transcriptome analysis of carnation (Dianthus caryophyllus L.) based on next-generation sequencing technology

PubMed Central

2012-01-01

Background Carnation (Dianthus caryophyllus L.), in the family Caryophyllaceae, can be found in a wide range of colors and is a model system for studies of flower senescence. In addition, it is one of the most important flowers in the global floriculture industry. However, few genomics resources, such as sequences and markers are available for carnation or other members of the Caryophyllaceae. To increase our understanding of the genetic control of important characters in carnation, we generated an expressed sequence tag (EST) database for a carnation cultivar important in horticulture by high-throughput sequencing using 454 pyrosequencing technology. Results We constructed a normalized cDNA library and a 3’-UTR library of carnation, obtaining a total of 1,162,126 high-quality reads. These reads were assembled into 300,740 unigenes consisting of 37,844 contigs and 262,896 singlets. The contigs were searched against an Arabidopsis sequence database, and 61.8% (23,380) of them had at least one BLASTX hit. These contigs were also annotated with Gene Ontology (GO) and were found to cover a broad range of GO categories. Furthermore, we identified 17,362 potential simple sequence repeats (SSRs) in 14,291 of the unigenes. We focused on gene discovery in the areas of flower color and ethylene biosynthesis. Transcripts were identified for almost every gene involved in flower chlorophyll and carotenoid metabolism and in anthocyanin biosynthesis. Transcripts were also identified for every step in the ethylene biosynthesis pathway. Conclusions We present the first large-scale sequence data set for carnation, generated using next-generation sequencing technology. The large EST database generated from these sequences is an informative resource for identifying genes involved in various biological processes in carnation and provides an EST resource for understanding the genetic diversity of this plant. PMID:22747974
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation.

PubMed

Rowland, Lisa J; Alkharouf, Nadim; Darwish, Omar; Ogden, Elizabeth L; Polashock, James J; Bassil, Nahla V; Main, Dorrie

2012-04-02

There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry.
Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation

PubMed Central

2012-01-01

Background There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Results Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. Conclusions These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry. PMID:22471859
Pyrene-Containing ortho-Oligo(phenylene)ethynylene Foldamer as a Ratiometric Probe Based on Circularly Polarized Luminescence.

PubMed

Reiné, Pablo; Justicia, Jose; Morcillo, Sara P; Abbate, Sergio; Vaz, Belen; Ribagorda, María; Orte, Ángel; Álvarez de Cienfuegos, Luis; Longhi, Giovanna; Campaña, Araceli G; Miguel, Delia; Cuerva, Juan M

2018-04-20

In this manuscript, we report the first synthesis of an organic monomolecular emitter, which behaves as a circularly polarized luminescence (CPL)-based ratiometric probe. The enantiopure helical ortho-oligo(phenylene)ethynylene ( o-OPE) core has been prepared by a new and efficient macrocyclization reaction. The combination of such o-OPE helical skeleton and a pyrene couple leads to two different CPL emission features in a single structure whose ratio linearly responds to silver(I) concentration.
Informatic selection of a neural crest-melanocyte cDNA set for microarray analysis

PubMed Central

Loftus, S. K.; Chen, Y.; Gooden, G.; Ryan, J. F.; Birznieks, G.; Hilliard, M.; Baxevanis, A. D.; Bittner, M.; Meltzer, P.; Trent, J.; Pavan, W.

1999-01-01

With cDNA microarrays, it is now possible to compare the expression of many genes simultaneously. To maximize the likelihood of finding genes whose expression is altered under the experimental conditions, it would be advantageous to be able to select clones for tissue-appropriate cDNA sets. We have taken advantage of the extensive sequence information in the dbEST expressed sequence tag (EST) database to identify a neural crest-derived melanocyte cDNA set for microarray analysis. Analysis of characterized genes with dbEST identified one library that contained ESTs representing 21 neural crest-expressed genes (library 198). The distribution of the ESTs corresponding to these genes was biased toward being derived from library 198. This is in contrast to the EST distribution profile for a set of control genes, characterized to be more ubiquitously expressed in multiple tissues (P < 1 × 10−9). From library 198, a subset of 852 clustered ESTs were selected that have a library distribution profile similar to that of the 21 neural crest-expressed genes. Microarray analysis demonstrated the majority of the neural crest-selected 852 ESTs (Mel1 array) were differentially expressed in melanoma cell lines compared with a non-neural crest kidney epithelial cell line (P < 1 × 10−8). This was not observed with an array of 1,238 ESTs that was selected without library origin bias (P = 0.204). This study presents an approach for selecting tissue-appropriate cDNAs that can be used to examine the expression profiles of developmental processes and diseases. PMID:10430933
Candidate gene database and transcript map for peach, a model species for fruit trees.

PubMed

Horn, Renate; Lecouls, Anne-Claire; Callahan, Ann; Dandekar, Abhaya; Garay, Lilibeth; McCord, Per; Howad, Werner; Chan, Helen; Verde, Ignazio; Main, Doreen; Jung, Sook; Georgi, Laura; Forrest, Sam; Mook, Jennifer; Zhebentyayeva, Tatyana; Yu, Yeisoo; Kim, Hye Ran; Jesudurai, Christopher; Sosinski, Bryon; Arús, Pere; Baird, Vance; Parfitt, Dan; Reighard, Gregory; Scorza, Ralph; Tomkins, Jeffrey; Wing, Rod; Abbott, Albert Glenn

2005-05-01

Peach (Prunus persica) is a model species for the Rosaceae, which includes a number of economically important fruit tree species. To develop an extensive Prunus expressed sequence tag (EST) database for identifying and cloning the genes important to fruit and tree development, we generated 9,984 high-quality ESTs from a peach cDNA library of developing fruit mesocarp. After assembly and annotation, a putative peach unigene set consisting of 3,842 ESTs was defined. Gene ontology (GO) classification was assigned based on the annotation of the single "best hit" match against the Swiss-Prot database. No significant homology could be found in the GenBank nr databases for 24.3% of the sequences. Using core markers from the general Prunus genetic map, we anchored bacterial artificial chromosome (BAC) clones on the genetic map, thereby providing a framework for the construction of a physical and transcript map. A transcript map was developed by hybridizing 1,236 ESTs from the putative peach unigene set and an additional 68 peach cDNA clones against the peach BAC library. Hybridizing ESTs to genetically anchored BACs immediately localized 11.2% of the ESTs on the genetic map. ESTs showed a clustering of expressed genes in defined regions of the linkage groups. [The data were built into a regularly updated Genome Database for Rosaceae (GDR), available at (http://www.genome.clemson.edu/gdr/).].
PAVE: program for assembling and viewing ESTs.

PubMed

Soderlund, Carol; Johnson, Eric; Bomhoff, Matthew; Descour, Anne

2009-08-26

New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. The PAVE (Program for Assembling and Viewing ESTs) assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.
Characterization of expressed sequence tags (ESTs) of pigeonpea (Cajanus cajan L.) and functional validation of selected genes for abiotic stress tolerance in Arabidopsis thaliana.

PubMed

Priyanka, B; Sekhar, K; Sunita, T; Reddy, V D; Rao, Khareedu Venkateswara

2010-03-01

Pigeonpea, a major grain legume crop with remarkable drought tolerance traits, has been used for the isolation of stress-responsive genes. Herein, we report generation of ESTs, transcript profiles of selected genes and validation of candidate genes obtained from the subtracted cDNA libraries of pigeonpea plants subjected to PEG/water-deficit stress conditions. Cluster analysis of 124 selected ESTs yielded 75 high-quality ESTs. Homology searches disclosed that 55 ESTs share significant similarity with the known/putative proteins or ESTs available in the databases. These ESTs were characterized and genes relevant to the specific physiological processes were identified. Of the 75 ESTs obtained from the cDNA libraries of drought-stressed plants, 20 ESTs proved to be unique to the pigeonpea. These sequences are envisaged to serve as a potential source of stress-inducible genes of the drought stress-response transcriptome, and hence may be used for deciphering the mechanism of drought tolerance of the pigeonpea. Expression profiles of selected genes revealed increased levels of m-RNA transcripts in pigeonpea plants subjected to different abiotic stresses. Transgenic Arabidopsis lines, expressing Cajanus cajan hybrid-proline-rich protein (CcHyPRP), C. cajan cyclophilin (CcCYP) and C. cajan cold and drought regulatory (CcCDR) genes, exhibited marked tolerance, increased plant biomass and enhanced photosynthetic rates under PEG/NaCl/cold/heat stress conditions. This study represents the first report dealing with the isolation of drought-specific ESTs, transcriptome analysis and functional validation of drought-responsive genes of the pigeonpea. These genes, as such, hold promise for engineering crop plants bestowed with tolerance to major abiotic stresses.
Toward allotetraploid cotton genome assembly: integration of a high-density molecular genetic linkage map with DNA sequence information

PubMed Central

2012-01-01

Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource for tetraploid cotton genome assembly, for cloning genes related to superior agronomic traits, and for further comparative genomic analyses in Gossypium. PMID:23046547
Comparative Analysis of Expressed Genes from Cacao Meristems Infected by Moniliophthora perniciosa

PubMed Central

Gesteira, Abelmon S.; Micheli, Fabienne; Carels, Nicolas; Da Silva, Aline C.; Gramacho, Karina P.; Schuster, Ivan; Macêdo, Joci N.; Pereira, Gonçalo A. G.; Cascardo, Júlio C. M.

2007-01-01

Background and Aims Witches' broom disease is caused by the hemibiotrophic basidiomycete Moniliophthora perniciosa, and is one of the most important diseases of cacao in the western hemisphere. Because very little is known about the global process of such disease development, expressed sequence tags (ESTs) were used to identify genes expressed during the Theobroma cacao–Moniliophthora perniciosa interaction. Methods Two cDNA libraries corresponding to the resistant (RT) and susceptible (SP) cacao–M. perniciosa interactions were constructed from total RNA, using the DB SMART Creator cDNA library kit (Clontech). Clones were randomly selected, sequenced from the 5′ end and analysed using bioinformatics tools including in silico analysis of the differential gene expression. Key Results A total of 6884 ESTs were generated from the RT and SP cDNA libraries. These ESTs were composed of 2585 singlets and 341 contigs for a total of 2926 non-redundant sequences. The redundancy of the libraries was low and their specificity high when compared with the few other cacao libraries already published. Sequence analysis allowed the assignment of a putative functional category for 54 % of sequences, whereas approx. 22 % of sequences corresponded to unknown function and approx. 24 % of sequences did not show any significant similarity with other proteins present in the database. Despite the similar overall distribution of the sequences in functional categories between the two libraries, qualitative differences were observed. Genes involved during the defence response to pathogen infection or in programmed cell death were identified, such as pathogenesis related-proteins, trypsin inhibitor or oxalate oxidase, and some of them showed an in silico differential expression between the resistant and the susceptible interactions. Conclusions As far as is known this is the first EST resource from the cacao–M. perniciosa interaction and it is believed that it will provide a significant contribution to the understanding of the molecular mechanisms of the resistance and susceptibility of cacao to M. perniciosa, to develop strategies to control witches broom, and as a source of polymorphism for molecular marker development and marker-assisted selection. PMID:17557832
De novo assembly and characterization of bark transcriptome using Illumina sequencing and development of EST-SSR markers in rubber tree (Hevea brasiliensis Muell. Arg.)

PubMed Central

2012-01-01

Background In rubber tree, bark is one of important agricultural and biological organs. However, the molecular mechanism involved in the bark formation and development in rubber tree remains largely unknown, which is at least partially due to lack of bark transcriptomic and genomic information. Therefore, it is necessary to carried out high-throughput transcriptome sequencing of rubber tree bark to generate enormous transcript sequences for the functional characterization and molecular marker development. Results In this study, more than 30 million sequencing reads were generated using Illumina paired-end sequencing technology. In total, 22,756 unigenes with an average length of 485 bp were obtained with de novo assembly. The similarity search indicated that 16,520 and 12,558 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 6,867 and 5,559 unigenes were separately assigned to Gene Ontology (GO) and Clusters of Orthologous Group (COG). When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 12,097 unigenes were assigned to 5 main categories including 123 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (9,043, 74.75%), suggesting the active metabolic processes in rubber tree bark. In addition, a total of 39,257 EST-SSRs were identified from 22,756 unigenes, and the characterizations of EST-SSRs were further analyzed in rubber tree. 110 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among 13 Hevea germplasms, PCR success rate and polymorphism rate of 110 markers were separately 96.36% and 55.45% in this study. Conclusion By assembling and analyzing de novo transcriptome sequencing data, we reported the comprehensive functional characterization of rubber tree bark. This research generated a substantial fraction of rubber tree transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation, and microarrays development in rubber tree. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding in rubber tree. Moreover, this study also supported that transcriptome analysis based on Illumina paired-end sequencing is a powerful tool for transcriptome characterization and molecular marker development in non-model species, especially those with large and complex genomes. PMID:22607098
ocsESTdb: a database of oil crop seed EST sequences for comparative analysis and investigation of a global metabolic network and oil accumulation metabolism.

PubMed

Ke, Tao; Yu, Jingyin; Dong, Caihua; Mao, Han; Hua, Wei; Liu, Shengyi

2015-01-21

Oil crop seeds are important sources of fatty acids (FAs) for human and animal nutrition. Despite their importance, there is a lack of an essential bioinformatics resource on gene transcription of oil crops from a comparative perspective. In this study, we developed ocsESTdb, the first database of expressed sequence tag (EST) information on seeds of four large-scale oil crops with an emphasis on global metabolic networks and oil accumulation metabolism that target the involved unigenes. A total of 248,522 ESTs and 106,835 unigenes were collected from the cDNA libraries of rapeseed (Brassica napus), soybean (Glycine max), sesame (Sesamum indicum) and peanut (Arachis hypogaea). These unigenes were annotated by a sequence similarity search against databases including TAIR, NR protein database, Gene Ontology, COG, Swiss-Prot, TrEMBL and Kyoto Encyclopedia of Genes and Genomes (KEGG). Five genome-scale metabolic networks that contain different numbers of metabolites and gene-enzyme reaction-association entries were analysed and constructed using Cytoscape and yEd programs. Details of unigene entries, deduced amino acid sequences and putative annotation are available from our database to browse, search and download. Intuitive and graphical representations of EST/unigene sequences, functional annotations, metabolic pathways and metabolic networks are also available. ocsESTdb will be updated regularly and can be freely accessed at http://ocri-genomics.org/ocsESTdb/ . ocsESTdb may serve as a valuable and unique resource for comparative analysis of acyl lipid synthesis and metabolism in oilseed plants. It also may provide vital insights into improving oil content in seeds of oil crop species by transcriptional reconstruction of the metabolic network.
Poly A- Transcripts Expressed in HeLa Cells

PubMed Central

Lu, Jian; Xuan, Zhenyu; Chen, Jun; Zheng, Yonglan; Zhou, Tom; Zhang, Michael Q.; Wu, Chung-I; Wang, San Ming

2008-01-01

Background Transcripts expressed in eukaryotes are classified as poly A+ transcripts or poly A- transcripts based on the presence or absence of the 3′ poly A tail. Most transcripts identified so far are poly A+ transcripts, whereas the poly A- transcripts remain largely unknown. Methodology/Principal Findings We developed the TRD (Total RNA Detection) system for transcript identification. The system detects the transcripts through the following steps: 1) depleting the abundant ribosomal and small-size transcripts; 2) synthesizing cDNA without regard to the status of the 3′ poly A tail; 3) applying the 454 sequencing technology for massive 3′ EST collection from the cDNA; and 4) determining the genome origins of the detected transcripts by mapping the sequences to the human genome reference sequences. Using this system, we characterized the cytoplasmic transcripts from HeLa cells. Of the 13,467 distinct 3′ ESTs analyzed, 24% are poly A-, 36% are poly A+, and 40% are bimorphic with poly A+ features but without the 3′ poly A tail. Most of the poly A- 3′ ESTs do not match known transcript sequences; they have a similar distribution pattern in the genome as the poly A+ and bimorphic 3′ ESTs, and their mapped intergenic regions are evolutionarily conserved. Experiments confirmed the authenticity of the detected poly A- transcripts. Conclusion/Significance Our study provides the first large-scale sequence evidence for the presence of poly A- transcripts in eukaryotes. The abundance of the poly A- transcripts highlights the need for comprehensive identification of these transcripts for decoding the transcriptome, annotating the genome and studying biological relevance of the poly A- transcripts. PMID:18665230
A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

PubMed Central

Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

2009-01-01

Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an important step towards the identification of all genes in the citrus genome. Furthermore, public availability of the cDNA clones generated in this study, and not only their sequence, enables testing of the biological function of the genes represented in the collection. Expression of the citrus SEP3 homologue, CitrSEP, in Arabidopsis results in early flowering, along with other phenotypes resembling the over-expression of the Arabidopsis SEPALLATA genes. Our findings suggest that the members of the SEP gene family play similar roles in these quite distant plant species. PMID:19747386
Directional, seamless, and restriction enzyme-free construction of random-primed complementary DNA libraries using phosphorothioate-modified primers.

PubMed

Howland, Shanshan W; Poh, Chek-Meng; Rénia, Laurent

2011-09-01

Directional cloning of complementary DNA (cDNA) primed by oligo(dT) is commonly achieved by appending a restriction site to the primer, whereas the second strand is synthesized through the combined action of RNase H and Escherichia coli DNA polymerase I (PolI). Although random primers provide more uniform and complete coverage, directional cloning with the same strategy is highly inefficient. We report that phosphorothioate linkages protect the tail sequence appended to random primers from the 5'→3' exonuclease activity of PolI. We present a simple strategy for constructing a random-primed cDNA library using the efficient, size-independent, and seamless In-Fusion cloning method instead of restriction enzymes. Copyright © 2011 Elsevier Inc. All rights reserved.
Late Oligocene-Early Miocene larger benthic foraminifera from the mixed siliciclastic-carbonate and reefal strata of Kharabeh Sanji stratigraphic section, NW Iran

NASA Astrophysics Data System (ADS)

Hosseinzadeh, R.

2012-04-01

The marine Oligo-Miocene sediments of the Qom Formation at Kharabeh Sanji section west Uromieh consisting of mixed siliciclastic-carbonates changing to reefal strata were studied in detail to establish a high resolution biostratigraphic zonal scheme. Contineous distribution of larger benthic foraminifera (mainly miogypsinids) allowed us to correlate the identified taxa with the shallow benthic zonation (SBZ) already introduced for European sequences and to ascribe detailed age to the study section based on the determined biozones. The identified fauna include the genera Miogypsinodes, Miogypsina, Neorotalia, Nephrolepidina, Eulepidina and Spiroclypeus. The foraminifereal assemblage resemble to the fauna described from European basins characterizing the SBZ 23 to SBZ 25 zones representing a time interval from the Late Chattian to Burdigalian.

Label-free probing of genes by time-domain terahertz sensing.

PubMed

Haring Bolivar, P; Brucherseifer, M; Nagel, M; Kurz, H; Bosserhoff, A; Büttner, R

2002-11-07

A label-free sensing approach for the label-free characterization of genetic material with terahertz (THz) electromagnetic waves is presented. Time-resolved THz analysis of polynucleotides demonstrates a strong dependence of the complex refractive index of DNA molecules in the THz frequency range on their hybridization state. By monitoring THz signals one can thus infer the binding state (hybridized or denatured) of oligo- and polynucleotides, enabling the label-free determination the genetic composition of unknown DNA sequences. A broadband experimental proof-of-principle in a freespace analytic configuration, as well as a higher-sensitivity approach using integrated THz sensors reaching femtomol detection levels and demonstrating the capability to detect single-base mutations, are presented. The potential application for next generation high-throughput label-free genetic analytic systems is discussed.
Junctions between i-motif tetramers in supramolecular structures

PubMed Central

Guittet, Eric; Renciuk, Daniel; Leroy, Jean-Louis

2012-01-01

The symmetry of i-motif tetramers gives to cytidine-rich oligonucleotides the capacity to associate into supramolecular structures (sms). In order to determine how the tetramers are linked together in such structures, we have measured by gel filtration chromatography and NMR the formation and dissociation kinetics of sms built by oligonucleotides containing two short C stretches separated by a non-cytidine-base. We show that a stretch of only two cytidines either at the 3′- or 5′-end is long enough to link the tetramers into sms. The analysis of the properties of sms formed by oligonucleotides differing by the length of the oligo-C stretches, the sequence orientation and the nature of the non-C base provides a model of the junction connecting the tetramers in sms. PMID:22362739
Evolution, substrate specificity and subfamily classification of glycoside hydrolase family 5 (GH5).

PubMed

Aspeborg, Henrik; Coutinho, Pedro M; Wang, Yang; Brumer, Harry; Henrissat, Bernard

2012-09-20

The large Glycoside Hydrolase family 5 (GH5) groups together a wide range of enzymes acting on β-linked oligo- and polysaccharides, and glycoconjugates from a large spectrum of organisms. The long and complex evolution of this family of enzymes and its broad sequence diversity limits functional prediction. With the objective of improving the differentiation of enzyme specificities in a knowledge-based context, and to obtain new evolutionary insights, we present here a new, robust subfamily classification of family GH5. About 80% of the current sequences were assigned into 51 subfamilies in a global analysis of all publicly available GH5 sequences and associated biochemical data. Examination of subfamilies with catalytically-active members revealed that one third are monospecific (containing a single enzyme activity), although new functions may be discovered with biochemical characterization in the future. Furthermore, twenty subfamilies presently have no characterization whatsoever and many others have only limited structural and biochemical data. Mapping of functional knowledge onto the GH5 phylogenetic tree revealed that the sequence space of this historical and industrially important family is far from well dispersed, highlighting targets in need of further study. The analysis also uncovered a number of GH5 proteins which have lost their catalytic machinery, indicating evolution towards novel functions. Overall, the subfamily division of GH5 provides an actively curated resource for large-scale protein sequence annotation for glycogenomics; the subfamily assignments are openly accessible via the Carbohydrate-Active Enzyme database at http://www.cazy.org/GH5.html.
Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

PubMed

Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

2016-06-04

Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.
Fluorescent probes for nucleic Acid visualization in fixed and live cells.

PubMed

Boutorine, Alexandre S; Novopashina, Darya S; Krasheninina, Olga A; Nozeret, Karine; Venyaminova, Alya G

2013-12-11

This review analyses the literature concerning non-fluorescent and fluorescent probes for nucleic acid imaging in fixed and living cells from the point of view of their suitability for imaging intracellular native RNA and DNA. Attention is mainly paid to fluorescent probes for fluorescence microscopy imaging. Requirements for the target-binding part and the fluorophore making up the probe are formulated. In the case of native double-stranded DNA, structure-specific and sequence-specific probes are discussed. Among the latest, three classes of dsDNA-targeting molecules are described: (i) sequence-specific peptides and proteins; (ii) triplex-forming oligonucleotides and (iii) polyamide oligo(N-methylpyrrole/N-methylimidazole) minor groove binders. Polyamides seem to be the most promising targeting agents for fluorescent probe design, however, some technical problems remain to be solved, such as the relatively low sequence specificity and the high background fluorescence inside the cells. Several examples of fluorescent probe applications for DNA imaging in fixed and living cells are cited. In the case of intracellular RNA, only modified oligonucleotides can provide such sequence-specific imaging. Several approaches for designing fluorescent probes are considered: linear fluorescent probes based on modified oligonucleotide analogs, molecular beacons, binary fluorescent probes and template-directed reactions with fluorescence probe formation, FRET donor-acceptor pairs, pyrene excimers, aptamers and others. The suitability of all these methods for living cell applications is discussed.
Gene Expression Profiling Reveals Functional Specialization along the Intestinal Tract of a Carnivorous Teleostean Fish (Dicentrarchus labrax)

PubMed Central

Calduch-Giner, Josep A.; Sitjà-Bobadilla, Ariadna; Pérez-Sánchez, Jaume

2016-01-01

High-quality sequencing reads from the intestine of European sea bass were assembled, annotated by similarity against protein reference databases and combined with nucleotide sequences from public and private databases. After redundancy filtering, 24,906 non-redundant annotated sequences encoding 15,367 different gene descriptions were obtained. These annotated sequences were used to design a custom, high-density oligo-microarray (8 × 15 K) for the transcriptomic profiling of anterior (AI), middle (MI), and posterior (PI) intestinal segments. Similar molecular signatures were found for AI and MI segments, which were combined in a single group (AI-MI) whereas the PI outstood separately, with more than 1900 differentially expressed genes with a fold-change cutoff of 2. Functional analysis revealed that molecular and cellular functions related to feed digestion and nutrient absorption and transport were over-represented in AI-MI segments. By contrast, the initiation and establishment of immune defense mechanisms became especially relevant in PI, although the microarray expression profiling validated by qPCR indicated that these functional changes are gradual from anterior to posterior intestinal segments. This functional divergence occurred in association with spatial transcriptional changes in nutrient transporters and the mucosal chemosensing system via G protein-coupled receptors. These findings contribute to identify key indicators of gut functions and to compare different fish feeding strategies and immune defense mechanisms acquired along the evolution of teleosts. PMID:27610085
Gene Expression Profiling Reveals Functional Specialization along the Intestinal Tract of a Carnivorous Teleostean Fish (Dicentrarchus labrax).

PubMed

Calduch-Giner, Josep A; Sitjà-Bobadilla, Ariadna; Pérez-Sánchez, Jaume

2016-01-01

High-quality sequencing reads from the intestine of European sea bass were assembled, annotated by similarity against protein reference databases and combined with nucleotide sequences from public and private databases. After redundancy filtering, 24,906 non-redundant annotated sequences encoding 15,367 different gene descriptions were obtained. These annotated sequences were used to design a custom, high-density oligo-microarray (8 × 15 K) for the transcriptomic profiling of anterior (AI), middle (MI), and posterior (PI) intestinal segments. Similar molecular signatures were found for AI and MI segments, which were combined in a single group (AI-MI) whereas the PI outstood separately, with more than 1900 differentially expressed genes with a fold-change cutoff of 2. Functional analysis revealed that molecular and cellular functions related to feed digestion and nutrient absorption and transport were over-represented in AI-MI segments. By contrast, the initiation and establishment of immune defense mechanisms became especially relevant in PI, although the microarray expression profiling validated by qPCR indicated that these functional changes are gradual from anterior to posterior intestinal segments. This functional divergence occurred in association with spatial transcriptional changes in nutrient transporters and the mucosal chemosensing system via G protein-coupled receptors. These findings contribute to identify key indicators of gut functions and to compare different fish feeding strategies and immune defense mechanisms acquired along the evolution of teleosts.
Partial DNA sequencing of Douglas-fir cDNAs used in RFLP mapping

Treesearch

K.D. Jermstad; D.L. Bassoni; C.S. Kinlaw; D.B. Neale

1998-01-01

DNA sequences from 87 Douglas-fir (Pseudotsuga menziesii [Mirb.] Franco) cDNA RFLP probes were determined. Sequences were submitted to the GenBank dbEST database and searched for similarity against nucleotide and protein databases using the BLASTn and BLASTx programs. Twenty-one sequences (24%) were assigned putative functions; 18 of which...
Smart poly(oligo(propylene glycol) methacrylate) hydrogel prepared by gamma radiation

NASA Astrophysics Data System (ADS)

Suljovrujic, E.; Micic, M.

2015-01-01

The synthesis of poly(oligo(propylene glycol) methacrylate) (POPGMA) from functionalised oligo(propylene glycol) methacrylate (OPGMA) monomers by gamma radiation-induced radical polymerisation is reported for the first time; POPGMA homopolymeric hydrogel with oligo(propylene glycol) (OPG) pendant chains, as a non-linear PPGMA-analogue, was synthesised from an monomer-solvent (OPGMA375-water/ethanol) mixture at different irradiation doses (5, 10, 25, and 40 kGy). Determination of the gel fraction was conducted after synthesis. The swelling properties of the POPGMA hydrogel were preliminarily investigated over wide pH (2.2-9.0) and temperature (4-70 °C) ranges. Additional characterisation of structure and properties was conducted by UV-vis and Fourier transform infrared (FTIR) spectroscopy as well as by differential scanning calorimetry (DSC). In order to evaluate the potential for biomedical applications, biocompatibility (cytocompatibility and haemolytic activity) studies were performed as well. Sol-gel conversion was relatively high for all irradiation doses, indicating radiation-induced synthesis as a good method for fabricating this hydrogel. Thermoresponsiveness and variations in swelling capacity as a result of thermosensitive OPG pendant chains with a lower critical solution temperature (LCST) were mainly observed below room temperature; thus, the volume phase transition temperature (VPTT) of POPGMA homopolymeric hydrogel is about 15 °C. Furthermore, POPGMA has satisfactory biocompatibility. The results indicate that the hydrogels with propylene glycol pendant chains can be easily prepared by gamma radiation and have potential for different applications as smart and biocompatible polymers.
PLANT OLIGOSACCHARIDES ENHANCE WHEAT DEFENCE RESPONSE AGAINST SEPTORIA LEAF BLOTCH.

PubMed

Somai-Jemmali, L; Siah, A; Randoux, B; Reignault, Ph; Halama, P; Rodriguez, R; Hamada, W

2015-01-01

Our work provides the first evidence for elicitation and protection effects of preventive treatments with oligosaccharides (20%)-based new formulation (Oligos) against Mycosphaerella graminicola, a major pathogen of bread wheat (BW) and durum wheat (DW). In planta Oligos treatment led to strongly reduced hyphal growth, penetration, mesophyll colonization and fructification. During the necrotrophic phase, Oligos also drastically decreased the production of M. graminicola CWDE activities, such as xylanase and glucanase as well as protease activity in both wheat species, suggesting their correlation with disease severity. Concerning plant defence markers, PR2, Chi 4 precursor-, Per- and LOX-1-encoding genes were up-regulated, while glucanase (GLUC), catalase (CAT) and lipoxygenase (LOX) activities and total phenolic compound (PC) accumulation were induced in both (non-inoculated and inoculated contexts. In inoculated context, a localized accumulation of H2O2 and PC at fungal penetration sites and a specific induction of phenylalanine ammonia-Lyase (PAL) enzymatic activity were observed. Moreover, our experiment exhibited some similarities and differences in both wheat species responses. GLUC and CAT activities and H2O2 accumulation were more responsive in DW leaves, while LOX and PAL activities and PC accumulation occurred earlier and to a stronger extent in BW leaves. The tested Oligos formulation showed an interesting resistance induction activity characterized by a high and stable efficiency whatever the wheat species, suggesting it integration in common control strategies against STB on both DW and BW.
A blackberry (Rubus L.) expressed sequence tag library for the development of simple sequence repeat markers

PubMed Central

Lewers, Kim S; Saski, Chris A; Cuthbertson, Brandon J; Henry, David C; Staton, Meg E; Main, Dorrie S; Dhanaraj, Anik L; Rowland, Lisa J; Tomkins, Jeff P

2008-01-01

Background The recent development of novel repeat-fruiting types of blackberry (Rubus L.) cultivars, combined with a long history of morphological marker-assisted selection for thornlessness by blackberry breeders, has given rise to increased interest in using molecular markers to facilitate blackberry breeding. Yet no genetic maps, molecular markers, or even sequences exist specifically for cultivated blackberry. The purpose of this study is to begin development of these tools by generating and annotating the first blackberry expressed sequence tag (EST) library, designing primers from the ESTs to amplify regions containing simple sequence repeats (SSR), and testing the usefulness of a subset of the EST-SSRs with two blackberry cultivars. Results A cDNA library of 18,432 clones was generated from expanding leaf tissue of the cultivar Merton Thornless, a progenitor of many thornless commercial cultivars. Among the most abundantly expressed of the 3,000 genes annotated were those involved with energy, cell structure, and defense. From individual sequences containing SSRs, 673 primer pairs were designed. Of a randomly chosen set of 33 primer pairs tested with two blackberry cultivars, 10 detected an average of 1.9 polymorphic PCR products. Conclusion This rate predicts that this library may yield as many as 940 SSR primer pairs detecting 1,786 polymorphisms. This may be sufficient to generate a genetic map that can be used to associate molecular markers with phenotypic traits, making possible molecular marker-assisted breeding to compliment existing morphological marker-assisted breeding in blackberry. PMID:18570660
The Salivary Microbiome in Polycystic Ovary Syndrome (PCOS) and Its Association with Disease-Related Parameters: A Pilot Study.

PubMed

Lindheim, Lisa; Bashir, Mina; Münzker, Julia; Trummer, Christian; Zachhuber, Verena; Pieber, Thomas R; Gorkiewicz, Gregor; Obermayer-Pietsch, Barbara

2016-01-01

Polycystic ovary syndrome (PCOS) is a common female endocrine condition of unclear etiology characterized by hyperandrogenism, oligo/amenorrhoea, and polycystic ovarian morphology. PCOS is often complicated by infertility, overweight/obesity, insulin resistance, and low-grade inflammation. The gut microbiome is known to contribute to several of these conditions. Recently, an association between stool and saliva microbiome community profiles was shown, making saliva a possible convenient, non-invasive sample type for detecting gut microbiome changes in systemic disease. In this study, we describe the saliva microbiome of PCOS patients and the association of microbiome features with PCOS-related parameters. 16S rRNA gene amplicon sequencing was performed on saliva samples from 24 PCOS patients and 20 healthy controls. Data processing and microbiome analyses were conducted in mothur and QIIME. All study subjects were characterized regarding reproductive, metabolic, and inflammatory parameters. PCOS patients showed a decrease in bacteria from the phylum Actinobacteria and a borderline significant shift in bacterial community composition in unweighted UniFrac analysis. No differences between patients and controls were found in alpha diversity, weighted UniFrac analysis, or on other taxonomic levels. We found no association of saliva alpha diversity, beta diversity, or taxonomic composition with serum testosterone, oligo/amenorrhoea, overweight, insulin resistance, inflammatory markers, age, or diet. In this pilot study, patients with PCOS showed a reduced salivary relative abundance of Actinobacteria. Reproductive and metabolic components of the syndrome were not associated with saliva microbiome parameters, indicating that the majority of between-subject variation in saliva microbiome profiles remains to be explained.
Biochemical properties of Glu-SH3 as a family 13 glycoside hydrolase with remarkable substrate specificity for trehalose: Implications to sequence-based classification of CAZymes.

PubMed

Ghadikolaei, Kamran Khalili; Shojaei, Maral; Ghaderi, Armin; Hojjati, Farzaneh; Noghabi, Kambiz Akbari; Zahiri, Hossein Shahbani

2016-08-01

A novel glycoside hydrolase from Exiguobacterium sp. SH3 was characterized. The enzyme, designated as Glu-SH3, was predicted by in silico analysis to have structural similarity with members of oligo-1,6-glucosidase and trehalose-6-phosphate hydrolase subfamilies in the GH-13 family of glycoside hydrolases. The gene was expressed in Escherichia coli and the recombinant enzyme was purified as a His-tagged protein of about 60 kDa. The enzyme was shown to have remarkable substrate specificity for trehalose. The characteristic ability of Glu-SH3 to hydrolyze trehalose was ascertained by zymography, thin layer chromatography, and NMR spectroscopy. The maximum activity of Glu-SH3 was obtained at 35 °C and pH 7, but it was able to exhibit more than 90% of the activity within the pH range of 5-8. The Vmax and Km values were estimated to be 170 U and 4.5 mg ml(-1), respectively. By comparison with trehalases, Glu-SH3 with Kcat and Kcat/Km values of 1552 s(-1) and 119.4 mM(-1) s(-1) can be recognized as a very efficient trehalose-hydrolyzing glycosidase. Given the phylogeny and the substrate specificity of Glu-SH3, it may be assumed that the enzyme shares a common ancestor with oligo-1,6-glucosidases but have evolved distinctly to serve a physiological function in trehalose metabolism. Copyright © 2016 Elsevier Inc. All rights reserved.
Optimal word sizes for dissimilarity measures and estimation of the degree of dissimilarity between DNA sequences.

PubMed

Wu, Tiee-Jian; Huang, Ying-Hsueh; Li, Lung-An

2005-11-15

Several measures of DNA sequence dissimilarity have been developed. The purpose of this paper is 3-fold. Firstly, we compare the performance of several word-based or alignment-based methods. Secondly, we give a general guideline for choosing the window size and determining the optimal word sizes for several word-based measures at different window sizes. Thirdly, we use a large-scale simulation method to simulate data from the distribution of SK-LD (symmetric Kullback-Leibler discrepancy). These simulated data can be used to estimate the degree of dissimilarity beta between any pair of DNA sequences. Our study shows (1) for whole sequence similiarity/dissimilarity identification the window size taken should be as large as possible, but probably not >3000, as restricted by CPU time in practice, (2) for each measure the optimal word size increases with window size, (3) when the optimal word size is used, SK-LD performance is superior in both simulation and real data analysis, (4) the estimate beta of beta based on SK-LD can be used to filter out quickly a large number of dissimilar sequences and speed alignment-based database search for similar sequences and (5) beta is also applicable in local similarity comparison situations. For example, it can help in selecting oligo probes with high specificity and, therefore, has potential in probe design for microarrays. The algorithm SK-LD, estimate beta and simulation software are implemented in MATLAB code, and are available at http://www.stat.ncku.edu.tw/tjwu
Analysis of xylem formation in pine by cDNA sequencing

NASA Technical Reports Server (NTRS)

Allona, I.; Quinn, M.; Shoop, E.; Swope, K.; St Cyr, S.; Carlis, J.; Riedl, J.; Retzel, E.; Campbell, M. M.; Sederoff, R.;

1998-01-01

Secondary xylem (wood) formation is likely to involve some genes expressed rarely or not at all in herbaceous plants. Moreover, environmental and developmental stimuli influence secondary xylem differentiation, producing morphological and chemical changes in wood. To increase our understanding of xylem formation, and to provide material for comparative analysis of gymnosperm and angiosperm sequences, ESTs were obtained from immature xylem of loblolly pine (Pinus taeda L.). A total of 1,097 single-pass sequences were obtained from 5' ends of cDNAs made from gravistimulated tissue from bent trees. Cluster analysis detected 107 groups of similar sequences, ranging in size from 2 to 20 sequences. A total of 361 sequences fell into these groups, whereas 736 sequences were unique. About 55% of the pine EST sequences show similarity to previously described sequences in public databases. About 10% of the recognized genes encode factors involved in cell wall formation. Sequences similar to cell wall proteins, most known lignin biosynthetic enzymes, and several enzymes of carbohydrate metabolism were found. A number of putative regulatory proteins also are represented. Expression patterns of several of these genes were studied in various tissues and organs of pine. Sequencing novel genes expressed during xylem formation will provide a powerful means of identifying mechanisms controlling this important differentiation pathway.

The characterisation of novel secreted Ly-6 proteins from rat urine by the combined use of two-dimensional gel electrophoresis, microbore high performance liquid chromatography and expressed sequence tag data.

PubMed

Southan, Christopher; Cutler, Paul; Birrell, Helen; Connell, John; Fantom, Kenneth G M; Sims, Matthew; Shaikh, Narjis; Schneider, Klaus

2002-02-01

A proteomic study of rat urine was undertaken using two-dimensional gel electrophoresis, microbore high performance liquid chromatography, mass spectrometry and N-terminal sequencing. Five known urinary proteins were identified but two novel peptide fragments matched a large number of rat expressed sequence tags (ESTs) from a liver library. By combining protein chemical and nucleotide data, two 101-residue open reading frames with 90% amino acid identity were determined, rat urinary protein 1 (RUP-1) and RUP-2. The data established signal peptide removal and provided evidence for N-glycosylation. A third related sequence, rat spleen protein (RSP-1) was confirmed from EST searches. These three proteins have been submitted to SWISS-PROT as P81827, P81828 and Q9QXN2, respectively. A fourth novel homologue was found in porcine and bovine ESTs from embryo libraries. Alignment with known homologues showed conserved cysteine positions characteristic of a secreted subfamily of Ly-6 proteins. In two cases, antineoplastic urinary protein and caltrin, these homologues have unverified functional annotations. The RUP sequences showed high scoring matches to three unrelated rat mRNAs subsequently established to be chimeric. Two of these share extended sectional identity to RUP-1 but the third may represent another novel Ly-6 homologue. These chimeras have caused serious annotation errors in secondary databases.
Gene discovery using next-generation pyrosequencing to develop ESTs for Phalaenopsis orchids

PubMed Central

2011-01-01

Background Orchids are one of the most diversified angiosperms, but few genomic resources are available for these non-model plants. In addition to the ecological significance, Phalaenopsis has been considered as an economically important floriculture industry worldwide. We aimed to use massively parallel 454 pyrosequencing for a global characterization of the Phalaenopsis transcriptome. Results To maximize sequence diversity, we pooled RNA from 10 samples of different tissues, various developmental stages, and biotic- or abiotic-stressed plants. We obtained 206,960 expressed sequence tags (ESTs) with an average read length of 228 bp. These reads were assembled into 8,233 contigs and 34,630 singletons. The unigenes were searched against the NCBI non-redundant (NR) protein database. Based on sequence similarity with known proteins, these analyses identified 22,234 different genes (E-value cutoff, e-7). Assembled sequences were annotated with Gene Ontology, Gene Family and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Among these annotations, over 780 unigenes encoding putative transcription factors were identified. Conclusion Pyrosequencing was effective in identifying a large set of unigenes from Phalaenopsis. The informative EST dataset we developed constitutes a much-needed resource for discovery of genes involved in various biological processes in Phalaenopsis and other orchid species. These transcribed sequences will narrow the gap between study of model organisms with many genomic resources and species that are important for ecological and evolutionary studies. PMID:21749684
Self-Cloning CRISPR.

PubMed

Arbab, Mandana; Sherwood, Richard I

2016-08-17

CRISPR/Cas9-gene editing has emerged as a revolutionary technology to easily modify specific genomic loci by designing complementary sgRNA sequences and introducing these into cells along with Cas9. Self-cloning CRISPR/Cas9 (scCRISPR) uses a self-cleaving palindromic sgRNA plasmid (sgPal) that recombines with short PCR-amplified site-specific sgRNA sequences within the target cell by homologous recombination to circumvent the process of sgRNA plasmid construction. Through this mechanism, scCRISPR enables gene editing within 2 hr once sgRNA oligos are available, with high efficiency equivalent to conventional sgRNA targeting: >90% gene knockout in both mouse and human embryonic stem cells and cancer cell lines. Furthermore, using PCR-based addition of short homology arms, we achieve efficient site-specific knock-in of transgenes such as GFP without traditional plasmid cloning or genome-integrated selection cassette (2% to 4% knock-in rate). The methods in this paper describe the most rapid and efficient means of CRISPR gene editing. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Capture-SELEX: Selection of DNA Aptamers for Aminoglycoside Antibiotics

PubMed Central

2012-01-01

Small organic molecules are challenging targets for an aptamer selection using the SELEX technology (SELEX—Systematic Evolution of Ligans by EXponential enrichment). Often they are not suitable for immobilization on solid surfaces, which is a common procedure in known aptamer selection methods. The Capture-SELEX procedure allows the selection of DNA aptamers for solute targets. A special SELEX library was constructed with the aim to immobilize this library on magnetic beads or other surfaces. For this purpose a docking sequence was incorporated into the random region of the library enabling hybridization to a complementary oligo fixed on magnetic beads. Oligonucleotides of the library which exhibit high affinity to the target and a secondary structure fitting to the target are released from the beads for binding to the target during the aptamer selection process. The oligonucleotides of these binding complexes were amplified, purified, and immobilized via the docking sequence to the magnetic beads as the starting point of the following selection round. Based on this Capture-SELEX procedure, the successful DNA aptamer selection for the aminoglycoside antibiotic kanamycin A as a small molecule target is described. PMID:23326761
Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

PubMed Central

2011-01-01

Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms. PMID:21791039

A Novel Alkaliphilic Bacillus Esterase Belongs to the 13th Bacterial Lipolytic Enzyme Family

PubMed Central

Rao, Lang; Xue, Yanfen; Zheng, Yingying; Lu, Jian R.; Ma, Yanhe

2013-01-01

Background Microbial derived lipolytic hydrolysts are an important class of biocatalysts because of their huge abundance and ability to display bioactivities under extreme conditions. In spite of recent advances, our understanding of these enzymes remains rudimentary. The aim of our research is to advance our understanding by seeking for more unusual lipid hydrolysts and revealing their molecular structure and bioactivities. Methodology/Principal Findings Bacillus. pseudofirmus OF4 is an extreme alkaliphile with tolerance of pH up to 11. In this work we successfully undertook a heterologous expression of a gene estof4 from the alkaliphilic B. pseudofirmus sp OF4. The recombinant protein called EstOF4 was purified into a homologous product by Ni-NTA affinity and gel filtration. The purified EstOF4 was active as dimer with the molecular weight of 64 KDa. It hydrolyzed a wide range of substrates including p-nitrophenyl esters (C2–C12) and triglycerides (C2–C6). Its optimal performance occurred at pH 8.5 and 50°C towards p-nitrophenyl caproate and triacetin. Sequence alignment revealed that EstOF4 shared 71% identity to esterase Est30 from Geobacillus stearothermophilus with a typical lipase pentapeptide motif G91LS93LG95. A structural model developed from homology modeling revealed that EstOF4 possessed a typical esterase 6α/7β hydrolase fold and a cap domain. Site-directed mutagenesis and inhibition studies confirmed the putative catalytic triad Ser93, Asp190 and His220. Conclusion EstOF4 is a new bacterial esterase with a preference to short chain ester substrates. With a high sequence identity towards esterase Est30 and several others, EstOF4 was classified into the same bacterial lipolytic family, Family XIII. All the members in this family originate from the same bacterial genus, bacillus and display optimal activities from neutral pH to alkaline conditions with short and middle chain length substrates. However, with roughly 70% sequence identity, these enzymes showed hugely different thermal stabilities, indicating their diverse thermal adaptations via just changing a few amino acid residues. PMID:23577139
A Transcriptomic Analysis of Echinococcus granulosus Larval Stages: Implications for Parasite Biology and Host Adaptation

PubMed Central

Parkinson, John; Wasmuth, James D.; Salinas, Gustavo; Bizarro, Cristiano V.; Sanford, Chris; Berriman, Matthew; Ferreira, Henrique B.; Zaha, Arnaldo; Blaxter, Mark L.; Maizels, Rick M.; Fernández, Cecilia

2012-01-01

Background The cestode Echinococcus granulosus - the agent of cystic echinococcosis, a zoonosis affecting humans and domestic animals worldwide - is an excellent model for the study of host-parasite cross-talk that interfaces with two mammalian hosts. To develop the molecular analysis of these interactions, we carried out an EST survey of E. granulosus larval stages. We report the salient features of this study with a focus on genes reflecting physiological adaptations of different parasite stages. Methodology/Principal Findings We generated ∼10,000 ESTs from two sets of full-length enriched libraries (derived from oligo-capped and trans-spliced cDNAs) prepared with three parasite materials: hydatid cyst wall, larval worms (protoscoleces), and pepsin/H+-activated protoscoleces. The ESTs were clustered into 2700 distinct gene products. In the context of the biology of E. granulosus, our analyses reveal: (i) a diverse group of abundant long non-protein coding transcripts showing homology to a middle repetitive element (EgBRep) that could either be active molecular species or represent precursors of small RNAs (like piRNAs); (ii) an up-regulation of fermentative pathways in the tissue of the cyst wall; (iii) highly expressed thiol- and selenol-dependent antioxidant enzyme targets of thioredoxin glutathione reductase, the functional hub of redox metabolism in parasitic flatworms; (iv) candidate apomucins for the external layer of the tissue-dwelling hydatid cyst, a mucin-rich structure that is critical for survival in the intermediate host; (v) a set of tetraspanins, a protein family that appears to have expanded in the cestode lineage; and (vi) a set of platyhelminth-specific gene products that may offer targets for novel pan-platyhelminth drug development. Conclusions/Significance This survey has greatly increased the quality and the quantity of the molecular information on E. granulosus and constitutes a valuable resource for gene prediction on the parasite genome and for further genomic and proteomic analyses focused on cestodes and platyhelminths. PMID:23209850
A global assembly of cotton ESTs

PubMed Central

Udall, Joshua A.; Swanson, Jordan M.; Haller, Karl; Rapp, Ryan A.; Sparks, Michael E.; Hatfield, Jamie; Yu, Yeisoo; Wu, Yingru; Dowd, Caitriona; Arpat, Aladdin B.; Sickler, Brad A.; Wilkins, Thea A.; Guo, Jin Ying; Chen, Xiao Ya; Scheffler, Jodi; Taliercio, Earl; Turley, Ricky; McFadden, Helen; Payton, Paxton; Klueva, Natalya; Allen, Randell; Zhang, Deshui; Haigler, Candace; Wilkerson, Curtis; Suo, Jinfeng; Schulze, Stefan R.; Pierce, Margaret L.; Essenberg, Margaret; Kim, HyeRan; Llewellyn, Danny J.; Dennis, Elizabeth S.; Kudrna, David; Wing, Rod; Paterson, Andrew H.; Soderlund, Cari; Wendel, Jonathan F.

2006-01-01

Approximately 185,000 Gossypium EST sequences comprising >94,800,000 nucleotides were amassed from 30 cDNA libraries constructed from a variety of tissues and organs under a range of conditions, including drought stress and pathogen challenges. These libraries were derived from allopolyploid cotton (Gossypium hirsutum; AT and DT genomes) as well as its two diploid progenitors, Gossypium arboreum (A genome) and Gossypium raimondii (D genome). ESTs were assembled using the Program for Assembling and Viewing ESTs (PAVE), resulting in 22,030 contigs and 29,077 singletons (51,107 unigenes). Further comparisons among the singletons and contigs led to recognition of 33,665 exemplar sequences that represent a nonredundant set of putative Gossypium genes containing partial or full-length coding regions and usually one or two UTRs. The assembly, along with their UniProt BLASTX hits, GO annotation, and Pfam analysis results, are freely accessible as a public resource for cotton genomics. Because ESTs from diploid and allotetraploid Gossypium were combined in a single assembly, we were in many cases able to bioinformatically distinguish duplicated genes in allotetraploid cotton and assign them to either the A or D genome. The assembly and associated information provide a framework for future investigation of cotton functional and evolutionary genomics. PMID:16478941
Needles in the EST Haystack: Large-Scale Identification and Analysis of Excretory-Secretory (ES) Proteins in Parasitic Nematodes Using Expressed Sequence Tags (ESTs)

PubMed Central

Nagaraj, Shivashankar H.; Gasser, Robin B.; Ranganathan, Shoba

2008-01-01

Background Parasitic nematodes of humans, other animals and plants continue to impose a significant public health and economic burden worldwide, due to the diseases they cause. Promising antiparasitic drug and vaccine candidates have been discovered from excreted or secreted (ES) proteins released from the parasite and exposed to the immune system of the host. Mining the entire expressed sequence tag (EST) data available from parasitic nematodes represents an approach to discover such ES targets. Methods and Findings In this study, we predicted, using EST2Secretome, a novel, high-throughput, computational workflow system, 4,710 ES proteins from 452,134 ESTs derived from 39 different species of nematodes, parasitic in animals (including humans) or plants. In total, 2,632, 786, and 1,292 ES proteins were predicted for animal-, human-, and plant-parasitic nematodes. Subsequently, we systematically analysed ES proteins using computational methods. Of these 4,710 proteins, 2,490 (52.8%) had orthologues in Caenorhabditis elegans, whereas 621 (13.8%) appeared to be novel, currently having no significant match to any molecule available in public databases. Of the C. elegans homologues, 267 had strong “loss-of-function” phenotypes by RNA interference (RNAi) in this nematode. We could functionally classify 1,948 (41.3%) sequences using the Gene Ontology (GO) terms, establish pathway associations for 573 (12.2%) sequences using Kyoto Encyclopaedia of Genes and Genomes (KEGG), and identify protein interaction partners for 1,774 (37.6%) molecules. We also mapped 758 (16.1%) proteins to protein domains including the nematode-specific protein family “transthyretin-like” and “chromadorea ALT,” considered as vaccine candidates against filariasis in humans. Conclusions We report the large-scale analysis of ES proteins inferred from EST data for a range of parasitic nematodes. This set of ES proteins provides an inventory of known and novel members of ES proteins as a foundation for studies focused on understanding the biology of parasitic nematodes and their interactions with their hosts, as well as for the development of novel drugs or vaccines for parasite intervention and control. PMID:18820748
Analysis and functional annotation of expressed sequence tags (ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq.)

PubMed Central

Ho, Chai-Ling; Kwan, Yen-Yen; Choi, Mei-Chooi; Tee, Sue-Sean; Ng, Wai-Har; Lim, Kok-Ang; Lee, Yang-Ping; Ooi, Siew-Eng; Lee, Weng-Wah; Tee, Jin-Ming; Tan, Siang-Hee; Kulaveerasingam, Harikrishna; Alwee, Sharifah Shahrul Rabiah Syed; Abdullah, Meilina Ong

2007-01-01

Background Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an expressed sequence tag (EST) analysis on oil palm. Results In this study, six cDNA libraries from oil palm zygotic embryos, suspension cells, shoot apical meristems, young flowers, mature flowers and roots, were constructed. We have generated a total of 14537 expressed sequence tags (ESTs) from these libraries, from which 6464 tentative unique contigs (TUCs) and 2129 singletons were obtained. Approximately 6008 of these tentative unique genes (TUGs) have significant matches to the non-redundant protein database, from which 2361 were assigned to one or more Gene Ontology categories. Predominant transcripts and differentially expressed genes were identified in multiple oil palm tissues. Homologues of genes involved in many aspects of flower development were also identified among the EST collection, such as CONSTANS-like, AGAMOUS-like (AGL)2, AGL20, LFY-like, SQUAMOSA, SQUAMOSA binding protein (SBP) etc. Majority of them are the first representatives in oil palm, providing opportunities to explore the cause of epigenetic homeotic flowering abnormality in oil palm, given the importance of flowering in fruit production. The transcript levels of two flowering-related genes, EgSBP and EgSEP were analysed in the flower tissues of various developmental stages. Gene homologues for enzymes involved in oil biosynthesis, utilization of nitrogen sources, and scavenging of oxygen radicals, were also uncovered among the oil palm ESTs. Conclusion The EST sequences generated will allow comparative genomic studies between oil palm and other monocotyledonous and dicotyledonous plants, development of gene-targeted markers for the reference genetic map, design and fabrication of DNA array for future studies of oil palm. The outcomes of such studies will contribute to oil palm improvements through the establishment of breeding program using marker-assisted selection, development of diagnostic assays using gene targeted markers, and discovery of candidate genes related to important agronomic traits of oil palm. PMID:17953740
Refined annotation and assembly of the Tetrahymena thermophila genome sequence through EST analysis, comparative genomic hybridization, and targeted gap closure

PubMed Central

Coyne, Robert S; Thiagarajan, Mathangi; Jones, Kristie M; Wortman, Jennifer R; Tallon, Luke J; Haas, Brian J; Cassidy-Hanley, Donna M; Wiley, Emily A; Smith, Joshua J; Collins, Kathleen; Lee, Suzanne R; Couvillion, Mary T; Liu, Yifan; Garg, Jyoti; Pearlman, Ronald E; Hamilton, Eileen P; Orias, Eduardo; Eisen, Jonathan A; Methé, Barbara A

2008-01-01

Background Tetrahymena thermophila, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of Tetrahymena's coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing. Results We addressed the problem of MIC contamination using comparative genomic hybridization with purified MIC and MAC DNA probes against a whole genome oligonucleotide microarray, allowing the identification of 763 genome scaffolds likely to contain MIC-limited DNA sequences. We also employed standard genome closure methods to essentially finish over 60% of the MAC genome. For the improvement of annotation, we have sequenced and analyzed over 60,000 verified EST reads from a variety of cellular growth and development conditions. Using this EST evidence, a combination of automated and manual reannotation efforts led to updates that affect 16% of the current protein-coding gene models. By comparing EST abundance, many genes showing apparent differential expression between these conditions were identified. Rare instances of alternative splicing and uses of the non-standard amino acid selenocysteine were also identified. Conclusion We report here significant progress in genome closure and reannotation of Tetrahymena thermophila. Our experience to date suggests that complete closure of the MAC genome is attainable. Using the new EST evidence, automated and manual curation has resulted in substantial improvements to the over 24,000 gene models, which will be valuable to researchers studying this model organism as well as for comparative genomics purposes. PMID:19036158
Pharmacologic Studies on the In Vitro Bronchodilating Vasoactive Actions of Oligo-PGB (Prostaglandin B)

DTIC Science & Technology

1988-01-06

kotriene D4 (LTD4) 1x10-9M and carbachol . lxlO- M induced similar contrac-tions. The degree of relaxation induced by O-PGB was dependent upon the...contra ti1 agoinst (36% of par---erine when •he bronchi were precontrated with SJ, 25%-against LTD4 and ,.ily 15% agairst carbachol ) suggesting that the...18 or 24 hours. In protl1, we either tested HMW and oligo-PGB in the absencE of a contractile agent or afier exposing the tissue to carbachol (Cch
Tiny abortive initiation transcripts exert antitermination activity on an RNA hairpin-dependent intrinsic terminator.

PubMed

Lee, Sooncheol; Nguyen, Huong Minh; Kang, Changwon

2010-10-01

No biological function has been identified for tiny RNA transcripts that are abortively and repetitiously released from initiation complexes of RNA polymerase in vitro and in vivo to date. In this study, we show that abortive initiation affects termination in transcription of bacteriophage T7 gene 10. Specifically, abortive transcripts produced from promoter phi 10 exert trans-acting antitermination activity on terminator T phi both in vitro and in vivo. Following abortive initiation cycling of T7 RNA polymerase at phi 10, short G-rich and oligo(G) RNAs were produced and both specifically sequestered 5- and 6-nt C + U stretch sequences, consequently interfering with terminator hairpin formation. This antitermination activity depended on sequence-specific hybridization of abortive transcripts with the 5' but not 3' half of T phi RNA. Antitermination was abolished when T phi was mutated to lack a C + U stretch, but restored when abortive transcript sequence was additionally modified to complement the mutation in T phi, both in vitro and in vivo. Antitermination was enhanced in vivo when the abortive transcript concentration was increased via overproduction of RNA polymerase or ribonuclease deficiency. Accordingly, antitermination activity exerted on T phi by abortive transcripts should facilitate expression of T phi-downstream promoter-less genes 11 and 12 in T7 infection of Escherichia coli.
Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

PubMed Central

Al-Swailem, Abdulaziz M.; Shehata, Maher M.; Abu-Duhier, Faisel M.; Al-Yamani, Essam J.; Al-Busadah, Khalid A.; Al-Arawi, Mohammed S.; Al-Khider, Ali Y.; Al-Muhaimeed, Abdullah N.; Al-Qahtani, Fahad H.; Manee, Manee M.; Al-Shomrani, Badr M.; Al-Qhtani, Saad M.; Al-Harthi, Amer S.; Akdemir, Kadir C.; Otu, Hasan H.

2010-01-01

Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665
Bioinformatic mining of EST-SSR loci in the Pacific oyster, Crassostrea gigas.

PubMed

Wang, Y; Ren, R; Yu, Z

2008-06-01

A set of expressed sequence tag-simple sequence repeat (EST-SSR) markers of the Pacific oyster, Crassostrea gigas, was developed through bioinformatic mining of the GenBank public database. As of June 30, 2007, a total of 5132 EST sequences from GenBank were downloaded and screened for di-, tri- and tetra-nucleotide repeats, with criteria set at a minimum of 5, 4 and 4 repeats for the three categories of SSRs respectively. Seventeen polymorphic microsatellite markers were characterized. Allele numbers ranged from 3 to 10, and the observed and expected heterozygosity values varied from 0.125 to 0.770 and from 0.113 to 0.732 respectively. Eleven loci were at Hardy-Weinberg equilibrium (HWE); the other six loci showed significant departure from HWE (P < 0.01), suggesting possible presence of null alleles. Pairwise check of linkage disequilibrium (LD) indicated that 11 of 136 pairs of loci showed significant LD (P < 0.01), likely due to HWE present in single markers. Cross-species amplification was examined for five other Crassostrea species and reasonable results were obtained, promising usefulness of these markers in oyster genetics.
Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome.

PubMed

de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

2013-07-01

The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees.
Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome

PubMed Central

de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

2013-01-01

The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees. PMID:23885214
Gene discovery in Boophilus microplus, the cattle tick: the transcriptomes of ovaries, salivary glands, and hemocytes.

PubMed

Santos, Isabel K F de Miranda; Valenzuela, Jesus G; Ribeiro, José Marcos C; de Castro, Marilia; Costa, Juliana Nardelli; Costa, Ana Maria; da Silva, Edson Ramiro; Neto, Olavo Bilac Rego; Rocha, Clarisse; Daffre, Sirlei; Ferreira, Beatriz R; da Silva, João Santana; Szabó, Matias Pablo; Bechara, Gervasio Henrique

2004-10-01

The quest for new control strategies for ticks can profit from high throughput genomics. In order to identify genes that are involved in oogenesis and development, in defense, and in hematophagy, the transcriptomes of ovaries, hemocytes, and salivary glands from rapidly ingurgitating females, and of salivary glands from males of Boophilus microplus were PCR amplified, and the expressed sequence tags (EST) of random clones were mass sequenced. So far, more than 1,344 EST have been generated for these tissues, with approximately 30% novelty, depending on the the tissue studied. To date approximately 760 nucleotide sequences from B. microplus are deposited in the NCBI database. Mass sequencing of partial cDNAs of parasite genes can build up this scant database and rapidly generate a large quantity of useful information about potential targets for immunobiological or chemical control.
SalmonDB: a bioinformatics resource for Salmo salar and Oncorhynchus mykiss

PubMed Central

Di Génova, Alex; Aravena, Andrés; Zapata, Luis; González, Mauricio; Maass, Alejandro; Iturra, Patricia

2011-01-01

SalmonDB is a new multiorganism database containing EST sequences from Salmo salar, Oncorhynchus mykiss and the whole genome sequence of Danio rerio, Gasterosteus aculeatus, Tetraodon nigroviridis, Oryzias latipes and Takifugu rubripes, built with core components from GMOD project, GOPArc system and the BioMart project. The information provided by this resource includes Gene Ontology terms, metabolic pathways, SNP prediction, CDS prediction, orthologs prediction, several precalculated BLAST searches and domains. It also provides a BLAST server for matching user-provided sequences to any of the databases and an advanced query tool (BioMart) that allows easy browsing of EST databases with user-defined criteria. These tools make SalmonDB database a valuable resource for researchers searching for transcripts and genomic information regarding S. salar and other salmonid species. The database is expected to grow in the near feature, particularly with the S. salar genome sequencing project. Database URL: http://genomicasalmones.dim.uchile.cl/ PMID:22120661
SalmonDB: a bioinformatics resource for Salmo salar and Oncorhynchus mykiss.

PubMed

Di Génova, Alex; Aravena, Andrés; Zapata, Luis; González, Mauricio; Maass, Alejandro; Iturra, Patricia

2011-01-01

SalmonDB is a new multiorganism database containing EST sequences from Salmo salar, Oncorhynchus mykiss and the whole genome sequence of Danio rerio, Gasterosteus aculeatus, Tetraodon nigroviridis, Oryzias latipes and Takifugu rubripes, built with core components from GMOD project, GOPArc system and the BioMart project. The information provided by this resource includes Gene Ontology terms, metabolic pathways, SNP prediction, CDS prediction, orthologs prediction, several precalculated BLAST searches and domains. It also provides a BLAST server for matching user-provided sequences to any of the databases and an advanced query tool (BioMart) that allows easy browsing of EST databases with user-defined criteria. These tools make SalmonDB database a valuable resource for researchers searching for transcripts and genomic information regarding S. salar and other salmonid species. The database is expected to grow in the near feature, particularly with the S. salar genome sequencing project. Database URL: http://genomicasalmones.dim.uchile.cl/
An expressed sequence tag (EST) library for Drosophila serrata, a model system for sexual selection and climatic adaptation studies.

PubMed

Frentiu, Francesca D; Adamski, Marcin; McGraw, Elizabeth A; Blows, Mark W; Chenoweth, Stephen F

2009-01-21

The native Australian fly Drosophila serrata belongs to the highly speciose montium subgroup of the melanogaster species group. It has recently emerged as an excellent model system with which to address a number of important questions, including the evolution of traits under sexual selection and traits involved in climatic adaptation along latitudinal gradients. Understanding the molecular genetic basis of such traits has been limited by a lack of genomic resources for this species. Here, we present the first expressed sequence tag (EST) collection for D. serrata that will enable the identification of genes underlying sexually-selected phenotypes and physiological responses to environmental change and may help resolve controversial phylogenetic relationships within the montium subgroup. A normalized cDNA library was constructed from whole fly bodies at several developmental stages, including larvae and adults. Assembly of 11,616 clones sequenced from the 3' end allowed us to identify 6,607 unique contigs, of which at least 90% encoded peptides. Partial transcripts were discovered from a variety of genes of evolutionary interest by BLASTing contigs against the 12 Drosophila genomes currently sequenced. By incorporating into the cDNA library multiple individuals from populations spanning a large portion of the geographical range of D. serrata, we were able to identify 11,057 putative single nucleotide polymorphisms (SNPs), with 278 different contigs having at least one "double hit" SNP that is highly likely to be a real polymorphism. At least 394 EST-associated microsatellite markers, representing 355 different contigs, were also found, providing an additional set of genetic markers. The assembled EST library is available online at http://www.chenowethlab.org/serrata/index.cgi. We have provided the first gene collection and largest set of polymorphic genetic markers, to date, for the fly D. serrata. The EST collection will provide much needed genomic resources for this model species and facilitate comparative evolutionary studies within the montium subgroup of the D. melanogaster lineage.
Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

PubMed Central

de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

2000-01-01

Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084
Tectono-sedimentary constraints to the Oligocene-to-Miocene evolution of the Peloritani thrust belt (NE Sicily)

NASA Astrophysics Data System (ADS)

Giunta, G.; Nigro, F.

1999-12-01

The Peloritani thrust belt belongs to the southern sector of the Calabrian Arc and is formed by a set of south-verging tectonic units, including crystalline basement and sedimentary cover (from the top: Aspromonte U.; Mela U.; Mandanici U.; Fondachelli U.; Longi-Taormina U.), piled up starting from Late Oligocene. At least two main terrigenous clastic formations lie with complicated relationships on top of the previous units: the Frazzanò Fm (Oligocene) and the Stilo-Capo d'Orlando Fm (Late Oligocene?-Early Miocene), as syn-to-post-tectonic deposits. These clastic deposits have different characteristics, in space and time, representing or flysch-like sequences involved in several thrust events (Frazzanò Fm) or molassic-like sequences (Stilo-Capo d'Orlando Fm), which unconformably overlie the tectonic units. In the present paper we describe a kinematic model of the progressive foreland migration of the Peloritani thrust belt, starting from Oligocene, carrying piggy-back basins and incorporating foredeep deposits, recognised in the Frazzanò-Stilo-Capo d'Orlando terrigenous successions. In general, the facies and structural observations on the overall Oligo-Miocene clastic sequences, outcropping in the Western Peloritani Mts, indicate: (a) the distal character of the Frazzanò Fm; (b) a complex group of terrigenous facies of the Stilo-Capo d'Orlando Fm, with lateral-to-vertical organisation, characterised by a distal-to-proximal-to-distal facies trend; (c) facies analogies of the basal portions of the Stilo-Capo d'Orlando Fm with the Frazzanò Fm; (d) the involvement of the Frazzanò Fm in lowermost and more external thrusting, and of the basal (Late Oligocene?) distal Stilo-Capo d'Orlando facies in the higher and inner thrusting during the early stages of deformation; (e) the involvement of the proximal Stilo-Capo d'Orlando facies in the tectonic edifice during the Early Miocene deformation; (f) the generally unconformable stratigraphical contacts of the higher proximal-to-distal (Early Miocene) Stilo-Capo d'Orlando facies on the constructing mobile belt; and (g) the presence of various thrust-faults, distinguished in a sequential order. The collected data allow us to hypothesise that the Oligo-Miocene tectono-sedimentary history was characterised by a foredeep with a deforming internal flank, probably lying in onlap on the constructing tectonic edifice (Frazzanò-lower Stilo-Capo d'Orlando Fms), and then deformed and covered by a piggy-back like sequence (middle-upper Stilo-Capo d'Orlando Fm), which was subsequently also deformed. The tectono-sedimentary evolution of the Peloritani belt has been probably developed through a progressive migration towards the foreland of a foredeep-compressional front couple and the chain body. The thrust stack progressively incorporates terrigenous foredeep deposits and in turn carried piggy-back basins.
Pyrosequencing the Manduca sexta larval midgut transcriptome: messages for digestion, detoxification and defence.

PubMed

Pauchet, Y; Wilkinson, P; Vogel, H; Nelson, D R; Reynolds, S E; Heckel, D G; ffrench-Constant, R H

2010-02-01

The tobacco hornworm Manduca sexta is an important model for insect physiology but genomic and transcriptomic data are currently lacking. Following a recent pyrosequencing study generating immune related expressed sequence tags (ESTs), here we use this new technology to define the M. sexta larval midgut transcriptome. We generated over 387,000 midgut ESTs, using a combination of Sanger and 454 sequencing, and classified predicted proteins into those involved in digestion, detoxification and immunity. In many cases the depth of 454 pyrosequencing coverage allowed us to define the entire cDNA sequence of a particular gene. Many new M. sexta genes are described including up to 36 new cytochrome P450s, some of which have been implicated in the metabolism of host plant-derived nicotine. New lepidopteran gene families such as the beta-fructofuranosidases, previously thought to be restricted to Bombyx mori, are also described. An unexpectedly high number of ESTs were involved in immunity, for example 39 contigs encoding serpins, and the increasingly appreciated role of the midgut in insect immunity is discussed. Similar studies of other tissues will allow for a tissue by tissue description of the M. sexta transcriptome and will form an essential complimentary step on the road to genome sequencing and annotation.
Profiling the resting venom gland of the scorpion Tityus stigmurus through a transcriptomic survey.

PubMed

Almeida, Diego D; Scortecci, Katia C; Kobashi, Leonardo S; Agnez-Lima, Lucymara F; Medeiros, Silvia R B; Silva-Junior, Arnóbio A; Junqueira-de-Azevedo, Inácio de L M; Fernandes-Pedrosa, Matheus de F

2012-08-01

The scorpion Tityus stigmurus is widely distributed in Northeastern Brazil and known to cause severe human envenoming, inducing pain, hyposthesia, edema, erythema, paresthesia, headaches and vomiting. The present study uses a transcriptomic approach to characterize the gene expression profile from the non-stimulated venom gland of Tityus stigmurus scorpion. A cDNA library was constructed and 540 clones were sequenced and grouped into 153 clusters, with one or more ESTs (expressed sequence tags). Forty-one percent of ESTs belong to recognized toxin-coding sequences, with transcripts encoding antimicrobial toxins (AMP-like) being the most abundant, followed by alfa KTx- like, beta KTx-like, beta NaTx-like and alfa NaTx-like. Our analysis indicated that 34% of the transcripts encode "other possible venom molecules", which correspond to anionic peptides, hypothetical secreted peptides, metalloproteinases, cystein-rich peptides and lectins. Fifteen percent of ESTs are similar to cellular transcripts. Sequences without good matches corresponded to 11%. This investigation provides the first global view of gene expression of the venom gland from Tityus stigmurus under resting conditions. This approach enables characterization of a large number of venom gland component molecules, which belong either to known or non yet described types of venom peptides and proteins from the Buthidae family.

Rapid and efficient cDNA library screening by self-ligation of inverse PCR products (SLIP).

PubMed

Hoskins, Roger A; Stapleton, Mark; George, Reed A; Yu, Charles; Wan, Kenneth H; Carlson, Joseph W; Celniker, Susan E

2005-12-02

cDNA cloning is a central technology in molecular biology. cDNA sequences are used to determine mRNA transcript structures, including splice junctions, open reading frames (ORFs) and 5'- and 3'-untranslated regions (UTRs). cDNA clones are valuable reagents for functional studies of genes and proteins. Expressed Sequence Tag (EST) sequencing is the method of choice for recovering cDNAs representing many of the transcripts encoded in a eukaryotic genome. However, EST sequencing samples a cDNA library at random, and it recovers transcripts with low expression levels inefficiently. We describe a PCR-based method for directed screening of plasmid cDNA libraries. We demonstrate its utility in a screen of libraries used in our Drosophila EST projects for 153 transcription factor genes that were not represented by full-length cDNA clones in our Drosophila Gene Collection. We recovered high-quality, full-length cDNAs for 72 genes and variously compromised clones for an additional 32 genes. The method can be used at any scale, from the isolation of cDNA clones for a particular gene of interest, to the improvement of large gene collections in model organisms and the human. Finally, we discuss the relative merits of directed cDNA library screening and RT-PCR approaches.
Genic Microsatellite Markers in Brassica rapa: Development, Characterization, Mapping, and Their Utility in Other Cultivated and Wild Brassica Relatives

PubMed Central

Ramchiary, Nirala; Nguyen, Van Dan; Li, Xiaonan; Hong, Chang Pyo; Dhandapani, Vignesh; Choi, Su Ryun; Yu, Ge; Piao, Zhong Yun; Lim, Yong Pyo

2011-01-01

Genic microsatellite markers, also known as functional markers, are preferred over anonymous markers as they reveal the variation in transcribed genes among individuals. In this study, we developed a total of 707 expressed sequence tag-derived simple sequence repeat markers (EST-SSRs) and used for development of a high-density integrated map using four individual mapping populations of B. rapa. This map contains a total of 1426 markers, consisting of 306 EST-SSRs, 153 intron polymorphic markers, 395 bacterial artificial chromosome-derived SSRs (BAC-SSRs), and 572 public SSRs and other markers covering a total distance of 1245.9 cM of the B. rapa genome. Analysis of allelic diversity in 24 B. rapa germplasm using 234 mapped EST-SSR markers showed amplification of 2 alleles by majority of EST-SSRs, although amplification of alleles ranging from 2 to 8 was found. Transferability analysis of 167 EST-SSRs in 35 species belonging to cultivated and wild brassica relatives showed 42.51% (Sysimprium leteum) to 100% (B. carinata, B. juncea, and B. napus) amplification. Our newly developed EST-SSRs and high-density linkage map based on highly transferable genic markers would facilitate the molecular mapping of quantitative trait loci and the positional cloning of specific genes, in addition to marker-assisted selection and comparative genomic studies of B. rapa with other related species. PMID:21768136
Functional Intestinal Bile Acid 7α-Dehydroxylation by Clostridium scindens Associated with Protection from Clostridium difficile Infection in a Gnotobiotic Mouse Model.

PubMed

Studer, Nicolas; Desharnais, Lyne; Beutler, Markus; Brugiroux, Sandrine; Terrazos, Miguel A; Menin, Laure; Schürch, Christian M; McCoy, Kathy D; Kuehne, Sarah A; Minton, Nigel P; Stecher, Bärbel; Bernier-Latmani, Rizlan; Hapfelmeier, Siegfried

2016-01-01

Bile acids, important mediators of lipid absorption, also act as hormone-like regulators and as antimicrobial molecules. In all these functions their potency is modulated by a variety of chemical modifications catalyzed by bacteria of the healthy gut microbiota, generating a complex variety of secondary bile acids. Intestinal commensal organisms are well-adapted to normal concentrations of bile acids in the gut. In contrast, physiological concentrations of the various intestinal bile acid species play an important role in the resistance to intestinal colonization by pathogens such as Clostridium difficile . Antibiotic therapy can perturb the gut microbiota and thereby impair the production of protective secondary bile acids. The most important bile acid transformation is 7α-dehydroxylation, producing deoxycholic acid (DCA) and lithocholic acid (LCA). The enzymatic pathway carrying out 7α-dehydroxylation is restricted to a narrow phylogenetic group of commensal bacteria, the best-characterized of which is Clostridium scindens . Like many other intestinal commensal species, 7-dehydroxylating bacteria are understudied in vivo . Conventional animals contain variable and uncharacterized indigenous 7α-dehydroxylating organisms that cannot be selectively removed, making controlled colonization with a specific strain in the context of an undisturbed microbiota unfeasible. In the present study, we used a recently established, standardized gnotobiotic mouse model that is stably associated with a simplified murine 12-species "oligo-mouse microbiota" (Oligo-MM 12 ). It is representative of the major murine intestinal bacterial phyla, but is deficient for 7α-dehydroxylation. We find that the Oligo-MM 12 consortium carries out bile acid deconjugation, a prerequisite for 7α-dehydroxylation, and confers no resistance to C. difficile infection (CDI). Amendment of Oligo-MM 12 with C. scindens normalized the large intestinal bile acid composition by reconstituting 7α-dehydroxylation. These changes had only minor effects on the composition of the native Oligo-MM 12 , but significantly decreased early large intestinal C. difficile colonization and pathogenesis. The delayed pathogenesis of C. difficile in C. scindens -colonized mice was associated with breakdown of cecal microbial bile acid transformation.
Two-year survival rates of anti-TNF-α therapy in psoriatic arthritis (PsA) patients with either polyarticular or oligoarticular PsA.

PubMed

Iannone, F; Lopriore, S; Bucci, R; Scioscia, C; Anelli, M G; Notarnicola, A; Lapadula, G

2015-05-01

To evaluate the 2-year drug survival rates of the tumour necrosis factor (TNF)-α blockers adalimumab, etanercept, and infliximab in psoriatic arthritis (PsA) patients with either oligoarticular (oligo-PsA) or polyarticular PsA (poly-PsA). We studied a prospective cohort of 328 PsA patients with peripheral arthritis (213 with poly-PsA and 115 with oligo-PsA), beginning their first ever anti-TNF-α treatment with adalimumab, etanercept, or infliximab. The aim of the study was to evaluate the drug survival rates and possible baseline predictors at 2 years. After 24 months, persistence in therapy with the first anti-TNF-α blocker was not statistically different in the oligo-PsA (70.4%) and poly-PsA (65.7%) subsets. Predictors of drug discontinuation were female sex [hazard ratio (HR) 1.63, 95% confidence interval (CI) 1.00-2.68, p = 0.04] and starting the therapy in years 2003-8 (HR 0.51, 95% CI 0.33-0.80, p = 0.003). In poly-PsA, the persistence of etanercept (68.3%) was significantly higher than that of adalimumab (51.9%, p = 0.01), whereas in oligo-PsA no significant difference was detected. In poly-PsA, the period 2003-8 was a negative predictor (HR 0.36, 95% CI 0.21-0.62, p = 0.0001) whereas in oligo-PsA female gender was a positive predictor of drug discontinuation (HR 2.08, 95% CI 1.02-4.24, p = 0.04). With regard to clinical outcomes, the best responses in terms of European League Against Rheumatism (EULAR) 'good' response or Disease Activity Score (DAS28) remission, crude or adjusted according to the LUND Efficacy indeX (LUNDEX), were seen in patients on etanercept or infliximab. Our study provides some evidence that anti-TNF-α drugs may perform differently in PsA, and that the analysis of clinical disease subsets may improve our knowledge and promote better management of PsA.
The pathogenesis of oligoarticular/polyarticular vs systemic juvenile idiopathic arthritis.

PubMed

Lin, Yu-Tsan; Wang, Chen-Ti; Gershwin, M Eric; Chiang, Bor-Luen

2011-06-01

Juvenile idiopathic arthritis (JIA) has had a long and difficult problem with classification. It is clearly a heterogeneous and multi-factorial autoimmune disease but all too often the distinctions among subtypes were unclear. In fact, there is now increasing evidence of a distinct pathogenesis of oligo/polyarticular JIA compared to systemic JIA. Oligo/polyarticular JIA is an antigen-driven lymphocyte-mediated autoimmune disease with abnormality in the adaptive immune system. Cartilage-derived auto-antigens activate autoreactive T cells including Th1 and Th17 cells with production of pro-inflammatory cytokines IFN-γ and IL-17. On the other hand, the inhibition of regulatory T (Treg) cells including natural Foxp3(+) Treg and self-heat shock protein-induced Treg cells with decreased anti-inflammatory cytokine IL-10 results in the loss of immune tolerance. Imbalance between autoreactive Th1/Th17 and Treg cells leads to the failure of T cell tolerance to self-antigens, which contributes to the synovial inflammation of oligo/polyarticular JIA. By contrast, systemic JIA is an autoinflammatory disease with abnormality in the innate immune system. A loss of control of the alternative secretory pathway leading to aberrant activation of phagocytes including monocytes, macrophages and neutrophils seems to be involved in the release of pro-inflammatory cytokines IL-1, IL-6, IL-18 and pro-inflammatory S100-proteins, which contribute to the multisystem inflammation of systemic JIA. Markedly distinct pathogenesis of oligo/polyarticular JIA and systemic JIA implies that they might need different treatment strategies. Copyright © 2011 Elsevier B.V. All rights reserved.
[The role of essential metal ions in the human organism and their oral supplementation to the human body in deficiency states].

PubMed

Lakatos, Béla; Szentmihályi, Klára; Vinkler, Péter; Balla, József; Balla, György

2004-06-20

The role of essential nutrient metal ions (Mg, Fe, Cu, Zn, Mn and Co) often deficient in our foodstuffs, although vitally essential in the function of the human organism as well as the different reasons for these deficiencies both in foods and in the human body have been studied. The most frequent nutritional disease is iron deficient anaemia. Inorganic salts, artificial synthetic monomer organic metal complexes of high stability or organic polymer complexes of high molecular mass are unsatisfactory for supplementation to the human body, owing to poor absorption, low availability and/or harmful side effects. In contrast, we have recently found that mixed metal complexes of oligo/polygalacturonic acids with medium molecular weight prepared from natural pectin of plant origin are efficient for oral supplementation. Sufficient absorption of essential metal ions from metal oligo/polygalacturonate mixed complexes with polynuclear innersphere structure is due to the high ionselectivity and medium stability values. Metal oligo/polygalacturonate mixed complexes contain all deficient essential metal ions in adequate amounts and ratios for higher bioavailability of metal ions and optimal vital function. Therefore, by oral administration of these complexes, metal ion homeostasis and optimal interactions with vitamins and hormones can be ensured. Prelatent or latent macroelement Mg deficiency can often be observed among clinical or ambulance patients. Latent or manifest mesoelement iron deficiency is the most common, however, the occurrence of microelement copper, zinc, manganese and cobalt latent deficiencies is not seldom either. Supplementation studies utilizing essential metal oligo/polygalacturonate complexes led to satisfactory outcome without harmful side effects.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus.

PubMed

Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus

PubMed Central

Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication

PubMed Central

Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

2016-01-01

Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615
Osmylated DNA, a novel concept for sequencing DNA using nanopores

NASA Astrophysics Data System (ADS)

Kanavarioti, Anastassia

2015-03-01

Saenger sequencing has led the advances in molecular biology, while faster and cheaper next generation technologies are urgently needed. A newer approach exploits nanopores, natural or solid-state, set in an electrical field, and obtains base sequence information from current variations due to the passage of a ssDNA molecule through the pore. A hurdle in this approach is the fact that the four bases are chemically comparable to each other which leads to small differences in current obstruction. ‘Base calling’ becomes even more challenging because most nanopores sense a short sequence and not individual bases. Perhaps sequencing DNA via nanopores would be more manageable, if only the bases were two, and chemically very different from each other; a sequence of 1s and 0s comes to mind. Osmylated DNA comes close to such a sequence of 1s and 0s. Osmylation is the addition of osmium tetroxide bipyridine across the C5-C6 double bond of the pyrimidines. Osmylation adds almost 400% mass to the reactive base, creates a sterically and electronically notably different molecule, labeled 1, compared to the unreactive purines, labeled 0. If osmylated DNA were successfully sequenced, the result would be a sequence of osmylated pyrimidines (1), and purines (0), and not of the actual nucleobases. To solve this problem we studied the osmylation reaction with short oligos and with M13mp18, a long ssDNA, developed a UV-vis assay to measure extent of osmylation, and designed two protocols. Protocol A uses mild conditions and yields osmylated thymidines (1), while leaving the other three bases (0) practically intact. Protocol B uses harsher conditions and effectively osmylates both pyrimidines, but not the purines. Applying these two protocols also to the complementary of the target polynucleotide yields a total of four osmylated strands that collectively could define the actual base sequence of the target DNA.
A cricket Gene Index: a genomic resource for studying neurobiology, speciation, and molecular evolution

PubMed Central

Danley, Patrick D; Mullen, Sean P; Liu, Fenglong; Nene, Vishvanath; Quackenbush, John; Shaw, Kerry L

2007-01-01

Background As the developmental costs of genomic tools decline, genomic approaches to non-model systems are becoming more feasible. Many of these systems may lack advanced genetic tools but are extremely valuable models in other biological fields. Here we report the development of expressed sequence tags (EST's) in an orthopteroid insect, a model for the study of neurobiology, speciation, and evolution. Results We report the sequencing of 14,502 EST's from clones derived from a nerve cord cDNA library, and the subsequent construction of a Gene Index from these sequences, from the Hawaiian trigonidiine cricket Laupala kohalensis. The Gene Index contains 8607 unique sequences comprised of 2575 tentative consensus (TC) sequences and 6032 singletons. For each of the unique sequences, an attempt was made to assign a provisional annotation and to categorize its function using a Gene Ontology-based classification through a sequence-based comparison to known proteins. In addition, a set of unique 70 base pair oligomers that can be used for DNA microarrays was developed. All Gene Index information is posted at the DFCI Gene Indices web page Conclusion Orthopterans are models used to understand the neurophysiological basis of complex motor patterns such as flight and stridulation. The sequences presented in the cricket Gene Index will provide neurophysiologists with many genetic tools that have been largely absent in this field. The cricket Gene Index is one of only two gene indices to be developed in an evolutionary model system. Species within the genus Laupala have speciated recently, rapidly, and extensively. Therefore, the genes identified in the cricket Gene Index can be used to study the genomics of speciation. Furthermore, this gene index represents a significant EST resources for basal insects. As such, this resource is a valuable comparative tool for the understanding of invertebrate molecular evolution. The sequences presented here will provide much needed genomic resources for three distinct but overlapping fields of inquiry: neurobiology, speciation, and molecular evolution. PMID:17459168
Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems

PubMed Central

2011-01-01

Background Alfalfa, [Medicago sativa (L.) sativa], a widely-grown perennial forage has potential for development as a cellulosic ethanol feedstock. However, the genomics of alfalfa, a non-model species, is still in its infancy. The recent advent of RNA-Seq, a massively parallel sequencing method for transcriptome analysis, provides an opportunity to expand the identification of alfalfa genes and polymorphisms, and conduct in-depth transcript profiling. Results Cell walls in stems of alfalfa genotype 708 have higher cellulose and lower lignin concentrations compared to cell walls in stems of genotype 773. Using the Illumina GA-II platform, a total of 198,861,304 expression sequence tags (ESTs, 76 bp in length) were generated from cDNA libraries derived from elongating stem (ES) and post-elongation stem (PES) internodes of 708 and 773. In addition, 341,984 ESTs were generated from ES and PES internodes of genotype 773 using the GS FLX Titanium platform. The first alfalfa (Medicago sativa) gene index (MSGI 1.0) was assembled using the Sanger ESTs available from GenBank, the GS FLX Titanium EST sequences, and the de novo assembled Illumina sequences. MSGI 1.0 contains 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. We identified a total of 1,294 simple sequence repeats (SSR) among the sequences in MSGI 1.0. In addition, a total of 10,826 single nucleotide polymorphisms (SNPs) were predicted between the two genotypes. Out of 55 SNPs randomly selected for experimental validation, 47 (85%) were polymorphic between the two genotypes. We also identified numerous allelic variations within each genotype. Digital gene expression analysis identified numerous candidate genes that may play a role in stem development as well as candidate genes that may contribute to the differences in cell wall composition in stems of the two genotypes. Conclusions Our results demonstrate that RNA-Seq can be successfully used for gene identification, polymorphism detection and transcript profiling in alfalfa, a non-model, allogamous, autotetraploid species. The alfalfa gene index assembled in this study, and the SNPs, SSRs and candidate genes identified can be used to improve alfalfa as a forage crop and cellulosic feedstock. PMID:21504589
Expressed sequence tag analysis of guinea pig (Cavia porcellus) eye tissues for NEIBank

PubMed Central

Simpanya, Mukoma F.; Wistow, Graeme; Gao, James; David, Larry L.; Giblin, Frank J.

2008-01-01

Purpose To characterize gene expression patterns in guinea pig ocular tissues and identify orthologs of human genes from NEIBank expressed sequence tags. Methods RNA was extracted from dissected eye tissues of 2.5-month-old guinea pigs to make three unamplified and unnormalized cDNA libraries in the pCMVSport-6 vector for the lens, retina, and eye minus lens and retina. Over 4,000 clones were sequenced from each library and were analyzed using GRIST for clustering and gene identification. Lens crystallin EST data were validated using two-dimensional electrophoresis (2-DE), matrix assisted laser desorption (MALDI), and electrospray ionization mass spectrometry (ESIMS). Results Combined data from the three libraries generated a total of 6,694 distinctive gene clusters, with each library having between 1,000 and 3,000 clusters. Approximately 60% of the total gene clusters were novel cDNA sequences and had significant homologies to other mammalian sequences in GenBank. Complete cDNA sequences were obtained for many guinea pig lens proteins, including αA/αAinsert-, γN-, and γS-crystallins, lengsin and GRIFIN. The ratio of αA- to αB-crystallin on 2-DE gels was 8: 1 in the lens nucleus and 6.5: 1 in the cortex. Analysis of ESTs, genome sequence, and proteins (by MALDI), did not reveal any evidence for the presence of γD-, γE-, and γF-crystallin in the guinea pig. Predicted masses of many guinea pig lens crystallins were confirmed by ESIMS analysis. For the retina, orthologs of human phototransduction genes were found, such as Rhodopsin, S-antigen (Sag, Arrestin), and Transducin. The guinea-pig ortholog of NRL, a key rod photoreceptor-specific transcription factor, was also represented in EST data. In the ‘rest-of-eye’ library, the most abundant transcripts included decorin and keratin 12, representative of the cornea. Conclusions Genomic analysis of guinea pig eye tissues provides sequence-verified clones for future studies. Guinea pig orthologs of many human eye specific genes were identified. Guinea pig gene structures were similar to their human and rodent gene counterparts. Surprisingly, no orthologs of γD-, γE-, and γF-crystallin were found in EST, proteomic, or the current guinea pig genome data. PMID:19104676
Bacterial Utilization of Ether Glycols

PubMed Central

Fincher, Edward L.; Payne, W. J.

1962-01-01

A soil bacterium capable of using oligo- and polyethylene glycols and ether alcohols as sole sources of carbon for aerobic growth was isolated. The effects of substituent groups added to the ether bonds on the acceptability of the compounds as substrates were studied. Mechanisms for the incorporation of two-carbon compounds were demonstrated by the observation that acetate, glyoxylate, ethylene glycol, and a number of the tricarboxylic acid cycle intermediates served as growth substrates in minimal media. The rate of oxidation of the short-chained ethylene glycols by adapted resting cells varied directly with increasing numbers of two-carbon units in the chains from one to four. The amount of oxygen consumed per carbon atom of oligo- and polyethylene glycols was 100% of theoretical, but only 67% of theoretical for ethylene glycol. Resting cells oxidized oligo- and polyethylene glycols with 2 to 600 two-carbon units in the chains. Longer chained polyethylene glycols (up to 6,000) were oxidized at a very slow rate by these cells. Dehydrogenation of triethylene glycol by adapted cells was observed, coupling the reaction with methylene blue reduction. PMID:13945208
Comparison of the CT OligoGen kit with cobas 4800 assay for detection of Chlamydia trachomatis.

PubMed

Parra-Sánchez, Manuel; Marcuello-López, Ana; García-Rey, Silvia; Zakariya-Yousef, Ismail; Sivianes-Valdecantos, Nieves; Sierra-Atienza, Celestina; Bernal-Martínez, Samuel; Pueyo-Rodrígez, Isabel; Martín-Mazuelos, Estrella; Palomares-Folía, José Carlos

2015-12-01

A prospective study was designed to assess the performance of the new CT OligoGen kit and the cobas 4800 assay for detection of Chlamydia trachomatis. A set of samples that included urine samples (n=212), endocervical (n=167), rectal (n=53), pharyngeal (n=7) and urethral swabs (n=3). The samples were sent from a regional sexually transmitted diseases (STD) clinic in Seville, Spain, and were collected from 261 men and 181 women. Discordant results were re-analyzed and clinical data and other tests were reviewed in order to resolve them. Sensitivity, specificity, positive predicative value (PPV), negative predictive value (NPV) and kappa value for C. trachomatis detection using the CT OligoGen kit were 98.5%, 100%, 100%, 95.4% and 0.97, respectively. This new kit had a high sensitivity, specificity, PPV and NPV for C. trachomatis, therefore the performance profile confirms the usefulness and reliable results of this new assay. Copyright © 2015 Elsevier España, S.L.U. y Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Sensing miRNA: Signal Amplification by Cognate RISC for Intracellular Detection of miRNA in Live Cells.

PubMed

Kavishwar, Amol; Medarova, Zdravka

2016-01-01

The ability to detect miRNA expression in live cells would leave these cells available for further manipulation or culture. Here, we describe the design of a miRNA sensor oligonucleotide whose sequence mimics the target mRNA. The sensor has a fluorescent label on one end of the oligo and a quencher on the other. When inside the cell, the sensor is recognized by its cognate miRNA-RISC and gets cleaved, setting the fluorophore free from its quencher. This results in fluorescence "turn on." Since cleavage by the RISC complex is an enzymatic process, the described approach has a very high level of sensitivity (nM). The rate of nonspecific cleavage of the sensor is very slow permitting the collection of meaningful signal over a long period of time.
First genetic linkage map of Taraxacum koksaghyz Rodin based on AFLP, SSR, COS and EST-SSR markers.

PubMed

Arias, Marina; Hernandez, Monica; Remondegui, Naroa; Huvenaars, Koen; van Dijk, Peter; Ritter, Enrique

2016-08-04

Taraxacum koksaghyz Rodin (TKS) has been studied in many occasions as a possible alternative source for natural rubber production of good quality and for inulin production. Some tire companies are already testing TKS tire prototypes. There are also many investigations on the production of bio-fuels from inulin and inulin applications for health improvement and in the food industry. A limited amount of genomic resources exist for TKS and particularly no genetic linkage map is available in this species. We have constructed the first TKS genetic linkage map based on AFLP, COS, SSR and EST-SSR markers. The integrated linkage map with eight linkage groups (LG), representing the eight chromosomes of Russian dandelion, has 185 individual AFLP markers from parent 1, 188 individual AFLP markers from parent 2, 75 common AFLP markers and 6 COS, 1 SSR and 63 EST-SSR loci. Blasting the EST-SSR sequences against known sequences from lettuce allowed a partial alignment of our TKS map with a lettuce map. Blast searches against plant gene databases revealed some homologies with useful genes for downstream applications in the future.
Insilico profiling of microRNAs in Korean ginseng (Panax ginseng Meyer)

PubMed Central

Mathiyalagan, Ramya; Subramaniyam, Sathiyamoorthy; Natarajan, Sathishkumar; Kim, Yeon Ju; Sun, Myung Suk; Kim, Se Young; Kim, Yu-Jin; Yang, Deok Chun

2013-01-01

MicroRNAs (miRNAs) are a class of recently discovered non-coding small RNA molecules, on average approximately 21 nucleotides in length, which underlie numerous important biological roles in gene regulation in various organisms. The miRNA database (release 18) has 18,226 miRNAs, which have been deposited from different species. Although miRNAs have been identified and validated in many plant species, no studies have been reported on discovering miRNAs in Panax ginseng Meyer, which is a traditionally known medicinal plant in oriental medicine, also known as Korean ginseng. It has triterpene ginseng saponins called ginsenosides, which are responsible for its various pharmacological activities. Predicting conserved miRNAs by homology-based analysis with available expressed sequence tag (EST) sequences can be powerful, if the species lacks whole genome sequence information. In this study by using the EST based computational approach, 69 conserved miRNAs belonging to 44 miRNA families were identified in Korean ginseng. The digital gene expression patterns of predicted conserved miRNAs were analyzed by deep sequencing using small RNA sequences of flower buds, leaves, and lateral roots. We have found that many of the identified miRNAs showed tissue specific expressions. Using the insilico method, 346 potential targets were identified for the predicted 69 conserved miRNAs by searching the ginseng EST database, and the predicted targets were mainly involved in secondary metabolic processes, responses to biotic and abiotic stress, and transcription regulator activities, as well as a variety of other metabolic processes. PMID:23717176
MAGIC database and interfaces: an integrated package for gene discovery and expression.

PubMed

Cordonnier-Pratt, Marie-Michèle; Liang, Chun; Wang, Haiming; Kolychev, Dmitri S; Sun, Feng; Freeman, Robert; Sullivan, Robert; Pratt, Lee H

2004-01-01

The rapidly increasing rate at which biological data is being produced requires a corresponding growth in relational databases and associated tools that can help laboratories contend with that data. With this need in mind, we describe here a Modular Approach to a Genomic, Integrated and Comprehensive (MAGIC) Database. This Oracle 9i database derives from an initial focus in our laboratory on gene discovery via production and analysis of expressed sequence tags (ESTs), and subsequently on gene expression as assessed by both EST clustering and microarrays. The MAGIC Gene Discovery portion of the database focuses on information derived from DNA sequences and on its biological relevance. In addition to MAGIC SEQ-LIMS, which is designed to support activities in the laboratory, it contains several additional subschemas. The latter include MAGIC Admin for database administration, MAGIC Sequence for sequence processing as well as sequence and clone attributes, MAGIC Cluster for the results of EST clustering, MAGIC Polymorphism in support of microsatellite and single-nucleotide-polymorphism discovery, and MAGIC Annotation for electronic annotation by BLAST and BLAT. The MAGIC Microarray portion is a MIAME-compliant database with two components at present. These are MAGIC Array-LIMS, which makes possible remote entry of all information into the database, and MAGIC Array Analysis, which provides data mining and visualization. Because all aspects of interaction with the MAGIC Database are via a web browser, it is ideally suited not only for individual research laboratories but also for core facilities that serve clients at any distance.
Bacterial discrimination by means of a universal array approach mediated by LDR (ligase detection reaction)

PubMed Central

Busti, Elena; Bordoni, Roberta; Castiglioni, Bianca; Monciardini, Paolo; Sosio, Margherita; Donadio, Stefano; Consolandi, Clarissa; Rossi Bernardi, Luigi; Battaglia, Cristina; De Bellis, Gianluca

2002-01-01

Background PCR amplification of bacterial 16S rRNA genes provides the most comprehensive and flexible means of sampling bacterial communities. Sequence analysis of these cloned fragments can provide a qualitative and quantitative insight of the microbial population under scrutiny although this approach is not suited to large-scale screenings. Other methods, such as denaturing gradient gel electrophoresis, heteroduplex or terminal restriction fragment analysis are rapid and therefore amenable to field-scale experiments. A very recent addition to these analytical tools is represented by microarray technology. Results Here we present our results using a Universal DNA Microarray approach as an analytical tool for bacterial discrimination. The proposed procedure is based on the properties of the DNA ligation reaction and requires the design of two probes specific for each target sequence. One oligo carries a fluorescent label and the other a unique sequence (cZipCode or complementary ZipCode) which identifies a ligation product. Ligated fragments, obtained in presence of a proper template (a PCR amplified fragment of the 16s rRNA gene) contain either the fluorescent label or the unique sequence and therefore are addressed to the location on the microarray where the ZipCode sequence has been spotted. Such an array is therefore "Universal" being unrelated to a specific molecular analysis. Here we present the design of probes specific for some groups of bacteria and their application to bacterial diagnostics. Conclusions The combined use of selective probes, ligation reaction and the Universal Array approach yielded an analytical procedure with a good power of discrimination among bacteria. PMID:12243651

Repetitive Elements May Comprise Over Two-Thirds of the Human Genome

PubMed Central

de Koning, A. P. Jason; Gu, Wanjun; Castoe, Todd A.; Batzer, Mark A.; Pollock, David D.

2011-01-01

Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo “clouds”). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%–69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed “element-specific” P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed. PMID:22144907
A Framework Phylogeny of the American Oak Clade Based on Sequenced RAD Data

PubMed Central

Hipp, Andrew L.; Eaton, Deren A. R.; Cavender-Bares, Jeannine; Fitzek, Elisabeth; Nipper, Rick; Manos, Paul S.

2014-01-01

Previous phylogenetic studies in oaks (Quercus, Fagaceae) have failed to resolve the backbone topology of the genus with strong support. Here, we utilize next-generation sequencing of restriction-site associated DNA (RAD-Seq) to resolve a framework phylogeny of a predominantly American clade of oaks whose crown age is estimated at 23–33 million years old. Using a recently developed analytical pipeline for RAD-Seq phylogenetics, we created a concatenated matrix of 1.40 E06 aligned nucleotides, constituting 27,727 sequence clusters. RAD-Seq data were readily combined across runs, with no difference in phylogenetic placement between technical replicates, which overlapped by only 43–64% in locus coverage. 17% (4,715) of the loci we analyzed could be mapped with high confidence to one or more expressed sequence tags in NCBI Genbank. A concatenated matrix of the loci that BLAST to at least one EST sequence provides approximately half as many variable or parsimony-informative characters as equal-sized datasets from the non-EST loci. The EST-associated matrix is more complete (fewer missing loci) and has slightly lower homoplasy than non-EST subsampled matrices of the same size, but there is no difference in phylogenetic support or relative attribution of base substitutions to internal versus terminal branches of the phylogeny. We introduce a partitioned RAD visualization method (implemented in the R package RADami; http://cran.r-project.org/web/packages/RADami) to investigate the possibility that suboptimal topologies supported by large numbers of loci—due, for example, to reticulate evolution or lineage sorting—are masked by the globally optimal tree. We find no evidence for strongly-supported alternative topologies in our study, suggesting that the phylogeny we recover is a robust estimate of large-scale phylogenetic patterns in the American oak clade. Our study is one of the first to demonstrate the utility of RAD-Seq data for inferring phylogeny in a 23–33 million year-old clade. PMID:24705617
Solid-phase synthesis of oligo-2-pyrimidinone-2'-deoxyribonucleotides and oligo-2-pyrimidinone-2'-deoxyriboside methylphosphonates.

PubMed Central

Zhou, Y; Ts'o, P O

1996-01-01

A synthetic method was developed for the synthesis of oligodeoxyribonucleotides and oligodeoxyribonucleoside methylphosphonates comprised exclusively of the fluorescent 2-pyrimidinone base for the first time. The method utilized the solid-phase 2-cyanoethylphosphoramidite and methylphosphonamidite chemistry for internucleotide couplings and a baselabile oxalyl linkage to anchor the oligomers onto the CPG support. Cleavage of the oligomers from the support was effected by a short treatment of the support with 5% ammonium hydroxide in methanol at room temperature, without any degradation of the base-sensitive 2-pyrimidinone residues or the base-sensitive methylphosphonate backbone. PMID:8758991
Comparative Analysis of Gene Expression for Convergent Evolution of Camera Eye Between Octopus and Human

PubMed Central

Ogura, Atsushi; Ikeo, Kazuho; Gojobori, Takashi

2004-01-01

Although the camera eye of the octopus is very similar to that of humans, phylogenetic and embryological analyses have suggested that their camera eyes have been acquired independently. It has been known as a typical example of convergent evolution. To study the molecular basis of convergent evolution of camera eyes, we conducted a comparative analysis of gene expression in octopus and human camera eyes. We sequenced 16,432 ESTs of the octopus eye, leading to 1052 nonredundant genes that have matches in the protein database. Comparing these 1052 genes with 13,303 already-known ESTs of the human eye, 729 (69.3%) genes were commonly expressed between the human and octopus eyes. On the contrary, when we compared octopus eye ESTs with human connective tissue ESTs, the expression similarity was quite low. To trace the evolutionary changes that are potentially responsible for camera eye formation, we also compared octopus-eye ESTs with the completed genome sequences of other organisms. We found that 1019 out of the 1052 genes had already existed at the common ancestor of bilateria, and 875 genes were conserved between humans and octopuses. It suggests that a larger number of conserved genes and their similar gene expression may be responsible for the convergent evolution of the camera eye. PMID:15289475
Chasing Migration Genes: A Brain Expressed Sequence Tag Resource for Summer and Migratory Monarch Butterflies (Danaus plexippus)

PubMed Central

Zhu, Haisun; Casselman, Amy; Reppert, Steven M.

2008-01-01

North American monarch butterflies (Danaus plexippus) undergo a spectacular fall migration. In contrast to summer butterflies, migrants are juvenile hormone (JH) deficient, which leads to reproductive diapause and increased longevity. Migrants also utilize time-compensated sun compass orientation to help them navigate to their overwintering grounds. Here, we describe a brain expressed sequence tag (EST) resource to identify genes involved in migratory behaviors. A brain EST library was constructed from summer and migrating butterflies. Of 9,484 unique sequences, 6068 had positive hits with the non-redundant protein database; the EST database likely represents ∼52% of the gene-encoding potential of the monarch genome. The brain transcriptome was cataloged using Gene Ontology and compared to Drosophila. Monarch genes were well represented, including those implicated in behavior. Three genes involved in increased JH activity (allatotropin, juvenile hormone acid methyltransfersase, and takeout) were upregulated in summer butterflies, compared to migrants. The locomotion-relevant turtle gene was marginally upregulated in migrants, while the foraging and single-minded genes were not differentially regulated. Many of the genes important for the monarch circadian clock mechanism (involved in sun compass orientation) were in the EST resource, including the newly identified cryptochrome 2. The EST database also revealed a novel Na+/K+ ATPase allele predicted to be more resistant to the toxic effects of milkweed than that reported previously. Potential genetic markers were identified from 3,486 EST contigs and included 1599 double-hit single nucleotide polymorphisms (SNPs) and 98 microsatellite polymorphisms. These data provide a template of the brain transcriptome for the monarch butterfly. Our “snap-shot” analysis of the differential regulation of candidate genes between summer and migratory butterflies suggests that unbiased, comprehensive transcriptional profiling will inform the molecular basis of migration. The identified SNPs and microsatellite polymorphisms can be used as genetic markers to address questions of population and subspecies structure. PMID:18183285
Ontology and diversity of transcript-associated microsatellites mined from a globe artichoke EST database

PubMed Central

Scaglione, Davide; Acquadro, Alberto; Portis, Ezio; Taylor, Christopher A; Lanteri, Sergio; Knapp, Steven J

2009-01-01

Background The globe artichoke (Cynara cardunculus var. scolymus L.) is a significant crop in the Mediterranean basin. Despite its commercial importance and its both dietary and pharmaceutical value, knowledge of its genetics and genomics remains scant. Microsatellite markers have become a key tool in genetic and genomic analysis, and we have exploited recently acquired EST (expressed sequence tag) sequence data (Composite Genome Project - CGP) to develop an extensive set of microsatellite markers. Results A unigene assembly was created from over 36,000 globe artichoke EST sequences, containing 6,621 contigs and 12,434 singletons. Over 12,000 of these unigenes were functionally assigned on the basis of homology with Arabidopsis thaliana reference proteins. A total of 4,219 perfect repeats, located within 3,308 unigenes was identified and the gene ontology (GO) analysis highlighted some GO term's enrichments among different classes of microsatellites with respect to their position. Sufficient flanking sequence was available to enable the design of primers to amplify 2,311 of these microsatellites, and a set of 300 was tested against a DNA panel derived from 28 C. cardunculus genotypes. Consistent amplification and polymorphism was obtained from 236 of these assays. Their polymorphic information content (PIC) ranged from 0.04 to 0.90 (mean 0.66). Between 176 and 198 of the assays were informative in at least one of the three available mapping populations. Conclusion EST-based microsatellites have provided a large set of de novo genetic markers, which show significant amounts of polymorphism both between and within the three taxa of C. cardunculus. They are thus well suited as assays for phylogenetic analysis, the construction of genetic maps, marker-assisted breeding, transcript mapping and other genomic applications in the species. PMID:19785740
ASPIC: a novel method to predict the exon-intron structure of a gene that is optimally compatible to a set of transcript sequences.

PubMed

Bonizzoni, Paola; Rizzi, Raffaella; Pesole, Graziano

2005-10-05

Currently available methods to predict splice sites are mainly based on the independent and progressive alignment of transcript data (mostly ESTs) to the genomic sequence. Apart from often being computationally expensive, this approach is vulnerable to several problems--hence the need to develop novel strategies. We propose a method, based on a novel multiple genome-EST alignment algorithm, for the detection of splice sites. To avoid limitations of splice sites prediction (mainly, over-predictions) due to independent single EST alignments to the genomic sequence our approach performs a multiple alignment of transcript data to the genomic sequence based on the combined analysis of all available data. We recast the problem of predicting constitutive and alternative splicing as an optimization problem, where the optimal multiple transcript alignment minimizes the number of exons and hence of splice site observations. We have implemented a splice site predictor based on this algorithm in the software tool ASPIC (Alternative Splicing PredICtion). It is distinguished from other methods based on BLAST-like tools by the incorporation of entirely new ad hoc procedures for accurate and computationally efficient transcript alignment and adopts dynamic programming for the refinement of intron boundaries. ASPIC also provides the minimal set of non-mergeable transcript isoforms compatible with the detected splicing events. The ASPIC web resource is dynamically interconnected with the Ensembl and Unigene databases and also implements an upload facility. Extensive bench marking shows that ASPIC outperforms other existing methods in the detection of novel splicing isoforms and in the minimization of over-predictions. ASPIC also requires a lower computation time for processing a single gene and an EST cluster. The ASPIC web resource is available at http://aspic.algo.disco.unimib.it/aspic-devel/.
ESTminer: a Web interface for mining EST contig and cluster databases.

PubMed

Huang, Yecheng; Pumphrey, Janie; Gingle, Alan R

2005-03-01

ESTminer is a Web application and database schema for interactive mining of expressed sequence tag (EST) contig and cluster datasets. The Web interface contains a query frame that allows the selection of contigs/clusters with specific cDNA library makeup or a threshold number of members. The results are displayed as color-coded tree nodes, where the color indicates the fractional size of each cDNA library component. The nodes are expandable, revealing library statistics as well as EST or contig members, with links to sequence data, GenBank records or user configurable links. Also, the interface allows 'queries within queries' where the result set of a query is further filtered by the subsequent query. ESTminer is implemented in Java/JSP and the package, including MySQL and Oracle schema creation scripts, is available from http://cggc.agtec.uga.edu/Data/download.asp agingle@uga.edu.
A study of alternative splicing in the pig

PubMed Central

2010-01-01

Background Since at least half of the genes in mammalian genomes are subjected to alternative splicing, alternative pre-mRNA splicing plays an important contribution to the complexity of the mammalian proteome. Expressed sequence tags (ESTs) provide evidence of a great number of possible alternative isoforms. With the EST resource for the domestic pig now containing more than one million porcine ESTs, it is possible to identify alternative splice forms of the individual transcripts in this species from the EST data with some confidence. Results The pig EST data generated by the Sino-Danish Pig Genome project has been assembled with publicly available ESTs and made available in the PigEST database. Using the Distiller package 2,515 EST clusters with candidate alternative isoforms were identified in the EST data with high confidence. In agreement with general observations in human and mouse, we find putative splice variants in about 30% of the contigs with more than 50 ESTs. Based on the criteria that a minimum of two EST sequences confirmed each splice event, a list of 100 genes with the most distinct tissue-specific alternative splice events was generated from the list of candidates. To confirm the tissue specificity of the splice events, 10 genes with functional annotation were randomly selected from which 16 individual splice events were chosen for experimental verification by quantitative PCR (qPCR). Six genes were shown to have tissue specific alternatively spliced transcripts with expression patterns matching those of the EST data. The remaining four genes had tissue-restricted expression of alternative spliced transcripts. Five out of the 16 splice events that were experimentally verified were found to be putative pig specific. Conclusions In accordance with human and rodent studies we estimate that approximately 30% of the porcine genes undergo alternative splicing. We found a good correlation between EST predicted tissue-specificity and experimentally validated splice events in different porcine tissue. This study indicates that a cluster size of around 50 ESTs is optimal for in silico detection of alternative splicing. Although based on a limited number of splice events, the study supports the notion that alternative splicing could have an important impact on species differentiation since 31% of the splice events studied appears to be species specific. PMID:20444244
Spatial analysis of biomineralization associated gene expression from the mantle organ of the pearl oyster Pinctada maxima

PubMed Central

2011-01-01

Background Biomineralization is a process encompassing all mineral containing tissues produced within an organism. One of the most dynamic examples of this process is the formation of the mollusk shell, comprising a variety of crystal phases and microstructures. The organic component incorporated within the shell is said to dictate this architecture. However general understanding of how this process is achieved remains ambiguous. The mantle is a conserved organ involved in shell formation throughout molluscs. Specifically the mantle is thought to be responsible for secreting the protein component of the shell. This study employs molecular approaches to determine the spatial expression of genes within the mantle tissue to further the elucidation of the shell biomineralization. Results A microarray platform was custom generated (PmaxArray 1.0) from the pearl oyster Pinctada maxima. PmaxArray 1.0 consists of 4992 expressed sequence tags (ESTs) originating from mantle tissue. This microarray was used to analyze the spatial expression of ESTs throughout the mantle organ. The mantle was dissected into five discrete regions and analyzed for differential gene expression with PmaxArray 1.0. Over 2000 ESTs were determined to be differentially expressed among the tissue sections, identifying five major expression regions. In situ hybridization validated and further localized the expression for a subset of these ESTs. Comparative sequence similarity analysis of these ESTs revealed a number of the transcripts were novel while others showed significant sequence similarities to previously characterized shell related genes. Conclusions This investigation has mapped the spatial distribution for over 2000 ESTs present on PmaxArray 1.0 with reference to specific locations of the mantle. Expression profile clusters have indicated at least five unique functioning zones in the mantle. Three of these zones are likely involved in shell related activities including formation of nacre, periostracum and calcitic prismatic microstructure. A number of novel and known transcripts have been identified from these clusters. The development of PmaxArray 1.0, and the spatial map of its ESTs expression in the mantle has begun characterizing the molecular mechanisms linking the organics and inorganics of the molluscan shell. PMID:21936921
Cloning and expression of a nuclear encoded plastid specific 33 kDa ribonucleoprotein gene (33RNP) from pea that is light stimulated.

PubMed

Reddy, M K; Nair, S; Singh, B N; Mudgil, Y; Tewari, K K; Sopory, S K

2001-01-24

We report the cloning and sequencing of both cDNA and genomic DNA of a 33 kDa chloroplast ribonucleoprotein (33RNP) from pea. The analysis of the predicted amino acid sequence of the cDNA clone revealed that the encoded protein contains two RNA binding domains, including the conserved consensus ribonucleoprotein sequences CS-RNP1 and CS-RNP2, on the C-terminus half and the presence of a putative transit peptide sequence in the N-terminus region. The phylogenetic and multiple sequence alignment analysis of pea chloroplast RNP along with RNPs reported from the other plant sources revealed that the pea 33RNP is very closely related to Nicotiana sylvestris 31RNP and 28RNP and also to 31RNP and 28RNP of Arabidopsis and spinach, respectively. The pea 33RNP was expressed in Escherichia coli and purified to homogeneity. The in vitro import of precursor protein into chloroplasts confirmed that the N-terminus putative transit peptide is a bona fide transit peptide and 33RNP is localized in the chloroplast. The nucleic acid-binding properties of the recombinant protein, as revealed by South-Western analysis, showed that 33RNP has higher binding affinity for poly (U) and oligo dT than for ssDNA and dsDNA. The steady state transcript level was higher in leaves than in roots and the expression of this gene is light stimulated. Sequence analysis of the genomic clone revealed that the gene contains four exons and three introns. We have also isolated and analyzed the 5' flanking region of the pea 33RNP gene.
VitisExpDB: a database resource for grape functional genomics.

PubMed

Doddapaneni, Harshavardhan; Lin, Hong; Walker, M Andrew; Yao, Jiqiang; Civerolo, Edwin L

2008-02-28

The family Vitaceae consists of many different grape species that grow in a range of climatic conditions. In the past few years, several studies have generated functional genomic information on different Vitis species and cultivars, including the European grape vine, Vitis vinifera. Our goal is to develop a comprehensive web data source for Vitaceae. VitisExpDB is an online MySQL-PHP driven relational database that houses annotated EST and gene expression data for V. vinifera and non-vinifera grape species and varieties. Currently, the database stores approximately 320,000 EST sequences derived from 8 species/hybrids, their annotation (BLAST top match) details and Gene Ontology based structured vocabulary. Putative homologs for each EST in other species and varieties along with information on their percent nucleotide identities, phylogenetic relationship and common primers can be retrieved. The database also includes information on probe sequence and annotation features of the high density 60-mer gene expression chip consisting of approximately 20,000 non-redundant set of ESTs. Finally, the database includes 14 processed global microarray expression profile sets. Data from 12 of these expression profile sets have been mapped onto metabolic pathways. A user-friendly web interface with multiple search indices and extensively hyperlinked result features that permit efficient data retrieval has been developed. Several online bioinformatics tools that interact with the database along with other sequence analysis tools have been added. In addition, users can submit their ESTs to the database. The developed database provides genomic resource to grape community for functional analysis of genes in the collection and for the grape genome annotation and gene function identification. The VitisExpDB database is available through our website http://cropdisease.ars.usda.gov/vitis_at/main-page.htm.
VitisExpDB: A database resource for grape functional genomics

PubMed Central

Doddapaneni, Harshavardhan; Lin, Hong; Walker, M Andrew; Yao, Jiqiang; Civerolo, Edwin L

2008-01-01

Background The family Vitaceae consists of many different grape species that grow in a range of climatic conditions. In the past few years, several studies have generated functional genomic information on different Vitis species and cultivars, including the European grape vine, Vitis vinifera. Our goal is to develop a comprehensive web data source for Vitaceae. Description VitisExpDB is an online MySQL-PHP driven relational database that houses annotated EST and gene expression data for V. vinifera and non-vinifera grape species and varieties. Currently, the database stores ~320,000 EST sequences derived from 8 species/hybrids, their annotation (BLAST top match) details and Gene Ontology based structured vocabulary. Putative homologs for each EST in other species and varieties along with information on their percent nucleotide identities, phylogenetic relationship and common primers can be retrieved. The database also includes information on probe sequence and annotation features of the high density 60-mer gene expression chip consisting of ~20,000 non-redundant set of ESTs. Finally, the database includes 14 processed global microarray expression profile sets. Data from 12 of these expression profile sets have been mapped onto metabolic pathways. A user-friendly web interface with multiple search indices and extensively hyperlinked result features that permit efficient data retrieval has been developed. Several online bioinformatics tools that interact with the database along with other sequence analysis tools have been added. In addition, users can submit their ESTs to the database. Conclusion The developed database provides genomic resource to grape community for functional analysis of genes in the collection and for the grape genome annotation and gene function identification. The VitisExpDB database is available through our website . PMID:18307813
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

PubMed

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain.

PubMed

Cruz-Gallardo, Isabel; Aroca, Ángeles; Gunzburg, Menachem J; Sivakumaran, Andrew; Yoon, Je-Hyun; Angulo, Jesús; Persson, Cecilia; Gorospe, Myriam; Karlsson, B Göran; Wilce, Jacqueline A; Díaz-Moreno, Irene

2014-01-01

T-cell intracellular antigen-1 (TIA-1) is a key DNA/RNA binding protein that regulates translation by sequestering target mRNAs in stress granules (SG) in response to stress conditions. TIA-1 possesses three RNA recognition motifs (RRM) along with a glutamine-rich domain, with the central domains (RRM2 and RRM3) acting as RNA binding platforms. While the RRM2 domain, which displays high affinity for U-rich RNA sequences, is primarily responsible for interaction with RNA, the contribution of RRM3 to bind RNA as well as the target RNA sequences that it binds preferentially are still unknown. Here we combined nuclear magnetic resonance (NMR) and surface plasmon resonance (SPR) techniques to elucidate the sequence specificity of TIA-1 RRM3. With a novel approach using saturation transfer difference NMR (STD-NMR) to quantify protein-nucleic acids interactions, we demonstrate that isolated RRM3 binds to both C- and U-rich stretches with micromolar affinity. In combination with RRM2 and in the context of full-length TIA-1, RRM3 significantly enhanced the binding to RNA, particularly to cytosine-rich RNA oligos, as assessed by biotinylated RNA pull-down analysis. Our findings provide new insight into the role of RRM3 in regulating TIA-1 binding to C-rich stretches, that are abundant at the 5' TOPs (5' terminal oligopyrimidine tracts) of mRNAs whose translation is repressed under stress situations.
The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain

PubMed Central

Cruz-Gallardo, Isabel; Aroca, Ángeles; Gunzburg, Menachem J; Sivakumaran, Andrew; Yoon, Je-Hyun; Angulo, Jesús; Persson, Cecilia; Gorospe, Myriam; Karlsson, B Göran; Wilce, Jacqueline A; Díaz-Moreno, Irene

2014-01-01

T-cell intracellular antigen-1 (TIA-1) is a key DNA/RNA binding protein that regulates translation by sequestering target mRNAs in stress granules (SG) in response to stress conditions. TIA-1 possesses three RNA recognition motifs (RRM) along with a glutamine-rich domain, with the central domains (RRM2 and RRM3) acting as RNA binding platforms. While the RRM2 domain, which displays high affinity for U-rich RNA sequences, is primarily responsible for interaction with RNA, the contribution of RRM3 to bind RNA as well as the target RNA sequences that it binds preferentially are still unknown. Here we combined nuclear magnetic resonance (NMR) and surface plasmon resonance (SPR) techniques to elucidate the sequence specificity of TIA-1 RRM3. With a novel approach using saturation transfer difference NMR (STD-NMR) to quantify protein–nucleic acids interactions, we demonstrate that isolated RRM3 binds to both C- and U-rich stretches with micromolar affinity. In combination with RRM2 and in the context of full-length TIA-1, RRM3 significantly enhanced the binding to RNA, particularly to cytosine-rich RNA oligos, as assessed by biotinylated RNA pull-down analysis. Our findings provide new insight into the role of RRM3 in regulating TIA-1 binding to C-rich stretches, that are abundant at the 5′ TOPs (5′ terminal oligopyrimidine tracts) of mRNAs whose translation is repressed under stress situations. PMID:24824036
A novel nonsteroidal antifibrotic oligo decoy containing the TGF-beta element found in the COL1A1 gene which regulates murine schistosomiasis liver fibrosis.

PubMed

Boros, D L; Singh, K P; Gerard, H C; Hudson, A P; White, S L; Cutroneo, K R

2005-08-01

Schistosomiasis mansoni disseminated worm eggs in mice and humans induce granulomatous inflammations and cumulative fibrosis causing morbidity and possibly mortality. In this study, intrahepatic and I.V. injections of a double-stranded oligodeoxynucleotide decoy containing the TGF-beta regulatory element found in the distal promoter of the COL1A1 gene into worm-infected mice suppressed TGF-beta1, COL1A1, tissue inhibitor of metalloproteinase-1, and decreased COL3A1 mRNAs to a lesser extent. Sequence comparisons within the mouse genome found homologous sequences within the COL3A1, TGF-beta1, and TIMP-1 5' flanking regions. Cold competition gel mobility shift assays using these homologous sequences with 5' and 3' flanking regions found in the natural COL1A1 gene showed competition. Competitive gel mobility assays in a separate experiment showed no competition using a 5-base mutated or scrambled sequence. Explanted liver granulomas from saline-injected mice incorporated 10.45 +/- 1.7% (3)H-proline into newly synthesized collagen, whereas decoy-treated mice showed no collagen synthesis. Compared with the saline control schistosomiasis mice phosphorothioate double-stranded oligodeoxynucleotide treatment decreased total liver collagen content (i.e. hydroxy-4-proline) by 34%. This novel molecular approach has the potential to be employed as a novel antifibrotic treatment modality. (c) 2005 Wiley-Liss, Inc.
The Salivary Microbiome in Polycystic Ovary Syndrome (PCOS) and Its Association with Disease-Related Parameters: A Pilot Study

PubMed Central

Lindheim, Lisa; Bashir, Mina; Münzker, Julia; Trummer, Christian; Zachhuber, Verena; Pieber, Thomas R.; Gorkiewicz, Gregor; Obermayer-Pietsch, Barbara

2016-01-01

Background: Polycystic ovary syndrome (PCOS) is a common female endocrine condition of unclear etiology characterized by hyperandrogenism, oligo/amenorrhoea, and polycystic ovarian morphology. PCOS is often complicated by infertility, overweight/obesity, insulin resistance, and low-grade inflammation. The gut microbiome is known to contribute to several of these conditions. Recently, an association between stool and saliva microbiome community profiles was shown, making saliva a possible convenient, non-invasive sample type for detecting gut microbiome changes in systemic disease. In this study, we describe the saliva microbiome of PCOS patients and the association of microbiome features with PCOS-related parameters. Methods: 16S rRNA gene amplicon sequencing was performed on saliva samples from 24 PCOS patients and 20 healthy controls. Data processing and microbiome analyses were conducted in mothur and QIIME. All study subjects were characterized regarding reproductive, metabolic, and inflammatory parameters. Results: PCOS patients showed a decrease in bacteria from the phylum Actinobacteria and a borderline significant shift in bacterial community composition in unweighted UniFrac analysis. No differences between patients and controls were found in alpha diversity, weighted UniFrac analysis, or on other taxonomic levels. We found no association of saliva alpha diversity, beta diversity, or taxonomic composition with serum testosterone, oligo/amenorrhoea, overweight, insulin resistance, inflammatory markers, age, or diet. Conclusions: In this pilot study, patients with PCOS showed a reduced salivary relative abundance of Actinobacteria. Reproductive and metabolic components of the syndrome were not associated with saliva microbiome parameters, indicating that the majority of between-subject variation in saliva microbiome profiles remains to be explained. PMID:27610099
Photoinduced Electron Transfer and Hole Migration in Nanosized Helical Aromatic Oligoamide Foldamers.

PubMed

Li, Xuesong; Markandeya, Nagula; Jonusauskas, Gediminas; McClenaghan, Nathan D; Maurizot, Victor; Denisov, Sergey A; Huc, Ivan

2016-10-07

A series of photoactive triads have been synthesized and investigated in order to elucidate photoinduced electron transfer and hole migration mechanism across nanosized, rigid helical foldamers. The triads are comprised of a central helical oligoamide foldamer bridge with 9, 14, 18, 19, or 34 8-amino-2-quinolinecarboxylic acid repeat units, and of two chromophores, an N-terminal oligo(para-phenylenevinylene) electron donor and a C-terminal perylene bis-imide electron acceptor. Time-resolved fluorescence and transient absorption spectroscopic studies showed that, following photoexcitation of the electron acceptor, fast electron transfer occurs initially from the oligoquinoline bridge to the acceptor chromophore on the picosecond time scale. The oligo(para-phenylenevinylene) electron donor is oxidized after a time delay during which the hole migrates across the foldamer from the acceptor to the donor. The charge separated state that is finally generated was found to be remarkably long-lived (>80 μs). While the initial charge injection rate is largely invariant for all foldamer lengths (ca. 60 ps), the subsequent hole transfer to the donor varies from 1 × 10 9 s -1 for the longest sequence to 17 × 10 9 s -1 for the shortest. In all cases, charge transfer is very fast considering the foldamer length. Detailed analysis of the process in different media and at varying temperatures is consistent with a hopping mechanism of hole transport through the foldamer helix, with individual hops occurring on the subpicosecond time scale (k ET = 2.5 × 10 12 s -1 in CH 2 Cl 2 ). This work demonstrates the possibility of fast long-range hole transfer over 300 Å (through bonds) across a synthetic modular bridge, an achievement that had been previously observed principally with DNA structures.
Universal ligation-detection-reaction microarray applied for compost microbes

PubMed Central

Hultman, Jenni; Ritari, Jarmo; Romantschuk, Martin; Paulin, Lars; Auvinen, Petri

2008-01-01

Background Composting is one of the methods utilised in recycling organic communal waste. The composting process is dependent on aerobic microbial activity and proceeds through a succession of different phases each dominated by certain microorganisms. In this study, a ligation-detection-reaction (LDR) based microarray method was adapted for species-level detection of compost microbes characteristic of each stage of the composting process. LDR utilises the specificity of the ligase enzyme to covalently join two adjacently hybridised probes. A zip-oligo is attached to the 3'-end of one probe and fluorescent label to the 5'-end of the other probe. Upon ligation, the probes are combined in the same molecule and can be detected in a specific location on a universal microarray with complementary zip-oligos enabling equivalent hybridisation conditions for all probes. The method was applied to samples from Nordic composting facilities after testing and optimisation with fungal pure cultures and environmental clones. Results Probes targeted for fungi were able to detect 0.1 fmol of target ribosomal PCR product in an artificial reaction mixture containing 100 ng competing fungal ribosomal internal transcribed spacer (ITS) area or herring sperm DNA. The detection level was therefore approximately 0.04% of total DNA. Clone libraries were constructed from eight compost samples. The LDR microarray results were in concordance with the clone library sequencing results. In addition a control probe was used to monitor the per-spot hybridisation efficiency on the array. Conclusion This study demonstrates that the LDR microarray method is capable of sensitive and accurate species-level detection from a complex microbial community. The method can detect key species from compost samples, making it a basis for a tool for compost process monitoring in industrial facilities. PMID:19116002

Dry reagent dipstick test combined with 23S rRNA PCR for molecular diagnosis of bacterial infection in arthroplasty.

PubMed

Kalogianni, Despina P; Goura, Sophia; Aletras, Alexios J; Christopoulos, Theodore K; Chanos, Michalis G; Christofidou, Myrto; Skoutelis, Athanasios; Ioannou, Penelope C; Panagiotopoulos, Elias

2007-02-15

Periprosthetic joint infections present a challenging problem in orthopaedics. Conventional methods for detection of arthroplasty infections rely on bacterial culture of synovial fluid aspirates. During recent years, however, molecular tests that are based on DNA amplification by the polymerase chain reaction (PCR), followed by electrophoretic analysis of the products, have been introduced. We report a simple and inexpensive assay that allows visual detection and confirmation of the PCR-amplified sequences by hybridization within minutes. The assay is performed in a dry reagent dipstick format (strip) and does not require special instrumentation. Universal primers are used for PCR of the 23S ribosomal RNA (rRNA) gene. The biotinylated amplification product is hybridized with dA-tailed probes that are specific for six pathogens commonly involved in periprosthetic joint infections. The mixture is applied to the strip, which is then immersed in the appropriate buffer. The buffer migrates along the strip by capillary action and rehydrates gold nanoparticles with oligo(dT) strands attached to their surface. The nanoparticles bind to the target DNA through hybridization, and the hybrids are captured by immobilized streptavidin at the test zone of the strip, producing a characteristic red line. Unbound nanoparticles are captured by immobilized oligo(dT) strands at the control zone of the strip, generating a second line. The dipstick test was applied to the detection of Escherichia coli, Staphylococcus aureus, Staphylococcus epidermidis, Streptococcus pneumoniae, Enterococcus faesium, and Haemophilus influenza. Twelve samples of synovial fluids from patients were analyzed for the detection and identification of the infection caused by the six pathogens. The results were compared with bacterial cultures.
Cloning and characterization of a novel oocyte-specific gene encoding an F-Box protein in rainbow trout (Oncorhynchus mykiss)

USDA-ARS?s Scientific Manuscript database

Oocyte-specific genes play critical roles in oogenesis, folliculogenesis and early embryonic development. Through analysis of expressed sequence tags (ESTs) from a rainbow trout oocyte cDNA library, we identified a novel transcript which is represented by multiple ESTs derived only from the oocyte c...
Cloning and characterization of a novel oocyte-specific gene encoding an F-Box protein in rainbow trout (Oncorhynchus mykiss)

USDA-ARS?s Scientific Manuscript database

Oocyte-specific genes play critical roles in oogenesis, folliculogenesis and early embryonic development. Through analysis of expressed sequence tags (ESTs) from a rainbow trout oocyte cDNA library, we identified a novel transcript which is represented by ESTs only from the oocyte library. The novel...
Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions

PubMed Central

Argout, Xavier; Fouet, Olivier; Wincker, Patrick; Gramacho, Karina; Legavre, Thierry; Sabau, Xavier; Risterucci, Ange Marie; Da Silva, Corinne; Cascardo, Julio; Allegre, Mathilde; Kuhn, David; Verica, Joseph; Courtois, Brigitte; Loor, Gaston; Babin, Regis; Sounigo, Olivier; Ducamp, Michel; Guiltinan, Mark J; Ruiz, Manuel; Alemanno, Laurence; Machado, Regina; Phillips, Wilberth; Schnell, Ray; Gilmour, Martin; Rosenquist, Eric; Butler, David; Maximova, Siela; Lanaud, Claire

2008-01-01

Background Theobroma cacao L., is a tree originated from the tropical rainforest of South America. It is one of the major cash crops for many tropical countries. T. cacao is mainly produced on smallholdings, providing resources for 14 million farmers. Disease resistance and T. cacao quality improvement are two important challenges for all actors of cocoa and chocolate production. T. cacao is seriously affected by pests and fungal diseases, responsible for more than 40% yield losses and quality improvement, nutritional and organoleptic, is also important for consumers. An international collaboration was formed to develop an EST genomic resource database for cacao. Results Fifty-six cDNA libraries were constructed from different organs, different genotypes and different environmental conditions. A total of 149,650 valid EST sequences were generated corresponding to 48,594 unigenes, 12,692 contigs and 35,902 singletons. A total of 29,849 unigenes shared significant homology with public sequences from other species. Gene Ontology (GO) annotation was applied to distribute the ESTs among the main GO categories. A specific information system (ESTtik) was constructed to process, store and manage this EST collection allowing the user to query a database. To check the representativeness of our EST collection, we looked for the genes known to be involved in two different metabolic pathways extensively studied in other plant species and important for T. cacao qualities: the flavonoid and the terpene pathways. Most of the enzymes described in other crops for these two metabolic pathways were found in our EST collection. A large collection of new genetic markers was provided by this ESTs collection. Conclusion This EST collection displays a good representation of the T. cacao transcriptome, suitable for analysis of biochemical pathways based on oligonucleotide microarrays derived from these ESTs. It will provide numerous genetic markers that will allow the construction of a high density gene map of T. cacao. This EST collection represents a unique and important molecular resource for T. cacao study and improvement, facilitating the discovery of candidate genes for important T. cacao trait variation. PMID:18973681
Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions.

PubMed

Argout, Xavier; Fouet, Olivier; Wincker, Patrick; Gramacho, Karina; Legavre, Thierry; Sabau, Xavier; Risterucci, Ange Marie; Da Silva, Corinne; Cascardo, Julio; Allegre, Mathilde; Kuhn, David; Verica, Joseph; Courtois, Brigitte; Loor, Gaston; Babin, Regis; Sounigo, Olivier; Ducamp, Michel; Guiltinan, Mark J; Ruiz, Manuel; Alemanno, Laurence; Machado, Regina; Phillips, Wilberth; Schnell, Ray; Gilmour, Martin; Rosenquist, Eric; Butler, David; Maximova, Siela; Lanaud, Claire

2008-10-30

Theobroma cacao L., is a tree originated from the tropical rainforest of South America. It is one of the major cash crops for many tropical countries. T. cacao is mainly produced on smallholdings, providing resources for 14 million farmers. Disease resistance and T. cacao quality improvement are two important challenges for all actors of cocoa and chocolate production. T. cacao is seriously affected by pests and fungal diseases, responsible for more than 40% yield losses and quality improvement, nutritional and organoleptic, is also important for consumers. An international collaboration was formed to develop an EST genomic resource database for cacao. Fifty-six cDNA libraries were constructed from different organs, different genotypes and different environmental conditions. A total of 149,650 valid EST sequences were generated corresponding to 48,594 unigenes, 12,692 contigs and 35,902 singletons. A total of 29,849 unigenes shared significant homology with public sequences from other species.Gene Ontology (GO) annotation was applied to distribute the ESTs among the main GO categories.A specific information system (ESTtik) was constructed to process, store and manage this EST collection allowing the user to query a database.To check the representativeness of our EST collection, we looked for the genes known to be involved in two different metabolic pathways extensively studied in other plant species and important for T. cacao qualities: the flavonoid and the terpene pathways. Most of the enzymes described in other crops for these two metabolic pathways were found in our EST collection.A large collection of new genetic markers was provided by this ESTs collection. This EST collection displays a good representation of the T. cacao transcriptome, suitable for analysis of biochemical pathways based on oligonucleotide microarrays derived from these ESTs. It will provide numerous genetic markers that will allow the construction of a high density gene map of T. cacao. This EST collection represents a unique and important molecular resource for T. cacao study and improvement, facilitating the discovery of candidate genes for important T. cacao trait variation.
Venom proteomic and venomous glands transcriptomic analysis of the Egyptian scorpion Scorpio maurus palmatus (Arachnida: Scorpionidae).

PubMed

Abdel-Rahman, Mohamed A; Quintero-Hernandez, Veronica; Possani, Lourival D

2013-11-01

Proteomic analysis of the scorpion venom Scorpio maurus palmatus was performed using reverse-phase HPLC separation followed by mass spectrometry determination. Sixty five components were identified with molecular masses varying from 413 to 14,009 Da. The high percentage of peptides (41.5%) was from 3 to 5 KDa which may represent linear antimicrobial peptides and KScTxs. Also, 155 expressed sequence tags (ESTs) were analyzed through construction the cDNA library prepared from a pair of venomous gland. About 77% of the ESTs correspond to toxin-like peptides and proteins with definite open reading frames. The cDNA sequencing results also show the presence of sequences whose putative products have sequence similarity with antimicrobial peptides (24%), insecticidal toxins, β-NaScTxs, κ-KScTxs, α-KScTxs, calcines and La1-like peptides. Also, we have obtained 23 atypical types of venom molecules not recorded in other scorpion species. Moreover, 9% of the total ESTs revealed significant similarities with proteins involved in the cellular processes of these scorpion venomous glands. This is the first set of molecular masses and transcripts described from this species, in which various venom molecules have been identified. They belong to either known or unassigned types of scorpion venom peptides and proteins, and provide valuable information for evolutionary analysis and venomics. Copyright © 2013 Elsevier Ltd. All rights reserved.
Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

PubMed

Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

2012-12-01

Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
Mechanism of papain-catalyzed synthesis of oligo-tyrosine peptides.

PubMed

Mitsuhashi, Jun; Nakayama, Tsutomu; Narai-Kanayama, Asako

2015-01-01

Di-, tri-, and tetra-tyrosine peptides with angiotensin I-converting enzyme inhibitory activity were synthesized by papain-catalyzed polymerization of L-tyrosine ethyl ester in aqueous media at 30 °C. Varying the reaction pH from 6.0 to 7.5 and the initial concentration of the ester substrate from 25 to 100 mM, the highest yield of oligo-tyrosine peptides (79% on a substrate basis) was produced at pH 6.5 and 75 mM, respectively. In the reaction initiated with 100 mM of the substrate, approx. 50% yield of insoluble, highly polymerized peptides accumulated. At less than 15 mM, the reaction proceeded poorly; however, from 30 mM to 120 mM a dose-dependent increase in the consumption rate of the substrate was observed with a sigmoidal curve. Meanwhile, each of the tri- and tetra-tyrosine peptides, even at approx. 5mM, was consumed effectively by papain but was not elongated to insoluble polymers. For deacylation of the acyl-papain intermediate through which a new peptide bond is made, L-tyrosine ethyl ester, even at 5mM, showed higher nucleophilic activity than di- and tri-tyrosine. These results indicate that the mechanism through which papain polymerizes L-tyrosine ethyl ester is as follows: the first interaction between papain and the ester substrate is a rate-limiting step; oligo-tyrosine peptides produced early in the reaction period are preferentially used as acyl donors, while the initial ester substrate strongly contributes as a nucleophile to the elongation of the peptide product; and the balance between hydrolytic fragmentation and further elongation of oligo-tyrosine peptides is dependent on the surrounding concentration of the ester substrate. Copyright © 2015 Elsevier Inc. All rights reserved.
Chemical evolution of RNA under hydrothermal conditions and the role of thermal copolymers of amino acids for the prebiotic degradation and formation of RNA

NASA Technical Reports Server (NTRS)

Kawamura, K.; Nagahama, M.; Kuranoue, K.

2005-01-01

The roles of thermal copolymers of amino acids (TCAA) were studied for the prebiotic degradation of RNA. A weak catalytic ability of TCAA consisted of Glu, L-Ala, L-Val, L-Glu, L-Asp, and optionally L-His was detected for the cleavage of the ribose phosphodiester bond of a tetranucleotide (5'-dCrCdGdG) in aqueous solution at 80 degees C. The rate constants of the disappearance of 5'-dCrCdGdG were determined in aqueous solutions using different pH buffer and TCAA. The degradation rates were enhanced 1.3-3.0 times in the presence of TCAA at pH 7.5 and 8.0 at 80 degrees C, while the hydrolysis of oligoguanylate (oligo(G)) was accelerated about 1.6 times at pH 8.0. A weak inhibitory activity for the cleavage of oligo(G) was detected in the presence of 0.055 M TCAA-Std. On the other hand, our recent study on the influences of TCAA for the template-directed reaction of oligo(G) on a polycytidylic acid template showed that TCAA has an acceleration activity for the degradation of the activated nucleotide monomer and an acceleration activity for the formation of G5' ppG capped oligo(G). This series of studies suggest that efficient and selective catalytic or inhibitory activities for either the degradation or formation of RNA under hydrothermal conditions could have hardly emerged from the simple thermal condensation products of amino acids. A scenario is going to be deduced on the chemical evolution of enzymatic activities and RNA molecules concerning hydrothermal earth conditions. c2005 COSPAR. Published by Elsevier Ltd. All rights reserved.
Angiotensin-II receptor 1 antagonist fetopathy--risk assessment, critical time period and vena cava thrombosis as a possible new feature.

PubMed

Oppermann, Marc; Padberg, Stephanie; Kayser, Angela; Weber-Schoendorfer, Corinna; Schaefer, Christof

2013-03-01

Angiotensin-II receptor 1 antagonists (AT₁-antagonists) may cause severe and even lethal fetopathy in late pregnancy. However, exposure still occurs in spite of warnings in package leaflets. This study aimed to assess the risk of fetopathy, the sensitive time window, and possible new symptoms in prospective as well as retrospective cases with AT₁-antagonist treatment during the second or third trimester of pregnancy. Patients were enrolled by the Berlin Institute for Clinical Teratology and Drug Risk Assessment in Pregnancy between 1999 and 2011 through risk consultation. Symptoms defined as indicative of AT₁-antagonist fetopathy were: oligo-/anhydramnios, renal insufficiency, lung hypoplasia, joint contractures, skull hypoplasia and fetal/neonatal death. In 5/29 (17%) prospectively enrolled cases with AT₁-antagonist exposure beyond the first trimester oligo-/anhydramnios was diagnosed. Two infants showed additional symptoms of fetopathy. The risk is more than 30% if treatment continues beyond the 20th week of pregnancy. Oligo-/anhydramnios was reversible after AT₁-antagonist withdrawal. Among 16 retrospective case reports, three infants presented with a thrombosis of the inferior vena cava in the vicinity of the renal veins. Four out of 13 live births did not survive. Our survey suggests that the risk increases with duration of AT₁-antagonist treatment into late pregnancy and oligo-/anhydramnios may be reversible after AT₁-antagonist discontinuation. Thrombosis of inferior vena cava may be a new feature of AT₁-antagonist fetopathy. AT₁-antagonist medication during pregnancy constitutes a considerable risk and must be discontinued immediately. In case of indicative diagnostic findings in either the fetus or newborn, previous maternal AT₁-antagonist exposure should be considered. © 2012 The Authors. British Journal of Clinical Pharmacology © 2012 The British Pharmacological Society.
Transgenic expression of Bcl-2 modulates energy metabolism, prevents cytosolic acidification during ischemia, and reduces ischemia/reperfusion injury.

PubMed

Imahashi, Kenichi; Schneider, Michael D; Steenbergen, Charles; Murphy, Elizabeth

2004-10-01

The antiapoptotic protein Bcl-2 is targeted to the mitochondria, but it is uncertain whether Bcl-2 affects only myocyte survival after ischemia, or whether it also affects metabolic functions of mitochondria during ischemia. Hearts from mice overexpressing human Bcl-2 and from their wild-type littermates (WT) were subjected to 24 minutes of global ischemia followed by reperfusion. During ischemia, the decrease in pH(i) and the initial rate of decline in ATP were significantly reduced in Bcl-2 hearts compared with WT hearts (P<0.05). The reduced acidification during ischemia was dependent on the activity of mitochondrial F1F0-ATPase. In the presence of oligomycin (Oligo), an F1F0-ATPase inhibitor, the decrease in pH(i) was attenuated in WT hearts, but in Bcl-2 hearts, Oligo had no additional effect on pH(i) during ischemia. Likewise, addition of Oligo to WT hearts slowed the rate of decline in ATP during ischemia to a level similar to that observed in Bcl-2 hearts, but addition of Oligo had no significant effect on the rate of decline in ATP in Bcl-2 hearts during ischemia. These data are consistent with Bcl-2-mediated inhibition of consumption of glycolytic ATP. Furthermore, mitochondria from Bcl-2 hearts have a reduced rate of consumption of ATP on uncoupler addition. This could be accomplished by limiting ATP entry into the mitochondria through the voltage-dependent anion channel, and/or the adenine nucleotide transporter, or by direct inhibition of the F1F0-ATPase. Immunoprecipitation showed greater interaction between Bcl-2 and voltage-dependent anion channel during ischemia. These data indicate that Bcl-2 modulation of metabolism contributes to cardioprotection.
A proposed OB-fold with a protein-interaction surface in Candida albicans telomerase protein Est3

PubMed Central

Yu, Eun Young; Wang, Feng; Lei, Ming; Lue, Neal F

2008-01-01

Ever shorter telomeres 3 (Est3) is an essential telomerase regulatory subunit thought to be unique to budding yeasts. Here we use multiple sequence alignment and hidden Markov model–hidden Markov model (HMM-HMM) comparison to uncover potential similarities between Est3 and the mammalian telomeric protein Tpp1. Analysis of site-specific mutants of Candida albicans Est3 revealed functional distinctions between residues that are conserved between Est3 and Tpp1 and those that are unique to Est3. Although both types of residues are important for telomere maintenance in vivo, only the former contributes to telomerase activity in vitro and facilitates the association of Est3 with telomerase core components. Consistent with a function in protein-protein interaction, the residues common to Est3 and Tpp1 map to one face of an OB-fold model structure, away from the canonical nucleic acid binding surface. We propose that Est3 and the OB-fold domain of Tpp1 mediate a conserved function in telomerase regulation. PMID:19172753
Transcriptome analysis of Bupleurum chinense focusing on genes involved in the biosynthesis of saikosaponins

PubMed Central

2011-01-01

Abstract Background Bupleurum chinense DC. is a widely used traditional Chinese medicinal plant. Saikosaponins are the major bioactive constituents of B. chinense, but relatively little is known about saikosaponin biosynthesis. The 454 pyrosequencing technology provides a promising opportunity for finding novel genes that participate in plant metabolism. Consequently, this technology may help to identify the candidate genes involved in the saikosaponin biosynthetic pathway. Results One-quarter of the 454 pyrosequencing runs produced a total of 195, 088 high-quality reads, with an average read length of 356 bases (NCBI SRA accession SRA039388). A de novo assembly generated 24, 037 unique sequences (22, 748 contigs and 1, 289 singletons), 12, 649 (52.6%) of which were annotated against three public protein databases using a basic local alignment search tool (E-value ≤1e-10). All unique sequences were compared with NCBI expressed sequence tags (ESTs) (237) and encoding sequences (44) from the Bupleurum genus, and with a Sanger-sequenced EST dataset (3, 111). The 23, 173 (96.4%) unique sequences obtained in the present study represent novel Bupleurum genes. The ESTs of genes related to saikosaponin biosynthesis were found to encode known enzymes that catalyze the formation of the saikosaponin backbone; 246 cytochrome P450 (P450s) and 102 glycosyltransferases (GTs) unique sequences were also found in the 454 dataset. Full length cDNAs of 7 P450s and 7 uridine diphosphate GTs (UGTs) were verified by reverse transcriptase polymerase chain reaction or by cloning using 5' and/or 3' rapid amplification of cDNA ends. Two P450s and three UGTs were identified as the most likely candidates involved in saikosaponin biosynthesis. This finding was based on the coordinate up-regulation of their expression with β-AS in methyl jasmonate-treated adventitious roots and on their similar expression patterns with β-AS in various B. chinense tissues. Conclusions A collection of high-quality ESTs for B. chinense obtained by 454 pyrosequencing is provided here for the first time. These data should aid further research on the functional genomics of B. chinense and other Bupleurum species. The candidate genes for enzymes involved in saikosaponin biosynthesis, especially the P450s and UGTs, that were revealed provide a substantial foundation for follow-up research on the metabolism and regulation of the saikosaponins. PMID:22047182
Application of Cydia pomonella expressed sequence tags: identification and expression of three general odorant binding proteins in codling moth

USDA-ARS?s Scientific Manuscript database

The codling moth, Cydia pomonella, is one of the most important pests of pome fruits in the world, yet the molecular genetics and physiology of this insect remains poorly understood. A combined assembly of 8340 expressed sequence tags (ESTs) was generated from Roche 454 GS-FLX sequencing of 8 tissu...
Genotyping variability of computationally categorized peach microsatellite markers

USDA-ARS?s Scientific Manuscript database

Numerous expressed sequence tag (EST) simple sequence repeat (SSR) primers can be easily mined out. The obstacle to develop them into usable markers is how to optimally select downsized subsets of the primers for genotyping, which accordingly reduces amplification failure and monomorphism often occu...
Transcriptome analysis of blueberry using 454 EST sequencing

USDA-ARS?s Scientific Manuscript database

Blueberry (Vaccinium corymbosum) is a major berry crop in the United States, and one that has great nutritional and economical value. Next generation sequencing methodologies, such as 454, have been demonstrated to be successful and efficient in producing a snap-shot of transcriptional activities du...
Identification of novel serine proteinase gene transcripts in the midguts of two tropical insect pests, Scirpophaga incertulas (Wk.) and Helicoverpa armigera (Hb.).

PubMed

Mazumdar-Leighton, S; Babu, C R; Bennett, J

2000-01-01

We have used RT PCR and 3'RACE to identify diverse serine proteinase genes expressed in the midguts of the rice yellow stem borer (Scirpophaga incertulas) and Asian corn borer (Helicoverpa armigera). The RT-PCR primers encoded the conserved regions around the active site histidine57 and serine195 of Drosophila melanogaster alpha trypsin, including aspartate189 of the specificity pocket. These primers amplified three transcripts (SiP1-3) from midguts of S. incertulas, and two transcripts (HaP1-2) from midguts of H. armigera. The five RT PCR products were sequenced to permit design of gene-specific forward primers for use with anchored oligo dT primers in 3'RACE. Sequencing of the 3'RACE products indicated that SiP1, SiP2 and HaP1 encoded trypsin-like serine proteinases, while HaP2 encoded a chymotrypsin-like serine proteinases. The SiP3 transcript proved to be an abundant 960 nt mRNA encoding a trypsin-like protein in which the active site serine195 was replaced by aspartate. The possible functions of this unusual protein are discussed.
Development and Characterization of a Psathyrostachys huashanica Keng 7Ns Chromosome Addition Line with Leaf Rust Resistance

PubMed Central

Du, Wanli; Wang, Jing; Wang, Liangming; Zhang, Jun; Chen, Xinhong; Zhao, Jixin; Yang, Qunhui; Wu, Jun

2013-01-01

The aim of this study was to characterize a Triticum aestivum-Psathyrostachys huashanica Keng (2n = 2x = 14, NsNs) disomic addition line 2-1-6-3. Individual line 2-1-6-3 plants were analyzed using cytological, genomic in situ hybridization (GISH), EST-SSR, and EST-STS techniques. The alien addition line 2-1-6-3 was shown to have two P. huashanica chromosomes, with a meiotic configuration of 2n = 44 = 22 II. We tested 55 EST-SSR and 336 EST-STS primer pairs that mapped onto seven different wheat chromosomes using DNA from parents and the P. huashanica addition line. One EST-SSR and nine EST-STS primer pairs indicated that the additional chromosome of P. huashanica belonged to homoeologous group 7, the diagnostic fragments of five EST-STS markers (BE404955, BE591127, BE637663, BF482781 and CD452422) were cloned, sequenced and compared. The results showed that the amplified polymorphic bands of P. huashanica and disomic addition line 2-1-6-3 shared 100% sequence identity, which was designated as the 7Ns disomic addition line. Disomic addition line 2-1-6-3 was evaluated to test the leaf rust resistance of adult stages in the field. We found that one pair of the 7Ns genome chromosomes carried new leaf rust resistance gene(s). Moreover, wheat line 2-1-6-3 had a superior numbers of florets and grains per spike, which were associated with the introgression of the paired P. huashanica chromosomes. These high levels of disease resistance and stable, excellent agronomic traits suggest that this line could be utilized as a novel donor in wheat breeding programs. PMID:23976963
3' rapid amplification of cDNA ends (RACE) walking for rapid structural analysis of large transcripts.

PubMed

Ozawa, Tatsuhiko; Kondo, Masato; Isobe, Masaharu

2004-01-01

The 3' rapid amplification of cDNA ends (3' RACE) is widely used to isolate the cDNA of unknown 3' flanking sequences. However, the conventional 3' RACE often fails to amplify cDNA from a large transcript if there is a long distance between the 5' gene-specific primer and poly(A) stretch, since the conventional 3' RACE utilizes 3' oligo-dT-containing primer complementary to the poly(A) tail of mRNA at the first strand cDNA synthesis. To overcome this problem, we have developed an improved 3' RACE method suitable for the isolation of cDNA derived from very large transcripts. By using the oligonucleotide-containing random 9mer together with the GC-rich sequence for the suppression PCR technology at the first strand of cDNA synthesis, we have been able to amplify the cDNA from a very large transcript, such as the microtubule-actin crosslinking factor 1 (MACF1) gene, which codes a transcript of 20 kb in size. When there is no splicing variant, our highly specific amplification allows us to perform the direct sequencing of 3' RACE products without requiring cloning in bacterial hosts. Thus, this stepwise 3' RACE walking will help rapid characterization of the 3' structure of a gene, even when it encodes a very large transcript.
Construction and photophysical properties of organic-inorganic nanonetworks based on oligo(phenylenevinylene) and functionalized gold nanoparticles.

PubMed

Yang, Jien; Liu, Xiaofeng; Huang, Changshui; Zhou, Chunjie; Li, Yuliang; Zhu, Daoben

2010-02-22

Novel organic-inorganic nanonetworks of oligo(phenylenevinylene) (OPV) and gold nanoparticles (GNPs) have been synthesized by the amine-based epoxide ring-opening reaction. The resulting OPV-GNPs nanocomposites exhibit homogeneous and well-defined interfaces between the organic ligands and the inorganic nanoparticles, thereby promoting efficient electronic interfacial interaction between the two constituents. The functionalized gold nanoparticles serve as chemical reagents for the construction of nanohybrids, while the epoxide-terminated OPV acts as linkage between gold nanoparticles. The new architecture provides a facile methodology for fabrication of novel organic-inorganic nanohybrids under relatively mild conditions, which facilitates further applications of hybrid materials.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.