reference-based gene model: Topics by Science.gov

Sample records for reference-based gene model

Reference gene selection for quantitative gene expression studies during biological invasions: A test on multiple genes and tissues in a model ascidian Ciona savignyi.

PubMed

Huang, Xuena; Gao, Yangchun; Jiang, Bei; Zhou, Zunchun; Zhan, Aibin

2016-01-15

As invasive species have successfully colonized a wide range of dramatically different local environments, they offer a good opportunity to study interactions between species and rapidly changing environments. Gene expression represents one of the primary and crucial mechanisms for rapid adaptation to local environments. Here, we aim to select reference genes for quantitative gene expression analysis based on quantitative Real-Time PCR (qRT-PCR) for a model invasive ascidian, Ciona savignyi. We analyzed the stability of ten candidate reference genes in three tissues (siphon, pharynx and intestine) under two key environmental stresses (temperature and salinity) in the marine realm based on three programs (geNorm, NormFinder and delta Ct method). Our results demonstrated only minor difference for stability rankings among the three methods. The use of different single reference gene might influence the data interpretation, while multiple reference genes could minimize possible errors. Therefore, reference gene combinations were recommended for different tissues - the optimal reference gene combination for siphon was RPS15 and RPL17 under temperature stress, and RPL17, UBQ and TubA under salinity treatment; for pharynx, TubB, TubA and RPL17 were the most stable genes under temperature stress, while TubB, TubA and UBQ were the best under salinity stress; for intestine, UBQ, RPS15 and RPL17 were the most reliable reference genes under both treatments. Our results suggest that the necessity of selection and test of reference genes for different tissues under varying environmental stresses. The results obtained here are expected to reveal mechanisms of gene expression-mediated invasion success using C. savignyi as a model species. Copyright © 2015 Elsevier B.V. All rights reserved.
In silico selection of expression reference genes with demonstrated stability in barley among a diverse set of tissues and cultivars

USDA-ARS?s Scientific Manuscript database

Premise of the study: Reference genes are selected based on the assumption of temporal and spatial expression stability and on their widespread use in model species. They are often used in new target species without validation, presumed as stable. For barley, reference gene validation is lacking, bu...
Selection of low-variance expressed Malus x domestica (apple) genes for use as quantitative PCR reference genes (housekeepers)

USDA-ARS?s Scientific Manuscript database

To accurately measure gene expression using PCR-based approaches, there is the need for reference genes that have low variance in expression (housekeeping genes) to normalise the data for RNA quantity and quality. For non-model species such as Malus x domestica (apples), previously, the selection of...
Validation of reference genes for quantitative gene expression analysis in experimental epilepsy.

PubMed

Sadangi, Chinmaya; Rosenow, Felix; Norwood, Braxton A

2017-12-01

To grasp the molecular mechanisms and pathophysiology underlying epilepsy development (epileptogenesis) and epilepsy itself, it is important to understand the gene expression changes that occur during these phases. Quantitative real-time polymerase chain reaction (qPCR) is a technique that rapidly and accurately determines gene expression changes. It is crucial, however, that stable reference genes are selected for each experimental condition to ensure that accurate values are obtained for genes of interest. If reference genes are unstably expressed, this can lead to inaccurate data and erroneous conclusions. To date, epilepsy studies have used mostly single, nonvalidated reference genes. This is the first study to systematically evaluate reference genes in male Sprague-Dawley rat models of epilepsy. We assessed 15 potential reference genes in hippocampal tissue obtained from 2 different models during epileptogenesis, 1 model during chronic epilepsy, and a model of noninjurious seizures. Reference gene ranking varied between models and also differed between epileptogenesis and chronic epilepsy time points. There was also some variance between the four mathematical models used to rank reference genes. Notably, we found novel reference genes to be more stably expressed than those most often used in experimental epilepsy studies. The consequence of these findings is that reference genes suitable for one epilepsy model may not be appropriate for others and that reference genes can change over time. It is, therefore, critically important to validate potential reference genes before using them as normalizing factors in expression analysis in order to ensure accurate, valid results. © 2017 Wiley Periodicals, Inc.
A PLSPM-Based Test Statistic for Detecting Gene-Gene Co-Association in Genome-Wide Association Study with Case-Control Design

PubMed Central

Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong

2013-01-01

For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods. PMID:23620809
A PLSPM-based test statistic for detecting gene-gene co-association in genome-wide association study with case-control design.

PubMed

Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong

2013-01-01

For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods.
Integrative structural annotation of de novo RNA-Seq provides an accurate reference gene set of the enormous genome of the onion (Allium cepa L.)

PubMed Central

Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil

2015-01-01

The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. PMID:25362073
Selection and evaluation of reference genes for expression studies with quantitative PCR in the model fungus Neurospora crassa under different environmental conditions in continuous culture.

PubMed

Cusick, Kathleen D; Fitzgerald, Lisa A; Pirlo, Russell K; Cockrell, Allison L; Petersen, Emily R; Biffinger, Justin C

2014-01-01

Neurospora crassa has served as a model organism for studying circadian pathways and more recently has gained attention in the biofuel industry due to its enhanced capacity for cellulase production. However, in order to optimize N. crassa for biotechnological applications, metabolic pathways during growth under different environmental conditions must be addressed. Reverse-transcription quantitative PCR (RT-qPCR) is a technique that provides a high-throughput platform from which to measure the expression of a large set of genes over time. The selection of a suitable reference gene is critical for gene expression studies using relative quantification, as this strategy is based on normalization of target gene expression to a reference gene whose expression is stable under the experimental conditions. This study evaluated twelve candidate reference genes for use with N. crassa when grown in continuous culture bioreactors under different light and temperature conditions. Based on combined stability values from NormFinder and Best Keeper software packages, the following are the most appropriate reference genes under conditions of: (1) light/dark cycling: btl, asl, and vma1; (2) all-dark growth: btl, tbp, vma1, and vma2; (3) temperature flux: btl, vma1, act, and asl; (4) all conditions combined: vma1, vma2, tbp, and btl. Since N. crassa exists as different cell types (uni- or multi-nucleated), expression changes in a subset of the candidate genes was further assessed using absolute quantification. A strong negative correlation was found to exist between ratio and threshold cycle (CT) values, demonstrating that CT changes serve as a reliable reflection of transcript, and not gene copy number, fluctuations. The results of this study identified genes that are appropriate for use as reference genes in RT-qPCR studies with N. crassa and demonstrated that even with the presence of different cell types, relative quantification is an acceptable method for measuring gene expression changes during growth in bioreactors.
Integrative structural annotation of de novo RNA-Seq provides an accurate reference gene set of the enormous genome of the onion (Allium cepa L.).

PubMed

Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil

2015-02-01

The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Transcriptome assembly and digital gene expression atlas of the rainbow trout

USDA-ARS?s Scientific Manuscript database

Background: Transcriptome analysis is a preferred method for gene discovery, marker development and gene expression profiling in non-model organisms. Previously, we sequenced a transcriptome reference using Sanger-based and 454-pyrosequencing, however, a transcriptome assembly is still incomplete an...
Selection and Validation of Reference Genes for qRT-PCR Expression Analysis of Candidate Genes Involved in Olfactory Communication in the Butterfly Bicyclus anynana

PubMed Central

Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M.

2015-01-01

Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression profile of the target candidate genes. PMID:25793735
Selection and validation of reference genes for qRT-PCR expression analysis of candidate genes involved in olfactory communication in the butterfly Bicyclus anynana.

PubMed

Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M

2015-01-01

Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression profile of the target candidate genes.
Validation of reference genes for normalization of qPCR mRNA expression levels in Staphylococcus aureus exposed to osmotic and lactic acid stress conditions encountered during food production and preservation.

PubMed

Sihto, Henna-Maria; Tasara, Taurai; Stephan, Roger; Johler, Sophia

2014-07-01

Staphylococcus aureus represents the most prevalent cause of food-borne intoxications worldwide. While being repressed by competing bacteria in most matrices, this pathogen exhibits crucial competitive advantages during growth at high salt concentrations or low pH, conditions frequently encountered in food production and preservation. We aimed to identify reference genes that could be used to normalize qPCR mRNA expression levels during growth of S. aureus in food-related osmotic (NaCl) and acidic (lactic acid) stress adaptation models. Expression stability of nine housekeeping genes was evaluated in full (LB) and nutrient-deficient (CYGP w/o glucose) medium under conditions of osmotic (4.5% NaCl) and acidic stress (lactic acid, pH 6.0) after 2-h exposure. Among the set of candidate reference genes investigated, rplD, rpoB,gyrB, and rho were most stably expressed in LB and thus represent the most suitable reference genes for normalization of qPCR data in osmotic or lactic acid stress models in a rich medium. Under nutrient-deficient conditions, expression of rho and rpoB was highly stable across all tested conditions. The presented comprehensive data on changes in expression of various S. aureus housekeeping genes under conditions of osmotic and lactic acid stress facilitate selection of reference genes for qPCR-based stress response models. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
GeneImp: Fast Imputation to Large Reference Panels Using Genotype Likelihoods from Ultralow Coverage Sequencing

PubMed Central

Spiliopoulou, Athina; Colombo, Marco; Orchard, Peter; Agakov, Felix; McKeigue, Paul

2017-01-01

We address the task of genotype imputation to a dense reference panel given genotype likelihoods computed from ultralow coverage sequencing as inputs. In this setting, the data have a high-level of missingness or uncertainty, and are thus more amenable to a probabilistic representation. Most existing imputation algorithms are not well suited for this situation, as they rely on prephasing for computational efficiency, and, without definite genotype calls, the prephasing task becomes computationally expensive. We describe GeneImp, a program for genotype imputation that does not require prephasing and is computationally tractable for whole-genome imputation. GeneImp does not explicitly model recombination, instead it capitalizes on the existence of large reference panels—comprising thousands of reference haplotypes—and assumes that the reference haplotypes can adequately represent the target haplotypes over short regions unaltered. We validate GeneImp based on data from ultralow coverage sequencing (0.5×), and compare its performance to the most recent version of BEAGLE that can perform this task. We show that GeneImp achieves imputation quality very close to that of BEAGLE, using one to two orders of magnitude less time, without an increase in memory complexity. Therefore, GeneImp is the first practical choice for whole-genome imputation to a dense reference panel when prephasing cannot be applied, for instance, in datasets produced via ultralow coverage sequencing. A related future application for GeneImp is whole-genome imputation based on the off-target reads from deep whole-exome sequencing. PMID:28348060
Evaluation of Reference Genes for RT qPCR Analyses of Structure-Specific and Hormone Regulated Gene Expression in Physcomitrella patens Gametophytes

PubMed Central

Le Bail, Aude; Scholz, Sebastian; Kost, Benedikt

2013-01-01

The use of the moss Physcomitrella patens as a model system to study plant development and physiology is rapidly expanding. The strategic position of P. patens within the green lineage between algae and vascular plants, the high efficiency with which transgenes are incorporated by homologous recombination, advantages associated with the haploid gametophyte representing the dominant phase of the P. patens life cycle, the simple structure of protonemata, leafy shoots and rhizoids that constitute the haploid gametophyte, as well as a readily accessible high-quality genome sequence make this moss a very attractive experimental system. The investigation of the genetic and hormonal control of P. patens development heavily depends on the analysis of gene expression patterns by real time quantitative PCR (RT qPCR). This technique requires well characterized sets of reference genes, which display minimal expression level variations under all analyzed conditions, for data normalization. Sets of suitable reference genes have been described for most widely used model systems including e.g. Arabidopsis thaliana, but not for P. patens. Here, we present a RT qPCR based comparison of transcript levels of 12 selected candidate reference genes in a range of gametophytic P. patens structures at different developmental stages, and in P. patens protonemata treated with hormones or hormone transport inhibitors. Analysis of these RT qPCR data using GeNorm and NormFinder software resulted in the identification of sets of P. patens reference genes suitable for gene expression analysis under all tested conditions, and suggested that the two best reference genes are sufficient for effective data normalization under each of these conditions. PMID:23951063
Reference genes for reverse transcription quantitative PCR in canine brain tissue.

PubMed

Stassen, Quirine E M; Riemers, Frank M; Reijmerink, Hannah; Leegwater, Peter A J; Penning, Louis C

2015-12-09

In the last decade canine models have been used extensively to study genetic causes of neurological disorders such as epilepsy and Alzheimer's disease and unravel their pathophysiological pathways. Reverse transcription quantitative polymerase chain reaction is a sensitive and inexpensive method to study expression levels of genes involved in disease processes. Accurate normalisation with stably expressed so-called reference genes is crucial for reliable expression analysis. Following the minimum information for publication of quantitative real-time PCR experiments precise guidelines, the expression of ten frequently used reference genes, namely YWHAZ, HMBS, B2M, SDHA, GAPDH, HPRT, RPL13A, RPS5, RPS19 and GUSB was evaluated in seven brain regions (frontal lobe, parietal lobe, occipital lobe, temporal lobe, thalamus, hippocampus and cerebellum) and whole brain of healthy dogs. The stability of expression varied between different brain areas. Using the GeNorm and Normfinder software HMBS, GAPDH and HPRT were the most reliable reference genes for whole brain. Furthermore based on GeNorm calculations it was concluded that as little as two to three reference genes are sufficient to obtain reliable normalisation, irrespective the brain area. Our results amend/extend the limited previously published data on canine brain reference genes. Despite the excellent expression stability of HMBS, GAPDH and HRPT, the evaluation of expression stability of reference genes must be a standard and integral part of experimental design and subsequent data analysis.
Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

PubMed

Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

PubMed Central

Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563
Aligning a New Reference Genetic Map of Lupinus angustifolius with the Genome Sequence of the Model Legume, Lotus japonicus

PubMed Central

Nelson, Matthew N.; Moolhuijzen, Paula M.; Boersma, Jeffrey G.; Chudy, Magdalena; Lesniewska, Karolina; Bellgard, Matthew; Oliver, Richard P.; Święcicki, Wojciech; Wolko, Bogdan; Cowling, Wallace A.; Ellwood, Simon R.

2010-01-01

We have developed a dense reference genetic map of Lupinus angustifolius (2n = 40) based on a set of 106 publicly available recombinant inbred lines derived from a cross between domesticated and wild parental lines. The map comprised 1090 loci in 20 linkage groups and three small clusters, drawing together data from several previous mapping publications plus almost 200 new markers, of which 63 were gene-based markers. A total of 171 mainly gene-based, sequence-tagged site loci served as bridging points for comparing the Lu. angustifolius genome with the genome sequence of the model legume, Lotus japonicus via BLASTn homology searching. Comparative analysis indicated that the genomes of Lu. angustifolius and Lo. japonicus are highly diverged structurally but with significant regions of conserved synteny including the region of the Lu. angustifolius genome containing the pod-shatter resistance gene, lentus. We discuss the potential of synteny analysis for identifying candidate genes for domestication traits in Lu. angustifolius and in improving our understanding of Fabaceae genome evolution. PMID:20133394
Reverse transcription quantitative real-time polymerase chain reaction reference genes in the spared nerve injury model of neuropathic pain: validation and literature search.

PubMed

Piller, Nicolas; Decosterd, Isabelle; Suter, Marc R

2013-07-10

The reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR) is a widely used, highly sensitive laboratory technique to rapidly and easily detect, identify and quantify gene expression. Reliable RT-qPCR data necessitates accurate normalization with validated control genes (reference genes) whose expression is constant in all studied conditions. This stability has to be demonstrated.We performed a literature search for studies using quantitative or semi-quantitative PCR in the rat spared nerve injury (SNI) model of neuropathic pain to verify whether any reference genes had previously been validated. We then analyzed the stability over time of 7 commonly used reference genes in the nervous system - specifically in the spinal cord dorsal horn and the dorsal root ganglion (DRG). These were: Actin beta (Actb), Glyceraldehyde-3-phosphate dehydrogenase (GAPDH), ribosomal proteins 18S (18S), L13a (RPL13a) and L29 (RPL29), hypoxanthine phosphoribosyltransferase 1 (HPRT1) and hydroxymethylbilane synthase (HMBS). We compared the candidate genes and established a stability ranking using the geNorm algorithm. Finally, we assessed the number of reference genes necessary for accurate normalization in this neuropathic pain model. We found GAPDH, HMBS, Actb, HPRT1 and 18S cited as reference genes in literature on studies using the SNI model. Only HPRT1 and 18S had been once previously demonstrated as stable in RT-qPCR arrays. All the genes tested in this study, using the geNorm algorithm, presented gene stability values (M-value) acceptable enough for them to qualify as potential reference genes in both DRG and spinal cord. Using the coefficient of variation, 18S failed the 50% cut-off with a value of 61% in the DRG. The two most stable genes in the dorsal horn were RPL29 and RPL13a; in the DRG they were HPRT1 and Actb. Using a 0.15 cut-off for pairwise variations we found that any pair of stable reference gene was sufficient for the normalization process. In the rat SNI model, we validated and ranked Actb, RPL29, RPL13a, HMBS, GAPDH, HPRT1 and 18S as good reference genes in the spinal cord. In the DRG, 18S did not fulfill stability criteria. The combination of any two stable reference genes was sufficient to provide an accurate normalization.

Array-based comparative genomic hybridization-guided identification of reference genes for normalization of real-time quantitative polymerase chain reaction assay data for lymphomas, histiocytic sarcomas, and osteosarcomas of dogs.

PubMed

Tsai, Pei-Chien; Breen, Matthew

2012-09-01

To identify suitable reference genes for normalization of real-time quantitative PCR (RT-qPCR) assay data for common tumors of dogs. Malignant lymph node (n = 8), appendicular osteosarcoma (9), and histiocytic sarcoma (12) samples and control samples of various nonneoplastic canine tissues. Array-based comparative genomic hybridization (aCGH) data were used to guide selection of 9 candidate reference genes. Expression stability of candidate reference genes and 4 commonly used reference genes was determined for tumor samples with RT-qPCR assays and 3 software programs. LOC611555 was the candidate reference gene with the highest expression stability among the 3 tumor types. Of the commonly used reference genes, expression stability of HPRT was high in histiocytic sarcoma samples, and expression stability of Ubi and RPL32 was high in osteosarcoma samples. Some of the candidate reference genes had higher expression stability than did the commonly used reference genes. Data for constitutively expressed genes with high expression stability are required for normalization of RT-qPCR assay results. Without such data, accurate quantification of gene expression in tumor tissue samples is difficult. Results of the present study indicated LOC611555 may be a useful RT-qPCR assay reference gene for multiple tissue types. Some commonly used reference genes may be suitable for normalization of gene expression data for tumors of dogs, such as lymphomas, osteosarcomas, or histiocytic sarcomas.
Selection and validation of reference genes for qRT-PCR analysis during biological invasions: The thermal adaptability of Bemisia tabaci MED.

PubMed

Dai, Tian-Mei; Lü, Zhi-Chuang; Liu, Wan-Xue; Wan, Fang-Hao

2017-01-01

The Bemisia tabaci Mediterranean (MED) cryptic species has been rapidly invading to most parts of the world owing to its strong ecological adaptability, which is considered as a model insect for stress tolerance studies under rapidly changing environments. Selection of a suitable reference gene for quantitative stress-responsive gene expression analysis based on qRT-PCR is critical for elaborating the molecular mechanisms of thermotolerance. To obtain accurate and reliable normalization data in MED, eight candidate reference genes (β-act, GAPDH, β-tub, EF1-α, GST, 18S, RPL13A and α-tub) were examined under various thermal stresses for varied time periods by using geNorm, NormFinder and BestKeeper algorithms, respectively. Our results revealed that β-tub and EF1-α were the best reference genes across all sample sets. On the other hand, 18S and GADPH showed the least stability for all the samples studied. β-act was proved to be highly stable only in case of short-term thermal stresses. To our knowledge this was the first comprehensive report on validation of reference genes under varying temperature stresses in MED. The study could expedite particular discovery of thermotolerance genes in MED. Further, the present results can form the basis of further research on suitable reference genes in this invasive insect and will facilitate transcript profiling in other invasive insects.
Selection and Validation of Reference Genes for Quantitative Real-Time PCR in Buckwheat (Fagopyrum esculentum) Based on Transcriptome Sequence Data

PubMed Central

Demidenko, Natalia V.; Logacheva, Maria D.; Penin, Aleksey A.

2011-01-01

Quantitative reverse transcription PCR (qRT-PCR) is one of the most precise and widely used methods of gene expression analysis. A necessary prerequisite of exact and reliable data is the accurate choice of reference genes. We studied the expression stability of potential reference genes in common buckwheat (Fagopyrum esculentum) in order to find the optimal reference for gene expression analysis in this economically important crop. Recently sequenced buckwheat floral transcriptome was used as source of sequence information. Expression stability of eight candidate reference genes was assessed in different plant structures (leaves and inflorescences at two stages of development and fruits). These genes are the orthologs of Arabidopsis genes identified as stable in a genome-wide survey gene of expression stability and a traditionally used housekeeping gene GAPDH. Three software applications – geNorm, NormFinder and BestKeeper - were used to estimate expression stability and provided congruent results. The orthologs of AT4G33380 (expressed protein of unknown function, Expressed1), AT2G28390 (SAND family protein, SAND) and AT5G46630 (clathrin adapter complex subunit family protein, CACS) are revealed as the most stable. We recommend using the combination of Expressed1, SAND and CACS for the normalization of gene expression data in studies on buckwheat using qRT-PCR. These genes are listed among five the most stably expressed in Arabidopsis that emphasizes utility of the studies on model plants as a framework for other species. PMID:21589908
Identification of Reference Genes for Quantitative Gene Expression Studies in a Non-Model Tree Pistachio (Pistacia vera L.)

PubMed Central

Moazzam Jazi, Maryam; Ghadirzadeh Khorzoghi, Effat; Botanga, Christopher; Seyedi, Seyed Mahdi

2016-01-01

The tree species, Pistacia vera (P. vera) is an important commercial product that is salt-tolerant and long-lived, with a possible lifespan of over one thousand years. Gene expression analysis is an efficient method to explore the possible regulatory mechanisms underlying these characteristics. Therefore, having the most suitable set of reference genes is required for transcript level normalization under different conditions in P. vera. In the present study, we selected eight widely used reference genes, ACT, EF1α, α-TUB, β-TUB, GAPDH, CYP2, UBQ10, and 18S rRNA. Using qRT-PCR their expression was assessed in 54 different samples of three cultivars of P. vera. The samples were collected from different organs under various abiotic treatments (cold, drought, and salt) across three time points. Several statistical programs (geNorm, NormFinder, and BestKeeper) were applied to estimate the expression stability of candidate reference genes. Results obtained from the statistical analysis were then exposed to Rank aggregation package to generate a consensus gene rank. Based on our results, EF1α was found to be the superior reference gene in all samples under all abiotic treatments. In addition to EF1α, ACT and β-TUB were the second best reference genes for gene expression analysis in leaf and root. We recommended β-TUB as the second most stable gene for samples under the cold and drought treatments, while ACT holds the same position in samples analyzed under salt treatment. This report will benefit future research on the expression profiling of P. vera and other members of the Anacardiaceae family. PMID:27308855
Identification of Reference Genes for Quantitative Gene Expression Studies in a Non-Model Tree Pistachio (Pistacia vera L.).

PubMed

Moazzam Jazi, Maryam; Ghadirzadeh Khorzoghi, Effat; Botanga, Christopher; Seyedi, Seyed Mahdi

2016-01-01

The tree species, Pistacia vera (P. vera) is an important commercial product that is salt-tolerant and long-lived, with a possible lifespan of over one thousand years. Gene expression analysis is an efficient method to explore the possible regulatory mechanisms underlying these characteristics. Therefore, having the most suitable set of reference genes is required for transcript level normalization under different conditions in P. vera. In the present study, we selected eight widely used reference genes, ACT, EF1α, α-TUB, β-TUB, GAPDH, CYP2, UBQ10, and 18S rRNA. Using qRT-PCR their expression was assessed in 54 different samples of three cultivars of P. vera. The samples were collected from different organs under various abiotic treatments (cold, drought, and salt) across three time points. Several statistical programs (geNorm, NormFinder, and BestKeeper) were applied to estimate the expression stability of candidate reference genes. Results obtained from the statistical analysis were then exposed to Rank aggregation package to generate a consensus gene rank. Based on our results, EF1α was found to be the superior reference gene in all samples under all abiotic treatments. In addition to EF1α, ACT and β-TUB were the second best reference genes for gene expression analysis in leaf and root. We recommended β-TUB as the second most stable gene for samples under the cold and drought treatments, while ACT holds the same position in samples analyzed under salt treatment. This report will benefit future research on the expression profiling of P. vera and other members of the Anacardiaceae family.
Selection of appropriate reference genes for RT-qPCR analysis in a streptozotocin-induced Alzheimer's disease model of cynomolgus monkeys (Macaca fascicularis).

PubMed

Park, Sang-Je; Kim, Young-Hyun; Lee, Youngjeon; Kim, Kyoung-Min; Kim, Heui-Soo; Lee, Sang-Rae; Kim, Sun-Uk; Kim, Sang-Hyun; Kim, Ji-Su; Jeong, Kang-Jin; Lee, Kyoung-Min; Huh, Jae-Won; Chang, Kyu-Tae

2013-01-01

Reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR) has been widely used to quantify relative gene expression because of the specificity, sensitivity, and accuracy of this technique. In order to obtain reliable gene expression data from RT-qPCR experiments, it is important to utilize optimal reference genes for the normalization of target gene expression under varied experimental conditions. Previously, we developed and validated a novel icv-STZ cynomolgus monkey model for Alzheimer's disease (AD) research. However, in order to enhance the reliability of this disease model, appropriate reference genes must be selected to allow meaningful analysis of the gene expression levels in the icv-STZ cynomolgus monkey brain. In this study, we assessed the expression stability of 9 candidate reference genes in 2 matched-pair brain samples (5 regions) of control cynomolgus monkeys and those who had received intracerebroventricular injection of streptozotocin (icv-STZ). Three well-known analytical programs geNorm, NormFinder, and BestKeeper were used to choose the suitable reference genes from the total sample group, control group, and icv-STZ group. Combination analysis of the 3 different programs clearly indicated that the ideal reference genes are RPS19 and YWHAZ in the total sample group, GAPDH and RPS19 in the control group, and ACTB and GAPDH in the icv-STZ group. Additionally, we validated the normalization accuracy of the most appropriate reference genes (RPS19 and YWHAZ) by comparison with the least stable gene (TBP) using quantification of the APP and MAPT genes in the total sample group. To the best of our knowledge, this research is the first study to identify and validate the appropriate reference genes in cynomolgus monkey brains. These findings provide useful information for future studies involving the expression of target genes in the cynomolgus monkey.
With Reference to Reference Genes: A Systematic Review of Endogenous Controls in Gene Expression Studies.

PubMed

Chapman, Joanne R; Waldenström, Jonas

2015-01-01

The choice of reference genes that are stably expressed amongst treatment groups is a crucial step in real-time quantitative PCR gene expression studies. Recent guidelines have specified that a minimum of two validated reference genes should be used for normalisation. However, a quantitative review of the literature showed that the average number of reference genes used across all studies was 1.2. Thus, the vast majority of studies continue to use a single gene, with β-actin (ACTB) and/or glyceraldehyde 3-phosphate dehydrogenase (GAPDH) being commonly selected in studies of vertebrate gene expression. Few studies (15%) tested a panel of potential reference genes for stability of expression before using them to normalise data. Amongst studies specifically testing reference gene stability, few found ACTB or GAPDH to be optimal, whereby these genes were significantly less likely to be chosen when larger panels of potential reference genes were screened. Fewer reference genes were tested for stability in non-model organisms, presumably owing to a dearth of available primers in less well characterised species. Furthermore, the experimental conditions under which real-time quantitative PCR analyses were conducted had a large influence on the choice of reference genes, whereby different studies of rat brain tissue showed different reference genes to be the most stable. These results highlight the importance of validating the choice of normalising reference genes before conducting gene expression studies.
Selection and evaluation of novel reference genes for quantitative reverse transcription PCR (qRT-PCR) based on genome and transcriptome data in Brassica napus L.

PubMed

Yang, Hongli; Liu, Jing; Huang, Shunmou; Guo, Tingting; Deng, Linbin; Hua, Wei

2014-03-15

Selection of reference genes in Brassica napus, a tetraploid (4×) species, is a very difficult task without information on genome and transcriptome. By now, only several traditional reference genes which show significant expression differentiation under different conditions are used in B. napus. In the present study, based on genome and transcriptome data of the rapeseed Zhongshuang-11 cultivar, 14 candidate reference genes were screened for investigation in different tissues, cultivars, and treated conditions of B. napus. These genes were as follows: ELF5, ENTH, F-BOX7, F-BOX2, FYPP1, GDI1, GYF, MCP2d, OTP80, PPR, SPOC, Unknown1, Unknown2 and UBA. Among them, excluding GYF and FYPP1, another 12 genes, were identified to perform better than traditional reference genes ACTIN7 and GAPDH. To further validate the accuracy of the newly developed reference genes in normalization, expression levels of BnCAT1 (B. napus catalase 1) in different rapeseed tissues and seedlings under stress conditions were normalized by the three most stable reference genes PPR, GDI1, and ENTH and little difference existed in normalization results. To the best of our knowledge, this is the first time B. napus reference genes have been provided with the help of complete genome and transcriptome information. The new reference genes provided in this study are more accurate than previously reported reference genes in quantifying expression levels of B. napus genes. Crown Copyright © 2014. Published by Elsevier B.V. All rights reserved.
Validation of predictive models for germline mutations in DNA mismatch repair genes in colorectal cancer.

PubMed

Monzon, Jose G; Cremin, Carol; Armstrong, Linlea; Nuk, Jennifer; Young, Sean; Horsman, Doug E; Garbutt, Kristy; Bajdik, Chris D; Gill, Sharlene

2010-02-15

Lynch syndrome is defined by the presence of germline mutations in mismatch repair (MMR) genes. Several models have been recently devised that predict mutation carrier status (Myriad Genetics, Wijnen, Barnetson, PREMM and MMRpro models). Families at moderate-high risk for harboring a Lynch-associated mutation, referred to the BC Cancer Agency (BCCA) Hereditary Cancer Program (HCP), underwent mutation analysis, immunohistochemistry and/or microsatellite testing. Seventy-two tested cases were included. Twenty-five patients were mutation positive (34.7%) and 47 were mutation negative (65.3%). Nineteen of 43 patients who were both microsatellite stable and normal on immunohistochemistry for MLH1 and MSH2 were also genotyped for mutations in these genes; all 19 were negative for MMR gene mutations. Model-derived probabilities of harboring a MMR gene mutation in the proband were calculated and compared to observed results. The area under the ROC curves were 0.75 (95%CI; 0.63-0.87), 0.86 (0.7-0.96), 0.89 (0.82-0.97), 0.89 (0.81-0.98) and 0.93 (0.86-0.99) for the Myriad, Barnetson, Wijnen, MMRpro and PREMM models, respectively. The Amsterdam II criteria had a sensitivity and specificity of 0.76 and 0.74, respectively, in this cohort. The PREMM model demonstrated the best performance for predicting carrier status based on the positive likelihood ratios at the >10%, >20% and >30% probability thresholds. In this referred cohort, the PREMM model had the most favorable concordance index and predictive performance for carrier status based on the positive LR. These prediction models (PREMM, MMRPro and Wijnen) may soon replace the Amsterdam II and revised Bethesda criteria as a prescreening tool for Lynch mutations.
A powerful score-based test statistic for detecting gene-gene co-association.

PubMed

Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

2016-01-29

The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.
Statistical tools for transgene copy number estimation based on real-time PCR.

PubMed

Yuan, Joshua S; Burris, Jason; Stewart, Nathan R; Mentewab, Ayalew; Stewart, C Neal

2007-11-01

As compared with traditional transgene copy number detection technologies such as Southern blot analysis, real-time PCR provides a fast, inexpensive and high-throughput alternative. However, the real-time PCR based transgene copy number estimation tends to be ambiguous and subjective stemming from the lack of proper statistical analysis and data quality control to render a reliable estimation of copy number with a prediction value. Despite the recent progresses in statistical analysis of real-time PCR, few publications have integrated these advancements in real-time PCR based transgene copy number determination. Three experimental designs and four data quality control integrated statistical models are presented. For the first method, external calibration curves are established for the transgene based on serially-diluted templates. The Ct number from a control transgenic event and putative transgenic event are compared to derive the transgene copy number or zygosity estimation. Simple linear regression and two group T-test procedures were combined to model the data from this design. For the second experimental design, standard curves were generated for both an internal reference gene and the transgene, and the copy number of transgene was compared with that of internal reference gene. Multiple regression models and ANOVA models can be employed to analyze the data and perform quality control for this approach. In the third experimental design, transgene copy number is compared with reference gene without a standard curve, but rather, is based directly on fluorescence data. Two different multiple regression models were proposed to analyze the data based on two different approaches of amplification efficiency integration. Our results highlight the importance of proper statistical treatment and quality control integration in real-time PCR-based transgene copy number determination. These statistical methods allow the real-time PCR-based transgene copy number estimation to be more reliable and precise with a proper statistical estimation. Proper confidence intervals are necessary for unambiguous prediction of trangene copy number. The four different statistical methods are compared for their advantages and disadvantages. Moreover, the statistical methods can also be applied for other real-time PCR-based quantification assays including transfection efficiency analysis and pathogen quantification.
Evaluation of normalization reference genes for RT-qPCR analysis of spo0A and four sporulation sigma factor genes in Clostridium botulinum Group I strain ATCC 3502.

PubMed

Kirk, David G; Palonen, Eveliina; Korkeala, Hannu; Lindström, Miia

2014-04-01

Heat-resistant spores of Clostridium botulinum can withstand the pasteurization processes in modern food processing. This poses a risk to food safety as spores may germinate into botulinum neurotoxin-producing vegetative cells. Sporulation in Bacillus subtilis, the model organism for sporulation, is regulated by the transcription factor Spo0A and four alternative sigma factors, SigF, SigE, SigG, and SigK. While the corresponding regulators are found in available genomes of C. botulinum, little is known about their expression. To accurately measure the expression of these genes using quantitative reverse-transcriptase PCR (RT-qPCR) during the exponential and stationary growth phases, a suitable normalization reference gene is required. 16S rrn, adK, alaS, era, gluD, gyrA, rpoC, and rpsJ were selected as the candidate reference genes. The most stable candidate reference gene was 16S ribosomal RNA gene (rrn), based on its low coefficient of variation (1.81%) measured during the 18-h study time. Using 16S rrn as the normalization reference gene, the relative expression levels of spo0A, sigF, sigE, sigG, and sigK were measured over 18h. The pattern of expression showed spo0A expression during the logarithmic growth phase, followed by a drop in expression upon entry to the stationary phase. Expression levels of sigF, sigE, and sigG peaked simultaneously at the end of the exponential growth phase. Peak expression of sigK occurred at 18h, however low levels of expression were detected during the exponential phase. These findings suggest these sigma factors play a role in C. botulinum sporulation that is similar, but not equal, to their role in the B. subtilis model. Copyright © 2013 Elsevier Ltd. All rights reserved.
Mining gene link information for survival pathway hunting.

PubMed

Jing, Gao-Jian; Zhang, Zirui; Wang, Hong-Qiang; Zheng, Hong-Mei

2015-08-01

This study proposes a gene link-based method for survival time-related pathway hunting. In this method, the authors incorporate gene link information to estimate how a pathway is associated with cancer patient's survival time. Specifically, a gene link-based Cox proportional hazard model (Link-Cox) is established, in which two linked genes are considered together to represent a link variable and the association of the link with survival time is assessed using Cox proportional hazard model. On the basis of the Link-Cox model, the authors formulate a new statistic for measuring the association of a pathway with survival time of cancer patients, referred to as pathway survival score (PSS), by summarising survival significance over all the gene links in the pathway, and devise a permutation test to test the significance of an observed PSS. To evaluate the proposed method, the authors applied it to simulation data and two publicly available real-world gene expression data sets. Extensive comparisons with previous methods show the effectiveness and efficiency of the proposed method for survival pathway hunting.
Mathematical Modeling of RNA-Based Architectures for Closed Loop Control of Gene Expression.

PubMed

Agrawal, Deepak K; Tang, Xun; Westbrook, Alexandra; Marshall, Ryan; Maxwell, Colin S; Lucks, Julius; Noireaux, Vincent; Beisel, Chase L; Dunlop, Mary J; Franco, Elisa

2018-05-08

Feedback allows biological systems to control gene expression precisely and reliably, even in the presence of uncertainty, by sensing and processing environmental changes. Taking inspiration from natural architectures, synthetic biologists have engineered feedback loops to tune the dynamics and improve the robustness and predictability of gene expression. However, experimental implementations of biomolecular control systems are still far from satisfying performance specifications typically achieved by electrical or mechanical control systems. To address this gap, we present mathematical models of biomolecular controllers that enable reference tracking, disturbance rejection, and tuning of the temporal response of gene expression. These controllers employ RNA transcriptional regulators to achieve closed loop control where feedback is introduced via molecular sequestration. Sensitivity analysis of the models allows us to identify which parameters influence the transient and steady state response of a target gene expression process, as well as which biologically plausible parameter values enable perfect reference tracking. We quantify performance using typical control theory metrics to characterize response properties and provide clear selection guidelines for practical applications. Our results indicate that RNA regulators are well-suited for building robust and precise feedback controllers for gene expression. Additionally, our approach illustrates several quantitative methods useful for assessing the performance of biomolecular feedback control systems.
A comprehensive approach to identify reliable reference gene candidates to investigate the link between alcoholism and endocrinology in Sprague-Dawley rats.

PubMed

Taki, Faten A; Abdel-Rahman, Abdel A; Zhang, Baohong

2014-01-01

Gender and hormonal differences are often correlated with alcohol dependence and related complications like addiction and breast cancer. Estrogen (E2) is an important sex hormone because it serves as a key protein involved in organism level signaling pathways. Alcoholism has been reported to affect estrogen receptor signaling; however, identifying the players involved in such multi-faceted syndrome is complex and requires an interdisciplinary approach. In many situations, preliminary investigations included a straight forward, yet informative biotechniques such as gene expression analyses using quantitative real time PCR (qRT-PCR). The validity of qRT-PCR-based conclusions is affected by the choice of reliable internal controls. With this in mind, we compiled a list of 15 commonly used housekeeping genes (HKGs) as potential reference gene candidates in rat biological models. A comprehensive comparison among 5 statistical approaches (geNorm, dCt method, NormFinder, BestKeeper, and RefFinder) was performed to identify the minimal number as well the most stable reference genes required for reliable normalization in experimental rat groups that comprised sham operated (SO), ovariectomized rats in the absence (OVX) or presence of E2 (OVXE2). These rat groups were subdivided into subgroups that received alcohol in liquid diet or isocalroic control liquid diet for 12 weeks. Our results showed that U87, 5S rRNA, GAPDH, and U5a were the most reliable gene candidates for reference genes in heart and brain tissue. However, different gene stability ranking was specific for each tissue input combination. The present preliminary findings highlight the variability in reference gene rankings across different experimental conditions and analytic methods and constitute a fundamental step for gene expression assays.
Selection and validation of reference genes for miRNA expression studies during porcine pregnancy.

PubMed

Wessels, Jocelyn M; Edwards, Andrew K; Zettler, Candace; Tayade, Chandrakant

2011-01-01

MicroRNAs comprise a family of small non-coding RNAs that modulate several developmental and physiological processes including pregnancy. Their ubiquitous presence is confirmed in mammals, worms, flies and plants. Although rapid advances have been made in microRNA research, information on stable reference genes for validation of microRNA expression is still lacking. Real time PCR is a widely used tool to quantify gene transcripts. An appropriate reference gene must be chosen to minimize experimental error in this system. A small difference in miRNA levels between experimental samples can be biologically meaningful as these entities can affect multiple targets in a pathway. This study examined the suitability of six commercially available reference genes (RNU1A, RNU5A, RNU6B, SNORD25, SCARNA17, and SNORA73A) in maternal-fetal tissues from healthy and spontaneously arresting/dying conceptuses from sows were separately analyzed at gestation day 20. Comparisons were also made with non-pregnant endometrial tissues from sows. Spontaneous fetal loss is a prime concern to the commercial pork industry. Our laboratory has previously identified deficits in vasculature development at maternal-fetal interface as one of the major participating causes of fetal loss. Using this well-established model, we have extended our studies to identify suitable microRNA reference genes. A methodical approach to assessing suitability was adopted using standard curve and melting curve analysis, PCR product sequencing, real time PCR expression in a panel of gestational tissues, and geNorm and NormFinder analysis. Our quantitative real time PCR analysis confirmed expression of all 6 reference genes in maternal and fetal tissues. All genes were uniformly expressed in tissues from healthy and spontaneously arresting conceptus attachment sites. Comparisons between tissue types (maternal/fetal/non-pregnant) revealed significant differences for RNU5A, RNU6B, SCARNA17, and SNORA73A expression. Based on our methodical assessment of all 6 reference genes, results suggest that RNU1A is the most stable reference gene for porcine pregnancy studies.
[Stability analysis of reference gene based on real-time PCR in Artemisia annua under cadmium treatment].

PubMed

Zhou, Liang-Yun; Mo, Ge; Wang, Sheng; Tang, Jin-Fu; Yue, Hong; Huang, Lu-Qi; Shao, Ai-Juan; Guo, Lan-Ping

2014-03-01

In this study, Actin, 18S rRNA, PAL, GAPDH and CPR of Artemisia annua were selected as candidate reference genes, and their gene-specific primers for real-time PCR were designed, then geNorm, NormFinder, BestKeeper, Delta CT and RefFinder were used to evaluate their expression stability in the leaves of A. annua under treatment of different concentrations of Cd, with the purpose of finding a reliable reference gene to ensure the reliability of gene-expression analysis. The results showed that there were some significant differences among the candidate reference genes under different treatments and the order of expression stability of candidate reference gene was Actin > 18S rRNA > PAL > GAPDH > CPR. These results suggested that Actin, 18S rRNA and PAL could be used as ideal reference genes of gene expression analysis in A. annua and multiple internal control genes were adopted for results calibration. In addition, differences in expression stability of candidate reference genes in the leaves of A. annua under the same concentrations of Cd were observed, which suggested that the screening of candidate reference genes was needed even under the same treatment. To our best knowledge, this study for the first time provided the ideal reference genes under Cd treatment in the leaves of A. annua and offered reference for the gene expression analysis of A. annua under other conditions.
Identification of reference genes for quantitative expression analysis using large-scale RNA-seq data of Arabidopsis thaliana and model crop plants.

PubMed

Kudo, Toru; Sasaki, Yohei; Terashima, Shin; Matsuda-Imai, Noriko; Takano, Tomoyuki; Saito, Misa; Kanno, Maasa; Ozaki, Soichi; Suwabe, Keita; Suzuki, Go; Watanabe, Masao; Matsuoka, Makoto; Takayama, Seiji; Yano, Kentaro

2016-10-13

In quantitative gene expression analysis, normalization using a reference gene as an internal control is frequently performed for appropriate interpretation of the results. Efforts have been devoted to exploring superior novel reference genes using microarray transcriptomic data and to evaluating commonly used reference genes by targeting analysis. However, because the number of specifically detectable genes is totally dependent on probe design in the microarray analysis, exploration using microarray data may miss some of the best choices for the reference genes. Recently emerging RNA sequencing (RNA-seq) provides an ideal resource for comprehensive exploration of reference genes since this method is capable of detecting all expressed genes, in principle including even unknown genes. We report the results of a comprehensive exploration of reference genes using public RNA-seq data from plants such as Arabidopsis thaliana (Arabidopsis), Glycine max (soybean), Solanum lycopersicum (tomato) and Oryza sativa (rice). To select reference genes suitable for the broadest experimental conditions possible, candidates were surveyed by the following four steps: (1) evaluation of the basal expression level of each gene in each experiment; (2) evaluation of the expression stability of each gene in each experiment; (3) evaluation of the expression stability of each gene across the experiments; and (4) selection of top-ranked genes, after ranking according to the number of experiments in which the gene was expressed stably. Employing this procedure, 13, 10, 12 and 21 top candidates for reference genes were proposed in Arabidopsis, soybean, tomato and rice, respectively. Microarray expression data confirmed that the expression of the proposed reference genes under broad experimental conditions was more stable than that of commonly used reference genes. These novel reference genes will be useful for analyzing gene expression profiles across experiments carried out under various experimental conditions.
Identification of reference genes for RT-qPCR analysis in peach genotypes with contrasting chilling requirements.

PubMed

Marini, N; Bevilacqua, C B; Büttow, M V; Raseira, M C B; Bonow, S

2017-05-25

Selecting and validating reference genes are the first steps in studying gene expression by reverse transcriptase-quantitative polymerase chain reaction (RT-qPCR). The present study aimed to evaluate the stability of five reference genes for the purpose of normalization when studying gene expression in various cultivars of Prunus persica with different chilling requirements. Flower bud tissues of nine peach genotypes from Embrapa's peach breeding program with different chilling requirements were used, and five candidate reference genes based on the RT-qPCR that were useful for studying the relative quantitative gene expression and stability were evaluated using geNorm, NormFinder, and bestKeeper software packages. The results indicated that among the genes tested, the most stable genes to be used as reference genes are Act and UBQ10. This study is the first survey of the stability of reference genes in peaches under chilling stress and provides guidelines for more accurate RT-qPCR results.
Endogenous Reference Genes and Their Quantitative Real-Time PCR Assays for Genetically Modified Bread Wheat (Triticum aestivum L.) Detection.

PubMed

Yang, Litao; Quan, Sheng; Zhang, Dabing

2017-01-01

Endogenous reference genes (ERG) and their derivate analytical methods are standard requirements for analysis of genetically modified organisms (GMOs). Development and validation of suitable ERGs is the primary step for establishing assays that monitoring the genetically modified (GM) contents in food/feed samples. Herein, we give a review of the ERGs currently used for GM wheat analysis, such as ACC1, PKABA1, ALMT1, and Waxy-D1, as well as their performances in GM wheat analysis. Also, we discussed one model for developing and validating one ideal RG for one plant species based on our previous research work.

Identification and comprehensive evaluation of reference genes for RT-qPCR analysis of host gene-expression in Brassica juncea-aphid interaction using microarray data.

PubMed

Ram, Chet; Koramutla, Murali Krishna; Bhattacharya, Ramcharan

2017-07-01

Brassica juncea is a chief oil yielding crop in many parts of the world including India. With advancement of molecular techniques, RT-qPCR based study of gene-expression has become an integral part of experimentations in crop breeding. In RT-qPCR, use of appropriate reference gene(s) is pivotal. The virtue of the reference genes, being constant in expression throughout the experimental treatments, needs to be validated case by case. Appropriate reference gene(s) for normalization of gene-expression data in B. juncea during the biotic stress of aphid infestation is not known. In the present investigation, 11 reference genes identified from microarray database of Arabidopsis-aphid interaction at a cut off FDR ≤0.1, along with two known reference genes of B. juncea, were analyzed for their expression stability upon aphid infestation. These included 6 frequently used and 5 newly identified reference genes. Ranking orders of the reference genes in terms of expression stability were calculated using advanced statistical approaches such as geNorm, NormFinder, delta Ct and BestKeeper. The analysis suggested CAC, TUA and DUF179 as the most suitable reference genes. Further, normalization of the gene-expression data of STP4 and PR1 by the most and the least stable reference gene, respectively has demonstrated importance and applicability of the recommended reference genes in aphid infested samples of B. juncea. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
A stochastic model for optimizing composite predictors based on gene expression profiles.

PubMed

Ramanathan, Murali

2003-07-01

This project was done to develop a mathematical model for optimizing composite predictors based on gene expression profiles from DNA arrays and proteomics. The problem was amenable to a formulation and solution analogous to the portfolio optimization problem in mathematical finance: it requires the optimization of a quadratic function subject to linear constraints. The performance of the approach was compared to that of neighborhood analysis using a data set containing cDNA array-derived gene expression profiles from 14 multiple sclerosis patients receiving intramuscular inteferon-beta1a. The Markowitz portfolio model predicts that the covariance between genes can be exploited to construct an efficient composite. The model predicts that a composite is not needed for maximizing the mean value of a treatment effect: only a single gene is needed, but the usefulness of the effect measure may be compromised by high variability. The model optimized the composite to yield the highest mean for a given level of variability or the least variability for a given mean level. The choices that meet this optimization criteria lie on a curve of composite mean vs. composite variability plot referred to as the "efficient frontier." When a composite is constructed using the model, it outperforms the composite constructed using the neighborhood analysis method. The Markowitz portfolio model may find potential applications in constructing composite biomarkers and in the pharmacogenomic modeling of treatment effects derived from gene expression endpoints.
Combining evidence, biomedical literature and statistical dependence: new insights for functional annotation of gene sets

PubMed Central

Aubry, Marc; Monnier, Annabelle; Chicault, Celine; de Tayrac, Marie; Galibert, Marie-Dominique; Burgun, Anita; Mosser, Jean

2006-01-01

Background Large-scale genomic studies based on transcriptome technologies provide clusters of genes that need to be functionally annotated. The Gene Ontology (GO) implements a controlled vocabulary organised into three hierarchies: cellular components, molecular functions and biological processes. This terminology allows a coherent and consistent description of the knowledge about gene functions. The GO terms related to genes come primarily from semi-automatic annotations made by trained biologists (annotation based on evidence) or text-mining of the published scientific literature (literature profiling). Results We report an original functional annotation method based on a combination of evidence and literature that overcomes the weaknesses and the limitations of each approach. It relies on the Gene Ontology Annotation database (GOA Human) and the PubGene biomedical literature index. We support these annotations with statistically associated GO terms and retrieve associative relations across the three GO hierarchies to emphasise the major pathways involved by a gene cluster. Both annotation methods and associative relations were quantitatively evaluated with a reference set of 7397 genes and a multi-cluster study of 14 clusters. We also validated the biological appropriateness of our hybrid method with the annotation of a single gene (cdc2) and that of a down-regulated cluster of 37 genes identified by a transcriptome study of an in vitro enterocyte differentiation model (CaCo-2 cells). Conclusion The combination of both approaches is more informative than either separate approach: literature mining can enrich an annotation based only on evidence. Text-mining of the literature can also find valuable associated MEDLINE references that confirm the relevance of the annotation. Eventually, GO terms networks can be built with associative relations in order to highlight cooperative and competitive pathways and their connected molecular functions. PMID:16674810
Selection of suitable reference genes for normalization of genes of interest in canine soft tissue sarcomas using quantitative real-time polymerase chain reaction.

PubMed

Zornhagen, K W; Kristensen, A T; Hansen, A E; Oxboel, J; Kjaer, A

2015-12-01

Quantitative real-time reverse transcription polymerase chain reaction (RT-qPCR) is a sensitive technique for quantifying gene expression. Stably expressed reference genes are necessary for normalization of RT-qPCR data. Only a few articles have been published on reference genes in canine tumours. The objective of this study was to demonstrate how to identify suitable reference genes for normalization of genes of interest in canine soft tissue sarcomas using RT-qPCR. Primer pairs for 17 potential reference genes were designed and tested in archival tumour biopsies from six dogs. The geNorm algorithm was used to analyse the most suitable reference genes. Eight potential reference genes were excluded from this final analysis because of their dissociation curves. β-Glucuronidase (GUSB) and proteasome subunit, beta type, 6 (PSMB6) were most stably expressed with an M value of 0.154 and a CV of 0.053 describing their average stability. We suggest that choice of reference genes should be based on specific testing in every new experimental set-up. © 2014 John Wiley & Sons Ltd.
Coordinates and intervals in graph-based reference genomes.

PubMed

Rand, Knut D; Grytten, Ivar; Nederbragt, Alexander J; Storvik, Geir O; Glad, Ingrid K; Sandve, Geir K

2017-05-18

It has been proposed that future reference genomes should be graph structures in order to better represent the sequence diversity present in a species. However, there is currently no standard method to represent genomic intervals, such as the positions of genes or transcription factor binding sites, on graph-based reference genomes. We formalize offset-based coordinate systems on graph-based reference genomes and introduce methods for representing intervals on these reference structures. We show the advantage of our methods by representing genes on a graph-based representation of the newest assembly of the human genome (GRCh38) and its alternative loci for regions that are highly variable. More complex reference genomes, containing alternative loci, require methods to represent genomic data on these structures. Our proposed notation for genomic intervals makes it possible to fully utilize the alternative loci of the GRCh38 assembly and potential future graph-based reference genomes. We have made a Python package for representing such intervals on offset-based coordinate systems, available at https://github.com/uio-cels/offsetbasedgraph . An interactive web-tool using this Python package to visualize genes on a graph created from GRCh38 is available at https://github.com/uio-cels/genomicgraphcoords .
Tissue-specific selection of stable reference genes for real-time PCR normalization in an obese rat model.

PubMed

Cabiati, Manuela; Raucci, Serena; Caselli, Chiara; Guzzardi, Maria Angela; D'Amico, Andrea; Prescimone, Tommaso; Giannessi, Daniela; Del Ry, Silvia

2012-06-01

Obesity is a complex pathology with interacting and confounding causes due to the environment, hormonal signaling patterns, and genetic predisposition. At present, the Zucker rat is an eligible genetic model for research on obesity and metabolic syndrome, allowing scrutiny of gene expression profiles. Real-time PCR is the benchmark method for measuring mRNA expressions, but the accuracy and reproducibility of its data greatly depend on appropriate normalization strategies. In the Zucker rat model, no specific reference genes have been identified in myocardium, kidney, and lung, the main organs involved in this syndrome. The aim of this study was to select among ten candidates (Actb, Gapdh, Polr2a, Ywhag, Rpl13a, Sdha, Ppia, Tbp, Hprt1 and Tfrc) a set of reference genes that can be used for the normalization of mRNA expression data obtained by real-time PCR in obese and lean Zucker rats both at fasting and during acute hyperglycemia. The most stable genes in the heart were Sdha, Tbp, and Hprt1; in kidney, Tbp, Actb, and Gapdh were chosen, while Actb, Ywhag, and Sdha were selected as the most stably expressed set for pulmonary tissue. The normalization strategy was used to analyze mRNA expression of tumor necrosis factor α, the main inflammatory mediator in obesity, whose variations were more significant when normalized with the appropriately selected reference genes. The findings obtained in this study underline the importance of having three stably expressed reference gene sets for use in the cardiac, renal, and pulmonary tissues of an experimental model of obese and hyperglycemic Zucker rats.
Plant Reactome: a resource for plant pathways and comparative analysis

PubMed Central

Naithani, Sushma; Preece, Justin; D'Eustachio, Peter; Gupta, Parul; Amarasinghe, Vindhya; Dharmawardhana, Palitha D.; Wu, Guanming; Fabregat, Antonio; Elser, Justin L.; Weiser, Joel; Keays, Maria; Fuentes, Alfonso Munoz-Pomer; Petryszak, Robert; Stein, Lincoln D.; Ware, Doreen; Jaiswal, Pankaj

2017-01-01

Plant Reactome (http://plantreactome.gramene.org/) is a free, open-source, curated plant pathway database portal, provided as part of the Gramene project. The database provides intuitive bioinformatics tools for the visualization, analysis and interpretation of pathway knowledge to support genome annotation, genome analysis, modeling, systems biology, basic research and education. Plant Reactome employs the structural framework of a plant cell to show metabolic, transport, genetic, developmental and signaling pathways. We manually curate molecular details of pathways in these domains for reference species Oryza sativa (rice) supported by published literature and annotation of well-characterized genes. Two hundred twenty-two rice pathways, 1025 reactions associated with 1173 proteins, 907 small molecules and 256 literature references have been curated to date. These reference annotations were used to project pathways for 62 model, crop and evolutionarily significant plant species based on gene homology. Database users can search and browse various components of the database, visualize curated baseline expression of pathway-associated genes provided by the Expression Atlas and upload and analyze their Omics datasets. The database also offers data access via Application Programming Interfaces (APIs) and in various standardized pathway formats, such as SBML and BioPAX. PMID:27799469
Evaluation and selection of reliable reference genes for gene expression under abiotic stress in cotton (Gossypium hirsutum L.).

PubMed

Wang, Min; Wang, Qinglian; Zhang, Baohong

2013-11-01

Reference genes are critical for normalization of the gene expression level of target genes. The widely used housekeeping genes may change their expression levels at different tissue under different treatment or stress conditions. Therefore, systematical evaluation on the housekeeping genes is required for gene expression analysis. Up to date, no work was performed to evaluate the housekeeping genes in cotton under stress treatment. In this study, we chose 10 housekeeping genes to systematically assess their expression levels at two different tissues (leaves and roots) under two different abiotic stresses (salt and drought) with three different concentrations. Our results show that there is no best reference gene for all tissues at all stress conditions. The reliable reference gene should be selected based on a specific condition. For example, under salt stress, UBQ7, GAPDH and EF1A8 are better reference genes in leaves; TUA10, UBQ7, CYP1, GAPDH and EF1A8 were better in roots. Under drought stress, UBQ7, EF1A8, TUA10, and GAPDH showed less variety of expression level in leaves and roots. Thus, it is better to identify reliable reference genes first before performing any gene expression analysis. However, using a combination of housekeeping genes as reference gene may provide a new strategy for normalization of gene expression. In this study, we found that combination of four housekeeping genes worked well as reference genes under all the stress conditions. © 2013.
Exploring Valid Reference Genes for Quantitative Real-time PCR Analysis in Plutella xylostella (Lepidoptera: Plutellidae)

PubMed Central

Fu, Wei; Xie, Wen; Zhang, Zhuo; Wang, Shaoli; Wu, Qingjun; Liu, Yong; Zhou, Xiaomao; Zhou, Xuguo; Zhang, Youjun

2013-01-01

Abstract: Quantitative real-time PCR (qRT-PCR), a primary tool in gene expression analysis, requires an appropriate normalization strategy to control for variation among samples. The best option is to compare the mRNA level of a target gene with that of reference gene(s) whose expression level is stable across various experimental conditions. In this study, expression profiles of eight candidate reference genes from the diamondback moth, Plutella xylostella, were evaluated under diverse experimental conditions. RefFinder, a web-based analysis tool, integrates four major computational programs including geNorm, Normfinder, BestKeeper, and the comparative ΔCt method to comprehensively rank the tested candidate genes. Elongation factor 1 (EF1) was the most suited reference gene for the biotic factors (development stage, tissue, and strain). In contrast, although appropriate reference gene(s) do exist for several abiotic factors (temperature, photoperiod, insecticide, and mechanical injury), we were not able to identify a single universal reference gene. Nevertheless, a suite of candidate reference genes were specifically recommended for selected experimental conditions. Our finding is the first step toward establishing a standardized qRT-PCR analysis of this agriculturally important insect pest. PMID:23983612
Analysis of gene network robustness based on saturated fixed point attractors

PubMed Central

2014-01-01

The analysis of gene network robustness to noise and mutation is important for fundamental and practical reasons. Robustness refers to the stability of the equilibrium expression state of a gene network to variations of the initial expression state and network topology. Numerical simulation of these variations is commonly used for the assessment of robustness. Since there exists a great number of possible gene network topologies and initial states, even millions of simulations may be still too small to give reliable results. When the initial and equilibrium expression states are restricted to being saturated (i.e., their elements can only take values 1 or −1 corresponding to maximum activation and maximum repression of genes), an analytical gene network robustness assessment is possible. We present this analytical treatment based on determination of the saturated fixed point attractors for sigmoidal function models. The analysis can determine (a) for a given network, which and how many saturated equilibrium states exist and which and how many saturated initial states converge to each of these saturated equilibrium states and (b) for a given saturated equilibrium state or a given pair of saturated equilibrium and initial states, which and how many gene networks, referred to as viable, share this saturated equilibrium state or the pair of saturated equilibrium and initial states. We also show that the viable networks sharing a given saturated equilibrium state must follow certain patterns. These capabilities of the analytical treatment make it possible to properly define and accurately determine robustness to noise and mutation for gene networks. Previous network research conclusions drawn from performing millions of simulations follow directly from the results of our analytical treatment. Furthermore, the analytical results provide criteria for the identification of model validity and suggest modified models of gene network dynamics. The yeast cell-cycle network is used as an illustration of the practical application of this analytical treatment. PMID:24650364
Identification and validation of reference genes for quantitative real-time PCR normalization and its applications in lycium.

PubMed

Zeng, Shaohua; Liu, Yongliang; Wu, Min; Liu, Xiaomin; Shen, Xiaofei; Liu, Chunzhao; Wang, Ying

2014-01-01

Lycium barbarum and L. ruthenicum are extensively used as traditional Chinese medicinal plants. Next generation sequencing technology provides a powerful tool for analyzing transcriptomic profiles of gene expression in non-model species. Such gene expression can then be confirmed with quantitative real-time polymerase chain reaction (qRT-PCR). Therefore, use of systematically identified suitable reference genes is a prerequisite for obtaining reliable gene expression data. Here, we calculated the expression stability of 18 candidate reference genes across samples from different tissues and grown under salt stress using geNorm and NormFinder procedures. The geNorm-determined rank of reference genes was similar to those defined by NormFinder with some differences. Both procedures confirmed that the single most stable reference gene was ACNTIN1 for L. barbarum fruits, H2B1 for L. barbarum roots, and EF1α for L. ruthenicum fruits. PGK3, H2B2, and PGK3 were identified as the best stable reference genes for salt-treated L. ruthenicum leaves, roots, and stems, respectively. H2B1 and GAPDH1+PGK1 for L. ruthenicum and SAMDC2+H2B1 for L. barbarum were the best single and/or combined reference genes across all samples. Finally, expression of salt-responsive gene NAC, fruit ripening candidate gene LrPG, and anthocyanin genes were investigated to confirm the validity of the selected reference genes. Suitable reference genes identified in this study provide a foundation for accurately assessing gene expression and further better understanding of novel gene function to elucidate molecular mechanisms behind particular biological/physiological processes in Lycium.
Selecting and validating reference genes for quantitative real-time PCR in Plutella xylostella (L.).

PubMed

You, Yanchun; Xie, Miao; Vasseur, Liette; You, Minsheng

2018-05-01

Gene expression analysis provides important clues regarding gene functions, and quantitative real-time PCR (qRT-PCR) is a widely used method in gene expression studies. Reference genes are essential for normalizing and accurately assessing gene expression. In the present study, 16 candidate reference genes (ACTB, CyPA, EF1-α, GAPDH, HSP90, NDPk, RPL13a, RPL18, RPL19, RPL32, RPL4, RPL8, RPS13, RPS4, α-TUB, and β-TUB) from Plutella xylostella were selected to evaluate gene expression stability across different experimental conditions using five statistical algorithms (geNorm, NormFinder, Delta Ct, BestKeeper, and RefFinder). The results suggest that different reference genes or combinations of reference genes are suitable for normalization in gene expression studies of P. xylostella according to the different developmental stages, strains, tissues, and insecticide treatments. Based on the given experimental sets, the most stable reference genes were RPS4 across different developmental stages, RPL8 across different strains and tissues, and EF1-α across different insecticide treatments. A comprehensive and systematic assessment of potential reference genes for gene expression normalization is essential for post-genomic functional research in P. xylostella, a notorious pest with worldwide distribution and a high capacity to adapt and develop resistance to insecticides.
Genome-Wide Identification and Testing of Superior Reference Genes for Transcript Normalization in Arabidopsis1[w

PubMed Central

Czechowski, Tomasz; Stitt, Mark; Altmann, Thomas; Udvardi, Michael K.; Scheible, Wolf-Rüdiger

2005-01-01

Gene transcripts with invariant abundance during development and in the face of environmental stimuli are essential reference points for accurate gene expression analyses, such as RNA gel-blot analysis or quantitative reverse transcription-polymerase chain reaction (PCR). An exceptionally large set of data from Affymetrix ATH1 whole-genome GeneChip studies provided the means to identify a new generation of reference genes with very stable expression levels in the model plant species Arabidopsis (Arabidopsis thaliana). Hundreds of Arabidopsis genes were found that outperform traditional reference genes in terms of expression stability throughout development and under a range of environmental conditions. Most of these were expressed at much lower levels than traditional reference genes, making them very suitable for normalization of gene expression over a wide range of transcript levels. Specific and efficient primers were developed for 22 genes and tested on a diverse set of 20 cDNA samples. Quantitative reverse transcription-PCR confirmed superior expression stability and lower absolute expression levels for many of these genes, including genes encoding a protein phosphatase 2A subunit, a coatomer subunit, and an ubiquitin-conjugating enzyme. The developed PCR primers or hybridization probes for the novel reference genes will enable better normalization and quantification of transcript levels in Arabidopsis in the future. PMID:16166256
Using RNA-seq data to select reference genes for normalizing gene expression in apple roots.

PubMed

Zhou, Zhe; Cong, Peihua; Tian, Yi; Zhu, Yanmin

2017-01-01

Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for their potential use as reliable reference genes. These genes were selected based on their low variance of gene expression in apple root tissues from a recent RNA-seq data set, and a few previously reported apple reference genes for other tissue types. Four methods, Delta Ct, geNorm, NormFinder and BestKeeper, were used to evaluate their stability in apple root tissues of various genotypes and under different experimental conditions. A small panel of stably expressed genes, MDP0000095375, MDP0000147424, MDP0000233640, MDP0000326399 and MDP0000173025 were recommended for normalizing quantitative gene expression data in apple roots under various abiotic or biotic stresses. When the most stable and least stable reference genes were used for data normalization, significant differences were observed on the expression patterns of two target genes, MdLecRLK5 (MDP0000228426, a gene encoding a lectin receptor like kinase) and MdMAPK3 (MDP0000187103, a gene encoding a mitogen-activated protein kinase). Our data also indicated that for those carefully validated reference genes, a single reference gene is sufficient for reliable normalization of the quantitative gene expression. Depending on the experimental conditions, the most suitable reference genes can be specific to the sample of interest for more reliable RT-qPCR data normalization.
Using RNA-seq data to select reference genes for normalizing gene expression in apple roots

PubMed Central

Zhou, Zhe; Cong, Peihua; Tian, Yi

2017-01-01

Gene expression in apple roots in response to various stress conditions is a less-explored research subject. Reliable reference genes for normalizing quantitative gene expression data have not been carefully investigated. In this study, the suitability of a set of 15 apple genes were evaluated for their potential use as reliable reference genes. These genes were selected based on their low variance of gene expression in apple root tissues from a recent RNA-seq data set, and a few previously reported apple reference genes for other tissue types. Four methods, Delta Ct, geNorm, NormFinder and BestKeeper, were used to evaluate their stability in apple root tissues of various genotypes and under different experimental conditions. A small panel of stably expressed genes, MDP0000095375, MDP0000147424, MDP0000233640, MDP0000326399 and MDP0000173025 were recommended for normalizing quantitative gene expression data in apple roots under various abiotic or biotic stresses. When the most stable and least stable reference genes were used for data normalization, significant differences were observed on the expression patterns of two target genes, MdLecRLK5 (MDP0000228426, a gene encoding a lectin receptor like kinase) and MdMAPK3 (MDP0000187103, a gene encoding a mitogen-activated protein kinase). Our data also indicated that for those carefully validated reference genes, a single reference gene is sufficient for reliable normalization of the quantitative gene expression. Depending on the experimental conditions, the most suitable reference genes can be specific to the sample of interest for more reliable RT-qPCR data normalization. PMID:28934340
Identification and Validation of Reference Genes for RT-qPCR Analysis in Non-Heading Chinese Cabbage Flowers

PubMed Central

Wang, Cheng; Cui, Hong-Mi; Huang, Tian-Hong; Liu, Tong-Kun; Hou, Xi-Lin; Li, Ying

2016-01-01

Non-heading Chinese cabbage (Brassica rapa ssp. chinensis Makino) is an important vegetable member of Brassica rapa crops. It exhibits a typical sporophytic self-incompatibility (SI) system and is an ideal model plant to explore the mechanism of SI. Gene expression research are frequently used to unravel the complex genetic mechanism and in such studies appropriate reference selection is vital. Validation of reference genes have neither been conducted in Brassica rapa flowers nor in SI trait. In this study, 13 candidate reference genes were selected and examined systematically in 96 non-heading Chinese cabbage flower samples that represent four strategic groups in compatible and self-incompatible lines of non-heading Chinese cabbage. Two RT-qPCR analysis software, geNorm and NormFinder, were used to evaluate the expression stability of these genes systematically. Results revealed that best-ranked references genes should be selected according to specific sample subsets. DNAJ, UKN1, and PP2A were identified as the most stable reference genes among all samples. Moreover, our research further revealed that the widely used reference genes, CYP and ACP, were the least suitable reference genes in most non-heading Chinese cabbage flower sample sets. To further validate the suitability of the reference genes identified in this study, the expression level of SRK and Exo70A1 genes which play important roles in regulating interaction between pollen and stigma were studied. Our study presented the first systematic study of reference gene(s) selection for SI study and provided guidelines to obtain more accurate RT-qPCR results in non-heading Chinese cabbage. PMID:27375663
Comprehensive Annotation of the Parastagonospora nodorum Reference Genome Using Next-Generation Genomics, Transcriptomics and Proteogenomics

PubMed Central

Dodhia, Kejal; Stoll, Thomas; Hastie, Marcus; Furuki, Eiko; Ellwood, Simon R.; Williams, Angela H.; Tan, Yew-Foon; Testa, Alison C.; Gorman, Jeffrey J.; Oliver, Richard P.

2016-01-01

Parastagonospora nodorum, the causal agent of Septoria nodorum blotch (SNB), is an economically important pathogen of wheat (Triticum spp.), and a model for the study of necrotrophic pathology and genome evolution. The reference P. nodorum strain SN15 was the first Dothideomycete with a published genome sequence, and has been used as the basis for comparison within and between species. Here we present an updated reference genome assembly with corrections of SNP and indel errors in the underlying genome assembly from deep resequencing data as well as extensive manual annotation of gene models using transcriptomic and proteomic sources of evidence (https://github.com/robsyme/Parastagonospora_nodorum_SN15). The updated assembly and annotation includes 8,366 genes with modified protein sequence and 866 new genes. This study shows the benefits of using a wide variety of experimental methods allied to expert curation to generate a reliable set of gene models. PMID:26840125
Reconstruction of metabolic pathways by combining probabilistic graphical model-based and knowledge-based methods

PubMed Central

2014-01-01

Automatic reconstruction of metabolic pathways for an organism from genomics and transcriptomics data has been a challenging and important problem in bioinformatics. Traditionally, known reference pathways can be mapped into an organism-specific ones based on its genome annotation and protein homology. However, this simple knowledge-based mapping method might produce incomplete pathways and generally cannot predict unknown new relations and reactions. In contrast, ab initio metabolic network construction methods can predict novel reactions and interactions, but its accuracy tends to be low leading to a lot of false positives. Here we combine existing pathway knowledge and a new ab initio Bayesian probabilistic graphical model together in a novel fashion to improve automatic reconstruction of metabolic networks. Specifically, we built a knowledge database containing known, individual gene / protein interactions and metabolic reactions extracted from existing reference pathways. Known reactions and interactions were then used as constraints for Bayesian network learning methods to predict metabolic pathways. Using individual reactions and interactions extracted from different pathways of many organisms to guide pathway construction is new and improves both the coverage and accuracy of metabolic pathway construction. We applied this probabilistic knowledge-based approach to construct the metabolic networks from yeast gene expression data and compared its results with 62 known metabolic networks in the KEGG database. The experiment showed that the method improved the coverage of metabolic network construction over the traditional reference pathway mapping method and was more accurate than pure ab initio methods. PMID:25374614
A high resolution atlas of gene expression in the domestic sheep (Ovis aries)

PubMed Central

Farquhar, Iseabail L.; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G.; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C. Bruce; Freeman, Tom C.; Archibald, Alan L.; Hume, David A.

2017-01-01

Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of ‘guilt by association’ was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages. PMID:28915238
A high resolution atlas of gene expression in the domestic sheep (Ovis aries).

PubMed

Clark, Emily L; Bush, Stephen J; McCulloch, Mary E B; Farquhar, Iseabail L; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G; Wu, Chunlei; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C Bruce; Freeman, Tom C; Summers, Kim M; Archibald, Alan L; Hume, David A

2017-09-01

Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of 'guilt by association' was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages.

Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.

PubMed

Liu, Xuejun; Shi, Xinxin; Chen, Chunlin; Zhang, Li

2015-10-16

The high-throughput sequencing technology, RNA-Seq, has been widely used to quantify gene and isoform expression in the study of transcriptome in recent years. Accurate expression measurement from the millions or billions of short generated reads is obstructed by difficulties. One is ambiguous mapping of reads to reference transcriptome caused by alternative splicing. This increases the uncertainty in estimating isoform expression. The other is non-uniformity of read distribution along the reference transcriptome due to positional, sequencing, mappability and other undiscovered sources of biases. This violates the uniform assumption of read distribution for many expression calculation approaches, such as the direct RPKM calculation and Poisson-based models. Many methods have been proposed to address these difficulties. Some approaches employ latent variable models to discover the underlying pattern of read sequencing. However, most of these methods make bias correction based on surrounding sequence contents and share the bias models by all genes. They therefore cannot estimate gene- and isoform-specific biases as revealed by recent studies. We propose a latent variable model, NLDMseq, to estimate gene and isoform expression. Our method adopts latent variables to model the unknown isoforms, from which reads originate, and the underlying percentage of multiple spliced variants. The isoform- and exon-specific read sequencing biases are modeled to account for the non-uniformity of read distribution, and are identified by utilizing the replicate information of multiple lanes of a single library run. We employ simulation and real data to verify the performance of our method in terms of accuracy in the calculation of gene and isoform expression. Results show that NLDMseq obtains competitive gene and isoform expression compared to popular alternatives. Finally, the proposed method is applied to the detection of differential expression (DE) to show its usefulness in the downstream analysis. The proposed NLDMseq method provides an approach to accurately estimate gene and isoform expression from RNA-Seq data by modeling the isoform- and exon-specific read sequencing biases. It makes use of a latent variable model to discover the hidden pattern of read sequencing. We have shown that it works well in both simulations and real datasets, and has competitive performance compared to popular methods. The method has been implemented as a freely available software which can be found at https://github.com/PUGEA/NLDMseq.
GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

PubMed Central

Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H.

2011-01-01

GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. PMID:21998647
Reference Genes for Accurate Transcript Normalization in Citrus Genotypes under Different Experimental Conditions

PubMed Central

Mafra, Valéria; Kubo, Karen S.; Alves-Ferreira, Marcio; Ribeiro-Alves, Marcelo; Stuart, Rodrigo M.; Boava, Leonardo P.; Rodrigues, Carolina M.; Machado, Marcos A.

2012-01-01

Real-time reverse transcription PCR (RT-qPCR) has emerged as an accurate and widely used technique for expression profiling of selected genes. However, obtaining reliable measurements depends on the selection of appropriate reference genes for gene expression normalization. The aim of this work was to assess the expression stability of 15 candidate genes to determine which set of reference genes is best suited for transcript normalization in citrus in different tissues and organs and leaves challenged with five pathogens (Alternaria alternata, Phytophthora parasitica, Xylella fastidiosa and Candidatus Liberibacter asiaticus). We tested traditional genes used for transcript normalization in citrus and orthologs of Arabidopsis thaliana genes described as superior reference genes based on transcriptome data. geNorm and NormFinder algorithms were used to find the best reference genes to normalize all samples and conditions tested. Additionally, each biotic stress was individually analyzed by geNorm. In general, FBOX (encoding a member of the F-box family) and GAPC2 (GAPDH) was the most stable candidate gene set assessed under the different conditions and subsets tested, while CYP (cyclophilin), TUB (tubulin) and CtP (cathepsin) were the least stably expressed genes found. Validation of the best suitable reference genes for normalizing the expression level of the WRKY70 transcription factor in leaves infected with Candidatus Liberibacter asiaticus showed that arbitrary use of reference genes without previous testing could lead to misinterpretation of data. Our results revealed FBOX, SAND (a SAND family protein), GAPC2 and UPL7 (ubiquitin protein ligase 7) to be superior reference genes, and we recommend their use in studies of gene expression in citrus species and relatives. This work constitutes the first systematic analysis for the selection of superior reference genes for transcript normalization in different citrus organs and under biotic stress. PMID:22347455
Validation of endogenous reference genes for qRT-PCR analysis of human visceral adipose samples

PubMed Central

2010-01-01

Background Given the epidemic proportions of obesity worldwide and the concurrent prevalence of metabolic syndrome, there is an urgent need for better understanding the underlying mechanisms of metabolic syndrome, in particular, the gene expression differences which may participate in obesity, insulin resistance and the associated series of chronic liver conditions. Real-time PCR (qRT-PCR) is the standard method for studying changes in relative gene expression in different tissues and experimental conditions. However, variations in amount of starting material, enzymatic efficiency and presence of inhibitors can lead to quantification errors. Hence the need for accurate data normalization is vital. Among several known strategies for data normalization, the use of reference genes as an internal control is the most common approach. Recent studies have shown that both obesity and presence of insulin resistance influence an expression of commonly used reference genes in omental fat. In this study we validated candidate reference genes suitable for qRT-PCR profiling experiments using visceral adipose samples from obese and lean individuals. Results Cross-validation of expression stability of eight selected reference genes using three popular algorithms, GeNorm, NormFinder and BestKeeper found ACTB and RPII as most stable reference genes. Conclusions We recommend ACTB and RPII as stable reference genes most suitable for gene expression studies of human visceral adipose tissue. The use of these genes as a reference pair may further enhance the robustness of qRT-PCR in this model system. PMID:20492695
Validation of endogenous reference genes for qRT-PCR analysis of human visceral adipose samples.

PubMed

Mehta, Rohini; Birerdinc, Aybike; Hossain, Noreen; Afendy, Arian; Chandhoke, Vikas; Younossi, Zobair; Baranova, Ancha

2010-05-21

Given the epidemic proportions of obesity worldwide and the concurrent prevalence of metabolic syndrome, there is an urgent need for better understanding the underlying mechanisms of metabolic syndrome, in particular, the gene expression differences which may participate in obesity, insulin resistance and the associated series of chronic liver conditions. Real-time PCR (qRT-PCR) is the standard method for studying changes in relative gene expression in different tissues and experimental conditions. However, variations in amount of starting material, enzymatic efficiency and presence of inhibitors can lead to quantification errors. Hence the need for accurate data normalization is vital. Among several known strategies for data normalization, the use of reference genes as an internal control is the most common approach. Recent studies have shown that both obesity and presence of insulin resistance influence an expression of commonly used reference genes in omental fat. In this study we validated candidate reference genes suitable for qRT-PCR profiling experiments using visceral adipose samples from obese and lean individuals. Cross-validation of expression stability of eight selected reference genes using three popular algorithms, GeNorm, NormFinder and BestKeeper found ACTB and RPII as most stable reference genes. We recommend ACTB and RPII as stable reference genes most suitable for gene expression studies of human visceral adipose tissue. The use of these genes as a reference pair may further enhance the robustness of qRT-PCR in this model system.
Evaluation of Reference Genes for Quantitative Real-Time PCR in Songbirds

PubMed Central

Zinzow-Kramer, Wendy M.; Horton, Brent M.; Maney, Donna L.

2014-01-01

Quantitative real-time PCR (qPCR) is becoming a popular tool for the quantification of gene expression in the brain and endocrine tissues of songbirds. Accurate analysis of qPCR data relies on the selection of appropriate reference genes for normalization, yet few papers on songbirds contain evidence of reference gene validation. Here, we evaluated the expression of ten potential reference genes (18S, ACTB, GAPDH, HMBS, HPRT, PPIA, RPL4, RPL32, TFRC, and UBC) in brain, pituitary, ovary, and testis in two species of songbird: zebra finch and white-throated sparrow. We used two algorithms, geNorm and NormFinder, to assess the stability of these reference genes in our samples. We found that the suitability of some of the most popular reference genes for target gene normalization in mammals, such as 18S, depended highly on tissue type. Thus, they are not the best choices for brain and gonad in these songbirds. In contrast, we identified alternative genes, such as HPRT, RPL4 and PPIA, that were highly stable in brain, pituitary, and gonad in these species. Our results suggest that the validation of reference genes in mammals does not necessarily extrapolate to other taxonomic groups. For researchers wishing to identify and evaluate suitable reference genes for qPCR songbirds, our results should serve as a starting point and should help increase the power and utility of songbird models in behavioral neuroendocrinology. PMID:24780145
dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts

PubMed Central

Vincent, Jonathan; Dai, Zhanwu; Ravel, Catherine; Choulet, Frédéric; Mouzeyar, Said; Bouzidi, M. Fouad; Agier, Marie; Martre, Pierre

2013-01-01

The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ PMID:23660284
Evaluation of reference genes for gene expression studies in radish (Raphanus sativus L.) using quantitative real-time PCR.

PubMed

Xu, Yuanyuan; Zhu, Xianwen; Gong, Yiqin; Xu, Liang; Wang, Yan; Liu, Liwang

2012-08-03

Real-time quantitative reverse transcription PCR (RT-qPCR) is a rapid and reliable method for gene expression studies. Normalization based on reference genes can increase the reliability of this technique; however, recent studies have shown that almost no single reference gene is universal for all possible experimental conditions. In this study, eight frequently used reference genes were investigated, including Glyceraldehyde-3-phosphate dehydrogenase (GAPDH), Actin2/7 (ACT), Tubulin alpha-5 (TUA), Tubulin beta-1 (TUB), 18S ribosomal RNA (18SrRNA), RNA polymerase-II transcription factor (RPII), Elongation factor 1-b (EF-1b) and Translation elongation factor 2 (TEF2). Expression stability of candidate reference genes was examined across 27 radish samples, representing a range of tissue types, cultivars, photoperiodic and vernalization treatments, and developmental stages. The eight genes in these sample pools displayed a wide range of Ct values and were variably expressed. Two statistical software packages, geNorm and NormFinder showed that TEF2, RPII and ACT appeared to be relatively stable and therefore the most suitable for use as reference genes. These results facilitate selection of desirable reference genes for accurate gene expression studies in radish. Copyright © 2012 Elsevier Inc. All rights reserved.
Reference gene identification for reliable normalisation of quantitative RT-PCR data in Setaria viridis.

PubMed

Nguyen, Duc Quan; Eamens, Andrew L; Grof, Christopher P L

2018-01-01

Quantitative real-time polymerase chain reaction (RT-qPCR) is the key platform for the quantitative analysis of gene expression in a wide range of experimental systems and conditions. However, the accuracy and reproducibility of gene expression quantification via RT-qPCR is entirely dependent on the identification of reliable reference genes for data normalisation. Green foxtail ( Setaria viridis ) has recently been proposed as a potential experimental model for the study of C 4 photosynthesis and is closely related to many economically important crop species of the Panicoideae subfamily of grasses, including Zea mays (maize), Sorghum bicolor (sorghum) and Sacchurum officinarum (sugarcane). Setaria viridis (Accession 10) possesses a number of key traits as an experimental model, namely; (i) a small sized, sequenced and well annotated genome; (ii) short stature and generation time; (iii) prolific seed production, and; (iv) is amendable to Agrobacterium tumefaciens -mediated transformation. There is currently however, a lack of reference gene expression information for Setaria viridis ( S. viridis ). We therefore aimed to identify a cohort of suitable S. viridis reference genes for accurate and reliable normalisation of S. viridis RT-qPCR expression data. Eleven putative candidate reference genes were identified and examined across thirteen different S. viridis tissues. Of these, the geNorm and NormFinder analysis software identified SERINE / THERONINE - PROTEIN PHOSPHATASE 2A ( PP2A ), 5 '- ADENYLYLSULFATE REDUCTASE 6 ( ASPR6 ) and DUAL SPECIFICITY PHOSPHATASE ( DUSP ) as the most suitable combination of reference genes for the accurate and reliable normalisation of S. viridis RT-qPCR expression data. To demonstrate the suitability of the three selected reference genes, PP2A , ASPR6 and DUSP , were used to normalise the expression of CINNAMYL ALCOHOL DEHYDROGENASE ( CAD ) genes across the same tissues. This approach readily demonstrated the suitably of the three selected reference genes for the accurate and reliable normalisation of S. viridis RT-qPCR expression data. Further, the work reported here forms a highly useful platform for future gene expression quantification in S. viridis and can also be potentially directly translatable to other closely related and agronomically important C 4 crop species.
Evaluation of reference genes for quantitative RT-PCR in Lolium temulentum under abiotic stress

USDA-ARS?s Scientific Manuscript database

Lolium temulentum is a valuable model grass species for the study of stress in forage and turf grasses. Gene expression analysis by quantitative real time RT-PCR relies on the use of proper internal standards. The aim of this study was to identify and evaluate reference genes for use in real-time q...
Integration of gene normalization stages and co-reference resolution using a Markov logic network.

PubMed

Dai, Hong-Jie; Chang, Yen-Ching; Tsai, Richard Tzong-Han; Hsu, Wen-Lian

2011-09-15

Gene normalization (GN) is the task of normalizing a textual gene mention to a unique gene database ID. Traditional top performing GN systems usually need to consider several constraints to make decisions in the normalization process, including filtering out false positives, or disambiguating an ambiguous gene mention, to improve system performance. However, these constraints are usually executed in several separate stages and cannot use each other's input/output interactively. In this article, we propose a novel approach that employs a Markov logic network (MLN) to model the constraints used in the GN task. Firstly, we show how various constraints can be formulated and combined in an MLN. Secondly, we are the first to apply the two main concepts of co-reference resolution-discourse salience in centering theory and transitivity-to GN models. Furthermore, to make our results more relevant to developers of information extraction applications, we adopt the instance-based precision/recall/F-measure (PRF) in addition to the article-wide PRF to assess system performance. Experimental results show that our system outperforms baseline and state-of-the-art systems under two evaluation schemes. Through further analysis, we have found several unexplored challenges in the GN task. hongjie@iis.sinica.edu.tw Supplementary data are available at Bioinformatics online.
Plant Reactome: a resource for plant pathways and comparative analysis.

PubMed

Naithani, Sushma; Preece, Justin; D'Eustachio, Peter; Gupta, Parul; Amarasinghe, Vindhya; Dharmawardhana, Palitha D; Wu, Guanming; Fabregat, Antonio; Elser, Justin L; Weiser, Joel; Keays, Maria; Fuentes, Alfonso Munoz-Pomer; Petryszak, Robert; Stein, Lincoln D; Ware, Doreen; Jaiswal, Pankaj

2017-01-04

Plant Reactome (http://plantreactome.gramene.org/) is a free, open-source, curated plant pathway database portal, provided as part of the Gramene project. The database provides intuitive bioinformatics tools for the visualization, analysis and interpretation of pathway knowledge to support genome annotation, genome analysis, modeling, systems biology, basic research and education. Plant Reactome employs the structural framework of a plant cell to show metabolic, transport, genetic, developmental and signaling pathways. We manually curate molecular details of pathways in these domains for reference species Oryza sativa (rice) supported by published literature and annotation of well-characterized genes. Two hundred twenty-two rice pathways, 1025 reactions associated with 1173 proteins, 907 small molecules and 256 literature references have been curated to date. These reference annotations were used to project pathways for 62 model, crop and evolutionarily significant plant species based on gene homology. Database users can search and browse various components of the database, visualize curated baseline expression of pathway-associated genes provided by the Expression Atlas and upload and analyze their Omics datasets. The database also offers data access via Application Programming Interfaces (APIs) and in various standardized pathway formats, such as SBML and BioPAX. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
RnaSeqSampleSize: real data based sample size estimation for RNA sequencing.

PubMed

Zhao, Shilin; Li, Chung-I; Guo, Yan; Sheng, Quanhu; Shyr, Yu

2018-05-30

One of the most important and often neglected components of a successful RNA sequencing (RNA-Seq) experiment is sample size estimation. A few negative binomial model-based methods have been developed to estimate sample size based on the parameters of a single gene. However, thousands of genes are quantified and tested for differential expression simultaneously in RNA-Seq experiments. Thus, additional issues should be carefully addressed, including the false discovery rate for multiple statistic tests, widely distributed read counts and dispersions for different genes. To solve these issues, we developed a sample size and power estimation method named RnaSeqSampleSize, based on the distributions of gene average read counts and dispersions estimated from real RNA-seq data. Datasets from previous, similar experiments such as the Cancer Genome Atlas (TCGA) can be used as a point of reference. Read counts and their dispersions were estimated from the reference's distribution; using that information, we estimated and summarized the power and sample size. RnaSeqSampleSize is implemented in R language and can be installed from Bioconductor website. A user friendly web graphic interface is provided at http://cqs.mc.vanderbilt.edu/shiny/RnaSeqSampleSize/ . RnaSeqSampleSize provides a convenient and powerful way for power and sample size estimation for an RNAseq experiment. It is also equipped with several unique features, including estimation for interested genes or pathway, power curve visualization, and parameter optimization.
Reference Gene Validation for RT-qPCR, a Note on Different Available Software Packages

PubMed Central

De Spiegelaere, Ward; Dern-Wieloch, Jutta; Weigel, Roswitha; Schumacher, Valérie; Schorle, Hubert; Nettersheim, Daniel; Bergmann, Martin; Brehm, Ralph; Kliesch, Sabine; Vandekerckhove, Linos; Fink, Cornelia

2015-01-01

Background An appropriate normalization strategy is crucial for data analysis from real time reverse transcription polymerase chain reactions (RT-qPCR). It is widely supported to identify and validate stable reference genes, since no single biological gene is stably expressed between cell types or within cells under different conditions. Different algorithms exist to validate optimal reference genes for normalization. Applying human cells, we here compare the three main methods to the online available RefFinder tool that integrates these algorithms along with R-based software packages which include the NormFinder and GeNorm algorithms. Results 14 candidate reference genes were assessed by RT-qPCR in two sample sets, i.e. a set of samples of human testicular tissue containing carcinoma in situ (CIS), and a set of samples from the human adult Sertoli cell line (FS1) either cultured alone or in co-culture with the seminoma like cell line (TCam-2) or with equine bone marrow derived mesenchymal stem cells (eBM-MSC). Expression stabilities of the reference genes were evaluated using geNorm, NormFinder, and BestKeeper. Similar results were obtained by the three approaches for the most and least stably expressed genes. The R-based packages NormqPCR, SLqPCR and the NormFinder for R script gave identical gene rankings. Interestingly, different outputs were obtained between the original software packages and the RefFinder tool, which is based on raw Cq values for input. When the raw data were reanalysed assuming 100% efficiency for all genes, then the outputs of the original software packages were similar to the RefFinder software, indicating that RefFinder outputs may be biased because PCR efficiencies are not taken into account. Conclusions This report shows that assay efficiency is an important parameter for reference gene validation. New software tools that incorporate these algorithms should be carefully validated prior to use. PMID:25825906
Reference gene validation for RT-qPCR, a note on different available software packages.

PubMed

De Spiegelaere, Ward; Dern-Wieloch, Jutta; Weigel, Roswitha; Schumacher, Valérie; Schorle, Hubert; Nettersheim, Daniel; Bergmann, Martin; Brehm, Ralph; Kliesch, Sabine; Vandekerckhove, Linos; Fink, Cornelia

2015-01-01

An appropriate normalization strategy is crucial for data analysis from real time reverse transcription polymerase chain reactions (RT-qPCR). It is widely supported to identify and validate stable reference genes, since no single biological gene is stably expressed between cell types or within cells under different conditions. Different algorithms exist to validate optimal reference genes for normalization. Applying human cells, we here compare the three main methods to the online available RefFinder tool that integrates these algorithms along with R-based software packages which include the NormFinder and GeNorm algorithms. 14 candidate reference genes were assessed by RT-qPCR in two sample sets, i.e. a set of samples of human testicular tissue containing carcinoma in situ (CIS), and a set of samples from the human adult Sertoli cell line (FS1) either cultured alone or in co-culture with the seminoma like cell line (TCam-2) or with equine bone marrow derived mesenchymal stem cells (eBM-MSC). Expression stabilities of the reference genes were evaluated using geNorm, NormFinder, and BestKeeper. Similar results were obtained by the three approaches for the most and least stably expressed genes. The R-based packages NormqPCR, SLqPCR and the NormFinder for R script gave identical gene rankings. Interestingly, different outputs were obtained between the original software packages and the RefFinder tool, which is based on raw Cq values for input. When the raw data were reanalysed assuming 100% efficiency for all genes, then the outputs of the original software packages were similar to the RefFinder software, indicating that RefFinder outputs may be biased because PCR efficiencies are not taken into account. This report shows that assay efficiency is an important parameter for reference gene validation. New software tools that incorporate these algorithms should be carefully validated prior to use.
Validation of endogenous internal real-time PCR controls in renal tissues.

PubMed

Cui, Xiangqin; Zhou, Juling; Qiu, Jing; Johnson, Martin R; Mrug, Michal

2009-01-01

Endogenous internal controls ('reference' or 'housekeeping' genes) are widely used in real-time PCR (RT-PCR) analyses. Their use relies on the premise of consistently stable expression across studied experimental conditions. Unfortunately, none of these controls fulfills this premise across a wide range of experimental conditions; consequently, none of them can be recommended for universal use. To determine which endogenous RT-PCR controls are suitable for analyses of renal tissues altered by kidney disease, we studied the expression of 16 commonly used 'reference genes' in 7 mildly and 7 severely affected whole kidney tissues from a well-characterized cystic kidney disease model. Expression levels of these 16 genes, determined by TaqMan RT-PCR analyses and Affymetrix GeneChip arrays, were normalized and tested for overall variance and equivalence of the means. Both statistical approaches and both TaqMan- and GeneChip-based methods converged on 3 out of the 4 top-ranked genes (Ppia, Gapdh and Pgk1) that had the most constant expression levels across the studied phenotypes. A combination of the top-ranked genes will provide a suitable endogenous internal control for similar studies of kidney tissues across a wide range of disease severity. Copyright 2009 S. Karger AG, Basel.
sigReannot: an oligo-set re-annotation pipeline based on similarities with the Ensembl transcripts and Unigene clusters.

PubMed

Casel, Pierrot; Moreews, François; Lagarrigue, Sandrine; Klopp, Christophe

2009-07-16

Microarray is a powerful technology enabling to monitor tens of thousands of genes in a single experiment. Most microarrays are now using oligo-sets. The design of the oligo-nucleotides is time consuming and error prone. Genome wide microarray oligo-sets are designed using as large a set of transcripts as possible in order to monitor as many genes as possible. Depending on the genome sequencing state and on the assembly state the knowledge of the existing transcripts can be very different. This knowledge evolves with the different genome builds and gene builds. Once the design is done the microarrays are often used for several years. The biologists working in EADGENE expressed the need of up-to-dated annotation files for the oligo-sets they share including information about the orthologous genes of model species, the Gene Ontology, the corresponding pathways and the chromosomal location. The results of SigReannot on a chicken micro-array used in the EADGENE project compared to the initial annotations show that 23% of the oligo-nucleotide gene annotations were not confirmed, 2% were modified and 1% were added. The interest of this up-to-date annotation procedure is demonstrated through the analysis of real data previously published. SigReannot uses the oligo-nucleotide design procedure criteria to validate the probe-gene link and the Ensembl transcripts as reference for annotation. It therefore produces a high quality annotation based on reference gene sets.
Selection and validation of reference genes for gene expression analysis in apomictic and sexual Cenchrus ciliaris

PubMed Central

2013-01-01

Background Apomixis is a naturally occurring asexual mode of seed reproduction resulting in offspring genetically identical to the maternal plant. Identifying differential gene expression patterns between apomictic and sexual plants is valuable to help deconstruct the trait. Quantitative RT-PCR (qRT-PCR) is a popular method for analyzing gene expression. Normalizing gene expression data using proper reference genes which show stable expression under investigated conditions is critical in qRT-PCR analysis. We used qRT-PCR to validate expression and stability of six potential reference genes (EF1alpha, EIF4A, UBCE, GAPDH, ACT2 and TUBA) in vegetative and reproductive tissues of B-2S and B-12-9 accessions of C. ciliaris. Findings Among tissue types evaluated, EF1alpha showed the highest level of expression while TUBA showed the lowest. When all tissue types were evaluated and compared between genotypes, EIF4A was the most stable reference gene. Gene expression stability for specific ovary stages of B-2S and B-12-9 was also determined. Except for TUBA, all other tested reference genes could be used for any stage-specific ovary tissue normalization, irrespective of the mode of reproduction. Conclusion Our gene expression stability assay using six reference genes, in sexual and apomictic accessions of C. ciliaris, suggests that EIF4A is the most stable gene across all tissue types analyzed. All other tested reference genes, with the exception of TUBA, could be used for gene expression comparison studies between sexual and apomictic ovaries over multiple developmental stages. This reference gene validation data in C. ciliaris will serve as an important base for future apomixis-related transcriptome data validation. PMID:24083672
Bayesian estimation of differential transcript usage from RNA-seq data.

PubMed

Papastamoulis, Panagiotis; Rattray, Magnus

2017-11-27

Next generation sequencing allows the identification of genes consisting of differentially expressed transcripts, a term which usually refers to changes in the overall expression level. A specific type of differential expression is differential transcript usage (DTU) and targets changes in the relative within gene expression of a transcript. The contribution of this paper is to: (a) extend the use of cjBitSeq to the DTU context, a previously introduced Bayesian model which is originally designed for identifying changes in overall expression levels and (b) propose a Bayesian version of DRIMSeq, a frequentist model for inferring DTU. cjBitSeq is a read based model and performs fully Bayesian inference by MCMC sampling on the space of latent state of each transcript per gene. BayesDRIMSeq is a count based model and estimates the Bayes Factor of a DTU model against a null model using Laplace's approximation. The proposed models are benchmarked against the existing ones using a recent independent simulation study as well as a real RNA-seq dataset. Our results suggest that the Bayesian methods exhibit similar performance with DRIMSeq in terms of precision/recall but offer better calibration of False Discovery Rate.
Validation of the β-amy1 transcription profiling assay and selection of reference genes suited for a RT-qPCR assay in developing barley caryopsis.

PubMed

Ovesná, Jaroslava; Kučera, Ladislav; Vaculová, Kateřina; Štrymplová, Kamila; Svobodová, Ilona; Milella, Luigi

2012-01-01

Reverse transcription coupled with real-time quantitative PCR (RT-qPCR) is a frequently used method for gene expression profiling. Reference genes (RGs) are commonly employed to normalize gene expression data. A limited information exist on the gene expression and profiling in developing barley caryopsis. Expression stability was assessed by measuring the cycle threshold (Ct) range and applying both the GeNorm (pair-wise comparison of geometric means) and Normfinder (model-based approach) principles for the calculation. Here, we have identified a set of four RGs suitable for studying gene expression in the developing barley caryopsis. These encode the proteins GAPDH, HSP90, HSP70 and ubiquitin. We found a correlation between the frequency of occurrence of a transcript in silico and its suitability as an RG. This set of RGs was tested by comparing the normalized level of β-amylase (β-amy1) transcript with directly measured quantities of the BMY1 gene product in the developing barley caryopsis. This panel of genes could be used for other gene expression studies, as well as to optimize β-amy1 analysis for study of the impact of β-amy1 expression upon barley end-use quality.

Production of a reference transcriptome and transcriptomic database (EdwardsiellaBase) for the lined sea anemone, Edwardsiella lineata, a parasitic cnidarian

PubMed Central

2014-01-01

Background The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. Description We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215–364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. Conclusions The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a “non-model system.” PMID:24467778
Production of a reference transcriptome and transcriptomic database (EdwardsiellaBase) for the lined sea anemone, Edwardsiella lineata, a parasitic cnidarian.

PubMed

Stefanik, Derek J; Lubinski, Tristan J; Granger, Brian R; Byrd, Allyson L; Reitzel, Adam M; DeFilippo, Lukas; Lorenc, Allison; Finnerty, John R

2014-01-28

The lined sea anemone Edwardsiella lineata is an informative model system for evolutionary-developmental studies of parasitism. In this species, it is possible to compare alternate developmental pathways leading from a larva to either a free-living polyp or a vermiform parasite that inhabits the mesoglea of a ctenophore host. Additionally, E. lineata is confamilial with the model cnidarian Nematostella vectensis, providing an opportunity for comparative genomic, molecular and organismal studies. We generated a reference transcriptome for E. lineata via high-throughput sequencing of RNA isolated from five developmental stages (parasite; parasite-to-larva transition; larva; larva-to-adult transition; adult). The transcriptome comprises 90,440 contigs assembled from >15 billion nucleotides of DNA sequence. Using a molecular clock approach, we estimated the divergence between E. lineata and N. vectensis at 215-364 million years ago. Based on gene ontology and metabolic pathway analyses and gene family surveys (bHLH-PAS, deiodinases, Fox genes, LIM homeodomains, minicollagens, nuclear receptors, Sox genes, and Wnts), the transcriptome of E. lineata is comparable in depth and completeness to N. vectensis. Analyses of protein motifs and revealed extensive conservation between the proteins of these two edwardsiid anemones, although we show the NF-κB protein of E. lineata reflects the ancestral structure, while the NF-κB protein of N. vectensis has undergone a split that separates the DNA-binding domain from the inhibitory domain. All contigs have been deposited in a public database (EdwardsiellaBase), where they may be searched according to contig ID, gene ontology, protein family motif (Pfam), enzyme commission number, and BLAST. The alignment of the raw reads to the contigs can also be visualized via JBrowse. The transcriptomic data and database described here provide a platform for studying the evolutionary developmental genomics of a derived parasitic life cycle. In addition, these data from E. lineata will aid in the interpretation of evolutionary novelties in gene sequence or structure that have been reported for the model cnidarian N. vectensis (e.g., the split NF-κB locus). Finally, we include custom computational tools to facilitate the annotation of a transcriptome based on high-throughput sequencing data obtained from a "non-model system."
CARD 2017: expansion and model-centric curation of the comprehensive antibiotic resistance database

PubMed Central

Jia, Baofeng; Raphenya, Amogelang R.; Alcock, Brian; Waglechner, Nicholas; Guo, Peiyao; Tsang, Kara K.; Lago, Briony A.; Dave, Biren M.; Pereira, Sheldon; Sharma, Arjun N.; Doshi, Sachin; Courtot, Mélanie; Lo, Raymond; Williams, Laura E.; Frye, Jonathan G.; Elsayegh, Tariq; Sardar, Daim; Westman, Erin L.; Pawlowski, Andrew C.; Johnson, Timothy A.; Brinkman, Fiona S.L.; Wright, Gerard D.; McArthur, Andrew G.

2017-01-01

The Comprehensive Antibiotic Resistance Database (CARD; http://arpcard.mcmaster.ca) is a manually curated resource containing high quality reference data on the molecular basis of antimicrobial resistance (AMR), with an emphasis on the genes, proteins and mutations involved in AMR. CARD is ontologically structured, model centric, and spans the breadth of AMR drug classes and resistance mechanisms, including intrinsic, mutation-driven and acquired resistance. It is built upon the Antibiotic Resistance Ontology (ARO), a custom built, interconnected and hierarchical controlled vocabulary allowing advanced data sharing and organization. Its design allows the development of novel genome analysis tools, such as the Resistance Gene Identifier (RGI) for resistome prediction from raw genome sequence. Recent improvements include extensive curation of additional reference sequences and mutations, development of a unique Model Ontology and accompanying AMR detection models to power sequence analysis, new visualization tools, and expansion of the RGI for detection of emergent AMR threats. CARD curation is updated monthly based on an interplay of manual literature curation, computational text mining, and genome analysis. PMID:27789705
Methods and approaches in the topology-based analysis of biological pathways

PubMed Central

Mitrea, Cristina; Taghavi, Zeinab; Bokanizad, Behzad; Hanoudi, Samer; Tagett, Rebecca; Donato, Michele; Voichiţa, Călin; Drăghici, Sorin

2013-01-01

The goal of pathway analysis is to identify the pathways significantly impacted in a given phenotype. Many current methods are based on algorithms that consider pathways as simple gene lists, dramatically under-utilizing the knowledge that such pathways are meant to capture. During the past few years, a plethora of methods claiming to incorporate various aspects of the pathway topology have been proposed. These topology-based methods, sometimes referred to as “third generation,” have the potential to better model the phenomena described by pathways. Although there is now a large variety of approaches used for this purpose, no review is currently available to offer guidance for potential users and developers. This review covers 22 such topology-based pathway analysis methods published in the last decade. We compare these methods based on: type of pathways analyzed (e.g., signaling or metabolic), input (subset of genes, all genes, fold changes, gene p-values, etc.), mathematical models, pathway scoring approaches, output (one or more pathway scores, p-values, etc.) and implementation (web-based, standalone, etc.). We identify and discuss challenges, arising both in methodology and in pathway representation, including inconsistent terminology, different data formats, lack of meaningful benchmarks, and the lack of tissue and condition specificity. PMID:24133454
Identification of Reference Genes and Analysis of Heat Shock Protein Gene Expression in Lingzhi or Reishi Medicinal Mushroom, Ganoderma lucidum, after Exposure to Heat Stress.

PubMed

Liu, Yong-Nan; Lu, Xiao-Xiao; Ren, Ang; Shi, Liang; Jiang, Ai-Liang; Yu, Han-Shou; Zhao, Ming-Wen

2017-01-01

Ganoderma lucidum has been considered an emerging model species for studying how environmental factors regulate the growth, development, and secondary metabolism of Basidiomycetes. Heat stress, which is one of the most important environmental abiotic stresses, seriously affects the growth, development, and yield of microorganisms. Understanding the response to heat stress has gradually become a hotspot in microorganism research. But suitable reference genes for expression analysis under heat stress have not been reported in G. lucidum. In this study, we systematically identified 11 candidate reference genes that were measured using reverse transcriptase quantitative polymerase chain reaction, and the gene expression stability was analyzed under heat stress conditions using geNorm and NormFinder. The results show that 5 reference genes-CYP and TIF, followed by UCE2, ACTIN, and UBQ1-are the most stable genes under our experimental conditions. Moreover, the relative expression levels of 3 heat stress response genes (hsp17.4, hsp70, and hsp90) were analyzed under heat stress conditions with different normalization strategies. The results show that use of a gene with unstable expression (SAND) as the reference gene leads to biased data and misinterpretations of the target gene expression level under heat stress.
Identification of Reference Genes for Real-Time Quantitative PCR Experiments in the Liverwort Marchantia polymorpha

PubMed Central

Dolan, Liam; Langdale, Jane A.

2015-01-01

Real-time quantitative polymerase chain reaction (qPCR) has become widely used as a method to compare gene transcript levels across different conditions. However, selection of suitable reference genes to normalize qPCR data is required for accurate transcript level analysis. Recently, Marchantia polymorpha has been adopted as a model for the study of liverwort development and land plant evolution. Identification of appropriate reference genes has therefore become a necessity for gene expression studies. In this study, transcript levels of eleven candidate reference genes have been analyzed across a range of biological contexts that encompass abiotic stress, hormone treatment and different developmental stages. The consistency of transcript levels was assessed using both geNorm and NormFinder algorithms, and a consensus ranking of the different candidate genes was then obtained. MpAPT and MpACT showed relatively constant transcript levels across all conditions tested whereas the transcript levels of other candidate genes were clearly influenced by experimental conditions. By analyzing transcript levels of phosphate and nitrate starvation reporter genes, we confirmed that MpAPT and MpACT are suitable reference genes in M. polymorpha and also demonstrated that normalization with an inappropriate gene can lead to erroneous analysis of qPCR data. PMID:25798897
Evolutionary interrogation of human biology in well-annotated genomic framework of rhesus macaque.

PubMed

Zhang, Shi-Jian; Liu, Chu-Jun; Yu, Peng; Zhong, Xiaoming; Chen, Jia-Yu; Yang, Xinzhuang; Peng, Jiguang; Yan, Shouyu; Wang, Chenqu; Zhu, Xiaotong; Xiong, Jingwei; Zhang, Yong E; Tan, Bertrand Chin-Ming; Li, Chuan-Yun

2014-05-01

With genome sequence and composition highly analogous to human, rhesus macaque represents a unique reference for evolutionary studies of human biology. Here, we developed a comprehensive genomic framework of rhesus macaque, the RhesusBase2, for evolutionary interrogation of human genes and the associated regulations. A total of 1,667 next-generation sequencing (NGS) data sets were processed, integrated, and evaluated, generating 51.2 million new functional annotation records. With extensive NGS annotations, RhesusBase2 refined the fine-scale structures in 30% of the macaque Ensembl transcripts, reporting an accurate, up-to-date set of macaque gene models. On the basis of these annotations and accurate macaque gene models, we further developed an NGS-oriented Molecular Evolution Gateway to access and visualize macaque annotations in reference to human orthologous genes and associated regulations (www.rhesusbase.org/molEvo). We highlighted the application of this well-annotated genomic framework in generating hypothetical link of human-biased regulations to human-specific traits, by using mechanistic characterization of the DIEXF gene as an example that provides novel clues to the understanding of digestive system reduction in human evolution. On a global scale, we also identified a catalog of 9,295 human-biased regulatory events, which may represent novel elements that have a substantial impact on shaping human transcriptome and possibly underpin recent human phenotypic evolution. Taken together, we provide an NGS data-driven, information-rich framework that will broadly benefit genomics research in general and serves as an important resource for in-depth evolutionary studies of human biology.
A gene co-expression network model identifies yield-related vicinity networks in Jatropha curcas shoot system.

PubMed

Govender, Nisha; Senan, Siju; Mohamed-Hussein, Zeti-Azura; Wickneswari, Ratnam

2018-06-15

The plant shoot system consists of reproductive organs such as inflorescences, buds and fruits, and the vegetative leaves and stems. In this study, the reproductive part of the Jatropha curcas shoot system, which includes the aerial shoots, shoots bearing the inflorescence and inflorescence were investigated in regard to gene-to-gene interactions underpinning yield-related biological processes. An RNA-seq based sequencing of shoot tissues performed on an Illumina HiSeq. 2500 platform generated 18 transcriptomes. Using the reference genome-based mapping approach, a total of 64 361 genes was identified in all samples and the data was annotated against the non-redundant database by the BLAST2GO Pro. Suite. After removing the outlier genes and samples, a total of 12 734 genes across 17 samples were subjected to gene co-expression network construction using petal, an R library. A gene co-expression network model built with scale-free and small-world properties extracted four vicinity networks (VNs) with putative involvement in yield-related biological processes as follow; heat stress tolerance, floral and shoot meristem differentiation, biosynthesis of chlorophyll molecules and laticifers, cell wall metabolism and epigenetic regulations. Our VNs revealed putative key players that could be adapted in breeding strategies for J. curcas shoot system improvements.
Evaluation and Validation of Reference Genes for qRT-PCR Normalization in Frankliniella occidentalis (Thysanoptera:Thripidae)

PubMed Central

Zheng, Yu-Tao; Li, Hong-Bo; Lu, Ming-Xing; Du, Yu-Zhou

2014-01-01

Quantitative real time PCR (qRT-PCR) has emerged as a reliable and reproducible technique for studying gene expression analysis. For accurate results, the normalization of data with reference genes is particularly essential. Once the transcriptome sequencing of Frankliniella occidentalis was completed, numerous unigenes were identified and annotated. Unfortunately, there are no studies on the stability of reference genes used in F. occidentalis. In this work, seven candidate reference genes, including actin, 18S rRNA, H3, tubulin, GAPDH, EF-1 and RPL32, were evaluated for their suitability as normalization genes under different experimental conditions using the statistical software programs BestKeeper, geNorm, Normfinder and the comparative ΔCt method. Because the rankings of the reference genes provided by each of the four programs were different, we chose a user-friendly web-based comprehensive tool RefFinder to get the final ranking. The result demonstrated that EF-1 and RPL32 displayed the most stable expression in different developmental stages; RPL32 and GAPDH showed the most stable expression at high temperatures, while 18S and EF-1 exhibited the most stable expression at low temperatures. In this study, we validated the suitable reference genes in F. occidentalis for gene expression profiling under different experimental conditions. The choice of internal standard is very important in the normalization of the target gene expression levels, thus validating and selecting the best genes will help improve the quality of gene expression data of F. occidentalis. What is more, these validated reference genes could serve as the basis for the selection of candidate reference genes in other insects. PMID:25356721
Evaluation and validation of reference genes for qRT-PCR normalization in Frankliniella occidentalis (Thysanoptera: Thripidae).

PubMed

Zheng, Yu-Tao; Li, Hong-Bo; Lu, Ming-Xing; Du, Yu-Zhou

2014-01-01

Quantitative real time PCR (qRT-PCR) has emerged as a reliable and reproducible technique for studying gene expression analysis. For accurate results, the normalization of data with reference genes is particularly essential. Once the transcriptome sequencing of Frankliniella occidentalis was completed, numerous unigenes were identified and annotated. Unfortunately, there are no studies on the stability of reference genes used in F. occidentalis. In this work, seven candidate reference genes, including actin, 18S rRNA, H3, tubulin, GAPDH, EF-1 and RPL32, were evaluated for their suitability as normalization genes under different experimental conditions using the statistical software programs BestKeeper, geNorm, Normfinder and the comparative ΔCt method. Because the rankings of the reference genes provided by each of the four programs were different, we chose a user-friendly web-based comprehensive tool RefFinder to get the final ranking. The result demonstrated that EF-1 and RPL32 displayed the most stable expression in different developmental stages; RPL32 and GAPDH showed the most stable expression at high temperatures, while 18S and EF-1 exhibited the most stable expression at low temperatures. In this study, we validated the suitable reference genes in F. occidentalis for gene expression profiling under different experimental conditions. The choice of internal standard is very important in the normalization of the target gene expression levels, thus validating and selecting the best genes will help improve the quality of gene expression data of F. occidentalis. What is more, these validated reference genes could serve as the basis for the selection of candidate reference genes in other insects.
Biomine: predicting links between biological entities using network models of heterogeneous databases.

PubMed

Eronen, Lauri; Toivonen, Hannu

2012-06-06

Biological databases contain large amounts of data concerning the functions and associations of genes and proteins. Integration of data from several such databases into a single repository can aid the discovery of previously unknown connections spanning multiple types of relationships and databases. Biomine is a system that integrates cross-references from several biological databases into a graph model with multiple types of edges, such as protein interactions, gene-disease associations and gene ontology annotations. Edges are weighted based on their type, reliability, and informativeness. We present Biomine and evaluate its performance in link prediction, where the goal is to predict pairs of nodes that will be connected in the future, based on current data. In particular, we formulate protein interaction prediction and disease gene prioritization tasks as instances of link prediction. The predictions are based on a proximity measure computed on the integrated graph. We consider and experiment with several such measures, and perform a parameter optimization procedure where different edge types are weighted to optimize link prediction accuracy. We also propose a novel method for disease-gene prioritization, defined as finding a subset of candidate genes that cluster together in the graph. We experimentally evaluate Biomine by predicting future annotations in the source databases and prioritizing lists of putative disease genes. The experimental results show that Biomine has strong potential for predicting links when a set of selected candidate links is available. The predictions obtained using the entire Biomine dataset are shown to clearly outperform ones obtained using any single source of data alone, when different types of links are suitably weighted. In the gene prioritization task, an established reference set of disease-associated genes is useful, but the results show that under favorable conditions, Biomine can also perform well when no such information is available.The Biomine system is a proof of concept. Its current version contains 1.1 million entities and 8.1 million relations between them, with focus on human genetics. Some of its functionalities are available in a public query interface at http://biomine.cs.helsinki.fi, allowing searching for and visualizing connections between given biological entities.
Selection of Valid Reference Genes for Reverse Transcription Quantitative PCR Analysis in Heliconius numata (Lepidoptera: Nymphalidae)

PubMed Central

Chouteau, Mathieu; Whibley, Annabel; Joron, Mathieu; Llaurens, Violaine

2016-01-01

Identifying the genetic basis of adaptive variation is challenging in non-model organisms and quantitative real time PCR. is a useful tool for validating predictions regarding the expression of candidate genes. However, comparing expression levels in different conditions requires rigorous experimental design and statistical analyses. Here, we focused on the neotropical passion-vine butterflies Heliconius, non-model species studied in evolutionary biology for their adaptive variation in wing color patterns involved in mimicry and in the signaling of their toxicity to predators. We aimed at selecting stable reference genes to be used for normalization of gene expression data in RT-qPCR analyses from developing wing discs according to the minimal guidelines described in Minimum Information for publication of Quantitative Real-Time PCR Experiments (MIQE). To design internal RT-qPCR controls, we studied the stability of expression of nine candidate reference genes (actin, annexin, eF1α, FK506BP, PolyABP, PolyUBQ, RpL3, RPS3A, and tubulin) at two developmental stages (prepupal and pupal) using three widely used programs (GeNorm, NormFinder and BestKeeper). Results showed that, despite differences in statistical methods, genes RpL3, eF1α, polyABP, and annexin were stably expressed in wing discs in late larval and pupal stages of Heliconius numata. This combination of genes may be used as a reference for a reliable study of differential expression in wings for instance for genes involved in important phenotypic variation, such as wing color pattern variation. Through this example, we provide general useful technical recommendations as well as relevant statistical strategies for evolutionary biologists aiming to identify candidate-genes involved adaptive variation in non-model organisms. PMID:27271971
SUPERFAMILY 1.75 including a domain-centric gene ontology method.

PubMed

de Lima Morais, David A; Fang, Hai; Rackham, Owen J L; Wilson, Derek; Pethica, Ralph; Chothia, Cyrus; Gough, Julian

2011-01-01

The SUPERFAMILY resource provides protein domain assignments at the structural classification of protein (SCOP) superfamily level for over 1400 completely sequenced genomes, over 120 metagenomes and other gene collections such as UniProt. All models and assignments are available to browse and download at http://supfam.org. A new hidden Markov model library based on SCOP 1.75 has been created and a previously ignored class of SCOP, coiled coils, is now included. Our scoring component now uses HMMER3, which is in orders of magnitude faster and produces superior results. A cloud-based pipeline was implemented and is publicly available at Amazon web services elastic computer cloud. The SUPERFAMILY reference tree of life has been improved allowing the user to highlight a chosen superfamily, family or domain architecture on the tree of life. The most significant advance in SUPERFAMILY is that now it contains a domain-based gene ontology (GO) at the superfamily and family levels. A new methodology was developed to ensure a high quality GO annotation. The new methodology is general purpose and has been used to produce domain-based phenotypic ontologies in addition to GO.
Elasmobranch qPCR reference genes: a case study of hypoxia preconditioned epaulette sharks

PubMed Central

2010-01-01

Background Elasmobranch fishes are an ancient group of vertebrates which have high potential as model species for research into evolutionary physiology and genomics. However, no comparative studies have established suitable reference genes for quantitative PCR (qPCR) in elasmobranchs for any physiological conditions. Oxygen availability has been a major force shaping the physiological evolution of vertebrates, especially fishes. Here we examined the suitability of 9 reference candidates from various functional categories after a single hypoxic insult or after hypoxia preconditioning in epaulette shark (Hemiscyllium ocellatum). Results Epaulette sharks were caught and exposed to hypoxia. Tissues were collected from 10 controls, 10 individuals with single hypoxic insult and 10 individuals with hypoxia preconditioning (8 hypoxic insults, 12 hours apart). We produced sequence information for reference gene candidates and monitored mRNA expression levels in four tissues: cerebellum, heart, gill and eye. The stability of the genes was examined with analysis of variance, geNorm and NormFinder. The best ranking genes in our study were eukaryotic translation elongation factor 1 beta (eef1b), ubiquitin (ubq) and polymerase (RNA) II (DNA directed) polypeptide F (polr2f). The performance of the ribosomal protein L6 (rpl6) was tissue-dependent. Notably, in one tissue the analysis of variance indicated statistically significant differences between treatments for genes that were ranked as the most stable candidates by reference gene software. Conclusions Our results indicate that eef1b and ubq are generally the most suitable reference genes for the conditions and tissues in the present epaulette shark studies. These genes could also be potential reference gene candidates for other physiological studies examining stress in elasmobranchs. The results emphasise the importance of inter-group variation in reference gene evaluation. PMID:20416043
Selection of reference genes for quantitative real time RT-PCR during dimorphism in the zygomycete Mucor circinelloides.

PubMed

Valle-Maldonado, Marco I; Jácome-Galarza, Irvin E; Gutiérrez-Corona, Félix; Ramírez-Díaz, Martha I; Campos-García, Jesús; Meza-Carmen, Víctor

2015-03-01

Mucor circinelloides is a dimorphic fungal model for studying several biological processes including cell differentiation (yeast-mold transitions) as well as biodiesel and carotene production. The recent release of the first draft sequence of the M. circinelloides genome, combined with the availability of analytical methods to determine patterns of gene expression, such as quantitative Reverse transcription-Polymerase chain reaction (qRT-PCR), and the development of molecular genetic tools for the manipulation of the fungus, may help identify M. circinelloides gene products and analyze their relevance in different biological processes. However, no information is available on M. circinelloides genes of stable expression that could serve as internal references in qRT-PCR analyses. One approach to solve this problem consists in the use of housekeeping genes as internal references. However, validation of the usability of these reference genes is a fundamental step prior to initiating qRT-PCR assays. This work evaluates expression of several constitutive genes by qRT-PCR throughout the morphological differentiation stages of M. circinelloides; our results indicate that tfc-1 and ef-1 are the most stable genes for qRT-PCR assays during differentiation studies and they are proposed as reference genes to carry out gene expression studies in this fungus.
RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes.

PubMed

Ono, Hiromasa; Ogasawara, Osamu; Okubo, Kosaku; Bono, Hidemasa

2017-08-29

Gene expression data are exponentially accumulating; thus, the functional annotation of such sequence data from metadata is urgently required. However, life scientists have difficulty utilizing the available data due to its sheer magnitude and complicated access. We have developed a web tool for browsing reference gene expression pattern of mammalian tissues and cell lines measured using different methods, which should facilitate the reuse of the precious data archived in several public databases. The web tool is called Reference Expression dataset (RefEx), and RefEx allows users to search by the gene name, various types of IDs, chromosomal regions in genetic maps, gene family based on InterPro, gene expression patterns, or biological categories based on Gene Ontology. RefEx also provides information about genes with tissue-specific expression, and the relative gene expression values are shown as choropleth maps on 3D human body images from BodyParts3D. Combined with the newly incorporated Functional Annotation of Mammals (FANTOM) dataset, RefEx provides insight regarding the functional interpretation of unfamiliar genes. RefEx is publicly available at http://refex.dbcls.jp/.
RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes

PubMed Central

Ono, Hiromasa; Ogasawara, Osamu; Okubo, Kosaku; Bono, Hidemasa

2017-01-01

Gene expression data are exponentially accumulating; thus, the functional annotation of such sequence data from metadata is urgently required. However, life scientists have difficulty utilizing the available data due to its sheer magnitude and complicated access. We have developed a web tool for browsing reference gene expression pattern of mammalian tissues and cell lines measured using different methods, which should facilitate the reuse of the precious data archived in several public databases. The web tool is called Reference Expression dataset (RefEx), and RefEx allows users to search by the gene name, various types of IDs, chromosomal regions in genetic maps, gene family based on InterPro, gene expression patterns, or biological categories based on Gene Ontology. RefEx also provides information about genes with tissue-specific expression, and the relative gene expression values are shown as choropleth maps on 3D human body images from BodyParts3D. Combined with the newly incorporated Functional Annotation of Mammals (FANTOM) dataset, RefEx provides insight regarding the functional interpretation of unfamiliar genes. RefEx is publicly available at http://refex.dbcls.jp/. PMID:28850115
Identification and Validation of Reference Genes and Their Impact on Normalized Gene Expression Studies across Cultivated and Wild Cicer Species

PubMed Central

Reddy, Palakolanu Sudhakar; Sri Cindhuri, Katamreddy; Sivaji Ganesh, Adusumalli; Sharma, Kiran Kumar

2016-01-01

Quantitative Real-Time PCR (qPCR) is a preferred and reliable method for accurate quantification of gene expression to understand precise gene functions. A total of 25 candidate reference genes including traditional and new generation reference genes were selected and evaluated in a diverse set of chickpea samples. The samples used in this study included nine chickpea genotypes (Cicer spp.) comprising of cultivated and wild species, six abiotic stress treatments (drought, salinity, high vapor pressure deficit, abscisic acid, cold and heat shock), and five diverse tissues (leaf, root, flower, seedlings and seed). The geNorm, NormFinder and RefFinder algorithms used to identify stably expressed genes in four sample sets revealed stable expression of UCP and G6PD genes across genotypes, while TIP41 and CAC were highly stable under abiotic stress conditions. While PP2A and ABCT genes were ranked as best for different tissues, ABCT, UCP and CAC were most stable across all samples. This study demonstrated the usefulness of new generation reference genes for more accurate qPCR based gene expression quantification in cultivated as well as wild chickpea species. Validation of the best reference genes was carried out by studying their impact on normalization of aquaporin genes PIP1;4 and TIP3;1, in three contrasting chickpea genotypes under high vapor pressure deficit (VPD) treatment. The chickpea TIP3;1 gene got significantly up regulated under high VPD conditions with higher relative expression in the drought susceptible genotype, confirming the suitability of the selected reference genes for expression analysis. This is the first comprehensive study on the stability of the new generation reference genes for qPCR studies in chickpea across species, different tissues and abiotic stresses. PMID:26863232
Identification and Validation of Reference Genes and Their Impact on Normalized Gene Expression Studies across Cultivated and Wild Cicer Species.

PubMed

Reddy, Dumbala Srinivas; Bhatnagar-Mathur, Pooja; Reddy, Palakolanu Sudhakar; Sri Cindhuri, Katamreddy; Sivaji Ganesh, Adusumalli; Sharma, Kiran Kumar

2016-01-01

Quantitative Real-Time PCR (qPCR) is a preferred and reliable method for accurate quantification of gene expression to understand precise gene functions. A total of 25 candidate reference genes including traditional and new generation reference genes were selected and evaluated in a diverse set of chickpea samples. The samples used in this study included nine chickpea genotypes (Cicer spp.) comprising of cultivated and wild species, six abiotic stress treatments (drought, salinity, high vapor pressure deficit, abscisic acid, cold and heat shock), and five diverse tissues (leaf, root, flower, seedlings and seed). The geNorm, NormFinder and RefFinder algorithms used to identify stably expressed genes in four sample sets revealed stable expression of UCP and G6PD genes across genotypes, while TIP41 and CAC were highly stable under abiotic stress conditions. While PP2A and ABCT genes were ranked as best for different tissues, ABCT, UCP and CAC were most stable across all samples. This study demonstrated the usefulness of new generation reference genes for more accurate qPCR based gene expression quantification in cultivated as well as wild chickpea species. Validation of the best reference genes was carried out by studying their impact on normalization of aquaporin genes PIP1;4 and TIP3;1, in three contrasting chickpea genotypes under high vapor pressure deficit (VPD) treatment. The chickpea TIP3;1 gene got significantly up regulated under high VPD conditions with higher relative expression in the drought susceptible genotype, confirming the suitability of the selected reference genes for expression analysis. This is the first comprehensive study on the stability of the new generation reference genes for qPCR studies in chickpea across species, different tissues and abiotic stresses.
Selection of reference genes for quantitative real-time PCR normalization in Panax ginseng at different stages of growth and in different organs.

PubMed

Liu, Jing; Wang, Qun; Sun, Minying; Zhu, Linlin; Yang, Michael; Zhao, Yu

2014-01-01

Quantitative real-time reverse transcription PCR (qRT-PCR) has become a widely used method for gene expression analysis; however, its data interpretation largely depends on the stability of reference genes. The transcriptomics of Panax ginseng, one of the most popular and traditional ingredients used in Chinese medicines, is increasingly being studied. Furthermore, it is vital to establish a series of reliable reference genes when qRT-PCR is used to assess the gene expression profile of ginseng. In this study, we screened out candidate reference genes for ginseng using gene expression data generated by a high-throughput sequencing platform. Based on the statistical tests, 20 reference genes (10 traditional housekeeping genes and 10 novel genes) were selected. These genes were tested for the normalization of expression levels in five growth stages and three distinct plant organs of ginseng by qPCR. These genes were subsequently ranked and compared according to the stability of their expressions using geNorm, NormFinder, and BestKeeper computational programs. Although the best reference genes were found to vary across different samples, CYP and EF-1α were the most stable genes amongst all samples. GAPDH/30S RPS20, CYP/60S RPL13 and CYP/QCR were the optimum pair of reference genes in the roots, stems, and leaves. CYP/60S RPL13, CYP/eIF-5A, aTUB/V-ATP, eIF-5A/SAR1, and aTUB/pol IIa were the most stably expressed combinations in each of the five developmental stages. Our study serves as a foundation for developing an accurate method of qRT-PCR and will benefit future studies on gene expression profiles of Panax Ginseng.

PhytoPath: an integrative resource for plant pathogen genomics.

PubMed

Pedro, Helder; Maheswari, Uma; Urban, Martin; Irvine, Alistair George; Cuzick, Alayne; McDowall, Mark D; Staines, Daniel M; Kulesha, Eugene; Hammond-Kosack, Kim Elizabeth; Kersey, Paul Julian

2016-01-04

PhytoPath (www.phytopathdb.org) is a resource for genomic and phenotypic data from plant pathogen species, that integrates phenotypic data for genes from PHI-base, an expertly curated catalog of genes with experimentally verified pathogenicity, with the Ensembl tools for data visualization and analysis. The resource is focused on fungi, protists (oomycetes) and bacterial plant pathogens that have genomes that have been sequenced and annotated. Genes with associated PHI-base data can be easily identified across all plant pathogen species using a BioMart-based query tool and visualized in their genomic context on the Ensembl genome browser. The PhytoPath resource contains data for 135 genomic sequences from 87 plant pathogen species, and 1364 genes curated for their role in pathogenicity and as targets for chemical intervention. Support for community annotation of gene models is provided using the WebApollo online gene editor, and we are working with interested communities to improve reference annotation for selected species. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Selection and Validation of Appropriate Reference Genes for qRT-PCR Analysis in Isatis indigotica Fort.

PubMed Central

Li, Tao; Wang, Jing; Lu, Miao; Zhang, Tianyi; Qu, Xinyun; Wang, Zhezhi

2017-01-01

Due to its sensitivity and specificity, real-time quantitative PCR (qRT-PCR) is a popular technique for investigating gene expression levels in plants. Based on the Minimum Information for Publication of Real-Time Quantitative PCR Experiments (MIQE) guidelines, it is necessary to select and validate putative appropriate reference genes for qRT-PCR normalization. In the current study, three algorithms, geNorm, NormFinder, and BestKeeper, were applied to assess the expression stability of 10 candidate reference genes across five different tissues and three different abiotic stresses in Isatis indigotica Fort. Additionally, the IiYUC6 gene associated with IAA biosynthesis was applied to validate the candidate reference genes. The analysis results of the geNorm, NormFinder, and BestKeeper algorithms indicated certain differences for the different sample sets and different experiment conditions. Considering all of the algorithms, PP2A-4 and TUB4 were recommended as the most stable reference genes for total and different tissue samples, respectively. Moreover, RPL15 and PP2A-4 were considered to be the most suitable reference genes for abiotic stress treatments. The obtained experimental results might contribute to improved accuracy and credibility for the expression levels of target genes by qRT-PCR normalization in I. indigotica. PMID:28702046
High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource

PubMed Central

Seaver, Samuel M. D.; Gerdes, Svetlana; Frelin, Océane; Lerma-Ortiz, Claudia; Bradbury, Louis M. T.; Zallot, Rémi; Hasnain, Ghulam; Niehaus, Thomas D.; El Yacoubi, Basma; Pasternak, Shiran; Olson, Robert; Pusch, Gordon; Overbeek, Ross; Stevens, Rick; de Crécy-Lagard, Valérie; Ware, Doreen; Hanson, Andrew D.; Henry, Christopher S.

2014-01-01

The increasing number of sequenced plant genomes is placing new demands on the methods applied to analyze, annotate, and model these genomes. Today’s annotation pipelines result in inconsistent gene assignments that complicate comparative analyses and prevent efficient construction of metabolic models. To overcome these problems, we have developed the PlantSEED, an integrated, metabolism-centric database to support subsystems-based annotation and metabolic model reconstruction for plant genomes. PlantSEED combines SEED subsystems technology, first developed for microbial genomes, with refined protein families and biochemical data to assign fully consistent functional annotations to orthologous genes, particularly those encoding primary metabolic pathways. Seamless integration with its parent, the prokaryotic SEED database, makes PlantSEED a unique environment for cross-kingdom comparative analysis of plant and bacterial genomes. The consistent annotations imposed by PlantSEED permit rapid reconstruction and modeling of primary metabolism for all plant genomes in the database. This feature opens the unique possibility of model-based assessment of the completeness and accuracy of gene annotation and thus allows computational identification of genes and pathways that are restricted to certain genomes or need better curation. We demonstrate the PlantSEED system by producing consistent annotations for 10 reference genomes. We also produce a functioning metabolic model for each genome, gapfilling to identify missing annotations and proposing gene candidates for missing annotations. Models are built around an extended biomass composition representing the most comprehensive published to date. To our knowledge, our models are the first to be published for seven of the genomes analyzed. PMID:24927599
High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource.

PubMed

Seaver, Samuel M D; Gerdes, Svetlana; Frelin, Océane; Lerma-Ortiz, Claudia; Bradbury, Louis M T; Zallot, Rémi; Hasnain, Ghulam; Niehaus, Thomas D; El Yacoubi, Basma; Pasternak, Shiran; Olson, Robert; Pusch, Gordon; Overbeek, Ross; Stevens, Rick; de Crécy-Lagard, Valérie; Ware, Doreen; Hanson, Andrew D; Henry, Christopher S

2014-07-01

The increasing number of sequenced plant genomes is placing new demands on the methods applied to analyze, annotate, and model these genomes. Today's annotation pipelines result in inconsistent gene assignments that complicate comparative analyses and prevent efficient construction of metabolic models. To overcome these problems, we have developed the PlantSEED, an integrated, metabolism-centric database to support subsystems-based annotation and metabolic model reconstruction for plant genomes. PlantSEED combines SEED subsystems technology, first developed for microbial genomes, with refined protein families and biochemical data to assign fully consistent functional annotations to orthologous genes, particularly those encoding primary metabolic pathways. Seamless integration with its parent, the prokaryotic SEED database, makes PlantSEED a unique environment for cross-kingdom comparative analysis of plant and bacterial genomes. The consistent annotations imposed by PlantSEED permit rapid reconstruction and modeling of primary metabolism for all plant genomes in the database. This feature opens the unique possibility of model-based assessment of the completeness and accuracy of gene annotation and thus allows computational identification of genes and pathways that are restricted to certain genomes or need better curation. We demonstrate the PlantSEED system by producing consistent annotations for 10 reference genomes. We also produce a functioning metabolic model for each genome, gapfilling to identify missing annotations and proposing gene candidates for missing annotations. Models are built around an extended biomass composition representing the most comprehensive published to date. To our knowledge, our models are the first to be published for seven of the genomes analyzed.
Reference genes for real-time PCR quantification of messenger RNAs and microRNAs in mouse model of obesity.

PubMed

Matoušková, Petra; Bártíková, Hana; Boušová, Iva; Hanušová, Veronika; Szotáková, Barbora; Skálová, Lenka

2014-01-01

Obesity and metabolic syndrome is increasing health problem worldwide. Among other ways, nutritional intervention using phytochemicals is important method for treatment and prevention of this disease. Recent studies have shown that certain phytochemicals could alter the expression of specific genes and microRNAs (miRNAs) that play a fundamental role in the pathogenesis of obesity. For study of the obesity and its treatment, monosodium glutamate (MSG)-injected mice with developed central obesity, insulin resistance and liver lipid accumulation are frequently used animal models. To understand the mechanism of phytochemicals action in obese animals, the study of selected genes expression together with miRNA quantification is extremely important. For this purpose, real-time quantitative PCR is a sensitive and reproducible method, but it depends on proper normalization entirely. The aim of present study was to identify the appropriate reference genes for mRNA and miRNA quantification in MSG mice treated with green tea catechins, potential anti-obesity phytochemicals. Two sets of reference genes were tested: first set contained seven commonly used genes for normalization of messenger RNA, the second set of candidate reference genes included ten small RNAs for normalization of miRNA. The expression stability of these reference genes were tested upon treatment of mice with catechins using geNorm, NormFinder and BestKeeper algorithms. Selected normalizers for mRNA quantification were tested and validated on expression of quinone oxidoreductase, biotransformation enzyme known to be modified by catechins. The effect of selected normalizers for miRNA quantification was tested on two obesity- and diabetes- related miRNAs, miR-221 and miR-29b, respectively. Finally, the combinations of B2M/18S/HPRT1 and miR-16/sno234 were validated as optimal reference genes for mRNA and miRNA quantification in liver and 18S/RPlP0/HPRT1 and sno234/miR-186 in small intestine of MSG mice. These reference genes will be used for mRNA and miRNA normalization in further study of green tea catechins action in obese mice.
Identification and Evaluation of Reliable Reference Genes in the Medicinal Fungus Shiraia bambusicola.

PubMed

Song, Liang; Li, Tong; Fan, Li; Shen, Xiao-Ye; Hou, Cheng-Lin

2016-04-01

The stability of reference genes plays a vital role in real-time quantitative reverse transcription polymerase chain reaction (qRT-PCR) analysis, which is generally regarded as a convenient and sensitive tool for the analysis of gene expression. A well-known medicinal fungus, Shiraia bambusicola, has great potential in the pharmaceutical, agricultural and food industries, but its suitable reference genes have not yet been determined. In the present study, 11 candidate reference genes in S. bambusicola were first evaluated and validated comprehensively. To identify the suitable reference genes for qRT-PCR analysis, three software-based algorithms, geNorm, NormFinder and Best Keeper, were applied to rank the tested genes. RNA samples were collected from seven fermentation stages using different media (potato dextrose or Czapek medium) and under different light conditions (12-h light/12-h dark and all-dark). The three most appropriate reference genes, ubi, tfc and ags, were able to normalize the qRT-PCR results under the culturing conditions of 12-h light/12-h dark, whereas the other three genes, vac, gke and acyl, performed better in the culturing conditions of all-dark growth. Therefore, under different light conditions, at least two reference genes (ubi and vac) could be employed to assure the reliability of qRT-PCR results. For both the natural culture medium (the most appropriate genes of this group: ubi, tfc and ags) and the chemically defined synthetic medium (the most stable genes of this group: tfc, vac and ef), the tfc gene remained the best gene used for normalizing the gene expression found with qRT-PCR. It is anticipated that these results would improve the selection of suitable reference genes for qRT-PCR assays and lay the foundation for an accurate analysis of gene expression in S. bambusicola.
Numerical Modelling Of The V-J Combinations Of The T Cell Receptor TRA/TRD Locus

PubMed Central

Dariz, Aurélie; Baum, Thierry Pascal; Hierle, Vivien; Demongeot, Jacques; Marche, Patrice Noël; Jouvin-Marche, Evelyne

2010-01-01

T-Cell antigen Receptor (TR) repertoire is generated through rearrangements of V and J genes encoding α and β chains. The quantification and frequency for every V-J combination during ontogeny and development of the immune system remain to be precisely established. We have addressed this issue by building a model able to account for Vα-Jα gene rearrangements during thymus development of mice. So we developed a numerical model on the whole TRA/TRD locus, based on experimental data, to estimate how Vα and Jα genes become accessible to rearrangements. The progressive opening of the locus to V-J gene recombinations is modeled through windows of accessibility of different sizes and with different speeds of progression. Furthermore, the possibility of successive secondary V-J rearrangements was included in the modelling. The model points out some unbalanced V-J associations resulting from a preferential access to gene rearrangements and from a non-uniform partition of the accessibility of the J genes, depending on their location in the locus. The model shows that 3 to 4 successive rearrangements are sufficient to explain the use of all the V and J genes of the locus. Finally, the model provides information on both the kinetics of rearrangements and frequencies of each V-J associations. The model accounts for the essential features of the observed rearrangements on the TRA/TRD locus and may provide a reference for the repertoire of the V-J combinatorial diversity. PMID:20174554
Stability of Reference Gene Expression After Porcine Sapelovirus Infection in Porcine Intestinal Epithelial Cells.

PubMed

Huang, Yong; Chen, Yabing; Sun, Huan; Lan, Daoliang

2016-01-01

Intestinal epithelial cells, which serve as the first physical barrier to protect intestinal tract from external antigens, have an important role in the local innate immunity. Screening of reference genes that have stable expression levels after viral infection in porcine intestinal epithelial cells is critical for ensuring the reliability of the expression analysis on anti-infection genes in porcine intestinal epithelial cells. In this study, nine common reference genes in pigs, including ACTB, B2M, GAPDH, HMBS, SDHA, HPRT1, TBP, YWHAZ, and RPL32, were chosen as the candidate reference genes. Porcine sapelovirus (PSV) was used as a model virus to infect porcine intestinal epithelial cell line (IPEC-J2). The expression stability of the nine genes was assessed by the geNorm, NormFinder, and BestKeeper software. Moreover, RefFinder program was used to evaluate the analytical results of above three softwares, and a relative expression experiment of selected target gene was used to verify the analysis results. The comprehensive results indicated that the gene combination of TBP and RPL32 has the most stable expression, which could be considered as an appropriate reference gene for research on gene expression after PSV infection in IPEC-J2cells. The results provided essential data for expression analysis of anti-infection genes in porcine intestinal epithelial cells.
Reliable reference genes for normalization of gene expression data in tea plants (Camellia sinensis) exposed to metal stresses.

PubMed

Wang, Ming-Le; Li, Qing-Hui; Xin, Hua-Hong; Chen, Xuan; Zhu, Xu-Jun; Li, Xing-Hui

2017-01-01

Tea plants [Camellia sinensis (L.) O. Kuntze] are an important leaf-type crop that are widely used for the production of non-alcoholic beverages in the world. Exposure to excessive amounts of heavy metals adversely affects the quality and yield of tea leaves. To analyze the molecular responses of tea plants to heavy metals, a reliable quantification of gene expression is important and of major importance herein is the normalization of the measured expression levels for the target genes. Ideally, stably expressed reference genes should be evaluated in all experimental systems. In this study, 12 candidate reference genes (i.e., 18S rRNA, Actin, CYP, EF-1α, eIF-4α, GAPDH, MON1, PP2AA3, TBP, TIP41, TUA, and UBC) were cloned from tea plants, and the stability of their expression was examined systematically in 60 samples exposed to diverse heavy metals (i.e., manganese, aluminum, copper, iron, and zinc). Three Excel-based algorithms (geNorm, NormFinder, and BestKeeper) were used to evaluate the expression stability of these genes. PP2AA3 and 18S rRNA were the most stably expressed genes, even though their expression profiles exhibited some variability. Moreover, commonly used reference genes (i.e., GAPDH and TBP) were the least appropriate reference genes for most samples. To further validate the suitability of the analyzed reference genes, the expression level of a phytochelatin synthase gene (i.e., CsPCS1) was determined using the putative reference genes for data normalizations. Our results may be beneficial for future studies involving the quantification of relative gene expression levels in tea plants.
Reliable reference genes for normalization of gene expression data in tea plants (Camellia sinensis) exposed to metal stresses

PubMed Central

Wang, Ming-Le; Li, Qing-Hui; Xin, Hua-Hong; Chen, Xuan; Zhu, Xu-Jun

2017-01-01

Tea plants [Camellia sinensis (L.) O. Kuntze] are an important leaf-type crop that are widely used for the production of non-alcoholic beverages in the world. Exposure to excessive amounts of heavy metals adversely affects the quality and yield of tea leaves. To analyze the molecular responses of tea plants to heavy metals, a reliable quantification of gene expression is important and of major importance herein is the normalization of the measured expression levels for the target genes. Ideally, stably expressed reference genes should be evaluated in all experimental systems. In this study, 12 candidate reference genes (i.e., 18S rRNA, Actin, CYP, EF-1α, eIF-4α, GAPDH, MON1, PP2AA3, TBP, TIP41, TUA, and UBC) were cloned from tea plants, and the stability of their expression was examined systematically in 60 samples exposed to diverse heavy metals (i.e., manganese, aluminum, copper, iron, and zinc). Three Excel-based algorithms (geNorm, NormFinder, and BestKeeper) were used to evaluate the expression stability of these genes. PP2AA3 and 18S rRNA were the most stably expressed genes, even though their expression profiles exhibited some variability. Moreover, commonly used reference genes (i.e., GAPDH and TBP) were the least appropriate reference genes for most samples. To further validate the suitability of the analyzed reference genes, the expression level of a phytochelatin synthase gene (i.e., CsPCS1) was determined using the putative reference genes for data normalizations. Our results may be beneficial for future studies involving the quantification of relative gene expression levels in tea plants. PMID:28453515
Innovative design method of automobile profile based on Fourier descriptor

NASA Astrophysics Data System (ADS)

Gao, Shuyong; Fu, Chaoxing; Xia, Fan; Shen, Wei

2017-10-01

Aiming at the innovation of the contours of automobile side, this paper presents an innovative design method of vehicle side profile based on Fourier descriptor. The design flow of this design method is: pre-processing, coordinate extraction, standardization, discrete Fourier transform, simplified Fourier descriptor, exchange descriptor innovation, inverse Fourier transform to get the outline of innovative design. Innovative concepts of the innovative methods of gene exchange among species and the innovative methods of gene exchange among different species are presented, and the contours of the innovative design are obtained separately. A three-dimensional model of a car is obtained by referring to the profile curve which is obtained by exchanging xenogeneic genes. The feasibility of the method proposed in this paper is verified by various aspects.
A de novo transcriptome and valid reference genes for quantitative real-time PCR in Colaphellus bowringi.

PubMed

Tan, Qian-Qian; Zhu, Li; Li, Yi; Liu, Wen; Ma, Wei-Hua; Lei, Chao-Liang; Wang, Xiao-Ping

2015-01-01

The cabbage beetle Colaphellus bowringi Baly is a serious insect pest of crucifers and undergoes reproductive diapause in soil. An understanding of the molecular mechanisms of diapause regulation, insecticide resistance, and other physiological processes is helpful for developing new management strategies for this beetle. However, the lack of genomic information and valid reference genes limits knowledge on the molecular bases of these physiological processes in this species. Using Illumina sequencing, we obtained more than 57 million sequence reads derived from C. bowringi, which were assembled into 39,390 unique sequences. A Clusters of Orthologous Groups classification was obtained for 9,048 of these sequences, covering 25 categories, and 16,951 were assigned to 255 Kyoto Encyclopedia of Genes and Genomes pathways. Eleven candidate reference gene sequences from the transcriptome were then identified through reverse transcriptase polymerase chain reaction. Among these candidate genes, EF1α, ACT1, and RPL19 proved to be the most stable reference genes for different reverse transcriptase quantitative polymerase chain reaction experiments in C. bowringi. Conversely, aTUB and GAPDH were the least stable reference genes. The abundant putative C. bowringi transcript sequences reported enrich the genomic resources of this beetle. Importantly, the larger number of gene sequences and valid reference genes provide a valuable platform for future gene expression studies, especially with regard to exploring the molecular mechanisms of different physiological processes in this species.
Reference gene stability of a synanthropic fly, Chrysomya megacephala.

PubMed

Wang, Xiaoyun; Xiong, Mei; Wang, Jialu; Lei, Chaoliang; Zhu, Fen

2015-10-29

Stable reference genes are essential for accurate normalization in gene expression studies with reverse transcription quantitative polymerase chain reaction (qPCR). A synanthropic fly, Chrysomya megacephala, is a well known medical vector and forensic indicator. Unfortunately, previous studies did not look at the stability of reference genes used in C. megacephala. In this study, the expression level of Actin, ribosomal protein L8 (Rpl8), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), elongation factor 1α (EF1), α-tubulin (α-TUB), β-tubulin (β-TUB), TATA binding box (TBP), 18S rRNA (18S) and ribosomal protein S7 (Rps7) were evaluated for their stability using online software RefFinder, which combines the normal software of the ΔCt method, BestKeeper, Normfinder, and geNorm. Moreover the number of suitable reference gene pairs was also suggested by Excel-based geNorm. The expression levels of these reference genes were evaluated under different experimental conditions with special perspectives of forensic applications: developmental stages (eggs, first, second and third instar larvae, pupae and adults); food sources of larvae (pork, fish and chicken); feeding larvae with drugs (untreated control, Estazolam and Marvelon); feeding larvae with heavy metals (untreated control, cadmium and zinc); tissues of adults (head, thorax, abdomen, legs and wings). According to RefFinder, EF1 was the most suitable reference gene of developmental stages, food and tissues; 18S and GAPDH were the most suitable reference genes for drugs and heavy metals, respectively, which could be widely used for quantification of target gene expression with qPCR in C. megacephala. Suitable reference gene pairs were also suggested by geNorm. This fundamental but vital work should facilitate the gene studies of related biological processes and deepen the understanding in physiology, toxicology, and especially medical and forensic entomology of C. megacephala.
A new specific reference gene based on growth hormone gene (GH1) used for detection and relative quantification of Aquadvantage® GM salmon (Salmo salar L.) in food products.

PubMed

Hafsa, Ahmed Ben; Nabi, Nesrine; Zellama, Mohamed Salem; Said, Khaled; Chaouachi, Maher

2016-01-01

Genetic transformation of fish is mainly oriented towards the improvement of growth for the benefit of the aquaculture. Actually, Atlantic salmon (Salmo salar) is the species most transformed to achieve growth rates quite large compared to the wild. To anticipate the presence of contaminations with GM salmon in fish markets and the lack of labeling regulations with a mandatory threshold, the proper methods are needed to test the authenticity of the ingredients. A quantitative real-time polymerase chain reaction (QRT-PCR) method was used in this study. Ct values were obtained and validated using 15 processed food containing salmon. The relative and absolute limits of detection were 0.01% and 0.01 ng/μl of genomic DNA, respectively. Results demonstrate that the developed QRT-PCR method is suitable specifically for identification of S. salar in food ingredients based on the salmon growth hormone gene 1 (GH1). The processes used to develop the specific salmon reference gene case study are intended to serve as a model for performing quantification of Aquadvantage® GM salmon on future genetically modified (GM) fish to be commercialized. Copyright © 2015 Elsevier Ltd. All rights reserved.
Validation of reference genes for gene expression analysis in olive (Olea europaea) mesocarp tissue by quantitative real-time RT-PCR

PubMed Central

2014-01-01

Background Gene expression analysis using quantitative reverse transcription PCR (qRT-PCR) is a robust method wherein the expression levels of target genes are normalised using internal control genes, known as reference genes, to derive changes in gene expression levels. Although reference genes have recently been suggested for olive tissues, combined/independent analysis on different cultivars has not yet been tested. Therefore, an assessment of reference genes was required to validate the recent findings and select stably expressed genes across different olive cultivars. Results A total of eight candidate reference genes [glyceraldehyde 3-phosphate dehydrogenase (GAPDH), serine/threonine-protein phosphatase catalytic subunit (PP2A), elongation factor 1 alpha (EF1-alpha), polyubiquitin (OUB2), aquaporin tonoplast intrinsic protein (TIP2), tubulin alpha (TUBA), 60S ribosomal protein L18-3 (60S RBP L18-3) and polypyrimidine tract-binding protein homolog 3 (PTB)] were chosen based on their stability in olive tissues as well as in other plants. Expression stability was examined by qRT-PCR across 12 biological samples, representing mesocarp tissues at various developmental stages in three different olive cultivars, Barnea, Frantoio and Picual, independently and together during the 2009 season with two software programs, GeNorm and BestKeeper. Both software packages identified GAPDH, EF1-alpha and PP2A as the three most stable reference genes across the three cultivars and in the cultivar, Barnea. GAPDH, EF1-alpha and 60S RBP L18-3 were found to be most stable reference genes in the cultivar Frantoio while 60S RBP L18-3, OUB2 and PP2A were found to be most stable reference genes in the cultivar Picual. Conclusions The analyses of expression stability of reference genes using qRT-PCR revealed that GAPDH, EF1-alpha, PP2A, 60S RBP L18-3 and OUB2 are suitable reference genes for expression analysis in developing Olea europaea mesocarp tissues, displaying the highest level of expression stability across three different olive cultivars, Barnea, Frantoio and Picual, however the combination of the three most stable reference genes do vary amongst individual cultivars. This study will provide guidance to other researchers to select reference genes for normalization against target genes by qPCR across tissues obtained from the mesocarp region of the olive fruit in the cultivars, Barnea, Frantoio and Picual. PMID:24884716
aes, the gene encoding the esterase B in Escherichia coli, is a powerful phylogenetic marker of the species.

PubMed

Lescat, Mathilde; Hoede, Claire; Clermont, Olivier; Garry, Louis; Darlu, Pierre; Tuffery, Pierre; Denamur, Erick; Picard, Bertrand

2009-12-29

Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. We identified the gene encoding esterase B as the acetyl-esterase gene (aes) using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR) strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.
Identifying clinically relevant drug resistance genes in drug-induced resistant cancer cell lines and post-chemotherapy tissues.

PubMed

Tong, Mengsha; Zheng, Weicheng; Lu, Xingrong; Ao, Lu; Li, Xiangyu; Guan, Qingzhou; Cai, Hao; Li, Mengyao; Yan, Haidan; Guo, You; Chi, Pan; Guo, Zheng

2015-12-01

Until recently, few molecular signatures of drug resistance identified in drug-induced resistant cancer cell models can be translated into clinical practice. Here, we defined differentially expressed genes (DEGs) between pre-chemotherapy colorectal cancer (CRC) tissue samples of non-responders and responders for 5-fluorouracil and oxaliplatin-based therapy as clinically relevant drug resistance genes (CRG5-FU/L-OHP). Taking CRG5-FU/L-OHP as reference, we evaluated the clinical relevance of several types of genes derived from HCT116 CRC cells with resistance to 5-fluorouracil and oxaliplatin, respectively. The results revealed that DEGs between parental and resistant cells, when both were treated with the corresponding drug for a certain time, were significantly consistent with the CRG5-FU/L-OHP as well as the DEGs between the post-chemotherapy CRC specimens of responders and non-responders. This study suggests a novel strategy to extract clinically relevant drug resistance genes from both drug-induced resistant cell models and post-chemotherapy cancer tissue specimens.
Validating internal controls for quantitative plant gene expression studies.

PubMed

Brunner, Amy M; Yakovlev, Igor A; Strauss, Steven H

2004-08-18

Real-time reverse transcription PCR (RT-PCR) has greatly improved the ease and sensitivity of quantitative gene expression studies. However, accurate measurement of gene expression with this method relies on the choice of a valid reference for data normalization. Studies rarely verify that gene expression levels for reference genes are adequately consistent among the samples used, nor compare alternative genes to assess which are most reliable for the experimental conditions analyzed. Using real-time RT-PCR to study the expression of 10 poplar (genus Populus) housekeeping genes, we demonstrate a simple method for determining the degree of stability of gene expression over a set of experimental conditions. Based on a traditional method for analyzing the stability of varieties in plant breeding, it defines measures of gene expression stability from analysis of variance (ANOVA) and linear regression. We found that the potential internal control genes differed widely in their expression stability over the different tissues, developmental stages and environmental conditions studied. Our results support that quantitative comparisons of candidate reference genes are an important part of real-time RT-PCR studies that seek to precisely evaluate variation in gene expression. The method we demonstrated facilitates statistical and graphical evaluation of gene expression stability. Selection of the best reference gene for a given set of experimental conditions should enable detection of biologically significant changes in gene expression that are too small to be revealed by less precise methods, or when highly variable reference genes are unknowingly used in real-time RT-PCR experiments.
Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.).

PubMed

Taylor, Candy M; Jost, Ricarda; Erskine, William; Nelson, Matthew N

2016-01-01

Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more suitable reference genes will be identified for this species in future.
Identifying Stable Reference Genes for qRT-PCR Normalisation in Gene Expression Studies of Narrow-Leafed Lupin (Lupinus angustifolius L.)

PubMed Central

Erskine, William; Nelson, Matthew N.

2016-01-01

Quantitative Reverse Transcription PCR (qRT-PCR) is currently one of the most popular, high-throughput and sensitive technologies available for quantifying gene expression. Its accurate application depends heavily upon normalisation of gene-of-interest data with reference genes that are uniformly expressed under experimental conditions. The aim of this study was to provide the first validation of reference genes for Lupinus angustifolius (narrow-leafed lupin, a significant grain legume crop) using a selection of seven genes previously trialed as reference genes for the model legume, Medicago truncatula. In a preliminary evaluation, the seven candidate reference genes were assessed on the basis of primer specificity for their respective targeted region, PCR amplification efficiency, and ability to discriminate between cDNA and gDNA. Following this assessment, expression of the three most promising candidates [Ubiquitin C (UBC), Helicase (HEL), and Polypyrimidine tract-binding protein (PTB)] was evaluated using the NormFinder and RefFinder statistical algorithms in two narrow-leafed lupin lines, both with and without vernalisation treatment, and across seven organ types (cotyledons, stem, leaves, shoot apical meristem, flowers, pods and roots) encompassing three developmental stages. UBC was consistently identified as the most stable candidate and has sufficiently uniform expression that it may be used as a sole reference gene under the experimental conditions tested here. However, as organ type and developmental stage were associated with greater variability in relative expression, it is recommended using UBC and HEL as a pair to achieve optimal normalisation. These results highlight the importance of rigorously assessing candidate reference genes for each species across a diverse range of organs and developmental stages. With emerging technologies, such as RNAseq, and the completion of valuable transcriptome data sets, it is possible that other potentially more suitable reference genes will be identified for this species in future. PMID:26872362

Detection of a Bacteriophage Gene Encoding a Mu-like Portal Protein in Haemophilus parasuis Reference Strains and Field Isolates by Nested Polymerase Chain Reaction

USDA-ARS?s Scientific Manuscript database

A nested PCR assay was developed to determine the presence of a gene encoding a bacteriophage Mu-like portal protein, gp29, in 15 reference strains and 31 field isolates of Haemophilus parasuis. Specific primers, based on the gene’s sequence, were utilized. A majority of the virulent reference strai...
RPL13A and EEF1A1 Are Suitable Reference Genes for qPCR during Adipocyte Differentiation of Vascular Stromal Cells from Patients with Different BMI and HOMA-IR.

PubMed

Gentile, Adriana-Mariel; Lhamyani, Said; Coín-Aragüez, Leticia; Oliva-Olivera, Wilfredo; Zayed, Hatem; Vega-Rioja, Antonio; Monteseirin, Javier; Romero-Zerbo, Silvana-Yanina; Tinahones, Francisco-José; Bermúdez-Silva, Francisco-Javier; El Bekay, Rajaa

2016-01-01

Real-time or quantitative PCR (qPCR) is a useful technique that requires reliable reference genes for data normalization in gene expression analysis. Adipogenesis is among the biological processes suitable for this technique. The selection of adequate reference genes is essential for qPCR gene expression analysis of human Vascular Stromal Cells (hVSCs) during their differentiation into adipocytes. To the best of our knowledge, there are no studies validating reference genes for the analyses of visceral and subcutaneous adipose tissue hVSCs from subjects with different Body Mass Index (BMI) and Homeostatic Model Assessment of Insulin Resistance (HOMA-IR) index. The present study was undertaken to analyze this question. We first analyzed the stability of expression of five potential reference genes: CYC, GAPDH, RPL13A, EEF1A1, and 18S ribosomal RNA, during in vitro adipogenic differentiation, in samples from these types of patients. The expression of RPL13A and EEF1A1 was not affected by differentiation, thus being these genes the most stable candidates, while CYC, GAPDH, and 18S were not suitable for this sort of analysis. This work highlights that RPL13A and EEF1A1 are good candidates as reference genes for qPCR analysis of hVSCs differentiation into adipocytes from subjects with different BMI and HOMA-IR.
RPL13A and EEF1A1 Are Suitable Reference Genes for qPCR during Adipocyte Differentiation of Vascular Stromal Cells from Patients with Different BMI and HOMA-IR

PubMed Central

Gentile, Adriana-Mariel; Lhamyani, Said; Coín-Aragüez, Leticia; Oliva-Olivera, Wilfredo; Zayed, Hatem; Vega-Rioja, Antonio; Monteseirin, Javier; Romero-Zerbo, Silvana-Yanina; Tinahones, Francisco-José; Bermúdez-Silva, Francisco-Javier; El Bekay, Rajaa

2016-01-01

Real-time or quantitative PCR (qPCR) is a useful technique that requires reliable reference genes for data normalization in gene expression analysis. Adipogenesis is among the biological processes suitable for this technique. The selection of adequate reference genes is essential for qPCR gene expression analysis of human Vascular Stromal Cells (hVSCs) during their differentiation into adipocytes. To the best of our knowledge, there are no studies validating reference genes for the analyses of visceral and subcutaneous adipose tissue hVSCs from subjects with different Body Mass Index (BMI) and Homeostatic Model Assessment of Insulin Resistance (HOMA-IR) index. The present study was undertaken to analyze this question. We first analyzed the stability of expression of five potential reference genes: CYC, GAPDH, RPL13A, EEF1A1, and 18S ribosomal RNA, during in vitro adipogenic differentiation, in samples from these types of patients. The expression of RPL13A and EEF1A1 was not affected by differentiation, thus being these genes the most stable candidates, while CYC, GAPDH, and 18S were not suitable for this sort of analysis. This work highlights that RPL13A and EEF1A1 are good candidates as reference genes for qPCR analysis of hVSCs differentiation into adipocytes from subjects with different BMI and HOMA-IR. PMID:27304673
Comparative Genomics of Non-TNL Disease Resistance Genes from Six Plant Species.

PubMed

Nepal, Madhav P; Andersen, Ethan J; Neupane, Surendra; Benson, Benjamin V

2017-09-30

Disease resistance genes (R genes), as part of the plant defense system, have coevolved with corresponding pathogen molecules. The main objectives of this project were to identify non-Toll interleukin receptor, nucleotide-binding site, leucine-rich repeat (nTNL) genes and elucidate their evolutionary divergence across six plant genomes. Using reference sequences from Arabidopsis , we investigated nTNL orthologs in the genomes of common bean, Medicago , soybean, poplar, and rice. We used Hidden Markov Models for sequence identification, performed model-based phylogenetic analyses, visualized chromosomal positioning, inferred gene clustering, and assessed gene expression profiles. We analyzed 908 nTNL R genes in the genomes of the six plant species, and classified them into 12 subgroups based on the presence of coiled-coil (CC), nucleotide binding site (NBS), leucine rich repeat (LRR), resistance to Powdery mildew 8 (RPW8), and BED type zinc finger domains. Traditionally classified CC-NBS-LRR (CNL) genes were nested into four clades (CNL A-D) often with abundant, well-supported homogeneous subclades of Type-II R genes. CNL-D members were absent in rice, indicating a unique R gene retention pattern in the rice genome. Genomes from Arabidopsis , common bean, poplar and soybean had one chromosome without any CNL R genes. Medicago and Arabidopsis had the highest and lowest number of gene clusters, respectively. Gene expression analyses suggested unique patterns of expression for each of the CNL clades. Differential gene expression patterns of the nTNL genes were often found to correlate with number of introns and GC content, suggesting structural and functional divergence.
Comparative Genomics of Non-TNL Disease Resistance Genes from Six Plant Species

PubMed Central

Andersen, Ethan J.; Neupane, Surendra; Benson, Benjamin V.

2017-01-01

Disease resistance genes (R genes), as part of the plant defense system, have coevolved with corresponding pathogen molecules. The main objectives of this project were to identify non-Toll interleukin receptor, nucleotide-binding site, leucine-rich repeat (nTNL) genes and elucidate their evolutionary divergence across six plant genomes. Using reference sequences from Arabidopsis, we investigated nTNL orthologs in the genomes of common bean, Medicago, soybean, poplar, and rice. We used Hidden Markov Models for sequence identification, performed model-based phylogenetic analyses, visualized chromosomal positioning, inferred gene clustering, and assessed gene expression profiles. We analyzed 908 nTNL R genes in the genomes of the six plant species, and classified them into 12 subgroups based on the presence of coiled-coil (CC), nucleotide binding site (NBS), leucine rich repeat (LRR), resistance to Powdery mildew 8 (RPW8), and BED type zinc finger domains. Traditionally classified CC-NBS-LRR (CNL) genes were nested into four clades (CNL A-D) often with abundant, well-supported homogeneous subclades of Type-II R genes. CNL-D members were absent in rice, indicating a unique R gene retention pattern in the rice genome. Genomes from Arabidopsis, common bean, poplar and soybean had one chromosome without any CNL R genes. Medicago and Arabidopsis had the highest and lowest number of gene clusters, respectively. Gene expression analyses suggested unique patterns of expression for each of the CNL clades. Differential gene expression patterns of the nTNL genes were often found to correlate with number of introns and GC content, suggesting structural and functional divergence. PMID:28973974
Establishing references for gene expression analyses by RT-qPCR in Theobroma cacao tissues.

PubMed

Pinheiro, T T; Litholdo, C G; Sereno, M L; Leal, G A; Albuquerque, P S B; Figueira, A

2011-11-17

Lack of continuous progress in Theobroma cacao (Malvaceae) breeding, especially associated with seed quality traits, requires more efficient selection methods based on genomic information. Reverse transcript quantitative PCR (RT-qPCR) has become the method of choice for gene expression analysis, but relative expression analysis requires various reference genes, which must be stable across various biological conditions. We sought suitable reference genes for various tissues of cacao, especially developing seeds. Ten potential reference genes were analyzed for stability at various stages of embryo development, leaves, stems, roots, flowers, and pod epicarp; seven of them were also evaluated in shoot tips treated either with hormones (salicylate; ethefon; methyl-jasmonate) or after inoculation with the fungus Moniliophthora perniciosa (Marasmiaceae sensu lato). For developing embryos, the three most stable genes were actin (ACT), polyubiquitin (PUB), and ribosomal protein L35 (Rpl35). In the analyses of various tissues, the most stable genes were malate dehydrogenase (MDH), glyceraldehyde 3-phosphate dehydrogenase (GAPDH), and acyl-carrier protein B (ACP B). GAPDH, MDH and tubulin (TUB) were the most appropriate for normalization when shoot apexes were treated with hormones, while ACT, TUB and Rpl35 were the most appropriate after inoculation with M. perniciosa. We conclude that for each plant system and biological or ontogenetical condition, there is a need to define suitable reference genes. This is the first report to define reference genes for expression studies in cacao.
Cis-regulatory element based targeted gene finding: genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana.

PubMed

Zhang, Weixiong; Ruan, Jianhua; Ho, Tuan-Hua David; You, Youngsook; Yu, Taotao; Quatrano, Ralph S

2005-07-15

A fundamental problem of computational genomics is identifying the genes that respond to certain endogenous cues and environmental stimuli. This problem can be referred to as targeted gene finding. Since gene regulation is mainly determined by the binding of transcription factors and cis-regulatory DNA sequences, most existing gene annotation methods, which exploit the conservation of open reading frames, are not effective in finding target genes. A viable approach to targeted gene finding is to exploit the cis-regulatory elements that are known to be responsible for the transcription of target genes. Given such cis-elements, putative target genes whose promoters contain the elements can be identified. As a case study, we apply the above approach to predict the genes in model plant Arabidopsis thaliana which are inducible by a phytohormone, abscisic acid (ABA), and abiotic stress, such as drought, cold and salinity. We first construct and analyze two ABA specific cis-elements, ABA-responsive element (ABRE) and its coupling element (CE), in A.thaliana, based on their conservation in rice and other cereal plants. We then use the ABRE-CE module to identify putative ABA-responsive genes in A.thaliana. Based on RT-PCR verification and the results from literature, this method has an accuracy rate of 67.5% for the top 40 predictions. The cis-element based targeted gene finding approach is expected to be widely applicable since a large number of cis-elements in many species are available.
Real-time polymerase chain reaction-based approach for quantification of the pat gene in the T25 Zea mays event.

PubMed

Weighardt, Florian; Barbati, Cristina; Paoletti, Claudia; Querci, Maddalena; Kay, Simon; De Beuckeleer, Marc; Van den Eede, Guy

2004-01-01

In Europe, a growing interest for reliable techniques for the quantification of genetically modified component(s) of food matrixes is arising from the need to comply with the European legislative framework on novel food products. Real-time polymerase chain reaction (PCR) is currently the most powerful technique for the quantification of specific nucleic acid sequences. Several real-time PCR methodologies based on different molecular principles have been developed for this purpose. The most frequently used approach in the field of genetically modified organism (GMO) quantification in food or feed samples is based on the 5'-3'-exonuclease activity of Taq DNA polymerase on specific degradation probes (TaqMan principle). A novel approach was developed for the establishment of a TaqMan quantification system assessing GMO contents around the 1% threshold stipulated under European Union (EU) legislation for the labeling of food products. The Zea mays T25 elite event was chosen as a model for the development of the novel GMO quantification approach. The most innovative aspect of the system is represented by the use of sequences cloned in plasmids as reference standards. In the field of GMO quantification, plasmids are an easy to use, cheap, and reliable alternative to Certified Reference Materials (CRMs), which are only available for a few of the GMOs authorized in Europe, have a relatively high production cost, and require further processing to be suitable for analysis. Strengths and weaknesses of the use of novel plasmid-based standards are addressed in detail. In addition, the quantification system was designed to avoid the use of a reference gene (e.g., a single copy, species-specific gene) as normalizer, i.e., to perform a GMO quantification based on an absolute instead of a relative measurement. In fact, experimental evidences show that the use of reference genes adds variability to the measurement system because a second independent real-time PCR-based measurement must be performed. Moreover, for some reference genes no sufficient information on copy number in and among genomes of different lines is available, making adequate quantification difficult. Once developed, the method was subsequently validated according to IUPAC and ISO 5725 guidelines. Thirteen laboratories from 8 EU countries participated in the trial. Eleven laboratories provided results complying with the predefined study requirements. Repeatability (RSDr) values ranged from 8.7 to 15.9%, with a mean value of 12%. Reproducibility (RSDR) values ranged from 16.3 to 25.5%, with a mean value of 21%. Following Codex Alimentarius Committee guidelines, both the limits of detection and quantitation were determined to be <0.1%.
Quantitative real-time PCR normalization for gene expression studies in the plant pathogenic fungi Lasiodiplodia theobromae.

PubMed

Paolinelli-Alfonso, Marcos; Galindo-Sánchez, Clara Elizabeth; Hernandez-Martinez, Rufina

2016-08-01

Lasiodiplodia theobromae is a highly virulent plant pathogen. It has been suggested that heat stress increases its virulence. The aim of this work was to evaluate, compare, and recommend normalization strategies for gene expression analysis of the fungus growing with grapevine wood under heat stress. Using RT-qPCR-derived data, reference gene stability was evaluated through geNorm, NormFinder and Bestkeeper applications. Based on the geometric mean using the ranking position obtained for each independent analysis, genes were ranked from least to most stable as follows: glyceraldehyde-3-phosphate dehydrogenase (GAPDH), actin (ACT), β-tubulin (TUB) and elongation factor-1α (EF1α). Using RNAseq-derived data based on the calculated tagwise dispersion these genes were ordered by increasing stability as follows: GAPDH, ACT, TUB, and EF1α. The correlation between RNAseq and RTqPCR results was used as criteria to identify the best RT-qPCR normalization approach. The gene TUB is recommended as the best option for normalization among the commonly used reference genes, but alternative fungal reference genes are also suggested. Copyright © 2016 Elsevier B.V. All rights reserved.
AGORA : Organellar genome annotation from the amino acid and nucleotide references.

PubMed

Jung, Jaehee; Kim, Jong Im; Jeong, Young-Sik; Yi, Gangman

2018-03-29

Next-generation sequencing (NGS) technologies have led to the accumulation of highthroughput sequence data from various organisms in biology. To apply gene annotation of organellar genomes for various organisms, more optimized tools for functional gene annotation are required. Almost all gene annotation tools are mainly focused on the chloroplast genome of land plants or the mitochondrial genome of animals.We have developed a web application AGORA for the fast, user-friendly, and improved annotations of organellar genomes. AGORA annotates genes based on a BLAST-based homology search and clustering with selected reference sequences from the NCBI database or user-defined uploaded data. AGORA can annotate the functional genes in almost all mitochondrion and plastid genomes of eukaryotes. The gene annotation of a genome with an exon-intron structure within a gene or inverted repeat region is also available. It provides information of start and end positions of each gene, BLAST results compared with the reference sequence, and visualization of gene map by OGDRAW. Users can freely use the software, and the accessible URL is https://bigdata.dongguk.edu/gene_project/AGORA/.The main module of the tool is implemented by the python and php, and the web page is built by the HTML and CSS to support all browsers. gangman@dongguk.edu.
Selection of reference genes for gene expression studies in virus-infected monocots using quantitative real-time PCR.

PubMed

Zhang, Kun; Niu, Shaofang; Di, Dianping; Shi, Lindan; Liu, Deshui; Cao, Xiuling; Miao, Hongqin; Wang, Xianbing; Han, Chenggui; Yu, Jialin; Li, Dawei; Zhang, Yongliang

2013-10-10

Both genome-wide transcriptomic surveys of the mRNA expression profiles and virus-induced gene silencing-based molecular studies of target gene during virus-plant interaction involve the precise estimation of the transcript abundance. Quantitative real-time PCR (qPCR) is the most widely adopted technique for mRNA quantification. In order to obtain reliable quantification of transcripts, identification of the best reference genes forms the basis of the preliminary work. Nevertheless, the stability of internal controls in virus-infected monocots needs to be fully explored. In this work, the suitability of ten housekeeping genes (ACT, EF1α, FBOX, GAPDH, GTPB, PP2A, SAND, TUBβ, UBC18 and UK) for potential use as reference genes in qPCR were investigated in five different monocot plants (Brachypodium, barley, sorghum, wheat and maize) under infection with different viruses including Barley stripe mosaic virus (BSMV), Brome mosaic virus (BMV), Rice black-streaked dwarf virus (RBSDV) and Sugarcane mosaic virus (SCMV). By using three different algorithms, the most appropriate reference genes or their combinations were identified for different experimental sets and their effectiveness for the normalisation of expression studies were further validated by quantitative analysis of a well-studied PR-1 gene. These results facilitate the selection of desirable reference genes for more accurate gene expression studies in virus-infected monocots. Copyright © 2013 Elsevier B.V. All rights reserved.
Identification and evaluation of reference genes for qRT-PCR studies in Lentinula edodes

PubMed Central

Qin, Peng; He, Maolan; Yu, Xiumei; Zhao, Ke; Zhang, Xiaoping; Ma, Menggen; Chen, Qiang; Chen, Xiaoqiong; Zeng, Xianfu; Gu, Yunfu

2018-01-01

Lentinula edodes (shiitake mushroom) is a common edible mushroom with a number of potential therapeutic and nutritional applications. It contains various medically important molecules, such as polysaccharides, terpenoids, sterols, and lipids, were contained in this mushroom. Quantitative real-time polymerase chain reaction (qRT-PCR) is a powerful tool to analyze the mechanisms underlying the biosynthetic pathways of these substances. qRT-PCR is used for accurate analyses of transcript levels owing to its rapidity, sensitivity, and reliability. However, its accuracy and reliability for the quantification of transcripts rely on the expression stability of the reference genes used for data normalization. To ensure the reliability of gene expression analyses using qRT-PCR in L. edodes molecular biology research, it is necessary to systematically evaluate reference genes. In the current study, ten potential reference genes were selected from L. edodes genomic data and their expression levels were measured by qRT-PCR using various samples. The expression stability of each candidate gene was analyzed by three commonly used software packages: geNorm, NormFinder, and BestKeeper. Base on the results, Rpl4 was the most stable reference gene across all experimental conditions, and Atu was the most stable gene among strains. 18S was found to be the best reference gene for different development stages, and Rpl4 was the most stably expressed gene under various nutrient conditions. The present work will contribute to qRT-PCR studies in L. edodes. PMID:29293626
Identification and evaluation of reference genes for qRT-PCR studies in Lentinula edodes.

PubMed

Xiang, Quanju; Li, Jin; Qin, Peng; He, Maolan; Yu, Xiumei; Zhao, Ke; Zhang, Xiaoping; Ma, Menggen; Chen, Qiang; Chen, Xiaoqiong; Zeng, Xianfu; Gu, Yunfu

2018-01-01

Lentinula edodes (shiitake mushroom) is a common edible mushroom with a number of potential therapeutic and nutritional applications. It contains various medically important molecules, such as polysaccharides, terpenoids, sterols, and lipids, were contained in this mushroom. Quantitative real-time polymerase chain reaction (qRT-PCR) is a powerful tool to analyze the mechanisms underlying the biosynthetic pathways of these substances. qRT-PCR is used for accurate analyses of transcript levels owing to its rapidity, sensitivity, and reliability. However, its accuracy and reliability for the quantification of transcripts rely on the expression stability of the reference genes used for data normalization. To ensure the reliability of gene expression analyses using qRT-PCR in L. edodes molecular biology research, it is necessary to systematically evaluate reference genes. In the current study, ten potential reference genes were selected from L. edodes genomic data and their expression levels were measured by qRT-PCR using various samples. The expression stability of each candidate gene was analyzed by three commonly used software packages: geNorm, NormFinder, and BestKeeper. Base on the results, Rpl4 was the most stable reference gene across all experimental conditions, and Atu was the most stable gene among strains. 18S was found to be the best reference gene for different development stages, and Rpl4 was the most stably expressed gene under various nutrient conditions. The present work will contribute to qRT-PCR studies in L. edodes.
Deep transcriptome sequencing provides new insights into the structural and functional organization of the wheat genome.

PubMed

Pingault, Lise; Choulet, Frédéric; Alberti, Adriana; Glover, Natasha; Wincker, Patrick; Feuillet, Catherine; Paux, Etienne

2015-02-10

Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before. By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level. Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
Validating internal controls for quantitative plant gene expression studies

PubMed Central

Brunner, Amy M; Yakovlev, Igor A; Strauss, Steven H

2004-01-01

Background Real-time reverse transcription PCR (RT-PCR) has greatly improved the ease and sensitivity of quantitative gene expression studies. However, accurate measurement of gene expression with this method relies on the choice of a valid reference for data normalization. Studies rarely verify that gene expression levels for reference genes are adequately consistent among the samples used, nor compare alternative genes to assess which are most reliable for the experimental conditions analyzed. Results Using real-time RT-PCR to study the expression of 10 poplar (genus Populus) housekeeping genes, we demonstrate a simple method for determining the degree of stability of gene expression over a set of experimental conditions. Based on a traditional method for analyzing the stability of varieties in plant breeding, it defines measures of gene expression stability from analysis of variance (ANOVA) and linear regression. We found that the potential internal control genes differed widely in their expression stability over the different tissues, developmental stages and environmental conditions studied. Conclusion Our results support that quantitative comparisons of candidate reference genes are an important part of real-time RT-PCR studies that seek to precisely evaluate variation in gene expression. The method we demonstrated facilitates statistical and graphical evaluation of gene expression stability. Selection of the best reference gene for a given set of experimental conditions should enable detection of biologically significant changes in gene expression that are too small to be revealed by less precise methods, or when highly variable reference genes are unknowingly used in real-time RT-PCR experiments. PMID:15317655
Identification of stable reference genes in differentiating human pluripotent stem cells.

PubMed

Holmgren, Gustav; Ghosheh, Nidal; Zeng, Xianmin; Bogestål, Yalda; Sartipy, Peter; Synnergren, Jane

2015-06-01

Reference genes, often referred to as housekeeping genes (HKGs), are frequently used to normalize gene expression data based on the assumption that they are expressed at a constant level in the cells. However, several studies have shown that there may be a large variability in the gene expression levels of HKGs in various cell types. In a previous study, employing human embryonic stem cells (hESCs) subjected to spontaneous differentiation, we observed that the expression of commonly used HKG varied to a degree that rendered them inappropriate to use as reference genes under those experimental settings. Here we present a substantially extended study of the HKG signature in human pluripotent stem cells (hPSC), including nine global gene expression datasets from both hESC and human induced pluripotent stem cells, obtained during directed differentiation toward endoderm-, mesoderm-, and ectoderm derivatives. Sets of stably expressed genes were compiled, and a handful of genes (e.g., EID2, ZNF324B, CAPN10, and RABEP2) were identified as generally applicable reference genes in hPSCs across all cell lines and experimental conditions. The stability in gene expression profiles was confirmed by reverse transcription quantitative PCR analysis. Taken together, the current results suggest that differentiating hPSCs have a distinct HKG signature, which in some aspects is different from somatic cell types, and underscore the necessity to validate the stability of reference genes under the actual experimental setup used. In addition, the novel putative HKGs identified in this study can preferentially be used for normalization of gene expression data obtained from differentiating hPSCs. Copyright © 2015 the American Physiological Society.
Identification of suitable internal control genes for expression studies in Coffea arabica under different experimental conditions

PubMed Central

Barsalobres-Cavallari, Carla F; Severino, Fábio E; Maluf, Mirian P; Maia, Ivan G

2009-01-01

Background Quantitative data from gene expression experiments are often normalized by transcription levels of reference or housekeeping genes. An inherent assumption for their use is that the expression of these genes is highly uniform in living organisms during various phases of development, in different cell types and under diverse environmental conditions. To date, the validation of reference genes in plants has received very little attention and suitable reference genes have not been defined for a great number of crop species including Coffea arabica. The aim of the research reported herein was to compare the relative expression of a set of potential reference genes across different types of tissue/organ samples of coffee. We also validated the expression profiles of the selected reference genes at various stages of development and under a specific biotic stress. Results The expression levels of five frequently used housekeeping genes (reference genes), namely alcohol dehydrogenase (adh), 14-3-3, polyubiquitin (poly), β-actin (actin) and glyceraldehyde-3-phosphate dehydrogenase (gapdh) was assessed by quantitative real-time RT-PCR over a set of five tissue/organ samples (root, stem, leaf, flower, and fruits) of Coffea arabica plants. In addition to these commonly used internal controls, three other genes encoding a cysteine proteinase (cys), a caffeine synthase (ccs) and the 60S ribosomal protein L7 (rpl7) were also tested. Their stability and suitability as reference genes were validated by geNorm, NormFinder and BestKeeper programs. The obtained results revealed significantly variable expression levels of all reference genes analyzed, with the exception of gapdh, which showed no significant changes in expression among the investigated experimental conditions. Conclusion Our data suggests that the expression of housekeeping genes is not completely stable in coffee. Based on our results, gapdh, followed by 14-3-3 and rpl7 were found to be homogeneously expressed and are therefore adequate for normalization purposes, showing equivalent transcript levels in different tissue/organ samples. Gapdh is therefore the recommended reference gene for measuring gene expression in Coffea arabica. Its use will enable more accurate and reliable normalization of tissue/organ-specific gene expression studies in this important cherry crop plant. PMID:19126214
A selection of reference genes and early-warning mRNA biomarkers for environmental monitoring using Mytilus spp. as sentinel species.

PubMed

Lacroix, C; Coquillé, V; Guyomarch, J; Auffret, M; Moraga, D

2014-09-15

mRNA biomarkers are promising tools for environmental health assessment and reference genes are needed to perform relevant qPCR analyses in tissue samples of sentinel species. In the present study, potential reference genes and mRNA biomarkers were tested in the gills and digestive glands of native and caged mussels (Mytilus spp.) exposed to harbor pollution. Results highlighted the difficulty to find stable reference genes in wild, non-model species and suggested the use of normalization indices instead of single genes as they exhibit a higher stability. Several target genes were found differentially expressed between mussel groups, especially in gills where cyp32, π-gst and CuZn-sod mRNA levels could be biomarker candidates. Multivariate analyses confirmed the ability of mRNA levels to highlight site-effects and suggested the use of several combined markers instead of individual ones. These findings support the use of qPCR technology and mRNA levels as early-warning biomarkers in marine monitoring programs. Copyright © 2014 Elsevier Ltd. All rights reserved.
Selection of Reference Genes for Expression Studies of Xenobiotic Adaptation in Tetranychus urticae.

PubMed

Morales, Mariany Ashanty; Mendoza, Bianca Marie; Lavine, Laura Corley; Lavine, Mark Daniel; Walsh, Douglas Bruce; Zhu, Fang

2016-01-01

Quantitative real-time PCR (qRT-PCR) is an extensively used, high-throughput method to analyze transcriptional expression of genes of interest. An appropriate normalization strategy with reliable reference genes is required for calculating gene expression across diverse experimental conditions. In this study, we aim to identify the most stable reference genes for expression studies of xenobiotic adaptation in Tetranychus urticae, an extremely polyphagous herbivore causing significant yield reduction of agriculture. We chose eight commonly used housekeeping genes as candidates. The qRT-PCR expression data for these genes were evaluated from seven populations: a susceptible and three acaricide resistant populations feeding on lima beans, and three other susceptible populations which had been shifted host from lima beans to three other plant species. The stability of the candidate reference genes was then assessed using four different algorithms (comparative ΔCt method, geNorm, NormFinder, and BestKeeper). Additionally, we used an online web-based tool (RefFinder) to assign an overall final rank for each candidate gene. Our study found that CycA and Rp49 are best for investigating gene expression in acaricide susceptible and resistant populations. GAPDH, Rp49, and Rpl18 are best for host plant shift studies. And GAPDH and Rp49 were the most stable reference genes when investigating gene expression under changes in both experimental conditions. These results will facilitate research in revealing molecular mechanisms underlying the xenobiotic adaptation of this notorious agricultural pest.
Selection of Reference Genes for Expression Studies of Xenobiotic Adaptation in Tetranychus urticae

PubMed Central

Morales, Mariany Ashanty; Mendoza, Bianca Marie; Lavine, Laura Corley; Lavine, Mark Daniel; Walsh, Douglas Bruce; Zhu, Fang

2016-01-01

Quantitative real-time PCR (qRT-PCR) is an extensively used, high-throughput method to analyze transcriptional expression of genes of interest. An appropriate normalization strategy with reliable reference genes is required for calculating gene expression across diverse experimental conditions. In this study, we aim to identify the most stable reference genes for expression studies of xenobiotic adaptation in Tetranychus urticae, an extremely polyphagous herbivore causing significant yield reduction of agriculture. We chose eight commonly used housekeeping genes as candidates. The qRT-PCR expression data for these genes were evaluated from seven populations: a susceptible and three acaricide resistant populations feeding on lima beans, and three other susceptible populations which had been shifted host from lima beans to three other plant species. The stability of the candidate reference genes was then assessed using four different algorithms (comparative ΔCt method, geNorm, NormFinder, and BestKeeper). Additionally, we used an online web-based tool (RefFinder) to assign an overall final rank for each candidate gene. Our study found that CycA and Rp49 are best for investigating gene expression in acaricide susceptible and resistant populations. GAPDH, Rp49, and Rpl18 are best for host plant shift studies. And GAPDH and Rp49 were the most stable reference genes when investigating gene expression under changes in both experimental conditions. These results will facilitate research in revealing molecular mechanisms underlying the xenobiotic adaptation of this notorious agricultural pest. PMID:27570487

Animal models of Parkinson's disease: limits and relevance to neuroprotection studies.

PubMed

Bezard, Erwan; Yue, Zhenyu; Kirik, Deniz; Spillantini, Maria Grazia

2013-01-01

Over the last two decades, significant strides has been made toward acquiring a better knowledge of both the etiology and pathogenesis of Parkinson's disease (PD). Experimental models are of paramount importance to obtain greater insights into the pathogenesis of the disease. Thus far, neurotoxin-based animal models have been the most popular tools employed to produce selective neuronal death in both in vitro and in vivo systems. These models have been commonly referred to as the pathogenic models. The current trend in modeling PD revolves around what can be called the disease gene-based models or etiologic models. The value of utilizing multiple models with a different mechanism of insult rests on the premise that dopamine-producing neurons die by stereotyped cascades that can be activated by a range of insults, from neurotoxins to downregulation and overexpression of disease-related genes. In this position article, we present the relevance of both pathogenic and etiologic models as well as the concept of clinically relevant designs that, we argue, should be utilized in the preclinical development phase of new neuroprotective therapies before embarking into clinical trials. Copyright © 2013 Movement Disorders Society.
The floral transcriptomes of four bamboo species (Bambusoideae; Poaceae): support for common ancestry among woody bamboos.

PubMed

Wysocki, William P; Ruiz-Sanchez, Eduardo; Yin, Yanbin; Duvall, Melvin R

2016-05-20

Next-generation sequencing now allows for total RNA extracts to be sequenced in non-model organisms such as bamboos, an economically and ecologically important group of grasses. Bamboos are divided into three lineages, two of which are woody perennials with bisexual flowers, which undergo gregarious monocarpy. The third lineage, which are herbaceous perennials, possesses unisexual flowers that undergo annual flowering events. Transcriptomes were assembled using both reference-based and de novo methods. These two methods were tested by characterizing transcriptome content using sequence alignment to previously characterized reference proteomes and by identifying Pfam domains. Because of the striking differences in floral morphology and phenology between the herbaceous and woody bamboo lineages, MADS-box genes, transcription factors that control floral development and timing, were characterized and analyzed in this study. Transcripts were identified using phylogenetic methods and categorized as A, B, C, D or E-class genes, which control floral development, or SOC or SVP-like genes, which control the timing of flowering events. Putative nuclear orthologues were also identified in bamboos to use as phylogenetic markers. Instances of gene copies exhibiting topological patterns that correspond to shared phenotypes were observed in several gene families including floral development and timing genes. Alignments and phylogenetic trees were generated for 3,878 genes and for all genes in a concatenated analysis. Both the concatenated analysis and those of 2,412 separate gene trees supported monophyly among the woody bamboos, which is incongruent with previous phylogenetic studies using plastid markers.
Identification of Suitable Reference Genes for mRNA Studies in Bone Marrow in a Mouse Model of Hematopoietic Stem Cell Transplantation.

PubMed

Li, H; Chen, C; Yao, H; Li, X; Yang, N; Qiao, J; Xu, K; Zeng, L

2016-10-01

Bone marrow micro-environment changes during hematopoietic stem cell transplantation (HSCT) with subsequent alteration of genes expression. Quantitative polymerase chain reaction (q-PCR) is a reliable and reproducible technique for the analysis of gene expression. To obtain more accurate results, it is essential to find a reference during HSCT. However, which gene is suitable during HSCT remains unclear. This study aimed to identify suitable reference genes for mRNA studies in bone marrow after HSCT. C57BL/6 mice were treated with either total body irradiation (group T) or busulfan/cyclophosphamide (BU/CY) (group B) followed by infusion of bone marrow cells. Normal mice without treatments were served as a control. All samples (group T + group B + control) were defined as group G. On days 7, 14, and 21 after transplantation, transcription levels of 7 candidate genes, ACTB, B2M, GAPDH, HMBS, HPRT, SDHA, and YWHAZ, in bone marrow cells were measured by use of real-time quantitative PCR. The expression stability of these 7 candidate reference genes were analyzed by 2 statistical software programs, GeNorm and NormFinder. Our results showed that ACTB displayed the highest expression in group G, with lowest expression of PSDHA in group T and HPRT in groups B and G. Analysis of expression stability by use of GeNorm or NormFinder demonstrated that expression of B2M in bone marrow were much more stable during HSCT, compared with other candidate genes including commonly used reference genes GAPDH and ACTB. ACTB could be used as a suitable reference gene for mRNA studies in bone marrow after HSCT. Copyright © 2016 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Marchisio, Mario Andrea, E-mail: marchisio@hit.edu.cn

Published in 2008, Parts & Pools represents one of the first attempts to conceptualize the modular design of bacterial synthetic gene circuits with Standard Biological Parts (DNA segments) and Pools of molecules referred to as common signal carriers (e.g., RNA polymerases and ribosomes). The original framework for modeling bacterial components and designing prokaryotic circuits evolved over the last years and brought, first, to the development of an algorithm for the automatic design of Boolean gene circuits. This is a remarkable achievement since gene digital circuits have a broad range of applications that goes from biosensors for health and environment caremore » to computational devices. More recently, Parts & Pools was enabled to give a proper formal description of eukaryotic biological circuit components. This was possible by employing a rule-based modeling approach, a technique that permits a faithful calculation of all the species and reactions involved in complex systems such as eukaryotic cells and compartments. In this way, Parts & Pools is currently suitable for the visual and modular design of synthetic gene circuits in yeast and mammalian cells too.« less
Reference gene selection for quantitative reverse transcription-polymerase chain reaction normalization during in vitro adventitious rooting in Eucalyptus globulus Labill.

PubMed

de Almeida, Márcia R; Ruedell, Carolina M; Ricachenevsky, Felipe K; Sperotto, Raul A; Pasquali, Giancarlo; Fett-Neto, Arthur G

2010-09-20

Eucalyptus globulus and its hybrids are very important for the cellulose and paper industry mainly due to their low lignin content and frost resistance. However, rooting of cuttings of this species is recalcitrant and exogenous auxin application is often necessary for good root development. To date one of the most accurate methods available for gene expression analysis is quantitative reverse transcription-polymerase chain reaction (qPCR); however, reliable use of this technique requires reference genes for normalization. There is no single reference gene that can be regarded as universal for all experiments and biological materials. Thus, the identification of reliable reference genes must be done for every species and experimental approach. The present study aimed at identifying suitable control genes for normalization of gene expression associated with adventitious rooting in E. globulus microcuttings. By the use of two distinct algorithms, geNorm and NormFinder, we have assessed gene expression stability of eleven candidate reference genes in E. globulus: 18S, ACT2, EF2, EUC12, H2B, IDH, SAND, TIP41, TUA, UBI and 33380. The candidate reference genes were evaluated in microccuttings rooted in vitro, in presence or absence of auxin, along six time-points spanning the process of adventitious rooting. Overall, the stability profiles of these genes determined with each one of the algorithms were very similar. Slight differences were observed in the most stable pair of genes indicated by each program: IDH and SAND for geNorm, and H2B and TUA for NormFinder. Both programs identified UBI and 18S as the most variable genes. To validate these results and select the most suitable reference genes, the expression profile of the ARGONAUTE1 gene was evaluated in relation to the most stable candidate genes indicated by each algorithm. Our study showed that expression stability varied between putative reference genes tested in E. globulus. Based on the AGO1 relative expression profile obtained using the genes suggested by the algorithms, H2B and TUA were considered as the most suitable reference genes for expression studies in E. globulus adventitious rooting. UBI and 18S were unsuitable for use as controls in qPCR related to this process. These findings will enable more accurate and reliable normalization of qPCR results for gene expression studies in this economically important woody plant, particularly related to rooting and clonal propagation.
Reference gene selection for quantitative reverse transcription-polymerase chain reaction normalization during in vitro adventitious rooting in Eucalyptus globulus Labill

PubMed Central

2010-01-01

Background Eucalyptus globulus and its hybrids are very important for the cellulose and paper industry mainly due to their low lignin content and frost resistance. However, rooting of cuttings of this species is recalcitrant and exogenous auxin application is often necessary for good root development. To date one of the most accurate methods available for gene expression analysis is quantitative reverse transcription-polymerase chain reaction (qPCR); however, reliable use of this technique requires reference genes for normalization. There is no single reference gene that can be regarded as universal for all experiments and biological materials. Thus, the identification of reliable reference genes must be done for every species and experimental approach. The present study aimed at identifying suitable control genes for normalization of gene expression associated with adventitious rooting in E. globulus microcuttings. Results By the use of two distinct algorithms, geNorm and NormFinder, we have assessed gene expression stability of eleven candidate reference genes in E. globulus: 18S, ACT2, EF2, EUC12, H2B, IDH, SAND, TIP41, TUA, UBI and 33380. The candidate reference genes were evaluated in microccuttings rooted in vitro, in presence or absence of auxin, along six time-points spanning the process of adventitious rooting. Overall, the stability profiles of these genes determined with each one of the algorithms were very similar. Slight differences were observed in the most stable pair of genes indicated by each program: IDH and SAND for geNorm, and H2B and TUA for NormFinder. Both programs indentified UBI and 18S as the most variable genes. To validate these results and select the most suitable reference genes, the expression profile of the ARGONAUTE1 gene was evaluated in relation to the most stable candidate genes indicated by each algorithm. Conclusion Our study showed that expression stability varied between putative reference genes tested in E. globulus. Based on the AGO1 relative expression profile obtained using the genes suggested by the algorithms, H2B and TUA were considered as the most suitable reference genes for expression studies in E. globulus adventitious rooting. UBI and 18S were unsuitable for use as controls in qPCR related to this process. These findings will enable more accurate and reliable normalization of qPCR results for gene expression studies in this economically important woody plant, particularly related to rooting and clonal propagation. PMID:20854682
Unique Features of the Loblolly Pine (Pinus taeda L.) Megagenome Revealed Through Sequence Annotation

PubMed Central

Wegrzyn, Jill L.; Liechty, John D.; Stevens, Kristian A.; Wu, Le-Shin; Loopstra, Carol A.; Vasquez-Gross, Hans A.; Dougherty, William M.; Lin, Brian Y.; Zieve, Jacob J.; Martínez-García, Pedro J.; Holt, Carson; Yandell, Mark; Zimin, Aleksey V.; Yorke, James A.; Crepeau, Marc W.; Puiu, Daniela; Salzberg, Steven L.; de Jong, Pieter J.; Mockaitis, Keithanne; Main, Doreen; Langley, Charles H.; Neale, David B.

2014-01-01

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20–40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%. PMID:24653211
Establishment of a Human Blood-Brain Barrier Co-culture Model Mimicking the Neurovascular Unit Using Induced Pluri- and Multipotent Stem Cells.

PubMed

Appelt-Menzel, Antje; Cubukova, Alevtina; Günther, Katharina; Edenhofer, Frank; Piontek, Jörg; Krause, Gerd; Stüber, Tanja; Walles, Heike; Neuhaus, Winfried; Metzger, Marco

2017-04-11

In vitro models of the human blood-brain barrier (BBB) are highly desirable for drug development. This study aims to analyze a set of ten different BBB culture models based on primary cells, human induced pluripotent stem cells (hiPSCs), and multipotent fetal neural stem cells (fNSCs). We systematically investigated the impact of astrocytes, pericytes, and NSCs on hiPSC-derived BBB endothelial cell function and gene expression. The quadruple culture models, based on these four cell types, achieved BBB characteristics including transendothelial electrical resistance (TEER) up to 2,500 Ω cm 2 and distinct upregulation of typical BBB genes. A complex in vivo-like tight junction (TJ) network was detected by freeze-fracture and transmission electron microscopy. Treatment with claudin-specific TJ modulators caused TEER decrease, confirming the relevant role of claudin subtypes for paracellular tightness. Drug permeability tests with reference substances were performed and confirmed the suitability of the models for drug transport studies. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing.

PubMed

Seoane-Zonjic, Pedro; Cañas, Rafael A; Bautista, Rocío; Gómez-Maldonado, Josefa; Arrillaga, Isabel; Fernández-Pozo, Noé; Claros, M Gonzalo; Cánovas, Francisco M; Ávila, Concepción

2016-02-27

In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82% of the gene structures, and a high proportion (85%) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses.
Evaluation of the stability of reference genes in bone mesenchymal stem cells from patients with avascular necrosis of the femoral head.

PubMed

Wang, X N; Yang, Q W; Du, Z W; Yu, T; Qin, Y G; Song, Y; Xu, M; Wang, J C

2016-05-25

This study aimed to evaluate 12 genes (18S, GAPDH, B2M, ACTB, ALAS1, GUSB, HPRT1, PBGD, PPIA, PUM1, RPL29, and TBP) for their reliability and stability as reference sequences for real-time quantitative PCR (RT-qPCR) in bone marrow-derived mesenchymal stem cells (BMSCs) isolated from patients with avascular necrosis of the femoral head (ANFH). BMSCs were isolated from 20 ANFH patients divided into four groups according to etiology, and four donors with femoral neck fractures. Total RNA was isolated from BMSCs and reverse transcribed into complementary DNA, which served as a template for RT-qPCR. Three commonly used programs were then used to analyze the results. Reference gene expression varied within each group, between specific groups, and among all five groups. Based on comparisons of all five groups, two of the programs used suggested that HPRT1 was the most stable reference gene, while 18S and ACTB were the most variable. Among the 12 candidate reference genes, HPRT1 exhibited the greatest reliability, followed by PPIA. Thus, these sequences could be used as references for the normalization of RT-qPCR results.
SGP-1: Prediction and Validation of Homologous Genes Based on Sequence Alignments

PubMed Central

Wiehe, Thomas; Gebauer-Jung, Steffi; Mitchell-Olds, Thomas; Guigó, Roderic

2001-01-01

Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors. PMID:11544202
Validation of Reference Genes for Robust qRT-PCR Gene Expression Analysis in the Rice Blast Fungus Magnaporthe oryzae.

PubMed

Che Omar, Sarena; Bentley, Michael A; Morieri, Giulia; Preston, Gail M; Gurr, Sarah J

2016-01-01

The rice blast fungus causes significant annual harvest losses. It also serves as a genetically-tractable model to study fungal ingress. Whilst pathogenicity determinants have been unmasked and changes in global gene expression described, we know little about Magnaporthe oryzae cell wall remodelling. Our interests, in wall remodelling genes expressed during infection, vegetative growth and under exogenous wall stress, demand robust choice of reference genes for quantitative Real Time-PCR (qRT-PCR) data normalisation. We describe the expression stability of nine candidate reference genes profiled by qRT-PCR with cDNAs derived during asexual germling development, from sexual stage perithecia and from vegetative mycelium grown under various exogenous stressors. Our Minimum Information for Publication of qRT-PCR Experiments (MIQE) compliant analysis reveals a set of robust reference genes used to track changes in the expression of the cell wall remodelling gene MGG_Crh2 (MGG_00592). We ranked nine candidate reference genes by their expression stability (M) and report the best gene combination needed for reliable gene expression normalisation, when assayed in three tissue groups (Infective, Vegetative, and Global) frequently used in M. oryzae expression studies. We found that MGG_Actin (MGG_03982) and the 40S 27a ribosomal subunit MGG_40s (MGG_02872) proved to be robust reference genes for the Infection group and MGG_40s and MGG_Ef1 (Elongation Factor1-α) for both Vegetative and Global groups. Using the above validated reference genes, M. oryzae MGG_Crh2 expression was found to be significantly (p<0.05) elevated three-fold during vegetative growth as compared with dormant spores and two fold higher under cell wall stress (Congo Red) compared to growth under optimal conditions. We recommend the combinatorial use of two reference genes, belonging to the cytoskeleton and ribosomal synthesis functional groups, MGG_Actin, MGG_40s, MGG_S8 (Ribosomal subunit 40S S8) or MGG_Ef1, which demonstrated low M values across heterogeneous tissues. By contrast, metabolic pathway genes MGG_Fad (FAD binding domain-containing protein) and MGG_Gapdh (Glyceraldehyde-3-phosphate dehydrogenase) performed poorly, due to their lack of expression stability across samples.
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.

PubMed

Johnson, Benjamin K; Scholz, Matthew B; Teal, Tracy K; Abramovitch, Robert B

2016-02-04

Many tools exist in the analysis of bacterial RNA sequencing (RNA-seq) transcriptional profiling experiments to identify differentially expressed genes between experimental conditions. Generally, the workflow includes quality control of reads, mapping to a reference, counting transcript abundance, and statistical tests for differentially expressed genes. In spite of the numerous tools developed for each component of an RNA-seq analysis workflow, easy-to-use bacterially oriented workflow applications to combine multiple tools and automate the process are lacking. With many tools to choose from for each step, the task of identifying a specific tool, adapting the input/output options to the specific use-case, and integrating the tools into a coherent analysis pipeline is not a trivial endeavor, particularly for microbiologists with limited bioinformatics experience. To make bacterial RNA-seq data analysis more accessible, we developed a Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis (SPARTA). SPARTA is a reference-based bacterial RNA-seq analysis workflow application for single-end Illumina reads. SPARTA is turnkey software that simplifies the process of analyzing RNA-seq data sets, making bacterial RNA-seq analysis a routine process that can be undertaken on a personal computer or in the classroom. The easy-to-install, complete workflow processes whole transcriptome shotgun sequencing data files by trimming reads and removing adapters, mapping reads to a reference, counting gene features, calculating differential gene expression, and, importantly, checking for potential batch effects within the data set. SPARTA outputs quality analysis reports, gene feature counts and differential gene expression tables and scatterplots. SPARTA provides an easy-to-use bacterial RNA-seq transcriptional profiling workflow to identify differentially expressed genes between experimental conditions. This software will enable microbiologists with limited bioinformatics experience to analyze their data and integrate next generation sequencing (NGS) technologies into the classroom. The SPARTA software and tutorial are available at sparta.readthedocs.org.
Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing.

PubMed

Kanda, Kojun; Pflug, James M; Sproul, John S; Dasenko, Mark A; Maddison, David R

2015-01-01

In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced.
Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

PubMed Central

Dasenko, Mark A.

2015-01-01

In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced. PMID:26716693
Selection of suitable reference genes from bone cells in large gradient high magnetic field based on GeNorm algorithm.

PubMed

Di, Shengmeng; Tian, Zongcheng; Qian, Airong; Gao, Xiang; Yu, Dan; Brandi, Maria Luisa; Shang, Peng

2011-12-01

Studies of animals and humans subjected to spaceflight demonstrate that weightlessness negatively affects the mass and mechanical properties of bone tissue. Bone cells could sense and respond to the gravity unloading, and genes sensitive to gravity change were considered to play a critical role in the mechanotransduction of bone cells. To evaluate the fold-change of gene expression, appropriate reference genes should be identified because there is no housekeeping gene having stable expression in all experimental conditions. Consequently, expression stability of ten candidate housekeeping genes were examined in osteoblast-like MC3T3-E1, osteocyte-like MLO-Y4, and preosteoclast-like FLG29.1 cells under different apparent gravities (μg, 1 g, and 2 g) in the high-intensity gradient magnetic field produced by a superconducting magnet. The results showed that the relative expression of these ten candidate housekeeping genes was different in different bone cells; Moreover, the most suitable reference genes of the same cells in altered gravity conditions were also different from that in strong magnetic field. It demonstrated the importance of selecting suitable reference genes in experimental set-ups. Furthermore, it provides an alternative choice to the traditionally accepted housekeeping genes used so far about studies of gravitational biology and magneto biology.
An extended set of yeast-based functional assays accurately identifies human disease mutations

PubMed Central

Sun, Song; Yang, Fan; Tan, Guihong; Costanzo, Michael; Oughtred, Rose; Hirschman, Jodi; Theesfeld, Chandra L.; Bansal, Pritpal; Sahni, Nidhi; Yi, Song; Yu, Analyn; Tyagi, Tanya; Tie, Cathy; Hill, David E.; Vidal, Marc; Andrews, Brenda J.; Boone, Charles; Dolinski, Kara; Roth, Frederick P.

2016-01-01

We can now routinely identify coding variants within individual human genomes. A pressing challenge is to determine which variants disrupt the function of disease-associated genes. Both experimental and computational methods exist to predict pathogenicity of human genetic variation. However, a systematic performance comparison between them has been lacking. Therefore, we developed and exploited a panel of 26 yeast-based functional complementation assays to measure the impact of 179 variants (101 disease- and 78 non-disease-associated variants) from 22 human disease genes. Using the resulting reference standard, we show that experimental functional assays in a 1-billion-year diverged model organism can identify pathogenic alleles with significantly higher precision and specificity than current computational methods. PMID:26975778
Nearly complete rRNA genes assembled from across the metazoan animals: effects of more taxa, a structure-based alignment, and paired-sites evolutionary models on phylogeny reconstruction.

PubMed

Mallatt, Jon; Craig, Catherine Waggoner; Yoder, Matthew J

2010-04-01

This study (1) uses nearly complete rRNA-gene sequences from across Metazoa (197 taxa) to reconstruct animal phylogeny; (2) presents a highly annotated, manual alignment of these sequences with special reference to rRNA features including paired sites (http://purl.oclc.org/NET/rRNA/Metazoan_alignment) and (3) tests, after eliminating as few disruptive, rogue sequences as possible, if a likelihood framework can recover the main metazoan clades. We found that systematic elimination of approximately 6% of the sequences, including the divergent or unstably placed sequences of cephalopods, arrowworm, symphylan and pauropod myriapods, and of myzostomid and nemertodermatid worms, led to a tree that supported Ecdysozoa, Lophotrochozoa, Protostomia, and Bilateria. Deuterostomia, however, was never recovered, because the rRNA of urochordates goes (nonsignificantly) near the base of the Bilateria. Counterintuitively, when we modeled the evolution of the paired sites, phylogenetic resolution was not increased over traditional tree-building models that assume all sites in rRNA evolve independently. The rRNA genes of non-bilaterians contain a higher % AT than do those of most bilaterians. The rRNA genes of Acoela and Myzostomida were found to be secondarily shortened, AT-enriched, and highly modified, throwing some doubt on the location of these worms at the base of Bilateria in the rRNA tree--especially myzostomids, which other evidence suggests are annelids instead. Other findings are marsupial-with-placental mammals, arrowworms in Ecdysozoa (well supported here but contradicted by morphology), and Placozoa as sister to Cnidaria. Finally, despite the difficulties, the rRNA-gene trees are in strong concordance with trees derived from multiple protein-coding genes in supporting the new animal phylogeny. (c) 2009 Elsevier Inc. All rights reserved.
Identifying clinically relevant drug resistance genes in drug-induced resistant cancer cell lines and post- chemotherapy tissues

PubMed Central

Tong, Mengsha; Zheng, Weicheng; Lu, Xingrong; Ao, Lu; Li, Xiangyu; Guan, Qingzhou; Cai, Hao; Li, Mengyao; Yan, Haidan; Guo, You; Chi, Pan; Guo, Zheng

2015-01-01

Until recently, few molecular signatures of drug resistance identified in drug-induced resistant cancer cell models can be translated into clinical practice. Here, we defined differentially expressed genes (DEGs) between pre-chemotherapy colorectal cancer (CRC) tissue samples of non-responders and responders for 5-fluorouracil and oxaliplatin-based therapy as clinically relevant drug resistance genes (CRG5-FU/L-OHP). Taking CRG5-FU/L-OHP as reference, we evaluated the clinical relevance of several types of genes derived from HCT116 CRC cells with resistance to 5-fluorouracil and oxaliplatin, respectively. The results revealed that DEGs between parental and resistant cells, when both were treated with the corresponding drug for a certain time, were significantly consistent with the CRG5-FU/L-OHP as well as the DEGs between the post-chemotherapy CRC specimens of responders and non-responders. This study suggests a novel strategy to extract clinically relevant drug resistance genes from both drug-induced resistant cell models and post-chemotherapy cancer tissue specimens. PMID:26515599
Biomining active cellulases from a mining bioremediation system.

PubMed

Mewis, Keith; Armstrong, Zachary; Song, Young C; Baldwin, Susan A; Withers, Stephen G; Hallam, Steven J

2013-09-20

Functional metagenomics has emerged as a powerful method for gene model validation and enzyme discovery from natural and human engineered ecosystems. Here we report development of a high-throughput functional metagenomic screen incorporating bioinformatic and biochemical analyses features. A fosmid library containing 6144 clones sourced from a mining bioremediation system was screened for cellulase activity using 2,4-dinitrophenyl β-cellobioside, a previously proven cellulose model substrate. Fifteen active clones were recovered and fully sequenced revealing 9 unique clones with the ability to hydrolyse 1,4-β-D-glucosidic linkages. Transposon mutagenesis identified genes belonging to glycoside hydrolase (GH) 1, 3, or 5 as necessary for mediating this activity. Reference trees for GH 1, 3, and 5 families were generated from sequences in the CAZy database for automated phylogenetic analysis of fosmid end and active clone sequences revealing known and novel cellulase encoding genes. Active cellulase genes recovered in functional screens were subcloned into inducible high copy plasmids, expressed and purified to determine enzymatic properties including thermostability, pH optima, and substrate specificity. The workflow described here provides a general paradigm for recovery and characterization of microbially derived genes and gene products based on genetic logic and contemporary screening technologies developed for model organismal systems. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

Genetic quality and sexual selection: an integrated framework for good genes and compatible genes.

PubMed

Neff, Bryan D; Pitcher, Trevor E

2005-01-01

Why are females so choosy when it comes to mating? This question has puzzled and marveled evolutionary and behavioral ecologists for decades. In mating systems in which males provide direct benefits to the female or her offspring, such as food or shelter, the answer seems straightforward--females should prefer to mate with males that are able to provide more resources. The answer is less clear in other mating systems in which males provide no resources (other than sperm) to females. Theoretical models that account for the evolution of mate choice in such nonresource-based mating systems require that females obtain a genetic benefit through increased offspring fitness from their choice. Empirical studies of nonresource-based mating systems that are characterized by strong female choice for males with elaborate sexual traits (like the large tail of peacocks) suggest that additive genetic benefits can explain only a small percentage of the variation in fitness. Other research on genetic benefits has examined nonadditive effects as another source of genetic variation in fitness and a potential benefit to female mate choice. In this paper, we review the sexual selection literature on genetic quality to address five objectives. First, we attempt to provide an integrated framework for discussing genetic quality. We propose that the term 'good gene' be used exclusively to refer to additive genetic variation in fitness, 'compatible gene' be used to refer to nonadditive genetic variation in fitness, and 'genetic quality' be defined as the sum of the two effects. Second, we review empirical approaches used to calculate the effect size of genetic quality and discuss these approaches in the context of measuring benefits from good genes, compatible genes and both types of genes. Third, we discuss biological mechanisms for acquiring and promoting offspring genetic quality and categorize these into three stages during breeding: (i) precopulatory (mate choice); (ii) postcopulatory, prefertilization (sperm utilization); and (iii) postcopulatory, postfertilization (differential investment). Fourth, we present a verbal model of the effect of good genes sexual selection and compatible genes sexual selection on population genetic variation in fitness, and discuss the potential trade-offs that might exist between mate choice for good genes and mate choice for compatible genes. Fifth, we discuss some future directions for research on genetic quality and sexual selection.
Text-mined phenotype annotation and vector-based similarity to improve identification of similar phenotypes and causative genes in monogenic disease patients.

PubMed

Saklatvala, Jake R; Dand, Nick; Simpson, Michael A

2018-05-01

The genetic diagnosis of rare monogenic diseases using exome/genome sequencing requires the true causal variant(s) to be identified from tens of thousands of observed variants. Typically a virtual gene panel approach is taken whereby only variants in genes known to cause phenotypes resembling the patient under investigation are considered. With the number of known monogenic gene-disease pairs exceeding 5,000, manual curation of personalized virtual panels using exhaustive knowledge of the genetic basis of the human monogenic phenotypic spectrum is challenging. We present improved probabilistic methods for estimating phenotypic similarity based on Human Phenotype Ontology annotation. A limitation of existing methods for evaluating a disease's similarity to a reference set is that reference diseases are typically represented as a series of binary (present/absent) observations of phenotypic terms. We evaluate a quantified disease reference set, using term frequency in phenotypic text descriptions to approximate term relevance. We demonstrate an improved ability to identify related diseases through the use of a quantified reference set, and that vector space similarity measures perform better than established information content-based measures. These improvements enable the generation of bespoke virtual gene panels, facilitating more accurate and efficient interpretation of genomic variant profiles from individuals with rare Mendelian disorders. These methods are available online at https://atlas.genetics.kcl.ac.uk/~jake/cgi-bin/patient_sim.py. © 2018 Wiley Periodicals, Inc.
A literature search tool for intelligent extraction of disease-associated genes.

PubMed

Jung, Jae-Yoon; DeLuca, Todd F; Nelson, Tristan H; Wall, Dennis P

2014-01-01

To extract disorder-associated genes from the scientific literature in PubMed with greater sensitivity for literature-based support than existing methods. We developed a PubMed query to retrieve disorder-related, original research articles. Then we applied a rule-based text-mining algorithm with keyword matching to extract target disorders, genes with significant results, and the type of study described by the article. We compared our resulting candidate disorder genes and supporting references with existing databases. We demonstrated that our candidate gene set covers nearly all genes in manually curated databases, and that the references supporting the disorder-gene link are more extensive and accurate than other general purpose gene-to-disorder association databases. We implemented a novel publication search tool to find target articles, specifically focused on links between disorders and genotypes. Through comparison against gold-standard manually updated gene-disorder databases and comparison with automated databases of similar functionality we show that our tool can search through the entirety of PubMed to extract the main gene findings for human diseases rapidly and accurately.
Unstable Expression of Commonly Used Reference Genes in Rat Pancreatic Islets Early after Isolation Affects Results of Gene Expression Studies.

PubMed

Kosinová, Lucie; Cahová, Monika; Fábryová, Eva; Týcová, Irena; Koblas, Tomáš; Leontovyč, Ivan; Saudek, František; Kříž, Jan

2016-01-01

The use of RT-qPCR provides a powerful tool for gene expression studies; however, the proper interpretation of the obtained data is crucially dependent on accurate normalization based on stable reference genes. Recently, strong evidence has been shown indicating that the expression of many commonly used reference genes may vary significantly due to diverse experimental conditions. The isolation of pancreatic islets is a complicated procedure which creates severe mechanical and metabolic stress leading possibly to cellular damage and alteration of gene expression. Despite of this, freshly isolated islets frequently serve as a control in various gene expression and intervention studies. The aim of our study was to determine expression of 16 candidate reference genes and one gene of interest (F3) in isolated rat pancreatic islets during short-term cultivation in order to find a suitable endogenous control for gene expression studies. We compared the expression stability of the most commonly used reference genes and evaluated the reliability of relative and absolute quantification using RT-qPCR during 0-120 hrs after isolation. In freshly isolated islets, the expression of all tested genes was markedly depressed and it increased several times throughout the first 48 hrs of cultivation. We observed significant variability among samples at 0 and 24 hrs but substantial stabilization from 48 hrs onwards. During the first 48 hrs, relative quantification failed to reflect the real changes in respective mRNA concentrations while in the interval 48-120 hrs, the relative expression generally paralleled the results determined by absolute quantification. Thus, our data call into question the suitability of relative quantification for gene expression analysis in pancreatic islets during the first 48 hrs of cultivation, as the results may be significantly affected by unstable expression of reference genes. However, this method could provide reliable information from 48 hrs onwards.
Optimizing and benchmarking de novo transcriptome sequencing: from library preparation to assembly evaluation.

PubMed

Hara, Yuichiro; Tatsumi, Kaori; Yoshida, Michio; Kajikawa, Eriko; Kiyonari, Hiroshi; Kuraku, Shigehiro

2015-11-18

RNA-seq enables gene expression profiling in selected spatiotemporal windows and yields massive sequence information with relatively low cost and time investment, even for non-model species. However, there remains a large room for optimizing its workflow, in order to take full advantage of continuously developing sequencing capacity. Transcriptome sequencing for three embryonic stages of Madagascar ground gecko (Paroedura picta) was performed with the Illumina platform. The output reads were assembled de novo for reconstructing transcript sequences. In order to evaluate the completeness of transcriptome assemblies, we prepared a reference gene set consisting of vertebrate one-to-one orthologs. To take advantage of increased read length of >150 nt, we demonstrated shortened RNA fragmentation time, which resulted in a dramatic shift of insert size distribution. To evaluate products of multiple de novo assembly runs incorporating reads with different RNA sources, read lengths, and insert sizes, we introduce a new reference gene set, core vertebrate genes (CVG), consisting of 233 genes that are shared as one-to-one orthologs by all vertebrate genomes examined (29 species)., The completeness assessment performed by the computational pipelines CEGMA and BUSCO referring to CVG, demonstrated higher accuracy and resolution than with the gene set previously established for this purpose. As a result of the assessment with CVG, we have derived the most comprehensive transcript sequence set of the Madagascar ground gecko by means of assembling individual libraries followed by clustering the assembled sequences based on their overall similarities. Our results provide several insights into optimizing de novo RNA-seq workflow, including the coordination between library insert size and read length, which manifested in improved connectivity of assemblies. The approach and assembly assessment with CVG demonstrated here would be applicable to transcriptome analysis of other species as well as whole genome analyses.
Evaluation of reference genes for reverse transcription quantitative real-time PCR (RT-qPCR) studies in Silene vulgaris considering the method of cDNA preparation

PubMed Central

Koloušková, Pavla; Stone, James D.

2017-01-01

Accurate gene expression measurements are essential in studies of both crop and wild plants. Reverse transcription quantitative real-time PCR (RT-qPCR) has become a preferred tool for gene expression estimation. A selection of suitable reference genes for the normalization of transcript levels is an essential prerequisite of accurate RT-qPCR results. We evaluated the expression stability of eight candidate reference genes across roots, leaves, flower buds and pollen of Silene vulgaris (bladder campion), a model plant for the study of gynodioecy. As random priming of cDNA is recommended for the study of organellar transcripts and poly(A) selection is indicated for nuclear transcripts, we estimated gene expression with both random-primed and oligo(dT)-primed cDNA. Accordingly, we determined reference genes that perform well with oligo(dT)- and random-primed cDNA, making it possible to estimate levels of nucleus-derived transcripts in the same cDNA samples as used for organellar transcripts, a key benefit in studies of cyto-nuclear interactions. Gene expression variance was estimated by RefFinder, which integrates four different analytical tools. The SvACT and SvGAPDH genes were the most stable candidates across various organs of S. vulgaris, regardless of whether pollen was included or not. PMID:28817728
Automated identification of reference genes based on RNA-seq data.

PubMed

Carmona, Rosario; Arroyo, Macarena; Jiménez-Quesada, María José; Seoane, Pedro; Zafra, Adoración; Larrosa, Rafael; Alché, Juan de Dios; Claros, M Gonzalo

2017-08-18

Gene expression analyses demand appropriate reference genes (RGs) for normalization, in order to obtain reliable assessments. Ideally, RG expression levels should remain constant in all cells, tissues or experimental conditions under study. Housekeeping genes traditionally fulfilled this requirement, but they have been reported to be less invariant than expected; therefore, RGs should be tested and validated for every particular situation. Microarray data have been used to propose new RGs, but only a limited set of model species and conditions are available; on the contrary, RNA-seq experiments are more and more frequent and constitute a new source of candidate RGs. An automated workflow based on mapped NGS reads has been constructed to obtain highly and invariantly expressed RGs based on a normalized expression in reads per mapped million and the coefficient of variation. This workflow has been tested with Roche/454 reads from reproductive tissues of olive tree (Olea europaea L.), as well as with Illumina paired-end reads from two different accessions of Arabidopsis thaliana and three different human cancers (prostate, small-cell cancer lung and lung adenocarcinoma). Candidate RGs have been proposed for each species and many of them have been previously reported as RGs in literature. Experimental validation of significant RGs in olive tree is provided to support the algorithm. Regardless sequencing technology, number of replicates, and library sizes, when RNA-seq experiments are designed and performed, the same datasets can be analyzed with our workflow to extract suitable RGs for subsequent PCR validation. Moreover, different subset of experimental conditions can provide different suitable RGs.
Genomic DNA-based absolute quantification of gene expression in Vitis

USDA-ARS?s Scientific Manuscript database

Many studies in which gene expression is quantified by polymerase chain reaction represent the expression of a gene of interest (GOI) relative to that of a reference gene (RG). Relative expression is founded on the assumptions that RG expression is stable across samples, treatments, organs, etc., an...
snoU6 and 5S RNAs are not reliable miRNA reference genes in neuronal differentiation.

PubMed

Lim, Q E; Zhou, L; Ho, Y K; Wan, G; Too, H P

2011-12-29

Accurate profiling of microRNAs (miRNAs) is an essential step for understanding the functional significance of these small RNAs in both physiological and pathological processes. Quantitative real-time PCR (qPCR) has gained acceptance as a robust and reliable transcriptomic method to profile subtle changes in miRNA levels and requires reference genes for accurate normalization of gene expression. 5S and snoU6 RNAs are commonly used as reference genes in microRNA quantification. It is currently unknown if these small RNAs are stably expressed during neuronal differentiation. Panels of miRNAs have been suggested as alternative reference genes to 5S and snoU6 in various physiological contexts. To test the hypothesis that miRNAs may serve as stable references during neuronal differentiation, the expressions of eight miRNAs, 5S and snoU6 RNAs in five differentiating neuronal cell types were analyzed using qPCR. The stabilities of the expressions were evaluated using two complementary statistical approaches (geNorm and Normfinder). Expressions of 5S and snoU6 RNAs were stable under some but not all conditions of neuronal differentiation and thus are not suitable reference genes. In contrast, a combination of three miRNAs (miR-103, miR-106b and miR-26b) allowed accurate expression normalization across different models of neuronal differentiation. Copyright © 2011 IBRO. Published by Elsevier Ltd. All rights reserved.
The Pathogen-Host Interactions database (PHI-base): additions and future developments

PubMed Central

Urban, Martin; Pant, Rashmi; Raghunath, Arathi; Irvine, Alistair G.; Pedro, Helder; Hammond-Kosack, Kim E.

2015-01-01

Rapidly evolving pathogens cause a diverse array of diseases and epidemics that threaten crop yield, food security as well as human, animal and ecosystem health. To combat infection greater comparative knowledge is required on the pathogenic process in multiple species. The Pathogen-Host Interactions database (PHI-base) catalogues experimentally verified pathogenicity, virulence and effector genes from bacterial, fungal and protist pathogens. Mutant phenotypes are associated with gene information. The included pathogens infect a wide range of hosts including humans, animals, plants, insects, fish and other fungi. The current version, PHI-base 3.6, available at http://www.phi-base.org, stores information on 2875 genes, 4102 interactions, 110 host species, 160 pathogenic species (103 plant, 3 fungal and 54 animal infecting species) and 181 diseases drawn from 1243 references. Phenotypic and gene function information has been obtained by manual curation of the peer-reviewed literature. A controlled vocabulary consisting of nine high-level phenotype terms permits comparisons and data analysis across the taxonomic space. PHI-base phenotypes were mapped via their associated gene information to reference genomes available in Ensembl Genomes. Virulence genes and hotspots can be visualized directly in genome browsers. Future plans for PHI-base include development of tools facilitating community-led curation and inclusion of the corresponding host target(s). PMID:25414340
Identification of a reference gene for the quantification of mRNA and miRNA expression during skin wound healing.

PubMed

Etich, Julia; Bergmeier, Vera; Pitzler, Lena; Brachvogel, Bent

2017-03-01

Wound healing is a coordinated process to restore tissue homeostasis and reestablish the protective barrier of the skin. miRNAs may modulate the expression of target genes to contribute to repair processes, but due to the complexity of the tissue it is challenging to quantify gene expression during the distinct phases of wound repair. Here, we aimed to identify a common reference gene to quantify changes in miRNA and mRNA expression during skin wound healing. Quantitative real-time PCR and bioinformatic analysis tools were used to identify suitable reference genes during skin repair and their reliability was tested by studying the expression of mRNAs and miRNAs. Morphological assessment of wounds showed that the injury model recapitulates the distinct phases of skin repair. Non-degraded RNA could be isolated from skin and wounds and used to study the expression of non-coding small nuclear RNAs during wound healing. Among those, RNU6B was most constantly expressed during skin repair. Using this reference gene we could confirm the transient upregulation of IL-1β and PTPRC/CD45 during the early phase as well as the increased expression of collagen type I at later stages of repair and validate the differential expression of miR-204, miR-205, and miR-31 in skin wounds. In contrast to Gapdh the normalization to multiple reference genes gave a similar outcome. RNU6B is an accurate alternative normalizer to quantify mRNA and miRNA expression during the distinct phases of skin wound healing when analysis of multiple reference genes is not feasible.
The GP problem: quantifying gene-to-phenotype relationships.

PubMed

Cooper, Mark; Chapman, Scott C; Podlich, Dean W; Hammer, Graeme L

2002-01-01

In this paper we refer to the gene-to-phenotype modeling challenge as the GP problem. Integrating information across levels of organization within a genotype-environment system is a major challenge in computational biology. However, resolving the GP problem is a fundamental requirement if we are to understand and predict phenotypes given knowledge of the genome and model dynamic properties of biological systems. Organisms are consequences of this integration, and it is a major property of biological systems that underlies the responses we observe. We discuss the E(NK) model as a framework for investigation of the GP problem and the prediction of system properties at different levels of organization. We apply this quantitative framework to an investigation of the processes involved in genetic improvement of plants for agriculture. In our analysis, N genes determine the genetic variation for a set of traits that are responsible for plant adaptation to E environment-types within a target population of environments. The N genes can interact in epistatic NK gene-networks through the way that they influence plant growth and development processes within a dynamic crop growth model. We use a sorghum crop growth model, available within the APSIM agricultural production systems simulation model, to integrate the gene-environment interactions that occur during growth and development and to predict genotype-to-phenotype relationships for a given E(NK) model. Directional selection is then applied to the population of genotypes, based on their predicted phenotypes, to simulate the dynamic aspects of genetic improvement by a plant-breeding program. The outcomes of the simulated breeding are evaluated across cycles of selection in terms of the changes in allele frequencies for the N genes and the genotypic and phenotypic values of the populations of genotypes.
Gene Expression Profile Analysis is Directly Affected by the Selected Reference Gene: The Case of Leaf-Cutting Atta Sexdens

PubMed Central

Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.

2018-01-01

Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.

PubMed

Wolen, Aaron R; Miles, Michael F

2012-01-01

For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
Selection and validation of reference genes for quantitative real-time PCR in Artemisia sphaerocephala based on transcriptome sequence data.

PubMed

Hu, Xiaowei; Zhang, Lijing; Nan, Shuzhen; Miao, Xiumei; Yang, Pengfang; Duan, Guoqin; Fu, Hua

2018-05-30

Artemisia sphaerocephala, a dicotyledonous perennial semi-shrub belonging to the Artemisia genus of the Compositae family, is widely distributed in northwestern China. This shrub is one of the most important pioneer plants which is capable of protecting rangelands from wind erosion. It therefore plays a vital role in maintaining desert ecosystem stability. In addition, to its use as a forage grass, it has excellent prospective applications as a source of plant oil and as a plant-based fuel. The use of internal genes is the basis for accurately assessing Real time quantitative PCR. In this study, based on transcriptome data of A. sphaerocephala, we analyzed 21 candidate internal genes to determine the optimal internal genes in this shrub. The stabilities of candidate genes were evaluated in 16 samples of A. sphaerocephala. Finally, UBC9 and TIP41-like were determined as the optimal reference genes in A. sphaerocephala by Delta Ct and three various programs. There were GeNorm, NormFinder and BestKeeper. Copyright © 2018 Elsevier B.V. All rights reserved.
Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content.

PubMed

Tran, Hue T M; Ramaraj, Thiruvarangan; Furtado, Agnelo; Lee, Leonard Slade; Henry, Robert J

2018-03-07

Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high- or low-caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long-read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway-based analysis, 65 caffeine-associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Digital gene expression for non-model organisms

PubMed Central

Hong, Lewis Z.; Li, Jun; Schmidt-Küntzel, Anne; Warren, Wesley C.; Barsh, Gregory S.

2011-01-01

Next-generation sequencing technologies offer new approaches for global measurements of gene expression but are mostly limited to organisms for which a high-quality assembled reference genome sequence is available. We present a method for gene expression profiling called EDGE, or EcoP15I-tagged Digital Gene Expression, based on ultra-high-throughput sequencing of 27-bp cDNA fragments that uniquely tag the corresponding gene, thereby allowing direct quantification of transcript abundance. We show that EDGE is capable of assaying for expression in >99% of genes in the genome and achieves saturation after 6–8 million reads. EDGE exhibits very little technical noise, reveals a large (106) dynamic range of gene expression, and is particularly suited for quantification of transcript abundance in non-model organisms where a high-quality annotated genome is not available. In a direct comparison with RNA-seq, both methods provide similar assessments of relative transcript abundance, but EDGE does better at detecting gene expression differences for poorly expressed genes and does not exhibit transcript length bias. Applying EDGE to laboratory mice, we show that a loss-of-function mutation in the melanocortin 1 receptor (Mc1r), recognized as a Mendelian determinant of yellow hair color in many different mammals, also causes reduced expression of genes involved in the interferon response. To illustrate the application of EDGE to a non-model organism, we examine skin biopsy samples from a cheetah (Acinonyx jubatus) and identify genes likely to control differences in the color of spotted versus non-spotted regions. PMID:21844123
Selection and Evaluation of Potential Reference Genes for Gene Expression Analysis in the Brown Planthopper, Nilaparvata lugens (Hemiptera: Delphacidae) Using Reverse-Transcription Quantitative PCR

PubMed Central

Zhu, Xun; Wan, Hu; Shakeel, Muhammad; Zhan, Sha; Jin, Byung-Rae; Li, Jianhong

2014-01-01

The brown planthopper (BPH), Nilaparvata lugens (Hemiptera, Delphacidae), is one of the most important rice pests. Abundant genetic studies on BPH have been conducted using reverse-transcription quantitative real-time PCR (qRT-PCR). Using qRT-PCR, the expression levels of target genes are calculated on the basis of endogenous controls. These genes need to be appropriately selected by experimentally assessing whether they are stably expressed under different conditions. However, such studies on potential reference genes in N. lugens are lacking. In this paper, we presented a systematic exploration of eight candidate reference genes in N. lugens, namely, actin 1 (ACT), muscle actin (MACT), ribosomal protein S11 (RPS11), ribosomal protein S15e (RPS15), alpha 2-tubulin (TUB), elongation factor 1 delta (EF), 18S ribosomal RNA (18S), and arginine kinase (AK) and used four alternative methods (BestKeeper, geNorm, NormFinder, and the delta Ct method) to evaluate the suitability of these genes as endogenous controls. We examined their expression levels among different experimental factors (developmental stage, body part, geographic population, temperature variation, pesticide exposure, diet change, and starvation) following the MIQE (Minimum Information for publication of Quantitative real time PCR Experiments) guidelines. Based on the results of RefFinder, which integrates four currently available major software programs to compare and rank the tested candidate reference genes, RPS15, RPS11, and TUB were found to be the most suitable reference genes in different developmental stages, body parts, and geographic populations, respectively. RPS15 was the most suitable gene under different temperature and diet conditions, while RPS11 was the most suitable gene under different pesticide exposure and starvation conditions. This work sheds light on establishing a standardized qRT-PCR procedure in N. lugens, and serves as a starting point for screening for reference genes for expression studies of related insects. PMID:24466124
Selection and evaluation of potential reference genes for gene expression analysis in the brown planthopper, Nilaparvata lugens (Hemiptera: Delphacidae) using reverse-transcription quantitative PCR.

PubMed

Yuan, Miao; Lu, Yanhui; Zhu, Xun; Wan, Hu; Shakeel, Muhammad; Zhan, Sha; Jin, Byung-Rae; Li, Jianhong

2014-01-01

The brown planthopper (BPH), Nilaparvata lugens (Hemiptera, Delphacidae), is one of the most important rice pests. Abundant genetic studies on BPH have been conducted using reverse-transcription quantitative real-time PCR (qRT-PCR). Using qRT-PCR, the expression levels of target genes are calculated on the basis of endogenous controls. These genes need to be appropriately selected by experimentally assessing whether they are stably expressed under different conditions. However, such studies on potential reference genes in N. lugens are lacking. In this paper, we presented a systematic exploration of eight candidate reference genes in N. lugens, namely, actin 1 (ACT), muscle actin (MACT), ribosomal protein S11 (RPS11), ribosomal protein S15e (RPS15), alpha 2-tubulin (TUB), elongation factor 1 delta (EF), 18S ribosomal RNA (18S), and arginine kinase (AK) and used four alternative methods (BestKeeper, geNorm, NormFinder, and the delta Ct method) to evaluate the suitability of these genes as endogenous controls. We examined their expression levels among different experimental factors (developmental stage, body part, geographic population, temperature variation, pesticide exposure, diet change, and starvation) following the MIQE (Minimum Information for publication of Quantitative real time PCR Experiments) guidelines. Based on the results of RefFinder, which integrates four currently available major software programs to compare and rank the tested candidate reference genes, RPS15, RPS11, and TUB were found to be the most suitable reference genes in different developmental stages, body parts, and geographic populations, respectively. RPS15 was the most suitable gene under different temperature and diet conditions, while RPS11 was the most suitable gene under different pesticide exposure and starvation conditions. This work sheds light on establishing a standardized qRT-PCR procedure in N. lugens, and serves as a starting point for screening for reference genes for expression studies of related insects.
sscMap: an extensible Java application for connecting small-molecule drugs using gene-expression signatures.

PubMed

Zhang, Shu-Dong; Gant, Timothy W

2009-07-31

Connectivity mapping is a process to recognize novel pharmacological and toxicological properties in small molecules by comparing their gene expression signatures with others in a database. A simple and robust method for connectivity mapping with increased specificity and sensitivity was recently developed, and its utility demonstrated using experimentally derived gene signatures. This paper introduces sscMap (statistically significant connections' map), a Java application designed to undertake connectivity mapping tasks using the recently published method. The software is bundled with a default collection of reference gene-expression profiles based on the publicly available dataset from the Broad Institute Connectivity Map 02, which includes data from over 7000 Affymetrix microarrays, for over 1000 small-molecule compounds, and 6100 treatment instances in 5 human cell lines. In addition, the application allows users to add their custom collections of reference profiles and is applicable to a wide range of other 'omics technologies. The utility of sscMap is two fold. First, it serves to make statistically significant connections between a user-supplied gene signature and the 6100 core reference profiles based on the Broad Institute expanded dataset. Second, it allows users to apply the same improved method to custom-built reference profiles which can be added to the database for future referencing. The software can be freely downloaded from http://purl.oclc.org/NET/sscMap.

Revealing the missing expressed genes beyond the human reference genome by RNA-Seq.

PubMed

Chen, Geng; Li, Ruiyuan; Shi, Leming; Qi, Junyi; Hu, Pengzhan; Luo, Jian; Liu, Mingyao; Shi, Tieliu

2011-12-02

The complete and accurate human reference genome is important for functional genomics researches. Therefore, the incomplete reference genome and individual specific sequences have significant effects on various studies. we used two RNA-Seq datasets from human brain tissues and 10 mixed cell lines to investigate the completeness of human reference genome. First, we demonstrated that in previously identified ~5 Mb Asian and ~5 Mb African novel sequences that are absent from the human reference genome of NCBI build 36, ~211 kb and ~201 kb of them could be transcribed, respectively. Our results suggest that many of those transcribed regions are not specific to Asian and African, but also present in Caucasian. Then, we found that the expressions of 104 RefSeq genes that are unalignable to NCBI build 37 in brain and cell lines are higher than 0.1 RPKM. 55 of them are conserved across human, chimpanzee and macaque, suggesting that there are still a significant number of functional human genes absent from the human reference genome. Moreover, we identified hundreds of novel transcript contigs that cannot be aligned to NCBI build 37, RefSeq genes and EST sequences. Some of those novel transcript contigs are also conserved among human, chimpanzee and macaque. By positioning those contigs onto the human genome, we identified several large deletions in the reference genome. Several conserved novel transcript contigs were further validated by RT-PCR. Our findings demonstrate that a significant number of genes are still absent from the incomplete human reference genome, highlighting the importance of further refining the human reference genome and curating those missing genes. Our study also shows the importance of de novo transcriptome assembly. The comparative approach between reference genome and other related human genomes based on the transcriptome provides an alternative way to refine the human reference genome.
Quantitative gene expression analysis in Caenorhabditis elegans using single molecule RNA FISH.

PubMed

Bolková, Jitka; Lanctôt, Christian

2016-04-01

Advances in fluorescent probe design and synthesis have allowed the uniform in situ labeling of individual RNA molecules. In a technique referred to as single molecule RNA FISH (smRNA FISH), the labeled RNA molecules can be imaged as diffraction-limited spots and counted using image analysis algorithms. Single RNA counting has provided valuable insights into the process of gene regulation. This microscopy-based method has often revealed a high cell-to-cell variability in expression levels, which has in turn led to a growing interest in investigating the biological significance of gene expression noise. Here we describe the application of the smRNA FISH technique to samples of Caenorhabditis elegans, a well-characterized model organism. Copyright © 2015 Elsevier Inc. All rights reserved.
Defining the Estimated Core Genome of Bacterial Populations Using a Bayesian Decision Model

PubMed Central

van Tonder, Andries J.; Mistry, Shilan; Bray, James E.; Hill, Dorothea M. C.; Cody, Alison J.; Farmer, Chris L.; Klugman, Keith P.; von Gottberg, Anne; Bentley, Stephen D.; Parkhill, Julian; Jolley, Keith A.; Maiden, Martin C. J.; Brueggemann, Angela B.

2014-01-01

The bacterial core genome is of intense interest and the volume of whole genome sequence data in the public domain available to investigate it has increased dramatically. The aim of our study was to develop a model to estimate the bacterial core genome from next-generation whole genome sequencing data and use this model to identify novel genes associated with important biological functions. Five bacterial datasets were analysed, comprising 2096 genomes in total. We developed a Bayesian decision model to estimate the number of core genes, calculated pairwise evolutionary distances (p-distances) based on nucleotide sequence diversity, and plotted the median p-distance for each core gene relative to its genome location. We designed visually-informative genome diagrams to depict areas of interest in genomes. Case studies demonstrated how the model could identify areas for further study, e.g. 25% of the core genes with higher sequence diversity in the Campylobacter jejuni and Neisseria meningitidis genomes encoded hypothetical proteins. The core gene with the highest p-distance value in C. jejuni was annotated in the reference genome as a putative hydrolase, but further work revealed that it shared sequence homology with beta-lactamase/metallo-beta-lactamases (enzymes that provide resistance to a range of broad-spectrum antibiotics) and thioredoxin reductase genes (which reduce oxidative stress and are essential for DNA replication) in other C. jejuni genomes. Our Bayesian model of estimating the core genome is principled, easy to use and can be applied to large genome datasets. This study also highlighted the lack of knowledge currently available for many core genes in bacterial genomes of significant global public health importance. PMID:25144616
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing.

PubMed

Jäger, Marten; Ott, Claus-Eric; Grünhagen, Johannes; Hecht, Jochen; Schell, Hanna; Mundlos, Stefan; Duda, Georg N; Robinson, Peter N; Lienau, Jasmin

2011-03-24

The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism.
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing

PubMed Central

2011-01-01

Background The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Results Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Conclusions Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism. PMID:21435219
Selection of suitable endogenous reference genes for relative copy number detection in sugarcane.

PubMed

Xue, Bantong; Guo, Jinlong; Que, Youxiong; Fu, Zhiwei; Wu, Luguang; Xu, Liping

2014-05-19

Transgene copy number has a great impact on the expression level and stability of exogenous gene in transgenic plants. Proper selection of endogenous reference genes is necessary for detection of genetic components in genetically modification (GM) crops by quantitative real-time PCR (qPCR) or by qualitative PCR approach, especially in sugarcane with polyploid and aneuploid genomic structure. qPCR technique has been widely accepted as an accurate, time-saving method on determination of copy numbers in transgenic plants and on detection of genetically modified plants to meet the regulatory and legislative requirement. In this study, to find a suitable endogenous reference gene and its real-time PCR assay for sugarcane (Saccharum spp. hybrids) DNA content quantification, we evaluated a set of potential "single copy" genes including P4H, APRT, ENOL, CYC, TST and PRR, through qualitative PCR and absolute quantitative PCR. Based on copy number comparisons among different sugarcane genotypes, including five S. officinarum, one S. spontaneum and two S. spp. hybrids, these endogenous genes fell into three groups: ENOL-3--high copy number group, TST-1 and PRR-1--medium copy number group, P4H-1, APRT-2 and CYC-2--low copy number group. Among these tested genes, P4H, APRT and CYC were the most stable, while ENOL and TST were the least stable across different sugarcane genotypes. Therefore, three primer pairs of P4H-3, APRT-2 and CYC-2 were then selected as the suitable reference gene primer pairs for sugarcane. The test of multi-target reference genes revealed that the APRT gene was a specific amplicon, suggesting this gene is the most suitable to be used as an endogenous reference target for sugarcane DNA content quantification. These results should be helpful for establishing accurate and reliable qualitative and quantitative PCR analysis of GM sugarcane.
Selection of reference genes for gene expression studies related to intramuscular fat deposition in Capra hircus skeletal muscle.

PubMed

Zhu, Wuzheng; Lin, Yaqiu; Liao, Honghai; Wang, Yong

2015-01-01

The identification of suitable reference genes is critical for obtaining reliable results from gene expression studies using quantitative real-time PCR (qPCR) because the expression of reference genes may vary considerably under different experimental conditions. In most cases, however, commonly used reference genes are employed in data normalization without proper validation, which may lead to incorrect data interpretation. Here, we aim to select a set of optimal reference genes for the accurate normalization of gene expression associated with intramuscular fat (IMF) deposition during development. In the present study, eight reference genes (PPIB, HMBS, RPLP0, B2M, YWHAZ, 18S, GAPDH and ACTB) were evaluated by three different algorithms (geNorm, NormFinder and BestKeeper) in two types of muscle tissues (longissimus dorsi muscle and biceps femoris muscle) across different developmental stages. All three algorithms gave similar results. PPIB and HMBS were identified as the most stable reference genes, while the commonly used reference genes 18S and GAPDH were the most variably expressed, with expression varying dramatically across different developmental stages. Furthermore, to reveal the crucial role of appropriate reference genes in obtaining a reliable result, analysis of PPARG expression was performed by normalization to the most and the least stable reference genes. The relative expression levels of PPARG normalized to the most stable reference genes greatly differed from those normalized to the least stable one. Therefore, evaluation of reference genes must be performed for a given experimental condition before the reference genes are used. PPIB and HMBS are the optimal reference genes for analysis of gene expression associated with IMF deposition in skeletal muscle during development.
A framework for analyzing the relationship between gene expression and morphological, topological, and dynamical patterns in neuronal networks.

PubMed

de Arruda, Henrique Ferraz; Comin, Cesar Henrique; Miazaki, Mauro; Viana, Matheus Palhares; Costa, Luciano da Fontoura

2015-04-30

A key point in developmental biology is to understand how gene expression influences the morphological and dynamical patterns that are observed in living beings. In this work we propose a methodology capable of addressing this problem that is based on estimating the mutual information and Pearson correlation between the intensity of gene expression and measurements of several morphological properties of the cells. A similar approach is applied in order to identify effects of gene expression over the system dynamics. Neuronal networks were artificially grown over a lattice by considering a reference model used to generate artificial neurons. The input parameters of the artificial neurons were determined according to two distinct patterns of gene expression and the dynamical response was assessed by considering the integrate-and-fire model. As far as single gene dependence is concerned, we found that the interaction between the gene expression and the network topology, as well as between the former and the dynamics response, is strongly affected by the gene expression pattern. In addition, we observed a high correlation between the gene expression and some topological measurements of the neuronal network for particular patterns of gene expression. To our best understanding, there are no similar analyses to compare with. A proper understanding of gene expression influence requires jointly studying the morphology, topology, and dynamics of neurons. The proposed framework represents a first step towards predicting gene expression patterns from morphology and connectivity. Copyright © 2015. Published by Elsevier B.V.
Gene: a gene-centered information resource at NCBI.

PubMed

Brown, Garth R; Hem, Vichet; Katz, Kenneth S; Ovetsky, Michael; Wallin, Craig; Ermolaeva, Olga; Tolstoy, Igor; Tatusova, Tatiana; Pruitt, Kim D; Maglott, Donna R; Murphy, Terence D

2015-01-01

The National Center for Biotechnology Information's (NCBI) Gene database (www.ncbi.nlm.nih.gov/gene) integrates gene-specific information from multiple data sources. NCBI Reference Sequence (RefSeq) genomes for viruses, prokaryotes and eukaryotes are the primary foundation for Gene records in that they form the critical association between sequence and a tracked gene upon which additional functional and descriptive content is anchored. Additional content is integrated based on the genomic location and RefSeq transcript and protein sequence data. The content of a Gene record represents the integration of curation and automated processing from RefSeq, collaborating model organism databases, consortia such as Gene Ontology, and other databases within NCBI. Records in Gene are assigned unique, tracked integers as identifiers. The content (citations, nomenclature, genomic location, gene products and their attributes, phenotypes, sequences, interactions, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programming utilities (E-Utilities and Entrez Direct) and for bulk transfer by FTP. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Re-annotation, improved large-scale assembly and establishment of a catalogue of noncoding loci for the genome of the model brown alga Ectocarpus.

PubMed

Cormier, Alexandre; Avia, Komlan; Sterck, Lieven; Derrien, Thomas; Wucher, Valentin; Andres, Gwendoline; Monsoor, Misharl; Godfroy, Olivier; Lipinska, Agnieszka; Perrineau, Marie-Mathilde; Van De Peer, Yves; Hitte, Christophe; Corre, Erwan; Coelho, Susana M; Cock, J Mark

2017-04-01

The genome of the filamentous brown alga Ectocarpus was the first to be completely sequenced from within the brown algal group and has served as a key reference genome both for this lineage and for the stramenopiles. We present a complete structural and functional reannotation of the Ectocarpus genome. The large-scale assembly of the Ectocarpus genome was significantly improved and genome-wide gene re-annotation using extensive RNA-seq data improved the structure of 11 108 existing protein-coding genes and added 2030 new loci. A genome-wide analysis of splicing isoforms identified an average of 1.6 transcripts per locus. A large number of previously undescribed noncoding genes were identified and annotated, including 717 loci that produce long noncoding RNAs. Conservation of lncRNAs between Ectocarpus and another brown alga, the kelp Saccharina japonica, suggests that at least a proportion of these loci serve a function. Finally, a large collection of single nucleotide polymorphism-based markers was developed for genetic analyses. These resources are available through an updated and improved genome database. This study significantly improves the utility of the Ectocarpus genome as a high-quality reference for the study of many important aspects of brown algal biology and as a reference for genomic analyses across the stramenopiles. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Prunus transcription factors: breeding perspectives

PubMed Central

Bianchi, Valmor J.; Rubio, Manuel; Trainotti, Livio; Verde, Ignazio; Bonghi, Claudio; Martínez-Gómez, Pedro

2015-01-01

Many plant processes depend on differential gene expression, which is generally controlled by complex proteins called transcription factors (TFs). In peach, 1533 TFs have been identified, accounting for about 5.5% of the 27,852 protein-coding genes. These TFs are the reference for the rest of the Prunus species. TF studies in Prunus have been performed on the gene expression analysis of different agronomic traits, including control of the flowering process, fruit quality, and biotic and abiotic stress resistance. These studies, using quantitative RT-PCR, have mainly been performed in peach, and to a lesser extent in other species, including almond, apricot, black cherry, Fuji cherry, Japanese apricot, plum, and sour and sweet cherry. Other tools have also been used in TF studies, including cDNA-AFLP, LC-ESI-MS, RNA, and DNA blotting or mapping. More recently, new tools assayed include microarray and high-throughput DNA sequencing (DNA-Seq) and RNA sequencing (RNA-Seq). New functional genomics opportunities include genome resequencing and the well-known synteny among Prunus genomes and transcriptomes. These new functional studies should be applied in breeding programs in the development of molecular markers. With the genome sequences available, some strategies that have been used in model systems (such as SNP genotyping assays and genotyping-by-sequencing) may be applicable in the functional analysis of Prunus TFs as well. In addition, the knowledge of the gene functions and position in the peach reference genome of the TFs represents an additional advantage. These facts could greatly facilitate the isolation of genes via QTL (quantitative trait loci) map-based cloning in the different Prunus species, following the association of these TFs with the identified QTLs using the peach reference genome. PMID:26124770
nGASP - the nematode genome annotation assessment project

DOE Office of Scientific and Technical Information (OSTI.GOV)

Coghlan, A; Fiedler, T J; McKay, S J

2008-12-19

While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner'more » algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders. While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders.« less
CoryneRegNet 3.0--an interactive systems biology platform for the analysis of gene regulatory networks in corynebacteria and Escherichia coli.

PubMed

Baumbach, Jan; Wittkop, Tobias; Rademacher, Katrin; Rahmann, Sven; Brinkrolf, Karina; Tauch, Andreas

2007-04-30

CoryneRegNet is an ontology-based data warehouse for the reconstruction and visualization of transcriptional regulatory interactions in prokaryotes. To extend the biological content of CoryneRegNet, we added comprehensive data on transcriptional regulations in the model organism Escherichia coli K-12, originally deposited in the international reference database RegulonDB. The enhanced web interface of CoryneRegNet offers several types of search options. The results of a search are displayed in a table-based style and include a visualization of the genetic organization of the respective gene region. Information on DNA binding sites of transcriptional regulators is depicted by sequence logos. The results can also be displayed by several layouters implemented in the graphical user interface GraphVis, allowing, for instance, the visualization of genome-wide network reconstructions and the homology-based inter-species comparison of reconstructed gene regulatory networks. In an application example, we compare the composition of the gene regulatory networks involved in the SOS response of E. coli and Corynebacterium glutamicum. CoryneRegNet is available at the following URL: http://www.cebitec.uni-bielefeld.de/groups/gi/software/coryneregnet/.
The physical map of wheat chromosome 1BS provides insights into its gene space organization and evolution

PubMed Central

2013-01-01

Background The wheat genome sequence is an essential tool for advanced genomic research and improvements. The generation of a high-quality wheat genome sequence is challenging due to its complex 17 Gb polyploid genome. To overcome these difficulties, sequencing through the construction of BAC-based physical maps of individual chromosomes is employed by the wheat genomics community. Here, we present the construction of the first comprehensive physical map of chromosome 1BS, and illustrate its unique gene space organization and evolution. Results Fingerprinted BAC clones were assembled into 57 long scaffolds, anchored and ordered with 2,438 markers, covering 83% of chromosome 1BS. The BAC-based chromosome 1BS physical map and gene order of the orthologous regions of model grass species were consistent, providing strong support for the reliability of the chromosome 1BS assembly. The gene space for chromosome 1BS spans the entire length of the chromosome arm, with 76% of the genes organized in small gene islands, accompanied by a two-fold increase in gene density from the centromere to the telomere. Conclusions This study provides new evidence on common and chromosome-specific features in the organization and evolution of the wheat genome, including a non-uniform distribution of gene density along the centromere-telomere axis, abundance of non-syntenic genes, the degree of colinearity with other grass genomes and a non-uniform size expansion along the centromere-telomere axis compared with other model cereal genomes. The high-quality physical map constructed in this study provides a solid basis for the assembly of a reference sequence of chromosome 1BS and for breeding applications. PMID:24359668
The Pathogen-Host Interactions database (PHI-base): additions and future developments.

PubMed

Urban, Martin; Pant, Rashmi; Raghunath, Arathi; Irvine, Alistair G; Pedro, Helder; Hammond-Kosack, Kim E

2015-01-01

Rapidly evolving pathogens cause a diverse array of diseases and epidemics that threaten crop yield, food security as well as human, animal and ecosystem health. To combat infection greater comparative knowledge is required on the pathogenic process in multiple species. The Pathogen-Host Interactions database (PHI-base) catalogues experimentally verified pathogenicity, virulence and effector genes from bacterial, fungal and protist pathogens. Mutant phenotypes are associated with gene information. The included pathogens infect a wide range of hosts including humans, animals, plants, insects, fish and other fungi. The current version, PHI-base 3.6, available at http://www.phi-base.org, stores information on 2875 genes, 4102 interactions, 110 host species, 160 pathogenic species (103 plant, 3 fungal and 54 animal infecting species) and 181 diseases drawn from 1243 references. Phenotypic and gene function information has been obtained by manual curation of the peer-reviewed literature. A controlled vocabulary consisting of nine high-level phenotype terms permits comparisons and data analysis across the taxonomic space. PHI-base phenotypes were mapped via their associated gene information to reference genomes available in Ensembl Genomes. Virulence genes and hotspots can be visualized directly in genome browsers. Future plans for PHI-base include development of tools facilitating community-led curation and inclusion of the corresponding host target(s). © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Interactions in the microbiome: communities of organisms and communities of genes

PubMed Central

Boon, Eva; Meehan, Conor J; Whidden, Chris; Wong, Dennis H-J; Langille, Morgan GI; Beiko, Robert G

2014-01-01

A central challenge in microbial community ecology is the delineation of appropriate units of biodiversity, which can be taxonomic, phylogenetic, or functional in nature. The term ‘community’ is applied ambiguously; in some cases, the term refers simply to a set of observed entities, while in other cases, it requires that these entities interact with one another. Microorganisms can rapidly gain and lose genes, potentially decoupling community roles from taxonomic and phylogenetic groupings. Trait-based approaches offer a useful alternative, but many traits can be defined based on gene functions, metabolic modules, and genomic properties, and the optimal set of traits to choose is often not obvious. An analysis that considers taxon assignment and traits in concert may be ideal, with the strengths of each approach offsetting the weaknesses of the other. Individual genes also merit consideration as entities in an ecological analysis, with characteristics such as diversity, turnover, and interactions modeled using genes rather than organisms as entities. We identify some promising avenues of research that are likely to yield a deeper understanding of microbial communities that shift from observation-based questions of ‘Who is there?’ and ‘What are they doing?’ to the mechanistically driven question of ‘How will they respond?’ PMID:23909933
Genetics and fine mapping of a purple leaf gene, BoPr, in ornamental kale (Brassica oleracea L. var. acephala).

PubMed

Liu, Xiao-Ping; Gao, Bao-Zhen; Han, Feng-Qing; Fang, Zhi-Yuan; Yang, Li-Mei; Zhuang, Mu; Lv, Hong-Hao; Liu, Yu-Mei; Li, Zhan-Sheng; Cai, Cheng-Cheng; Yu, Hai-Long; Li, Zhi-Yuan; Zhang, Yang-Yong

2017-03-14

Due to its variegated and colorful leaves, ornamental kale (Brassica oleracea L. var. acephala) has become a popular ornamental plant. In this study, we report the fine mapping and analysis of a candidate purple leaf gene using a backcross population and an F 2 population derived from two parental lines: W1827 (with white leaves) and P1835 (with purple leaves). Genetic analysis indicated that the purple leaf trait is controlled by a single dominant gene, which we named BoPr. Using markers developed based on the reference genome '02-12', the BoPr gene was preliminarily mapped to a 280-kb interval of chromosome C09, with flanking markers M17 and BoID4714 at genetic distances of 4.3 cM and 1.5 cM, respectively. The recombination rate within this interval is almost 12 times higher than the usual level, which could be caused by assembly error for reference genome '02-12' at this interval. Primers were designed based on 'TO1000', another B. oleracea reference genome. Among the newly designed InDel markers, BRID485 and BRID490 were found to be the closest to BoPr, flanking the gene at genetic distances of 0.1 cM and 0.2 cM, respectively; the interval between the two markers is 44.8 kb (reference genome 'TO1000'). Seven annotated genes are located within the 44.8 kb genomic region, of which only Bo9g058630 shows high homology to AT5G42800 (dihydroflavonol reductase), which was identified as a candidate gene for BoPr. Blast analysis revealed that this 44.8 kb interval is located on an unanchored scaffold (Scaffold000035_P2) of '02-12', confirming the existence of assembly error at the interval between M17 and BoID4714 for reference genome '02-12'. This study identified a candidate gene for BoPr and lays a foundation for the cloning and functional analysis of this gene.
On Utilizing Optimal and Information Theoretic Syntactic Modeling for Peptide Classification

NASA Astrophysics Data System (ADS)

Aygün, Eser; Oommen, B. John; Cataltepe, Zehra

Syntactic methods in pattern recognition have been used extensively in bioinformatics, and in particular, in the analysis of gene and protein expressions, and in the recognition and classification of bio-sequences. These methods are almost universally distance-based. This paper concerns the use of an Optimal and Information Theoretic (OIT) probabilistic model [11] to achieve peptide classification using the information residing in their syntactic representations. The latter has traditionally been achieved using the edit distances required in the respective peptide comparisons. We advocate that one can model the differences between compared strings as a mutation model consisting of random Substitutions, Insertions and Deletions (SID) obeying the OIT model. Thus, in this paper, we show that the probability measure obtained from the OIT model can be perceived as a sequence similarity metric, using which a Support Vector Machine (SVM)-based peptide classifier, referred to as OIT_SVM, can be devised.
Validation of reference genes for RT-qPCR studies of gene expression in banana fruit under different experimental conditions.

PubMed

Chen, Lei; Zhong, Hai-ying; Kuang, Jian-fei; Li, Jian-guo; Lu, Wang-jin; Chen, Jian-ye

2011-08-01

Reverse transcription quantitative real-time PCR (RT-qPCR) is a sensitive technique for quantifying gene expression, but its success depends on the stability of the reference gene(s) used for data normalization. Only a few studies on validation of reference genes have been conducted in fruit trees and none in banana yet. In the present work, 20 candidate reference genes were selected, and their expression stability in 144 banana samples were evaluated and analyzed using two algorithms, geNorm and NormFinder. The samples consisted of eight sample sets collected under different experimental conditions, including various tissues, developmental stages, postharvest ripening, stresses (chilling, high temperature, and pathogen), and hormone treatments. Our results showed that different suitable reference gene(s) or combination of reference genes for normalization should be selected depending on the experimental conditions. The RPS2 and UBQ2 genes were validated as the most suitable reference genes across all tested samples. More importantly, our data further showed that the widely used reference genes, ACT and GAPDH, were not the most suitable reference genes in many banana sample sets. In addition, the expression of MaEBF1, a gene of interest that plays an important role in regulating fruit ripening, under different experimental conditions was used to further confirm the validated reference genes. Taken together, our results provide guidelines for reference gene(s) selection under different experimental conditions and a foundation for more accurate and widespread use of RT-qPCR in banana.
Identification and validation of reference genes for qRT-PCR studies of the obligate aphid pathogenic fungus Pandora neoaphidis during different developmental stages.

PubMed

Zhang, Shutao; Chen, Chun; Xie, Tingna; Ye, Sudan

2017-01-01

The selection of stable reference genes is a critical step for the accurate quantification of gene expression. To identify and validate the reference genes in Pandora neoaphidis-an obligate aphid pathogenic fungus-the expression of 13classical candidate reference genes were evaluated by quantitative real-time reverse transcriptase polymerase chain reaction(qPCR) at four developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae). Four statistical algorithms, including geNorm, NormFinder, BestKeeper and Delta Ct method were used to rank putative reference genes according to their expression stability and indicate the best reference gene or combination of reference genes for accurate normalization. The analysis of comprehensive ranking revealed that ACT1and 18Swas the most stably expressed genes throughout the developmental stages. To further validate the suitability of the reference genes identified in this study, the expression of cell division control protein 25 (CDC25) and Chitinase 1(CHI1) genes were used to further confirm the validated candidate reference genes. Our study presented the first systematic study of reference gene(s) selection for P. neoaphidis study and provided guidelines to obtain more accurate qPCR results for future developmental efforts.

Evaluation of Reference Genes for Normalization of Gene Expression Using Quantitative RT-PCR under Aluminum, Cadmium, and Heat Stresses in Soybean.

PubMed

Gao, Mengmeng; Liu, Yaping; Ma, Xiao; Shuai, Qin; Gai, Junyi; Li, Yan

2017-01-01

Quantitative reverse transcription polymerase chain reaction (qRT-PCR) is widely used to analyze the relative gene expression level, however, the accuracy of qRT-PCR is greatly affected by the stability of reference genes, which is tissue- and environment- dependent. Therefore, choosing the most stable reference gene in a specific tissue and environment is critical to interpret gene expression patterns. Aluminum (Al), cadmium (Cd), and heat stresses are three important abiotic factors limiting soybean (Glycine max) production in southern China. To identify the suitable reference genes for normalizing the expression levels of target genes by qRT-PCR in soybean response to Al, Cd and heat stresses, we studied the expression stability of ten commonly used housekeeping genes in soybean roots and leaves under these three abiotic stresses, using five approaches, BestKeeper, Delta Ct, geNorm, NormFinder and RefFinder. We found TUA4 is the most stable reference gene in soybean root tips under Al stress. Under Cd stress, Fbox and UKN2 are the most stable reference genes in roots and leaves, respectively, while 60S is the most suitable reference gene when analyzing both roots and leaves together. For heat stress, TUA4 and UKN2 are the most stable housekeeping genes in roots and leaves, respectively, and UKN2 is the best reference gene for analysis of roots and leaves together. To validate the reference genes, we quantified the relative expression levels of six target genes that were involved in soybean response to Al, Cd or heat stresses, respectively. The expression patterns of these target genes differed between using the most and least stable reference genes, suggesting the selection of a suitable reference gene is critical for gene expression studies.
Gene expression studies of reference genes for quantitative real-time PCR: an overview in insects.

PubMed

Shakeel, Muhammad; Rodriguez, Alicia; Tahir, Urfa Bin; Jin, Fengliang

2018-02-01

Whenever gene expression is being examined, it is essential that a normalization process is carried out to eliminate non-biological variations. The use of reference genes, such as glyceraldehyde-3-phosphate dehydrogenase, actin, and ribosomal protein genes, is the usual method of choice for normalizing gene expression. Although reference genes are used to normalize target gene expression, a major problem is that the stability of these genes differs among tissues, developmental stages, species, and responses to abiotic factors. Therefore, the use and validation of multiple reference genes are required. This review discusses the reasons that why RT-qPCR has become the preferred method for validating results of gene expression profiles, the use of specific and non-specific dyes and the importance of use of primers and probes for qPCR as well as to discuss several statistical algorithms developed to help the validation of potential reference genes. The conflicts arising in the use of classical reference genes in gene normalization and their replacement with novel references are also discussed by citing the high stability and low stability of classical and novel reference genes under various biotic and abiotic experimental conditions by employing various methods applied for the reference genes amplification.
Selection and evaluation of reference genes for RT-qPCR expression studies on Burkholderia tropica strain Ppe8, a sugarcane-associated diazotrophic bacterium grown with different carbon sources or sugarcane juice.

PubMed

da Silva, Paula Renata Alves; Vidal, Marcia Soares; de Paula Soares, Cleiton; Polese, Valéria; Simões-Araújo, Jean Luís; Baldani, José Ivo

2016-11-01

Among the members of the genus Burkholderia, Burkholderia tropica has the ability to fix nitrogen and promote sugarcane plant growth as well as act as a biological control agent. There is little information about how this bacterium metabolizes carbohydrates as well as those carbon sources found in the sugarcane juice that accumulates in stems during plant growth. Reverse transcription quantitative PCR (RT-qPCR) can be used to evaluate changes in gene expression during bacterial growth on different carbon sources. Here we tested the expression of six reference genes, lpxC, gyrB, recA, rpoA, rpoB, and rpoD, when cells were grown with glucose, fructose, sucrose, mannitol, aconitic acid, and sugarcane juice as carbon sources. The lpxC, gyrB, and recA were selected as the most stable reference genes based on geNorm and NormFinder software analyses. Validation of these three reference genes during strain Ppe8 growth on the same carbon sources showed that genes involved in glycogen biosynthesis (glgA, glgB, glgC) and trehalose biosynthesis (treY and treZ) were highly expressed when Ppe8 was grown in aconitic acid relative to other carbon sources, while otsA expression (trehalose biosynthesis) was reduced with all carbon sources. In addition, the expression level of the ORF_6066 (gluconolactonase) gene was reduced on sugarcane juice. The results confirmed the stability of the three selected reference genes (lpxC, gyrB, and recA) during the RT-qPCR and also their robustness by evaluating the relative expression of genes involved in glycogen and trehalose biosynthesis when strain Ppe8 was grown on different carbon sources and sugarcane juice.
Discovering Implicit Entity Relation with the Gene-Citation-Gene Network

PubMed Central

Song, Min; Han, Nam-Gi; Kim, Yong-Hwan; Ding, Ying; Chambers, Tamy

2013-01-01

In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG) network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner. PMID:24358368
Survey of gene splicing algorithms based on reads.

PubMed

Si, Xiuhua; Wang, Qian; Zhang, Lei; Wu, Ruo; Ma, Jiquan

2017-11-02

Gene splicing is the process of assembling a large number of unordered short sequence fragments to the original genome sequence as accurately as possible. Several popular splicing algorithms based on reads are reviewed in this article, including reference genome algorithms and de novo splicing algorithms (Greedy-extension, Overlap-Layout-Consensus graph, De Bruijn graph). We also discuss a new splicing method based on the MapReduce strategy and Hadoop. By comparing these algorithms, some conclusions are drawn and some suggestions on gene splicing research are made.
Temporal Expression-based Analysis of Metabolism

PubMed Central

Segrè, Daniel

2012-01-01

Metabolic flux is frequently rerouted through cellular metabolism in response to dynamic changes in the intra- and extra-cellular environment. Capturing the mechanisms underlying these metabolic transitions in quantitative and predictive models is a prominent challenge in systems biology. Progress in this regard has been made by integrating high-throughput gene expression data into genome-scale stoichiometric models of metabolism. Here, we extend previous approaches to perform a Temporal Expression-based Analysis of Metabolism (TEAM). We apply TEAM to understanding the complex metabolic dynamics of the respiratorily versatile bacterium Shewanella oneidensis grown under aerobic, lactate-limited conditions. TEAM predicts temporal metabolic flux distributions using time-series gene expression data. Increased predictive power is achieved by supplementing these data with a large reference compendium of gene expression, which allows us to take into account the unique character of the distribution of expression of each individual gene. We further propose a straightforward method for studying the sensitivity of TEAM to changes in its fundamental free threshold parameter θ, and reveal that discrete zones of distinct metabolic behavior arise as this parameter is changed. By comparing the qualitative characteristics of these zones to additional experimental data, we are able to constrain the range of θ to a small, well-defined interval. In parallel, the sensitivity analysis reveals the inherently difficult nature of dynamic metabolic flux modeling: small errors early in the simulation propagate to relatively large changes later in the simulation. We expect that handling such “history-dependent” sensitivities will be a major challenge in the future development of dynamic metabolic-modeling techniques. PMID:23209390
Targeting the histone methyltransferase G9a activates imprinted genes and improves survival of a mouse model of Prader–Willi syndrome

PubMed Central

Kim, Yuna; Lee, Hyeong-Min; Xiong, Yan; Sciaky, Noah; Hulbert, Samuel W; Cao, Xinyu; Everitt, Jeffrey I; Jin, Jian; Roth, Bryan L; Jiang, Yong-hui

2017-01-01

Prader–Willi syndrome (PWS) is an imprinting disorder caused by a deficiency of paternally expressed gene(s) in the 15q11–q13 chromosomal region. The regulation of imprinted gene expression in this region is coordinated by an imprinting center (PWS-IC). In individuals with PWS, genes responsible for PWS on the maternal chromosome are present, but repressed epigenetically, which provides an opportunity for the use of epigenetic therapy to restore expression from the maternal copies of PWS-associated genes. Through a high-content screen (HCS) of >9,000 small molecules, we discovered that UNC0638 and UNC0642—two selective inhibitors of euchromatic histone lysine N-methyltransferase-2 (EHMT2, also known as G9a)—activated the maternal (m) copy of candidate genes underlying PWS, including the SnoRNA cluster SNORD116, in cells from humans with PWS and also from a mouse model of PWS carrying a paternal (p) deletion from small nuclear ribonucleoprotein N (Snrpn (S)) to ubiquitin protein ligase E3A (Ube3a (U)) (mouse model referred to hereafter as m+/pΔS−U). Both UNC0642 and UNC0638 caused a selective reduction of the dimethylation of histone H3 lysine 9 (H3K9me2) at PWS-IC, without changing DNA methylation, when analyzed by bisulfite genomic sequencing. This indicates that histone modification is essential for the imprinting of candidate genes underlying PWS. UNC0642 displayed therapeutic effects in the PWS mouse model by improving the survival and the growth of m+/pΔS−U newborn pups. This study provides the first proof of principle for an epigenetics-based therapy for PWS. PMID:28024084
Selection of relatively exact reference genes for gene expression studies in goosegrass (Eleusine indica) under herbicide stress.

PubMed

Chen, Jingchao; Huang, Zhaofeng; Huang, Hongjuan; Wei, Shouhui; Liu, Yan; Jiang, Cuilan; Zhang, Jie; Zhang, Chaoxian

2017-04-21

Goosegrass (Eleusine indica) is one of the most serious annual grassy weeds worldwide, and its evolved herbicide-resistant populations are more difficult to control. Quantitative real-time PCR (qPCR) is a common technique for investigating the resistance mechanism; however, there is as yet no report on the systematic selection of stable reference genes for goosegrass. This study proposed to test the expression stability of 9 candidate reference genes in goosegrass in different tissues and developmental stages and under stress from three types of herbicide. The results show that for different developmental stages and organs (control), eukaryotic initiation factor 4 A (eIF-4) is the most stable reference gene. Chloroplast acetolactate synthase (ALS) is the most stable reference gene under glyphosate stress. Under glufosinate stress, eIF-4 is the best reference gene. Ubiquitin-conjugating enzyme (UCE) is the most stable reference gene under quizalofop-p-ethyl stress. The gene eIF-4 is the recommended reference gene for goosegrass under the stress of all three herbicides. Moreover, pairwise analysis showed that seven reference genes were sufficient to normalize the gene expression data under three herbicides treatment. This study provides a list of reliable reference genes for transcript normalization in goosegrass, which will facilitate resistance mechanism studies in this weed species.
Normalizing gene expression by quantitative PCR during somatic embryogenesis in two representative conifer species: Pinus pinaster and Picea abies.

PubMed

de Vega-Bartol, José J; Santos, Raquen Raissa; Simões, Marta; Miguel, Célia M

2013-05-01

Suitable internal control genes to normalize qPCR data from different stages of embryo development and germination were identified in two representative conifer species. Clonal propagation by somatic embryogenesis has a great application potentiality in conifers. Quantitative PCR (qPCR) is widely used for gene expression analysis during somatic embryogenesis and embryo germination. No single reference gene is universal, so a systematic characterization of endogenous genes for concrete conditions is fundamental for accuracy. We identified suitable internal control genes to normalize qPCR data obtained at different steps of somatic embryogenesis (embryonal mass proliferation, embryo maturation and germination) in two representative conifer species, Pinus pinaster and Picea abies. Candidate genes included endogenous genes commonly used in conifers, genes previously tested in model plants, and genes with a lower variation of the expression along embryo development according to genome-wide transcript profiling studies. Three different algorithms were used to evaluate expression stability. The geometric average of the expression values of elongation factor-1α, α-tubulin and histone 3 in P. pinaster, and elongation factor-1α, α-tubulin, adenosine kinase and CAC in P. abies were adequate for expression studies throughout somatic embryogenesis. However, improved accuracy was achieved when using other gene combinations in experiments with samples at a single developmental stage. The importance of studies selecting reference genes to use in different tissues or developmental stages within one or close species, and the instability of commonly used reference genes, is highlighted.
Identification of Suitable Reference Genes for Gene Expression Normalization in qRT-PCR Analysis in Watermelon

PubMed Central

Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

2014-01-01

Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT–PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT–PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT–PCR analyses involving watermelon. PMID:24587403
Identification of suitable reference genes for gene expression normalization in qRT-PCR analysis in watermelon.

PubMed

Kong, Qiusheng; Yuan, Jingxian; Gao, Lingyun; Zhao, Shuang; Jiang, Wei; Huang, Yuan; Bie, Zhilong

2014-01-01

Watermelon is one of the major Cucurbitaceae crops and the recent availability of genome sequence greatly facilitates the fundamental researches on it. Quantitative real-time reverse transcriptase PCR (qRT-PCR) is the preferred method for gene expression analyses, and using validated reference genes for normalization is crucial to ensure the accuracy of this method. However, a systematic validation of reference genes has not been conducted on watermelon. In this study, transcripts of 15 candidate reference genes were quantified in watermelon using qRT-PCR, and the stability of these genes was compared using geNorm and NormFinder. geNorm identified ClTUA and ClACT, ClEF1α and ClACT, and ClCAC and ClTUA as the best pairs of reference genes in watermelon organs and tissues under normal growth conditions, abiotic stress, and biotic stress, respectively. NormFinder identified ClYLS8, ClUBCP, and ClCAC as the best single reference genes under the above experimental conditions, respectively. ClYLS8 and ClPP2A were identified as the best reference genes across all samples. Two to nine reference genes were required for more reliable normalization depending on the experimental conditions. The widely used watermelon reference gene 18SrRNA was less stable than the other reference genes under the experimental conditions. Catalase family genes were identified in watermelon genome, and used to validate the reliability of the identified reference genes. ClCAT1and ClCAT2 were induced and upregulated in the first 24 h, whereas ClCAT3 was downregulated in the leaves under low temperature stress. However, the expression levels of these genes were significantly overestimated and misinterpreted when 18SrRNA was used as a reference gene. These results provide a good starting point for reference gene selection in qRT-PCR analyses involving watermelon.
Validation of Reference Genes for Gene Expression Studies in Virus-Infected Nicotiana benthamiana Using Quantitative Real-Time PCR

PubMed Central

Han, Chenggui; Yu, Jialin; Li, Dawei; Zhang, Yongliang

2012-01-01

Nicotiana benthamiana is the most widely-used experimental host in plant virology. The recent release of the draft genome sequence for N. benthamiana consolidates its role as a model for plant–pathogen interactions. Quantitative real-time PCR (qPCR) is commonly employed for quantitative gene expression analysis. For valid qPCR analysis, accurate normalisation of gene expression against an appropriate internal control is required. Yet there has been little systematic investigation of reference gene stability in N. benthamiana under conditions of viral infections. In this study, the expression profiles of 16 commonly used housekeeping genes (GAPDH, 18S, EF1α, SAMD, L23, UK, PP2A, APR, UBI3, SAND, ACT, TUB, GBP, F-BOX, PPR and TIP41) were determined in N. benthamiana and those with acceptable expression levels were further selected for transcript stability analysis by qPCR of complementary DNA prepared from N. benthamiana leaf tissue infected with one of five RNA plant viruses (Tobacco necrosis virus A, Beet black scorch virus, Beet necrotic yellow vein virus, Barley stripe mosaic virus and Potato virus X). Gene stability was analysed in parallel by three commonly-used dedicated algorithms: geNorm, NormFinder and BestKeeper. Statistical analysis revealed that the PP2A, F-BOX and L23 genes were the most stable overall, and that the combination of these three genes was sufficient for accurate normalisation. In addition, the suitability of PP2A, F-BOX and L23 as reference genes was illustrated by expression-level analysis of AGO2 and RdR6 in virus-infected N. benthamiana leaves. This is the first study to systematically examine and evaluate the stability of different reference genes in N. benthamiana. Our results not only provide researchers studying these viruses a shortlist of potential housekeeping genes to use as normalisers for qPCR experiments, but should also guide the selection of appropriate reference genes for gene expression studies of N. benthamiana under other biotic and abiotic stress conditions. PMID:23029521
Evaluation of New Reference Genes in Papaya for Accurate Transcript Normalization under Different Experimental Conditions

PubMed Central

Chen, Weixin; Chen, Jianye; Lu, Wangjin; Chen, Lei; Fu, Danwen

2012-01-01

Real-time reverse transcription PCR (RT-qPCR) is a preferred method for rapid and accurate quantification of gene expression studies. Appropriate application of RT-qPCR requires accurate normalization though the use of reference genes. As no single reference gene is universally suitable for all experiments, thus reference gene(s) validation under different experimental conditions is crucial for RT-qPCR analysis. To date, only a few studies on reference genes have been done in other plants but none in papaya. In the present work, we selected 21 candidate reference genes, and evaluated their expression stability in 246 papaya fruit samples using three algorithms, geNorm, NormFinder and RefFinder. The samples consisted of 13 sets collected under different experimental conditions, including various tissues, different storage temperatures, different cultivars, developmental stages, postharvest ripening, modified atmosphere packaging, 1-methylcyclopropene (1-MCP) treatment, hot water treatment, biotic stress and hormone treatment. Our results demonstrated that expression stability varied greatly between reference genes and that different suitable reference gene(s) or combination of reference genes for normalization should be validated according to the experimental conditions. In general, the internal reference genes EIF (Eukaryotic initiation factor 4A), TBP1 (TATA binding protein 1) and TBP2 (TATA binding protein 2) genes had a good performance under most experimental conditions, whereas the most widely present used reference genes, ACTIN (Actin 2), 18S rRNA (18S ribosomal RNA) and GAPDH (Glyceraldehyde-3-phosphate dehydrogenase) were not suitable in many experimental conditions. In addition, two commonly used programs, geNorm and Normfinder, were proved sufficient for the validation. This work provides the first systematic analysis for the selection of superior reference genes for accurate transcript normalization in papaya under different experimental conditions. PMID:22952972
Improved maize reference genome with single-molecule technologies.

PubMed

Jiao, Yinping; Peluso, Paul; Shi, Jinghua; Liang, Tiffany; Stitzer, Michelle C; Wang, Bo; Campbell, Michael S; Stein, Joshua C; Wei, Xuehong; Chin, Chen-Shan; Guill, Katherine; Regulski, Michael; Kumari, Sunita; Olson, Andrew; Gent, Jonathan; Schneider, Kevin L; Wolfgruber, Thomas K; May, Michael R; Springer, Nathan M; Antoniou, Eric; McCombie, W Richard; Presting, Gernot G; McMullen, Michael; Ross-Ibarra, Jeffrey; Dawe, R Kelly; Hastie, Alex; Rank, David R; Ware, Doreen

2017-06-22

Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.
Reference genes for measuring mRNA expression.

PubMed

Dundas, Jitesh; Ling, Maurice

2012-12-01

The aim of this review is to find answers to some of the questions surrounding reference genes and their reliability for quantitative experiments. Reference genes are assumed to be at a constant expression level, over a range of conditions such as temperature. These genes, such as GADPH and beta-actin, are used extensively for gene expression studies using techniques like quantitative PCR. There have been several studies carried out on identifying reference genes. However, a lot of evidence indicates issues to the general suitability of these genes. Recent studies had shown that different factors, including the environment and methods, play an important role in changing the expression levels of the reference genes. Thus, we conclude that there is no reference gene that can deemed suitable for all the experimental conditions. In addition, we believe that every experiment will require the scientific evaluation and selection of the best candidate gene for use as a reference gene to obtain reliable scientific results.
A specific endogenous reference for genetically modified common bean (Phaseolus vulgaris L.) DNA quantification by real-time PCR targeting lectin gene.

PubMed

Venturelli, Gustavo L; Brod, Fábio C A; Rossi, Gabriela B; Zimmermann, Naíra F; Oliveira, Jaison P; Faria, Josias C; Arisi, Ana C M

2014-11-01

The Embrapa 5.1 genetically modified (GM) common bean was approved for commercialization in Brazil. Methods for the quantification of this new genetically modified organism (GMO) are necessary. The development of a suitable endogenous reference is essential for GMO quantification by real-time PCR. Based on this, a new taxon-specific endogenous reference quantification assay was developed for Phaseolus vulgaris L. Three genes encoding common bean proteins (phaseolin, arcelin, and lectin) were selected as candidates for endogenous reference. Primers targeting these candidate genes were designed and the detection was evaluated using the SYBR Green chemistry. The assay targeting lectin gene showed higher specificity than the remaining assays, and a hydrolysis probe was then designed. This assay showed high specificity for 50 common bean samples from two gene pools, Andean and Mesoamerican. For GM common bean varieties, the results were similar to those obtained for non-GM isogenic varieties with PCR efficiency values ranging from 92 to 101 %. Moreover, this assay presented a limit of detection of ten haploid genome copies. The primers and probe developed in this work are suitable to detect and quantify either GM or non-GM common bean.
APPRIS 2017: principal isoforms for multiple gene sets

PubMed Central

Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

2018-01-01

Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475
Ensembl comparative genomics resources.

PubMed

Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

2016-01-01

Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. © The Author(s) 2016. Published by Oxford University Press.
Ensembl comparative genomics resources

PubMed Central

Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J.; Searle, Stephen M. J.; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

2016-01-01

Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847
CrossLink: a novel method for cross-condition classification of cancer subtypes.

PubMed

Ma, Chifeng; Sastry, Konduru S; Flore, Mario; Gehani, Salah; Al-Bozom, Issam; Feng, Yusheng; Serpedin, Erchin; Chouchane, Lotfi; Chen, Yidong; Huang, Yufei

2016-08-22

We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms.

Validation of reference genes for real-time quantitative PCR normalization in soybean developmental and germinating seeds.

PubMed

Li, Qing; Fan, Cheng-Ming; Zhang, Xiao-Mei; Fu, Yong-Fu

2012-10-01

Most of traditional reference genes chosen for real-time quantitative PCR normalization were assumed to be ubiquitously and constitutively expressed in vegetative tissues. However, seeds show distinct transcriptomes compared with the vegetative tissues. Therefore, there is a need for re-validation of reference genes in samples of seed development and germination, especially for soybean seeds. In this study, we aimed at identifying reference genes suitable for the quantification of gene expression level in soybean seeds. In order to identify the best reference genes for soybean seeds, 18 putative reference genes were tested with various methods in different seed samples. We combined the outputs of both geNorm and NormFinder to assess the expression stability of these genes. The reference genes identified as optimums for seed development were TUA5 and UKN2, whereas for seed germination they were novel reference genes Glyma05g37470 and Glyma08g28550. Furthermore, for total seed samples it was necessary to combine four genes of Glyma05g37470, Glyma08g28550, Glyma18g04130 and UKN2 [corrected] for normalization. Key message We identified several reference genes that stably expressed in soybean seed developmental and germinating processes.
Selection of Reference Genes for Quantitative Gene Expression in Porcine Mesenchymal Stem Cells Derived from Various Sources along with Differentiation into Multilineages

PubMed Central

Lee, Won-Jae; Jeon, Ryoung-Hoon; Jang, Si-Jung; Park, Ji-Sung; Lee, Seung-Chan; Baregundi Subbarao, Raghavendra; Lee, Sung-Lim; Park, Bong-Wook; King, William Allan; Rho, Gyu-Jin

2015-01-01

The identification of stable reference genes is a prerequisite for ensuring accurate validation of gene expression, yet too little is known about stable reference genes of porcine MSCs. The present study was, therefore, conducted to assess the stability of reference genes in porcine MSCs derived from bone marrow (BMSCs), adipose (AMSCs), and skin (SMSCs) with their in vitro differentiated cells into mesenchymal lineages such as adipocytes, osteocytes, and chondrocytes. Twelve commonly used reference genes were investigated for their threshold cycle (Ct) values by qRT-PCR. The Ct values of candidate reference genes were analyzed by geNorm software to clarify stable expression regardless of experimental conditions. Thus, Pearson's correlation was applied to determine correlation between the three most stable reference genes (NF3) and optimal number of reference genes (NFopt). In assessment of stability of reference gene across experimental conditions by geNorm analysis, undifferentiated MSCs and each differentiated status into mesenchymal lineages showed slightly different results but similar patterns about more or less stable rankings. Furthermore, Pearson's correlation revealed high correlation (r > 0.9) between NF3 and NFopt. Overall, the present study showed that HMBS, YWHAZ, SDHA, and TBP are suitable reference genes for qRT-PCR in porcine MSCs. PMID:25972899
Lotus Base: An integrated information portal for the model legume Lotus japonicus

PubMed Central

Mun, Terry; Bachmann, Asger; Gupta, Vikas; Stougaard, Jens; Andersen, Stig U.

2016-01-01

Lotus japonicus is a well-characterized model legume widely used in the study of plant-microbe interactions. However, datasets from various Lotus studies are poorly integrated and lack interoperability. We recognize the need for a comprehensive repository that allows comprehensive and dynamic exploration of Lotus genomic and transcriptomic data. Equally important are user-friendly in-browser tools designed for data visualization and interpretation. Here, we present Lotus Base, which opens to the research community a large, established LORE1 insertion mutant population containing an excess of 120,000 lines, and serves the end-user tightly integrated data from Lotus, such as the reference genome, annotated proteins, and expression profiling data. We report the integration of expression data from the L. japonicus gene expression atlas project, and the development of tools to cluster and export such data, allowing users to construct, visualize, and annotate co-expression gene networks. Lotus Base takes advantage of modern advances in browser technology to deliver powerful data interpretation for biologists. Its modular construction and publicly available application programming interface enable developers to tap into the wealth of integrated Lotus data. Lotus Base is freely accessible at: https://lotus.au.dk. PMID:28008948
Identification of appropriate reference genes for human mesenchymal stem cell analysis by quantitative real-time PCR.

PubMed

Li, Xiuying; Yang, Qiwei; Bai, Jinping; Xuan, Yali; Wang, Yimin

2015-01-01

Normalization to a reference gene is the method of choice for quantitative reverse transcription-PCR (RT-qPCR) analysis. The stability of reference genes is critical for accurate experimental results and conclusions. We have evaluated the expression stability of eight commonly used reference genes found in four different human mesenchymal stem cells (MSC). Using geNorm, NormFinder and BestKeeper algorithms, we show that beta-2-microglobulin and peptidyl-prolylisomerase A were the optimal reference genes for normalizing RT-qPCR data obtained from MSC, whereas the TATA box binding protein was not suitable due to its extensive variability in expression. Our findings emphasize the significance of validating reference genes for qPCR analyses. We offer a short list of reference genes to use for normalization and recommend some commercially-available software programs as a rapid approach to validate reference genes. We also demonstrate that the two reference genes, β-actin and glyceraldehyde-3-phosphate dehydrogenase, are frequently used are not always successful in many cases.
Application of droplet digital PCR to determine copy number of endogenous genes and transgenes in sugarcane.

PubMed

Sun, Yue; Joyce, Priya Aiyar

2017-11-01

Droplet digital PCR combined with the low copy ACT allele as endogenous reference gene, makes accurate and rapid estimation of gene copy number in Q208 A and Q240 A attainable. Sugarcane is an important cultivated crop with both high polyploidy and aneuploidy in its 10 Gb genome. Without a known copy number reference gene, it is difficult to accurately estimate the copy number of any gene of interest by PCR-based methods in sugarcane. Recently, a new technology, known as droplet digital PCR (ddPCR) has been developed which can measure the absolute amount of the target DNA in a given sample. In this study, we deduced the true copy number of three endogenous genes, actin depolymerizing factor (ADF), adenine phosphoribosyltransferase (APRT) and actin (ACT) in three Australian sugarcane varieties, using ddPCR by comparing the absolute amounts of the above genes with a transgene of known copy number. A single copy of the ACT allele was detected in Q208 A , two copies in Q240 A , but was absent in Q117. Copy number variation was also observed for both APRT and ADF, and ranged from 9 to 11 in the three tested varieties. Using this newly developed ddPCR method, transgene copy number was successfully determined in 19 transgenic Q208 A and Q240 A events using ACT as the reference endogenous gene. Our study demonstrates that ddPCR can be used for high-throughput genetic analysis and is a quick, accurate and reliable alternative method for gene copy number determination in sugarcane. This discovered ACT allele would be a suitable endogenous reference gene for future gene copy number variation and dosage studies of functional genes in Q208 A and Q240 A .
Selection of relatively exact reference genes for gene expression studies in goosegrass (Eleusine indica) under herbicide stress

PubMed Central

Chen, Jingchao; Huang, Zhaofeng; Huang, Hongjuan; Wei, Shouhui; Liu, Yan; Jiang, Cuilan; Zhang, Jie; Zhang, Chaoxian

2017-01-01

Goosegrass (Eleusine indica) is one of the most serious annual grassy weeds worldwide, and its evolved herbicide-resistant populations are more difficult to control. Quantitative real-time PCR (qPCR) is a common technique for investigating the resistance mechanism; however, there is as yet no report on the systematic selection of stable reference genes for goosegrass. This study proposed to test the expression stability of 9 candidate reference genes in goosegrass in different tissues and developmental stages and under stress from three types of herbicide. The results show that for different developmental stages and organs (control), eukaryotic initiation factor 4 A (eIF-4) is the most stable reference gene. Chloroplast acetolactate synthase (ALS) is the most stable reference gene under glyphosate stress. Under glufosinate stress, eIF-4 is the best reference gene. Ubiquitin-conjugating enzyme (UCE) is the most stable reference gene under quizalofop-p-ethyl stress. The gene eIF-4 is the recommended reference gene for goosegrass under the stress of all three herbicides. Moreover, pairwise analysis showed that seven reference genes were sufficient to normalize the gene expression data under three herbicides treatment. This study provides a list of reliable reference genes for transcript normalization in goosegrass, which will facilitate resistance mechanism studies in this weed species. PMID:28429727
Using imputed genotype data in the joint score tests for genetic association and gene-environment interactions in case-control studies.

PubMed

Song, Minsun; Wheeler, William; Caporaso, Neil E; Landi, Maria Teresa; Chatterjee, Nilanjan

2018-03-01

Genome-wide association studies (GWAS) are now routinely imputed for untyped single nucleotide polymorphisms (SNPs) based on various powerful statistical algorithms for imputation trained on reference datasets. The use of predicted allele counts for imputed SNPs as the dosage variable is known to produce valid score test for genetic association. In this paper, we investigate how to best handle imputed SNPs in various modern complex tests for genetic associations incorporating gene-environment interactions. We focus on case-control association studies where inference for an underlying logistic regression model can be performed using alternative methods that rely on varying degree on an assumption of gene-environment independence in the underlying population. As increasingly large-scale GWAS are being performed through consortia effort where it is preferable to share only summary-level information across studies, we also describe simple mechanisms for implementing score tests based on standard meta-analysis of "one-step" maximum-likelihood estimates across studies. Applications of the methods in simulation studies and a dataset from GWAS of lung cancer illustrate ability of the proposed methods to maintain type-I error rates for the underlying testing procedures. For analysis of imputed SNPs, similar to typed SNPs, the retrospective methods can lead to considerable efficiency gain for modeling of gene-environment interactions under the assumption of gene-environment independence. Methods are made available for public use through CGEN R software package. © 2017 WILEY PERIODICALS, INC.
Research Resource: A Reference Transcriptome for Constitutive Androstane Receptor and Pregnane X Receptor Xenobiotic Signaling

PubMed Central

Ochsner, Scott A.; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian

2016-01-01

The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities. PMID:27409825
Research Resource: A Reference Transcriptome for Constitutive Androstane Receptor and Pregnane X Receptor Xenobiotic Signaling.

PubMed

Ochsner, Scott A; Tsimelzon, Anna; Dong, Jianrong; Coarfa, Cristian; McKenna, Neil J

2016-08-01

The pregnane X receptor (PXR) (PXR/NR1I3) and constitutive androstane receptor (CAR) (CAR/NR1I2) members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors are well-characterized mediators of xenobiotic and endocrine-disrupting chemical signaling. The Nuclear Receptor Signaling Atlas maintains a growing library of transcriptomic datasets involving perturbations of NR signaling pathways, many of which involve perturbations relevant to PXR and CAR xenobiotic signaling. Here, we generated a reference transcriptome based on the frequency of differential expression of genes across 159 experiments compiled from 22 datasets involving perturbations of CAR and PXR signaling pathways. In addition to the anticipated overrepresentation in the reference transcriptome of genes encoding components of the xenobiotic stress response, the ranking of genes involved in carbohydrate metabolism and gonadotropin action sheds mechanistic light on the suspected role of xenobiotics in metabolic syndrome and reproductive disorders. Gene Set Enrichment Analysis showed that although acetaminophen, chlorpromazine, and phenobarbital impacted many similar gene sets, differences in direction of regulation were evident in a variety of processes. Strikingly, gene sets representing genes linked to Parkinson's, Huntington's, and Alzheimer's diseases were enriched in all 3 transcriptomes. The reference xenobiotic transcriptome will be supplemented with additional future datasets to provide the community with a continually updated reference transcriptomic dataset for CAR- and PXR-mediated xenobiotic signaling. Our study demonstrates how aggregating and annotating transcriptomic datasets, and making them available for routine data mining, facilitates research into the mechanisms by which xenobiotics and endocrine-disrupting chemicals subvert conventional NR signaling modalities.
Technical note: Selection of suitable reference genes for studying gene expression in milk somatic cell of yak (Bos grunniens) during the lactation cycle.

PubMed

Bai, W L; Yin, R H; Zhao, S J; Jiang, W Q; Yin, R L; Ma, Z J; Wang, Z Y; Zhu, Y B; Luo, G B; Yang, R J; Zhao, Z H

2014-02-01

Quantitative real-time PCR is the most sensitive technique for gene expression analysis. Data normalization is essential to correct for potential errors incurred in all steps from RNA isolation to PCR amplification. The commonly accepted approach for normalization is the use of reference gene. Until now, no suitable reference genes have been available for data normalization of gene expression in milk somatic cells of lactating yaks across lactation. In the present study, we evaluated the transcriptional stability of 10 candidate reference genes in milk somatic cells of lactating yak, including ACTB, B2M, GAPDH, GTP, MRPL39, PPP1R11, RPS9, RPS15, UXT, and RN18S1. Four genes, RPS9, PPP1R11, UXT, and MRPL39, were identified as being the most stable genes in milk somatic cells of lactating yak. Using the combination of RPS9, PPP1R11, UXT, and MRPL39 as reference genes, we further assessed the relative expression of 4 genes of interest in milk somatic cells of yak across lactation, including ELF5, ABCG2, SREBF2, and DGAT1. Compared with expression in colostrum, the overall transcription levels of ELF5, ABCG2, and SREBF2 in milk were found to be significantly upregulated in early, peak, and late lactation, and significantly downregulated thereafter, before the dry period. A similar pattern was observed in the relative expression of DGAT1, but no significant difference was revealed in its expression in milk from late lactation compared with colostrum. Based on these results, we suggest that the geometric mean of RPS9, PPP1R11, UXT, and MRPL39 can be used for normalization of real-time PCR data in milk somatic cells of lactating yak, if similar experiments are performed. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Microarray-based comparative genomic profiling of reference strains and selected Canadian field isolates of Actinobacillus pleuropneumoniae

PubMed Central

Gouré, Julien; Findlay, Wendy A; Deslandes, Vincent; Bouevitch, Anne; Foote, Simon J; MacInnes, Janet I; Coulton, James W; Nash, John HE; Jacques, Mario

2009-01-01

Background Actinobacillus pleuropneumoniae, the causative agent of porcine pleuropneumonia, is a highly contagious respiratory pathogen that causes severe losses to the swine industry worldwide. Current commercially-available vaccines are of limited value because they do not induce cross-serovar immunity and do not prevent development of the carrier state. Microarray-based comparative genomic hybridizations (M-CGH) were used to estimate whole genomic diversity of representative Actinobacillus pleuropneumoniae strains. Our goal was to identify conserved genes, especially those predicted to encode outer membrane proteins and lipoproteins because of their potential for the development of more effective vaccines. Results Using hierarchical clustering, our M-CGH results showed that the majority of the genes in the genome of the serovar 5 A. pleuropneumoniae L20 strain were conserved in the reference strains of all 15 serovars and in representative field isolates. Fifty-eight conserved genes predicted to encode for outer membrane proteins or lipoproteins were identified. As well, there were several clusters of diverged or absent genes including those associated with capsule biosynthesis, toxin production as well as genes typically associated with mobile elements. Conclusion Although A. pleuropneumoniae strains are essentially clonal, M-CGH analysis of the reference strains of the fifteen serovars and representative field isolates revealed several classes of genes that were divergent or absent. Not surprisingly, these included genes associated with capsule biosynthesis as the capsule is associated with sero-specificity. Several of the conserved genes were identified as candidates for vaccine development, and we conclude that M-CGH is a valuable tool for reverse vaccinology. PMID:19239696
Predicting Gene Structure Changes Resulting from Genetic Variants via Exon Definition Features.

PubMed

Majoros, William H; Holt, Carson; Campbell, Michael S; Ware, Doreen; Yandell, Mark; Reddy, Timothy E

2018-04-25

Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed, and produce functional proteins. We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and noncoding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or noncoding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products, and we propose that they may commonly act as cryptic factors in disease. The software is available from geneprediction.org/SGRF. bmajoros@duke.edu. Supplementary information is available at Bioinformatics online.
PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.

PubMed

Gan, Ruei-Chi; Chen, Ting-Wen; Wu, Timothy H; Huang, Po-Jung; Lee, Chi-Ching; Yeh, Yuan-Ming; Chiu, Cheng-Hsun; Huang, Hsien-Da; Tang, Petrus

2016-12-22

Next-generation sequencing promises the de novo genomic and transcriptomic analysis of samples of interests. However, there are only a few organisms having reference genomic sequences and even fewer having well-defined or curated annotations. For transcriptome studies focusing on organisms lacking proper reference genomes, the common strategy is de novo assembly followed by functional annotation. However, things become even more complicated when multiple transcriptomes are compared. Here, we propose a new analysis strategy and quantification methods for quantifying expression level which not only generate a virtual reference from sequencing data, but also provide comparisons between transcriptomes. First, all reads from the transcriptome datasets are pooled together for de novo assembly. The assembled contigs are searched against NCBI NR databases to find potential homolog sequences. Based on the searched result, a set of virtual transcripts are generated and served as a reference transcriptome. By using the same reference, normalized quantification values including RC (read counts), eRPKM (estimated RPKM) and eTPM (estimated TPM) can be obtained that are comparable across transcriptome datasets. In order to demonstrate the feasibility of our strategy, we implement it in the web service PARRoT. PARRoT stands for Pipeline for Analyzing RNA Reads of Transcriptomes. It analyzes gene expression profiles for two transcriptome sequencing datasets. For better understanding of the biological meaning from the comparison among transcriptomes, PARRoT further provides linkage between these virtual transcripts and their potential function through showing best hits in SwissProt, NR database, assigning GO terms. Our demo datasets showed that PARRoT can analyze two paired-end transcriptomic datasets of approximately 100 million reads within just three hours. In this study, we proposed and implemented a strategy to analyze transcriptomes from non-reference organisms which offers the opportunity to quantify and compare transcriptome profiles through a homolog based virtual transcriptome reference. By using the homolog based reference, our strategy effectively avoids the problems that may cause from inconsistencies among transcriptomes. This strategy will shed lights on the field of comparative genomics for non-model organism. We have implemented PARRoT as a web service which is freely available at http://parrot.cgu.edu.tw .
Sequencing of individual chromosomes of plant pathogenic Fusarium oxysporum.

PubMed

Kashiwa, Takeshi; Kozaki, Toshinori; Ishii, Kazuo; Turgeon, B Gillian; Teraoka, Tohru; Komatsu, Ken; Arie, Tsutomu

2017-01-01

A small chromosome in reference isolate 4287 of F. oxysporum f. sp. lycopersici (Fol) has been designated as a 'pathogenicity chromosome' because it carries several pathogenicity related genes such as the Secreted In Xylem (SIX) genes. Sequence assembly of small chromosomes in other isolates, based on a reference genome template, is difficult because of karyotype variation among isolates and a high number of sequences associated with transposable elements. These factors often result in misassembly of sequences, making it unclear whether other isolates possess the same pathogenicity chromosome harboring SIX genes as in the reference isolate. To overcome this difficulty, single chromosome sequencing after Contour-clamped Homogeneous Electric Field (CHEF) separation of chromosomes was performed, followed by de novo assembly of sequences. The assembled sequences of individual chromosomes were consistent with results of probing gels of CHEF separated chromosomes with SIX genes. Individual chromosome sequencing revealed that several SIX genes are located on a single small chromosome in two pathogenic forms of F. oxysporum, beyond the reference isolate 4287, and in the cabbage yellows fungus F. oxysporum f. sp. conglutinans. The particular combination of SIX genes on each small chromosome varied. Moreover, not all SIX genes were found on small chromosomes; depending on the isolate, some were on big chromosomes. This suggests that recombination of chromosomes and/or translocation of SIX genes may occur frequently. Our method improves sequence comparison of small chromosomes among isolates. Copyright © 2016 Elsevier Inc. All rights reserved.
A New Chicken Genome Assembly Provides Insight into Avian Genome Structure.

PubMed

Warren, Wesley C; Hillier, LaDeana W; Tomlinson, Chad; Minx, Patrick; Kremitzki, Milinn; Graves, Tina; Markovic, Chris; Bouk, Nathan; Pruitt, Kim D; Thibaud-Nissen, Francoise; Schneider, Valerie; Mansour, Tamer A; Brown, C Titus; Zimin, Aleksey; Hawken, Rachel; Abrahamsen, Mitch; Pyrkosz, Alexis B; Morisson, Mireille; Fillon, Valerie; Vignal, Alain; Chow, William; Howe, Kerstin; Fulton, Janet E; Miller, Marcia M; Lovell, Peter; Mello, Claudio V; Wirthlin, Morgan; Mason, Andrew S; Kuo, Richard; Burt, David W; Dodgson, Jerry B; Cheng, Hans H

2017-01-05

The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3), built from combined long single molecule sequencing technology, finished BACs, and improved physical maps. In overall assembled bases, we see a gain of 183 Mb, including 16.4 Mb in placed chromosomes with a corresponding gain in the percentage of intact repeat elements characterized. Of the 1.21 Gb genome, we include three previously missing autosomes, GGA30, 31, and 33, and improve sequence contig length 10-fold over the previous Gallus_gallus-4.0. Despite the significant base representation improvements made, 138 Mb of sequence is not yet located to chromosomes. When annotated for gene content, Gallus_gallus-5.0 shows an increase of 4679 annotated genes (2768 noncoding and 1911 protein-coding) over those in Gallus_gallus-4.0. We also revisited the question of what genes are missing in the avian lineage, as assessed by the highest quality avian genome assembly to date, and found that a large fraction of the original set of missing genes are still absent in sequenced bird species. Finally, our new data support a detailed map of MHC-B, encompassing two segments: one with a highly stable gene copy number and another in which the gene copy number is highly variable. The chicken model has been a critical resource for many other fields of study, and this new reference assembly will substantially further these efforts. Copyright © 2017 Warren et al.
Identification of Reference Genes for Quantitative Real Time PCR Assays in Aortic Tissue of Syrian Hamsters with Bicuspid Aortic Valve

PubMed Central

Rueda-Martínez, Carmen; Fernández, M. Carmen; Soto-Navarrete, María Teresa; Jiménez-Navarro, Manuel; Durán, Ana Carmen; Fernández, Borja

2016-01-01

Bicuspid aortic valve (BAV) is the most frequent congenital cardiac malformation in humans, and appears frequently associated with dilatation of the ascending aorta. This association is likely the result of a common aetiology. Currently, a Syrian hamster strain with a relatively high (∼40%) incidence of BAV constitutes the only spontaneous animal model of BAV disease. The characterization of molecular alterations in the aorta of hamsters with BAV may serve to identify pathophysiological mechanisms and molecular markers of disease in humans. In this report, we evaluate the expression of ten candidate reference genes in aortic tissue of hamsters in order to identify housekeeping genes for normalization using quantitative real time PCR (RT-qPCR) assays. A total of 51 adult (180–240 days old) and 56 old (300–440 days old) animals were used. They belonged to a control strain of hamsters with normal, tricuspid aortic valve (TAV; n = 30), or to the affected strain of hamsters with TAV (n = 45) or BAV (n = 32). The expression stability of the candidate reference genes was determined by RT-qPCR using three statistical algorithms, GeNorm, NormFinder and Bestkeeper. The expression analyses showed that the most stable reference genes for the three algorithms employed were Cdkn1β, G3pdh and Polr2a. We propose the use of Cdkn1β, or both Cdkn1β and G3pdh as reference genes for mRNA expression analyses in Syrian hamster aorta. PMID:27711171
Identification of Reference Genes for Quantitative Real Time PCR Assays in Aortic Tissue of Syrian Hamsters with Bicuspid Aortic Valve.

PubMed

Rueda-Martínez, Carmen; Fernández, M Carmen; Soto-Navarrete, María Teresa; Jiménez-Navarro, Manuel; Durán, Ana Carmen; Fernández, Borja

2016-01-01

Bicuspid aortic valve (BAV) is the most frequent congenital cardiac malformation in humans, and appears frequently associated with dilatation of the ascending aorta. This association is likely the result of a common aetiology. Currently, a Syrian hamster strain with a relatively high (∼40%) incidence of BAV constitutes the only spontaneous animal model of BAV disease. The characterization of molecular alterations in the aorta of hamsters with BAV may serve to identify pathophysiological mechanisms and molecular markers of disease in humans. In this report, we evaluate the expression of ten candidate reference genes in aortic tissue of hamsters in order to identify housekeeping genes for normalization using quantitative real time PCR (RT-qPCR) assays. A total of 51 adult (180-240 days old) and 56 old (300-440 days old) animals were used. They belonged to a control strain of hamsters with normal, tricuspid aortic valve (TAV; n = 30), or to the affected strain of hamsters with TAV (n = 45) or BAV (n = 32). The expression stability of the candidate reference genes was determined by RT-qPCR using three statistical algorithms, GeNorm, NormFinder and Bestkeeper. The expression analyses showed that the most stable reference genes for the three algorithms employed were Cdkn1β, G3pdh and Polr2a. We propose the use of Cdkn1β, or both Cdkn1β and G3pdh as reference genes for mRNA expression analyses in Syrian hamster aorta.
Validation of miRNA genes suitable as reference genes in qPCR analyses of miRNA gene expression in Atlantic salmon (Salmo salar).

PubMed

Johansen, Ilona; Andreassen, Rune

2014-12-23

MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the post-transcriptional level. They play important roles by regulating genes that control multiple biological processes, and recent years there has been an increased interest in studying miRNA genes and miRNA gene expression. The most common method applied to study gene expression of single genes is quantitative PCR (qPCR). However, before expression of mature miRNAs can be studied robust qPCR methods (miRNA-qPCR) must be developed. This includes identification and validation of suitable reference genes. We are particularly interested in Atlantic salmon (Salmo salar). This is an economically important aquaculture species, but no reference genes dedicated for use in miRNA-qPCR methods has been validated for this species. Our aim was, therefore, to identify suitable reference genes for miRNA-qPCR methods in Salmo salar. We used a systematic approach where we utilized similar studies in other species, some biological criteria, results from deep sequencing of small RNAs and, finally, experimental validation of candidate reference genes by qPCR to identify the most suitable reference genes. Ssa-miR-25-3p was identified as most suitable single reference gene. The best combinations of two reference genes were ssa-miR-25-3p and ssa-miR-455-5p. These two genes were constitutively and stably expressed across many different tissues. Furthermore, infectious salmon anaemia did not seem to affect their expression levels. These genes were amplified with high specificity, good efficiency and the qPCR assays showed a good linearity when applying a simple cybergreen miRNA-PCR method using miRNA gene specific forward primers. We have identified suitable reference genes for miRNA-qPCR in Atlantic salmon. These results will greatly facilitate further studies on miRNA genes in this species. The reference genes identified are conserved genes that are identical in their mature sequence in many aquaculture species. Therefore, they may also be suitable as reference genes in other teleosts. Finally, the systematic approach used in our study successfully identified suitable reference genes, suggesting that this may be a useful strategy to apply in similar validation studies in other aquaculture species.
Selection of reliable reference genes for quantitative real-time PCR gene expression analysis in Jute (Corchorus capsularis) under stress treatments

PubMed Central

Niu, Xiaoping; Qi, Jianmin; Zhang, Gaoyang; Xu, Jiantang; Tao, Aifen; Fang, Pingping; Su, Jianguang

2015-01-01

To accurately measure gene expression using quantitative reverse transcription PCR (qRT-PCR), reliable reference gene(s) are required for data normalization. Corchorus capsularis, an annual herbaceous fiber crop with predominant biodegradability and renewability, has not been investigated for the stability of reference genes with qRT-PCR. In this study, 11 candidate reference genes were selected and their expression levels were assessed using qRT-PCR. To account for the influence of experimental approach and tissue type, 22 different jute samples were selected from abiotic and biotic stress conditions as well as three different tissue types. The stability of the candidate reference genes was evaluated using geNorm, NormFinder, and BestKeeper programs, and the comprehensive rankings of gene stability were generated by aggregate analysis. For the biotic stress and NaCl stress subsets, ACT7 and RAN were suitable as stable reference genes for gene expression normalization. For the PEG stress subset, UBC, and DnaJ were sufficient for accurate normalization. For the tissues subset, four reference genes TUBβ, UBI, EF1α, and RAN were sufficient for accurate normalization. The selected genes were further validated by comparing expression profiles of WRKY15 in various samples, and two stable reference genes were recommended for accurate normalization of qRT-PCR data. Our results provide researchers with appropriate reference genes for qRT-PCR in C. capsularis, and will facilitate gene expression study under these conditions. PMID:26528312
Genome-Wide Identification and Evaluation of Reference Genes for Quantitative RT-PCR Analysis during Tomato Fruit Development.

PubMed

Cheng, Yuan; Bian, Wuying; Pang, Xin; Yu, Jiahong; Ahammed, Golam J; Zhou, Guozhi; Wang, Rongqing; Ruan, Meiying; Li, Zhimiao; Ye, Qingjing; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

2017-01-01

Gene expression analysis in tomato fruit has drawn increasing attention nowadays. Quantitative real-time PCR (qPCR) is a routine technique for gene expression analysis. In qPCR operation, reliability of results largely depends on the choice of appropriate reference genes (RGs). Although tomato is a model for fruit biology study, few RGs for qPCR analysis in tomato fruit had yet been developed. In this study, we initially identified 38 most stably expressed genes based on tomato transcriptome data set, and their expression stabilities were further determined in a set of tomato fruit samples of four different fruit developmental stages (Immature, mature green, breaker, mature red) using qPCR analysis. Two statistical algorithms, geNorm and Normfinder, concordantly determined the superiority of these identified putative RGs. Notably, SlFRG05 (Solyc01g104170), SlFRG12 (Solyc04g009770), SlFRG16 (Solyc10g081190), SlFRG27 (Solyc06g007510), and SlFRG37 (Solyc11g005330) were proved to be suitable RGs for tomato fruit development study. Further analysis using geNorm indicate that the combined use of SlFRG03 (Solyc02g063070) and SlFRG27 would provide more reliable normalization results in qPCR experiments. The identified RGs in this study will be beneficial for future qPCR analysis of tomato fruit developmental study, as well as for the potential identification of optimal normalization controls in other plant species.

Genome-Wide Identification and Evaluation of Reference Genes for Quantitative RT-PCR Analysis during Tomato Fruit Development

PubMed Central

Cheng, Yuan; Bian, Wuying; Pang, Xin; Yu, Jiahong; Ahammed, Golam J.; Zhou, Guozhi; Wang, Rongqing; Ruan, Meiying; Li, Zhimiao; Ye, Qingjing; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

2017-01-01

Gene expression analysis in tomato fruit has drawn increasing attention nowadays. Quantitative real-time PCR (qPCR) is a routine technique for gene expression analysis. In qPCR operation, reliability of results largely depends on the choice of appropriate reference genes (RGs). Although tomato is a model for fruit biology study, few RGs for qPCR analysis in tomato fruit had yet been developed. In this study, we initially identified 38 most stably expressed genes based on tomato transcriptome data set, and their expression stabilities were further determined in a set of tomato fruit samples of four different fruit developmental stages (Immature, mature green, breaker, mature red) using qPCR analysis. Two statistical algorithms, geNorm and Normfinder, concordantly determined the superiority of these identified putative RGs. Notably, SlFRG05 (Solyc01g104170), SlFRG12 (Solyc04g009770), SlFRG16 (Solyc10g081190), SlFRG27 (Solyc06g007510), and SlFRG37 (Solyc11g005330) were proved to be suitable RGs for tomato fruit development study. Further analysis using geNorm indicate that the combined use of SlFRG03 (Solyc02g063070) and SlFRG27 would provide more reliable normalization results in qPCR experiments. The identified RGs in this study will be beneficial for future qPCR analysis of tomato fruit developmental study, as well as for the potential identification of optimal normalization controls in other plant species. PMID:28900431
Selection and validation of reliable housekeeping genes to evaluate Piscirickettsia salmonis gene expression.

PubMed

Flores-Herrera, Patricio; Arredondo-Zelada, Oscar; Marshall, Sergio H; Gómez, Fernando A

2018-06-01

Piscirickettsia salmonis is a highly aggressive facultative intracellular bacterium that challenges the sustainability of Chilean salmon production. Due to the limited knowledge of its biology, there is a need to identify key molecular markers that could help define the pathogenic potential of this bacterium. We think a model system should be implemented that efficiently evaluates the expression of putative bacterial markers by using validated, stable, and highly specific housekeeping genes to properly select target genes, which could lead to identifying those responsible for infection and disease induction in naturally infected fish. Here, we selected a set of validated reference or housekeeping genes for RT-qPCR expression analyses of P. salmonis under different growth and stress conditions, including an in vitro infection kinetic. After a thorough screening, we selected sdhA as the most reliable housekeeping gene able to represent stable and highly specific host reference genes for RT-qPCR-driven P. salmonis analysis. Copyright © 2018. Published by Elsevier B.V.
Identification of Single- and Multiple-Class Specific Signature Genes from Gene Expression Profiles by Group Marker Index

PubMed Central

Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang

2011-01-01

Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases. PMID:21909426
Endogenous Reference Genes for Gene Expression Studies on Bicuspid Aortic Valve Associated Aortopathy in Humans.

PubMed

Harrison, Oliver J; Moorjani, Narain; Torrens, Christopher; Ohri, Sunil K; Cagampang, Felino R

2016-01-01

Bicuspid aortic valve (BAV) disease is the most common congenital cardiac abnormality and predisposes patients to life-threatening aortic complications including aortic aneurysm. Quantitative real-time reverse transcription PCR (qRT-PCR) is one of the most commonly used methods to investigate underlying molecular mechanisms involved in aortopathy. The accuracy of the gene expression data is dependent on normalization by appropriate housekeeping (HK) genes, whose expression should remain constant regardless of aortic valve morphology, aortic diameter and other factors associated with aortopathy. Here, we identified an appropriate set of HK genes to be used as endogenous reference for quantifying gene expression in ascending aortic tissue using a spin column-based RNA extraction method. Ascending aortic biopsies were collected intra-operatively from patients undergoing aortic valve and/or ascending aortic surgery. These patients had BAV or tricuspid aortic valve (TAV), and the aortas were either dilated (≥4.5cm) or undilated. The cohort had an even distribution of gender, valve disease and hypertension. The expression stability of 12 reference genes were investigated (ATP5B, ACTB, B2M, CYC1, EIF4A2, GAPDH, SDHA, RPL13A, TOP1, UBC, YWHAZ, and 18S) using geNorm software. The most stable HK genes were found to be GAPDH, UBC and ACTB. Both GAPDH and UBC demonstrated relative stability regardless of valve morphology, aortic diameter, gender and age. The expression of B2M and SDHA were found to be the least stable HK genes. We propose the use of GAPDH, UBC and ACTB as reference genes for gene expression studies of BAV aortopathy using ascending aortic tissue.
Pre-Clinical Drug Prioritization via Prognosis-Guided Genetic Interaction Networks

PubMed Central

Xiong, Jianghui; Liu, Juan; Rayner, Simon; Tian, Ze; Li, Yinghui; Chen, Shanguang

2010-01-01

The high rates of failure in oncology drug clinical trials highlight the problems of using pre-clinical data to predict the clinical effects of drugs. Patient population heterogeneity and unpredictable physiology complicate pre-clinical cancer modeling efforts. We hypothesize that gene networks associated with cancer outcome in heterogeneous patient populations could serve as a reference for identifying drug effects. Here we propose a novel in vivo genetic interaction which we call ‘synergistic outcome determination’ (SOD), a concept similar to ‘Synthetic Lethality’. SOD is defined as the synergy of a gene pair with respect to cancer patients' outcome, whose correlation with outcome is due to cooperative, rather than independent, contributions of genes. The method combines microarray gene expression data with cancer prognostic information to identify synergistic gene-gene interactions that are then used to construct interaction networks based on gene modules (a group of genes which share similar function). In this way, we identified a cluster of important epigenetically regulated gene modules. By projecting drug sensitivity-associated genes on to the cancer-specific inter-module network, we defined a perturbation index for each drug based upon its characteristic perturbation pattern on the inter-module network. Finally, by calculating this index for compounds in the NCI Standard Agent Database, we significantly discriminated successful drugs from a broad set of test compounds, and further revealed the mechanisms of drug combinations. Thus, prognosis-guided synergistic gene-gene interaction networks could serve as an efficient in silico tool for pre-clinical drug prioritization and rational design of combinatorial therapies. PMID:21085674
[Characterization of Black and Dichothrix Cyanobacteria Based on the 16S Ribosomal RNA Gene Sequence

NASA Technical Reports Server (NTRS)

Ortega, Maya

2010-01-01

My project focuses on characterizing different cyanobacteria in thrombolitic mats found on the island of Highborn Cay, Bahamas. Thrombolites are interesting ecosystems because of the ability of bacteria in these mats to remove carbon dioxide from the atmosphere and mineralize it as calcium carbonate. In the future they may be used as models to develop carbon sequestration technologies, which could be used as part of regenerative life systems in space. These thrombolitic communities are also significant because of their similarities to early communities of life on Earth. I targeted two cyanobacteria in my research, Dichothrix spp. and whatever black is, since they are believed to be important to carbon sequestration in these thrombolitic mats. The goal of my summer research project was to molecularly identify these two cyanobacteria. DNA was isolated from each organism through mat dissections and DNA extractions. I ran Polymerase Chain Reactions (PCR) to amplify the 16S ribosomal RNA (rRNA) gene in each cyanobacteria. This specific gene is found in almost all bacteria and is highly conserved, meaning any changes in the sequence are most likely due to evolution. As a result, the 16S rRNA gene can be used for bacterial identification of different species based on the sequence of their 16S rRNA gene. Since the exact sequence of the Dichothrix gene was unknown, I designed different primers that flanked the gene based on the known sequences from other taxonomically similar cyanobacteria. Once the 16S rRNA gene was amplified, I cloned the gene into specialized Escherichia coli cells and sent the gene products for sequencing. Once the sequence is obtained, it will be added to a genetic database for future reference to and classification of other Dichothrix sp.
Systematic prediction of gene function in Arabidopsis thaliana using a probabilistic functional gene network

PubMed Central

Hwang, Sohyun; Rhee, Seung Y; Marcotte, Edward M; Lee, Insuk

2012-01-01

AraNet is a functional gene network for the reference plant Arabidopsis and has been constructed in order to identify new genes associated with plant traits. It is highly predictive for diverse biological pathways and can be used to prioritize genes for functional screens. Moreover, AraNet provides a web-based tool with which plant biologists can efficiently discover novel functions of Arabidopsis genes (http://www.functionalnet.org/aranet/). This protocol explains how to conduct network-based prediction of gene functions using AraNet and how to interpret the prediction results. Functional discovery in plant biology is facilitated by combining candidate prioritization by AraNet with focused experimental tests. PMID:21886106
Doubled Haploid ‘CUDH2107’ as a Reference for Bulb Onion (Allium cepa L.) Research: Development of a Transcriptome Catalogue and Identification of Transcripts Associated with Male Fertility

PubMed Central

Khosa, Jiffinvir S.; Lee, Robyn; Bräuning, Sophia; Lord, Janice; Pither-Joyce, Meeghan; McCallum, John; Macknight, Richard C.

2016-01-01

Researchers working on model plants have derived great benefit from developing genomic and genetic resources using ‘reference’ genotypes. Onion has a large and highly heterozygous genome making the sharing of germplasm and analysis of sequencing data complicated. To simplify the discovery and analysis of genes underlying important onion traits, we are promoting the use of the homozygous double haploid line ‘CUDH2107’ by the onion research community. In the present investigation, we performed transcriptome sequencing on vegetative and reproductive tissues of CUDH2107 to develop a multi-organ reference transcriptome catalogue. A total of 396 million 100 base pair paired reads was assembled using the Trinity pipeline, resulting in 271,665 transcript contigs. This dataset was analysed for gene ontology and transcripts were classified on the basis of putative biological processes, molecular function and cellular localization. Significant differences were observed in transcript expression profiles between different tissues. To demonstrate the utility of our CUDH2107 transcriptome catalogue for understanding the genetic and molecular basis of various traits, we identified orthologues of rice genes involved in male fertility and flower development. These genes provide an excellent starting point for studying the molecular regulation, and the engineering of reproductive traits. PMID:27861615
Impact of Gene Patents and Licensing Practices on Access to Genetic Testing for Long QT Syndrome

PubMed Central

Angrist, Misha; Chandrasekharan, Subhashini; Heaney, Christopher; Cook-Deegan, Robert

2010-01-01

Genetic testing for Long QT syndrome (LQTS) exemplifies patenting and exclusive licensing with different outcomes at different times. Exclusive licensing from the University of Utah changed the business model from sole provider to two US providers of LQTS testing. LQTS is associated with mutations in many genes, ten of which are now tested by two competing firms in the United States, PGxHealth and GeneDx. Until 2009, PGxHealth was sole provider, based largely on exclusive rights to patents from the University of Utah and other academic institutions. University of Utah patents were initially licensed to DNA Sciences, whose patent rights were acquired by Gennaissance, and then by Clinical Data, Inc., which owns PGxHealth. In 2002, DNA Sciences “cleared the market” by sending cease and desist patent enforcement letters to university and reference laboratories offering LQTS genetic testing. There was no test on the market for a one- to two-year period. From 2005-2008, most LQTS-related patents were controlled by Clinical Data, Inc., and its subsidiary PGxHealth. BioReference Laboratories, Inc., secured countervailing exclusive patent rights starting in 2006, also from the University of Utah, and broke the PGxHealth monopoly in early 2009, creating a duopoly for genetic testing in the United States, and expanding the number of genes for which commercial testing is available from five to ten. PMID:20393304
Estimation of gene induction enables a relevance-based ranking of gene sets.

PubMed

Bartholomé, Kilian; Kreutz, Clemens; Timmer, Jens

2009-07-01

In order to handle and interpret the vast amounts of data produced by microarray experiments, the analysis of sets of genes with a common biological functionality has been shown to be advantageous compared to single gene analyses. Some statistical methods have been proposed to analyse the differential gene expression of gene sets in microarray experiments. However, most of these methods either require threshhold values to be chosen for the analysis, or they need some reference set for the determination of significance. We present a method that estimates the number of differentially expressed genes in a gene set without requiring a threshold value for significance of genes. The method is self-contained (i.e., it does not require a reference set for comparison). In contrast to other methods which are focused on significance, our approach emphasizes the relevance of the regulation of gene sets. The presented method measures the degree of regulation of a gene set and is a useful tool to compare the induction of different gene sets and place the results of microarray experiments into the biological context. An R-package is available.
PMS2 gene mutational analysis: direct cDNA sequencing to circumvent pseudogene interference.

PubMed

Wimmer, Katharina; Wernstedt, Annekatrin

2014-01-01

The presence of highly homologous pseudocopies can compromise the mutation analysis of a gene of interest. In particular, when using PCR-based strategies, pseudogene co-amplification has to be effectively prevented. This is often achieved by using primers designed to be parental gene specific according to the reference sequence and by applying stringent PCR conditions. However, there are cases in which this approach is of limited utility. For example, it has been shown that the PMS2 gene exchanges sequences with one of its pseudogenes, named PMS2CL. This results in functional PMS2 alleles containing pseudogene-derived sequences at their 3'-end and in nonfunctional PMS2CL pseudogene alleles that contain gene-derived sequences. Hence, the paralogues cannot be distinguished according to the reference sequence. This shortcoming can be effectively circumvented by using direct cDNA sequencing. This approach is based on the selective amplification of PMS2 transcripts in two overlapping 1.6-kb RT-PCR products. In addition to avoiding pseudogene co-amplification and allele dropout, this method has also the advantage that it allows to effectively identify deletions, splice mutations, and de novo retrotransposon insertions that escape the detection of most DNA-based mutation analysis protocols.
De novo characterization of the gene-rich transcriptomes of two color-polymorphic spiders, Theridion grallator and T. californicum (Araneae: Theridiidae), with special reference to pigment genes.

PubMed

Croucher, Peter J P; Brewer, Michael S; Winchell, Christopher J; Oxford, Geoff S; Gillespie, Rosemary G

2013-12-08

A number of spider species within the family Theridiidae exhibit a dramatic abdominal (opisthosomal) color polymorphism. The polymorphism is inherited in a broadly Mendelian fashion and in some species consists of dozens of discrete morphs that are convergent across taxa and populations. Few genomic resources exist for spiders. Here, as a first necessary step towards identifying the genetic basis for this trait we present the near complete transcriptomes of two species: the Hawaiian happy-face spider Theridion grallator and Theridion californicum. We mined the gene complement for pigment-pathway genes and examined differential expression (DE) between morphs that are unpatterned (plain yellow) and patterned (yellow with superimposed patches of red, white or very dark brown). By deep sequencing both RNA-seq and normalized cDNA libraries from pooled specimens of each species we were able to assemble a comprehensive gene set for both species that we estimate to be 98-99% complete. It is likely that these species express more than 20,000 protein-coding genes, perhaps 4.5% (ca. 870) of which might be unique to spiders. Mining for pigment-associated Drosophila melanogaster genes indicated the presence of all ommochrome pathway genes and most pteridine pathway genes and DE analyses further indicate a possible role for the pteridine pathway in theridiid color patterning. Based upon our estimates, T. grallator and T. californicum express a large inventory of protein-coding genes. Our comprehensive assembly illustrates the continuing value of sequencing normalized cDNA libraries in addition to RNA-seq in order to generate a reference transcriptome for non-model species. The identification of pteridine-related genes and their possible involvement in color patterning is a novel finding in spiders and one that suggests a biochemical link between guanine deposits and the pigments exhibited by these species.
Validation of Reference Genes for Real-Time Quantitative PCR (qPCR) Analysis of Avibacterium paragallinarum.

PubMed

Wen, Shuxiang; Chen, Xiaoling; Xu, Fuzhou; Sun, Huiling

2016-01-01

Real-time quantitative reverse transcription PCR (qRT-PCR) offers a robust method for measurement of gene expression levels. Selection of reliable reference gene(s) for gene expression study is conducive to reduce variations derived from different amounts of RNA and cDNA, the efficiency of the reverse transcriptase or polymerase enzymes. Until now reference genes identified for other members of the family Pasteurellaceae have not been validated for Avibacterium paragallinarum. The aim of this study was to validate nine reference genes of serovars A, B, and C strains of A. paragallinarum in different growth phase by qRT-PCR. Three of the most widely used statistical algorithms, geNorm, NormFinder and ΔCT method were used to evaluate the expression stability of reference genes. Data analyzed by overall rankings showed that in exponential and stationary phase of serovar A, the most stable reference genes were gyrA and atpD respectively; in exponential and stationary phase of serovar B, the most stable reference genes were atpD and recN respectively; in exponential and stationary phase of serovar C, the most stable reference genes were rpoB and recN respectively. This study provides recommendations for stable endogenous control genes for use in further studies involving measurement of gene expression levels.
DNA-Based Methods in the Immunohematology Reference Laboratory

PubMed Central

Denomme, Gregory A

2010-01-01

Although hemagglutination serves the immunohematology reference laboratory well, when used alone, it has limited capability to resolve complex problems. This overview discusses how molecular approaches can be used in the immunohematology reference laboratory. In order to apply molecular approaches to immunohematology, knowledge of genes, DNA-based methods, and the molecular bases of blood groups are required. When applied correctly, DNA-based methods can predict blood groups to resolve ABO/Rh discrepancies, identify variant alleles, and screen donors for antigen-negative units. DNA-based testing in immunohematology is a valuable tool used to resolve blood group incompatibilities and to support patients in their transfusion needs. PMID:21257350
Genome variations associated with viral susceptibility and calcification in Emiliania huxleyi.

PubMed

Kegel, Jessica U; John, Uwe; Valentin, Klaus; Frickenhaus, Stephan

2013-01-01

Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain) to perform comparative genomic hybridizations (CGH) of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification). Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core) genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies.
Genome Variations Associated with Viral Susceptibility and Calcification in Emiliania huxleyi

PubMed Central

Kegel, Jessica U.; John, Uwe; Valentin, Klaus; Frickenhaus, Stephan

2013-01-01

Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain) to perform comparative genomic hybridizations (CGH) of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification). Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core) genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies. PMID:24260453
Trainable Gene Regulation Networks with Applications to Drosophila Pattern Formation

NASA Technical Reports Server (NTRS)

Mjolsness, Eric

2000-01-01

This chapter will very briefly introduce and review some computational experiments in using trainable gene regulation network models to simulate and understand selected episodes in the development of the fruit fly, Drosophila melanogaster. For details the reader is referred to the papers introduced below. It will then introduce a new gene regulation network model which can describe promoter-level substructure in gene regulation. As described in chapter 2, gene regulation may be thought of as a combination of cis-acting regulation by the extended promoter of a gene (including all regulatory sequences) by way of the transcription complex, and of trans-acting regulation by the transcription factor products of other genes. If we simplify the cis-action by using a phenomenological model which can be tuned to data, such as a unit or other small portion of an artificial neural network, then the full transacting interaction between multiple genes during development can be modelled as a larger network which can again be tuned or trained to data. The larger network will in general need to have recurrent (feedback) connections since at least some real gene regulation networks do. This is the basic modeling approach taken, which describes how a set of recurrent neural networks can be used as a modeling language for multiple developmental processes including gene regulation within a single cell, cell-cell communication, and cell division. Such network models have been called "gene circuits", "gene regulation networks", or "genetic regulatory networks", sometimes without distinguishing the models from the actual modeled systems.
Selection of novel reference genes for use in the human central nervous system: a BrainNet Europe Study.

PubMed

Durrenberger, Pascal F; Fernando, Francisca S; Magliozzi, Roberta; Kashefi, Samira N; Bonnert, Timothy P; Ferrer, Isidro; Seilhean, Danielle; Nait-Oumesmar, Brahim; Schmitt, Andrea; Gebicke-Haerter, Peter J; Falkai, Peter; Grünblatt, Edna; Palkovits, Miklos; Parchi, Piero; Capellari, Sabina; Arzberger, Thomas; Kretzschmar, Hans; Roncaroli, Federico; Dexter, David T; Reynolds, Richard

2012-12-01

The use of an appropriate reference gene to ensure accurate normalisation is crucial for the correct quantification of gene expression using qPCR assays and RNA arrays. The main criterion for a gene to qualify as a reference gene is a stable expression across various cell types and experimental settings. Several reference genes are commonly in use but more and more evidence reveals variations in their expression due to the presence of on-going neuropathological disease processes, raising doubts concerning their use. We conducted an analysis of genome-wide changes of gene expression in the human central nervous system (CNS) covering several neurological disorders and regions, including the spinal cord, and were able to identify a number of novel stable reference genes. We tested the stability of expression of eight novel (ATP5E, AARS, GAPVD1, CSNK2B, XPNPEP1, OSBP, NAT5 and DCTN2) and four more commonly used (BECN1, GAPDH, QARS and TUBB) reference genes in a smaller cohort using RT-qPCR. The most stable genes out of the 12 reference genes were tested as normaliser to validate increased levels of a target gene in CNS disease. We found that in human post-mortem tissue the novel reference genes, XPNPEP1 and AARS, were efficient in replicating microarray target gene expression levels and that XPNPEP1 was more efficient as a normaliser than BECN1, which has been shown to change in expression as a consequence of neuronal cell loss. We provide herein one more suitable novel reference gene, XPNPEP1, with no current neuroinflammatory or neurodegenerative associations that can be used for gene quantitative gene expression studies with human CNS post-mortem tissue and also suggest a list of potential other candidates. These data also emphasise the importance of organ/tissue-specific stably expressed genes as reference genes for RNA studies.
Effect of endogenous reference genes on digital PCR assessment of genetically engineered canola events.

PubMed

Demeke, Tigst; Eng, Monika

2018-05-01

Droplet digital PCR (ddPCR) has been used for absolute quantification of genetically engineered (GE) events. Absolute quantification of GE events by duplex ddPCR requires the use of appropriate primers and probes for target and reference gene sequences in order to accurately determine the amount of GE materials. Single copy reference genes are generally preferred for absolute quantification of GE events by ddPCR. Study has not been conducted on a comparison of reference genes for absolute quantification of GE canola events by ddPCR. The suitability of four endogenous reference sequences ( HMG-I/Y , FatA(A), CruA and Ccf) for absolute quantification of GE canola events by ddPCR was investigated. The effect of DNA extraction methods and DNA quality on the assessment of reference gene copy numbers was also investigated. ddPCR results were affected by the use of single vs. two copy reference genes. The single copy, FatA(A), reference gene was found to be stable and suitable for absolute quantification of GE canola events by ddPCR. For the copy numbers measured, the HMG-I/Y reference gene was less consistent than FatA(A) reference gene. The expected ddPCR values were underestimated when CruA and Ccf (two copy endogenous Cruciferin sequences) were used because of high number of copies. It is important to make an adjustment if two copy reference genes are used for ddPCR in order to obtain accurate results. On the other hand, real-time quantitative PCR results were not affected by the use of single vs. two copy reference genes.
Gene annotation from scientific literature using mappings between keyword systems.

PubMed

Pérez, Antonio J; Perez-Iratxeta, Carolina; Bork, Peer; Thode, Guillermo; Andrade, Miguel A

2004-09-01

The description of genes in databases by keywords helps the non-specialist to quickly grasp the properties of a gene and increases the efficiency of computational tools that are applied to gene data (e.g. searching a gene database for sequences related to a particular biological process). However, the association of keywords to genes or protein sequences is a difficult process that ultimately implies examination of the literature related to a gene. To support this task, we present a procedure to derive keywords from the set of scientific abstracts related to a gene. Our system is based on the automated extraction of mappings between related terms from different databases using a model of fuzzy associations that can be applied with all generality to any pair of linked databases. We tested the system by annotating genes of the SWISS-PROT database with keywords derived from the abstracts linked to their entries (stored in the MEDLINE database of scientific references). The performance of the annotation procedure was much better for SWISS-PROT keywords (recall of 47%, precision of 68%) than for Gene Ontology terms (recall of 8%, precision of 67%). The algorithm can be publicly accessed and used for the annotation of sequences through a web server at http://www.bork.embl.de/kat

Congruent Deep Relationships in the Grape Family (Vitaceae) Based on Sequences of Chloroplast Genomes and Mitochondrial Genes via Genome Skimming

PubMed Central

Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A.

2015-01-01

Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study, next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina HiSeq 2500 instrument. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera) methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs. PMID:26656830
Congruent Deep Relationships in the Grape Family (Vitaceae) Based on Sequences of Chloroplast Genomes and Mitochondrial Genes via Genome Skimming.

PubMed

Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A

2015-01-01

Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study,next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina NextSeq 500 instrument [corrected]. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera) methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs.
Low-level lasers alter mRNA levels from traditional reference genes used in breast cancer cells

NASA Astrophysics Data System (ADS)

Teixeira, A. F.; Canuto, K. S.; Rodrigues, J. A.; Fonseca, A. S.; Mencalha, A. L.

2017-07-01

Cancer is among the leading causes of mortality worldwide, increasing the importance of treatment development. Low-level lasers are used in several diseases, but some concerns remains on cancers. Reverse transcriptase quantitative polymerase chain reaction (RT-qPCR) is a technique used to understand cellular behavior through quantification of mRNA levels. Output data from target genes are commonly relative to a reference that cannot vary according to treatment. This study evaluated reference genes levels from MDA-MB-231 cells exposed to red or infrared lasers at different fluences. Cultures were exposed to red and infrared lasers, incubated (4 h, 37 °C), total RNA was extracted and cDNA synthesis was performed to evaluate mRNA levels from ACTB, GUSB and TRFC genes by RT-qPCR. Specific amplification was verified by melting curves and agarose gel electrophoresis. RefFinder enabled data analysis by geNorm, NormFinder and BestKeeper. Specific amplifications were obtained and, although mRNA levels from ACTB, GUSB or TRFC genes presented no significant variation through traditional statistical analysis, Excel-based tools revealed that the use of these reference genes are dependent of laser characteristics. Our data showed that exposure to low-level red and infrared lasers at different fluences alter the mRNA levels from ACTB, GUSB and TRFC in MDA-MB-231 cells.
Selection of reference genes for miRNA qRT-PCR under abiotic stress in grapevine.

PubMed

Luo, Meng; Gao, Zhen; Li, Hui; Li, Qin; Zhang, Caixi; Xu, Wenping; Song, Shiren; Ma, Chao; Wang, Shiping

2018-03-13

Grapevine is among the fruit crops with high economic value, and because of the economic losses caused by abiotic stresses, the stress resistance of Vitis vinifera has become an increasingly important research area. Among the mechanisms responding to environmental stresses, the role of miRNA has received much attention recently. qRT-PCR is a powerful method for miRNA quantitation, but the accuracy of the method strongly depends on the appropriate reference genes. To determine the most suitable reference genes for grapevine miRNA qRT-PCR, 15 genes were chosen as candidate reference genes. After eliminating 6 candidate reference genes with unsatisfactory amplification efficiency, the expression stability of the remaining candidate reference genes under salinity, cold and drought was analysed using four algorithms, geNorm, NormFinder, deltaCt and Bestkeeper. The results indicated that U6 snRNA was the most suitable reference gene under salinity and cold stresses; whereas miR168 was the best for drought stress. The best reference gene sets for salinity, cold and drought stresses were miR160e + miR164a, miR160e + miR168 and ACT + UBQ + GAPDH, respectively. The selected reference genes or gene sets were verified using miR319 or miR408 as the target gene.
[Selection of reference genes of Siraitia grosvenorii by real-time PCR].

PubMed

Tu, Dong-ping; Mo, Chang-ming; Ma, Xiao-jun; Zhao, Huan; Tang, Qi; Huang, Jie; Pan, Li-mei; Wei, Rong-chang

2015-01-01

Siraitia grosvenorii is a traditional Chinese medicine also as edible food. This study selected six candidate reference genes by real-time quantitative PCR, the expression stability of the candidate reference genes in the different samples was analyzed by using the software and methods of geNorm, NormFinder, BestKeeper, Delta CT method and RefFinder, reference genes for S. grosvenorii were selected for the first time. The results showed that 18SrRNA expressed most stable in all samples, was the best reference gene in the genetic analysis. The study has a guiding role for the analysis of gene expression using qRT-PCR methods, providing a suitable reference genes to ensure the results in the study on differential expressed gene in synthesis and biological pathways, also other genes of S. grosvenorii.
Evaluation of Reference Genes for Quantitative Real-Time PCR Analysis of the Gene Expression in Laticifers on the Basis of Latex Flow in Rubber Tree (Hevea brasiliensis Muell. Arg.)

PubMed Central

Chao, Jinquan; Yang, Shuguang; Chen, Yueyi; Tian, Wei-Min

2016-01-01

Latex exploitation-caused latex flow is effective in enhancing latex regeneration in laticifer cells of rubber tree. It should be suitable for screening appropriate reference gene for analysis of the expression of latex regeneration-related genes by quantitative real-time PCR (qRT-PCR). In the present study, the expression stability of 23 candidate reference genes was evaluated on the basis of latex flow by using geNorm and NormFinder algorithms. Ubiquitin-protein ligase 2a (UBC2a) and ubiquitin-protein ligase 2b (UBC2b) were the two most stable genes among the selected candidate references in rubber tree clones with differential duration of latex flow. The two genes were also high-ranked in previous reference gene screening across different tissues and experimental conditions. By contrast, the transcripts of latex regeneration-related genes fluctuated significantly during latex flow. The results suggest that screening reference gene during latex flow should be an efficient and effective clue for selection of reference genes in qRT-PCR. PMID:27524995
Identification and Evaluation of Reliable Reference Genes for Quantitative Real-Time PCR Analysis in Tea Plant (Camellia sinensis (L.) O. Kuntze)

PubMed Central

Hao, Xinyuan; Horvath, David P.; Chao, Wun S.; Yang, Yajun; Wang, Xinchao; Xiao, Bin

2014-01-01

Reliable reference selection for the accurate quantification of gene expression under various experimental conditions is a crucial step in qRT-PCR normalization. To date, only a few housekeeping genes have been identified and used as reference genes in tea plant. The validity of those reference genes are not clear since their expression stabilities have not been rigorously examined. To identify more appropriate reference genes for qRT-PCR studies on tea plant, we examined the expression stability of 11 candidate reference genes from three different sources: the orthologs of Arabidopsis traditional reference genes and stably expressed genes identified from whole-genome GeneChip studies, together with three housekeeping gene commonly used in tea plant research. We evaluated the transcript levels of these genes in 94 experimental samples. The expression stabilities of these 11 genes were ranked using four different computation programs including geNorm, Normfinder, BestKeeper, and the comparative ∆CT method. Results showed that the three commonly used housekeeping genes of CsTUBULIN1, CsACINT1 and Cs18S rRNA1 together with CsUBQ1 were the most unstable genes in all sample ranking order. However, CsPTB1, CsEF1, CsSAND1, CsCLATHRIN1 and CsUBC1 were the top five appropriate reference genes for qRT-PCR analysis in complex experimental conditions. PMID:25474086
Analysis of bHLH coding genes using gene co-expression network approach.

PubMed

Srivastava, Swati; Sanchita; Singh, Garima; Singh, Noopur; Srivastava, Gaurava; Sharma, Ashok

2016-07-01

Network analysis provides a powerful framework for the interpretation of data. It uses novel reference network-based metrices for module evolution. These could be used to identify module of highly connected genes showing variation in co-expression network. In this study, a co-expression network-based approach was used for analyzing the genes from microarray data. Our approach consists of a simple but robust rank-based network construction. The publicly available gene expression data of Solanum tuberosum under cold and heat stresses were considered to create and analyze a gene co-expression network. The analysis provide highly co-expressed module of bHLH coding genes based on correlation values. Our approach was to analyze the variation of genes expression, according to the time period of stress through co-expression network approach. As the result, the seed genes were identified showing multiple connections with other genes in the same cluster. Seed genes were found to be vary in different time periods of stress. These analyzed seed genes may be utilized further as marker genes for developing the stress tolerant plant species.
Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

PubMed

Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

2017-09-01

The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative, quantitative portrait of the relative, typical gene‑expression profile in the form of searchable database tables.
Phylogeny-dominant classification of J-proteins in Arabidopsis thaliana and Brassica oleracea.

PubMed

Zhang, Bin; Qiu, Han-Lin; Qu, Dong-Hai; Ruan, Ying; Chen, Dong-Hong

2018-04-05

Hsp40s or DnaJ/J-proteins are evolutionarily conserved in all organisms as co-chaperones of molecular chaperone HSP70s that mainly participate in maintaining cellular protein homeostasis, such as protein folding, assembly, stabilization, and translocation under normal conditions as well as refolding and degradation under environmental stresses. It has been reported that Arabidopsis J-proteins are classified into four classes (types A-D) according to domain organization, but their phylogenetic relationships are unknown. Here, we identified 129 J-proteins in the world-wide popular vegetable Brassica oleracea, a close relative of the model plant Arabidopsis, and also revised the information of Arabidopsis J-proteins based on the latest online bioresources. According to phylogenetic analysis with domain organization and gene structure as references, the J-proteins from Arabidopsis and B. oleracea were classified into 15 main clades (I-XV) separated by a number of undefined small branches with remote relationship. Based on the number of members, they respectively belong to multigene clades, oligo-gene clades, and mono-gene clades. The J-protein genes from different clades may function together or separately to constitute a complicated regulatory network. This study provides a constructive viewpoint for J-protein classification and an informative platform for further functional dissection and resistant genes discovery related to genetic improvement of crop plants.
Selection of Reference Genes for RT-qPCR Analysis in Coccinella septempunctata to Assess Un-intended Effects of RNAi Transgenic Plants.

PubMed

Yang, Chunxiao; Preisser, Evan L; Zhang, Hongjun; Liu, Yong; Dai, Liangying; Pan, Huipeng; Zhou, Xuguo

2016-01-01

The development of genetically engineered plants that employ RNA interference (RNAi) to suppress invertebrate pests opens up new avenues for insect control. While this biotechnology shows tremendous promise, the potential for both non-target and off-target impacts, which likely manifest via altered mRNA expression in the exposed organisms, remains a major concern. One powerful tool for the analysis of these un-intended effects is reverse transcriptase-quantitative polymerase chain reaction, a technique for quantifying gene expression using a suite of reference genes for normalization. The seven-spotted ladybeetle Coccinella septempunctata , a commonly used predator in both classical and augmentative biological controls, is a model surrogate species used in the environmental risk assessment (ERA) of plant incorporated protectants (PIPs). Here, we assessed the suitability of eight reference gene candidates for the normalization and analysis of C. septempunctata v-ATPase A gene expression under both biotic and abiotic conditions. Five computational tools with distinct algorisms, geNorm, Normfinder, BestKeeper , the Δ C t method, and RefFinder , were used to evaluate the stability of these candidates. As a result, unique sets of reference genes were recommended, respectively, for experiments involving different developmental stages, tissues, and ingested dsRNAs. By providing a foundation for standardized RT-qPCR analysis in C. septempunctata , our work improves the accuracy and replicability of the ERA of PIPs involving RNAi transgenic plants.
Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis.

PubMed

dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

2015-01-01

Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis.
Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis

PubMed Central

dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

2015-01-01

Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928
Identification and validation of reference genes for normalization of gene expression analysis using qRT-PCR in Helicoverpa armigera (Lepidoptera: Noctuidae).

PubMed

Zhang, Songdou; An, Shiheng; Li, Zhen; Wu, Fengming; Yang, Qingpo; Liu, Yichen; Cao, Jinjun; Zhang, Huaijiang; Zhang, Qingwen; Liu, Xiaoxia

2015-01-25

Recent studies have focused on determining functional genes and microRNAs in the pest Helicoverpa armigera (Lepidoptera: Noctuidae). Most of these studies used quantitative real-time PCR (qRT-PCR). Suitable reference genes are necessary to normalize gene expression data of qRT-PCR. However, a comprehensive study on the reference genes in H. armigera remains lacking. Twelve candidate reference genes of H. armigera were selected and evaluated for their expression stability under different biotic and abiotic conditions. The comprehensive stability ranking of candidate reference genes was recommended by RefFinder and the optimal number of reference genes was calculated by geNorm. Two target genes, thioredoxin (TRX) and Cu/Zn superoxide dismutase (SOD), were used to validate the selection of reference genes. Results showed that the most suitable candidate combinations of reference genes were as follows: 28S and RPS15 for developmental stages; RPS15 and RPL13 for larvae tissues; EF and RPL27 for adult tissues; GAPDH, RPL27, and β-TUB for nuclear polyhedrosis virus infection; RPS15 and RPL32 for insecticide treatment; RPS15 and RPL27 for temperature treatment; and RPL32, RPS15, and RPL27 for all samples. This study not only establishes an accurate method for normalizing qRT-PCR data in H. armigera but also serve as a reference for further study on gene transcription in H. armigera and other insects. Copyright © 2014 Elsevier B.V. All rights reserved.
Improved annotation of the insect vector of citrus greening disease: biocuration by a diverse genomics community

PubMed Central

Hosmani, Prashant S.; Villalobos-Ayala, Krystal; Miller, Sherry; Shippy, Teresa; Flores, Mirella; Rosendale, Andrew; Cordola, Chris; Bell, Tracey; Mann, Hannah; DeAvila, Gabe; DeAvila, Daniel; Moore, Zachary; Buller, Kyle; Ciolkevich, Kathryn; Nandyal, Samantha; Mahoney, Robert; Van Voorhis, Joshua; Dunlevy, Megan; Farrow, David; Hunter, David; Morgan, Taylar; Shore, Kayla; Guzman, Victoria; Izsak, Allison; Dixon, Danielle E.; Cridge, Andrew; Cano, Liliana; Cao, Xiaolong; Jiang, Haobo; Leng, Nan; Johnson, Shannon; Cantarel, Brandi L.; Richards, Stephen; English, Adam; Shatters, Robert G.; Childers, Chris; Chen, Mei-Ju; Hunter, Wayne; Cilia, Michelle; Mueller, Lukas A.; Munoz-Torres, Monica; Nelson, David; Poelchau, Monica F.; Benoit, Joshua B.; Wiersma-Koch, Helen; D’Elia, Tom; Brown, Susan J.

2017-01-01

Abstract The Asian citrus psyllid (Diaphorina citri Kuwayama) is the insect vector of the bacterium Candidatus Liberibacter asiaticus (CLas), the pathogen associated with citrus Huanglongbing (HLB, citrus greening). HLB threatens citrus production worldwide. Suppression or reduction of the insect vector using chemical insecticides has been the primary method to inhibit the spread of citrus greening disease. Accurate structural and functional annotation of the Asian citrus psyllid genome, as well as a clear understanding of the interactions between the insect and CLas, are required for development of new molecular-based HLB control methods. A draft assembly of the D. citri genome has been generated and annotated with automated pipelines. However, knowledge transfer from well-curated reference genomes such as that of Drosophila melanogaster to newly sequenced ones is challenging due to the complexity and diversity of insect genomes. To identify and improve gene models as potential targets for pest control, we manually curated several gene families with a focus on genes that have key functional roles in D. citri biology and CLas interactions. This community effort produced 530 manually curated gene models across developmental, physiological, RNAi regulatory and immunity-related pathways. As previously shown in the pea aphid, RNAi machinery genes putatively involved in the microRNA pathway have been specifically duplicated. A comprehensive transcriptome enabled us to identify a number of gene families that are either missing or misassembled in the draft genome. In order to develop biocuration as a training experience, we included undergraduate and graduate students from multiple institutions, as well as experienced annotators from the insect genomics research community. The resulting gene set (OGS v1.0) combines both automatically predicted and manually curated gene models. Database URL: https://citrusgreening.org/ PMID:29220441
New Rodent Population Models May Inform Human Health Risk Assessment and Identification of Genetic Susceptibility to Environmental Exposures.

PubMed

Harrill, Alison H; McAllister, Kimberly A

2017-08-15

This paper provides an introduction for environmental health scientists to emerging population-based rodent resources. Mouse reference populations provide an opportunity to model environmental exposures and gene-environment interactions in human disease and to inform human health risk assessment. This review will describe several mouse populations for toxicity assessment, including older models such as the Mouse Diversity Panel (MDP), and newer models that include the Collaborative Cross (CC) and Diversity Outbred (DO) models. This review will outline the features of the MDP, CC, and DO mouse models and will discuss published case studies investigating the use of these mouse population resources in each step of the risk assessment paradigm. These unique resources have the potential to be powerful tools for generating hypotheses related to gene-environment interplay in human disease, performing controlled exposure studies to understand the differential responses in humans for susceptibility or resistance to environmental exposures, and identifying gene variants that influence sensitivity to toxicity and disease states. These new resources offer substantial advances to classical toxicity testing paradigms by including genetically sensitive individuals that may inform toxicity risks for sensitive subpopulations. Both in vivo and complementary in vitro resources provide platforms with which to reduce uncertainty by providing population-level data around biological variability. https://doi.org/10.1289/EHP1274.
Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

PubMed Central

Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

2012-01-01

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697
Defining suitable reference genes for RT-qPCR analysis on human sertoli cells after 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) exposure.

PubMed

Ribeiro, Mariana Antunes; dos Reis, Mariana Bisarro; de Moraes, Leonardo Nazário; Briton-Jones, Christine; Rainho, Cláudia Aparecida; Scarano, Wellerson Rodrigo

2014-11-01

Quantitative real-time RT-PCR (qPCR) has proven to be a valuable molecular technique to quantify gene expression. There are few studies in the literature that describe suitable reference genes to normalize gene expression data. Studies of transcriptionally disruptive toxins, like tetrachlorodibenzo-p-dioxin (TCDD), require careful consideration of reference genes. The present study was designed to validate potential reference genes in human Sertoli cells after exposure to TCDD. 32 candidate reference genes were analyzed to determine their applicability. geNorm and NormFinder softwares were used to obtain an estimation of the expression stability of the 32 genes and to identify the most suitable genes for qPCR data normalization.
A Novel Strategy for Selection and Validation of Reference Genes in Dynamic Multidimensional Experimental Design in Yeast

PubMed Central

Cankorur-Cetinkaya, Ayca; Dereli, Elif; Eraslan, Serpil; Karabekmez, Erkan; Dikicioglu, Duygu; Kirdar, Betul

2012-01-01

Background Understanding the dynamic mechanism behind the transcriptional organization of genes in response to varying environmental conditions requires time-dependent data. The dynamic transcriptional response obtained by real-time RT-qPCR experiments could only be correctly interpreted if suitable reference genes are used in the analysis. The lack of available studies on the identification of candidate reference genes in dynamic gene expression studies necessitates the identification and the verification of a suitable gene set for the analysis of transient gene expression response. Principal Findings In this study, a candidate reference gene set for RT-qPCR analysis of dynamic transcriptional changes in Saccharomyces cerevisiae was determined using 31 different publicly available time series transcriptome datasets. Ten of the twelve candidates (TPI1, FBA1, CCW12, CDC19, ADH1, PGK1, GCN4, PDC1, RPS26A and ARF1) we identified were not previously reported as potential reference genes. Our method also identified the commonly used reference genes ACT1 and TDH3. The most stable reference genes from this pool were determined as TPI1, FBA1, CDC19 and ACT1 in response to a perturbation in the amount of available glucose and as FBA1, TDH3, CCW12 and ACT1 in response to a perturbation in the amount of available ammonium. The use of these newly proposed gene sets outperformed the use of common reference genes in the determination of dynamic transcriptional response of the target genes, HAP4 and MEP2, in response to relaxation from glucose and ammonium limitations, respectively. Conclusions A candidate reference gene set to be used in dynamic real-time RT-qPCR expression profiling in yeast was proposed for the first time in the present study. Suitable pools of stable reference genes to be used under different experimental conditions could be selected from this candidate set in order to successfully determine the expression profiles for the genes of interest. PMID:22675547
Evaluation of Suitable Reference Genes for Normalization of qPCR Gene Expression Studies in Brinjal (Solanum melongena L.) During Fruit Developmental Stages.

PubMed

Kanakachari, Mogilicherla; Solanke, Amolkumar U; Prabhakaran, Narayanasamy; Ahmad, Israr; Dhandapani, Gurusamy; Jayabalan, Narayanasamy; Kumar, Polumetla Ananda

2016-02-01

Brinjal/eggplant/aubergine is one of the major solanaceous vegetable crops. Recent availability of genome information greatly facilitates the fundamental research on brinjal. Gene expression patterns during different stages of fruit development can provide clues towards the understanding of its biological functions. Quantitative real-time PCR (qPCR) has become one of the most widely used methods for rapid and accurate quantification of gene expression. However, its success depends on the use of a suitable reference gene for data normalization. For qPCR analysis, a single reference gene is not universally suitable for all experiments. Therefore, reference gene validation is a crucial step. Suitable reference genes for qPCR analysis of brinjal fruit development have not been investigated so far. In this study, we have selected 21 candidate reference genes from the Brinjal (Solanum melongena) Plant Gene Indices database (compbio.dfci.harvard.edu/tgi/plant.html) and studied their expression profiles by qPCR during six different fruit developmental stages (0, 5, 10, 20, 30, and 50 days post anthesis) along with leaf samples of the Pusa Purple Long (PPL) variety. To evaluate the stability of gene expression, geNorm and NormFinder analytical softwares were used. geNorm identified SAND (SAND family protein) and TBP (TATA binding protein) as the best pairs of reference genes in brinjal fruit development. The results showed that for brinjal fruit development, individual or a combination of reference genes should be selected for data normalization. NormFinder identified Expressed gene (expressed sequence) as the best single reference gene in brinjal fruit development. In this study, we have identified and validated for the first time reference genes to provide accurate transcript normalization and quantification at various fruit developmental stages of brinjal which can also be useful for gene expression studies in other Solanaceae plant species.

Selection of reference genes for tissue/organ samples on day 3 fifth-instar larvae in silkworm, Bombyx mori.

PubMed

Wang, Genhong; Chen, Yanfei; Zhang, Xiaoying; Bai, Bingchuan; Yan, Hao; Qin, Daoyuan; Xia, Qingyou

2018-06-01

The silkworm, Bombyx mori, is one of the world's most economically important insect. Surveying variations in gene expression among multiple tissue/organ samples will provide clues for gene function assignments and will be helpful for identifying genes related to economic traits or specific cellular processes. To ensure their accuracy, commonly used gene expression quantification methods require a set of stable reference genes for data normalization. In this study, 24 candidate reference genes were assessed in 10 tissue/organ samples of day 3 fifth-instar B. mori larvae using geNorm and NormFinder. The results revealed that, using the combination of the expression of BGIBMGA003186 and BGIBMGA008209 was the optimum choice for normalizing the expression data of the B. mori tissue/organ samples. The most stable gene, BGIBMGA003186, is recommended if just one reference gene is used. Moreover, the commonly used reference gene encoding cytoplasmic actin was the least appropriate reference gene of the samples investigated. The reliability of the selected reference genes was further confirmed by evaluating the expression profiles of two cathepsin genes. Our results may be useful for future studies involving the quantification of relative gene expression levels of different tissue/organ samples in B. mori. © 2018 Wiley Periodicals, Inc.
Candidate Reference Genes Selection and Application for RT-qPCR Analysis in Kenaf with Cytoplasmic Male Sterility Background

PubMed Central

Zhou, Bujin; Chen, Peng; Khan, Aziz; Zhao, Yanhong; Chen, Lihong; Liu, Dongmei; Liao, Xiaofang; Kong, Xiangjun; Zhou, Ruiyang

2017-01-01

Cytoplasmic male sterility (CMS) is a maternally inherited trait that results in the production of dysfunctional pollen. Based on reliable reference gene-normalized real-time quantitative PCR (RT-qPCR) data, examining gene expression profile can provide valuable information on the molecular mechanism of kenaf CMS. However, studies have not been conducted regarding selection of reference genes for normalizing RT-qPCR data in the CMS and maintainer lines of kenaf crop. Therefore, we studied 10 candidate reference genes (ACT3, ELF1A, G6PD, PEPKR1, TUB, TUA, CYP, GAPDH, H3, and 18S) to assess their expression stability at three stages of pollen development in CMS line 722A and maintainer line 722B of kenaf. Five computational statistical approaches (GeNorm, NormFinder, ΔCt, BestKeeper, and RefFinder) were used to evaluate the expression stability levels of these genes. According to RefFinder and GeNorm, the combination of TUB, CYP, and PEPKR1 was identified as an internal control for the accurate normalization across all sample set, which was further confirmed by validating the expression of HcPDIL5-2a. Furthermore, the combination of TUB, CYP, and PEPKR1 was used to differentiate the expression pattern of five mitochondria F1F0-ATPase subunit genes (atp1, atp4, atp6, atp8, and atp9) by RT-qPCR during pollen development in CMS line 722A and maintainer line 722B. We found that atp1, atp6, and atp9 exhibited significantly different expression patterns during pollen development in line 722A compared with line 722B. This is the first systematic study of reference genes selection for CMS and will provide useful information for future research on the gene expressions and molecular mechanisms underlying CMS in kenaf. PMID:28919905
Assessment of reference gene stability in Rice stripe virus and Rice black streaked dwarf virus infection rice by quantitative Real-time PCR.

PubMed

Fang, Peng; Lu, Rongfei; Sun, Feng; Lan, Ying; Shen, Wenbiao; Du, Linlin; Zhou, Yijun; Zhou, Tong

2015-10-24

Stably expressed reference gene(s) normalization is important for the understanding of gene expression patterns by quantitative Real-time PCR (RT-qPCR), particularly for Rice stripe virus (RSV) and Rice black streaked dwarf virus (RBSDV) that caused seriously damage on rice plants in China and Southeast Asia. The expression of fourteen common used reference genes of Oryza sativa L. were evaluated by RT-qPCR in RSV and RBSDV infected rice plants. Suitable normalization reference gene(s) were identified by geNorm and NormFinder algorithms. UBQ 10 + GAPDH and UBC + Actin1 were identified as suitable reference genes for RT-qPCR normalization under RSV and RBSDV infection, respectively. When using multiple reference genes, the expression patterns of OsPRIb and OsWRKY, two virus resistance genes, were approximately similar with that reported previously. Comparatively, by using single reference gene (TIP41-Like), a weaker inducible response was observed. We proposed that the combination of two reference genes could obtain more accurate and reliable normalization of RT-qPCR results in RSV- and RBSDV-infected plants. This work therefore sheds light on establishing a standardized RT-qPCR procedure in RSV- and RBSDV-infected rice plants, and might serve as an important point for discovering complex regulatory networks and identifying genes relevant to biological processes or implicated in virus.
Identification and validation of suitable reference genes for RT-qPCR analysis in mouse testis development.

PubMed

Gong, Zu-Kang; Wang, Shuang-Jie; Huang, Yong-Qi; Zhao, Rui-Qiang; Zhu, Qi-Fang; Lin, Wen-Zhen

2014-12-01

RT-qPCR is a commonly used method for evaluating gene expression; however, its accuracy and reliability are dependent upon the choice of appropriate reference gene(s), and there is limited information available on suitable reference gene(s) that can be used in mouse testis at different stages. In this study, using the RT-qPCR method, we investigated the expression variations of six reference genes representing different functional classes (Actb, Gapdh, Ppia, Tbp, Rps29, Hprt1) in mice testis during embryonic and postnatal development. The expression stabilities of putative reference genes were evaluated using five algorithms: geNorm, NormFinder, Bestkeeper, the comparative delta C(t) method and integrated tool RefFinder. Analysis of the results showed that Ppia, Gapdh and Actb were identified as the most stable genes and the geometric mean of Ppia, Gapdh and Actb constitutes an appropriate normalization factor for gene expression studies. The mRNA expression of AT1 as a test gene of interest varied depending upon which of the reference gene(s) was used as an internal control(s). This study suggested that Ppia, Gapdh and Actb are suitable reference genes among the six genes used for RT-qPCR normalization and provide crucial information for transcriptional analyses in future studies of gene expression in the developing mouse testis.
Deep Sequencing of Urinary RNAs for Bladder Cancer Molecular Diagnostics.

PubMed

Sin, Mandy L Y; Mach, Kathleen E; Sinha, Rahul; Wu, Fan; Trivedi, Dharati R; Altobelli, Emanuela; Jensen, Kristin C; Sahoo, Debashis; Lu, Ying; Liao, Joseph C

2017-07-15

Purpose: The majority of bladder cancer patients present with localized disease and are managed by transurethral resection. However, the high rate of recurrence necessitates lifetime cystoscopic surveillance. Developing a sensitive and specific urine-based test would significantly improve bladder cancer screening, detection, and surveillance. Experimental Design: RNA-seq was used for biomarker discovery to directly assess the gene expression profile of exfoliated urothelial cells in urine derived from bladder cancer patients ( n = 13) and controls ( n = 10). Eight bladder cancer specific and 3 reference genes identified by RNA-seq were quantitated by qPCR in a training cohort of 102 urine samples. A diagnostic model based on the training cohort was constructed using multiple logistic regression. The model was further validated in an independent cohort of 101 urines. Results: A total of 418 genes were found to be differentially expressed between bladder cancer and controls. Validation of a subset of these genes was used to construct an equation for computing a probability of bladder cancer score (P BC ) based on expression of three markers ( ROBO1, WNT5A , and CDC42BPB ). Setting P BC = 0.45 as the cutoff for a positive test, urine testing using the three-marker panel had overall 88% sensitivity and 92% specificity in the training cohort. The accuracy of the three-marker panel in the independent validation cohort yielded an AUC of 0.87 and overall 83% sensitivity and 89% specificity. Conclusions: Urine-based molecular diagnostics using this three-marker signature could provide a valuable adjunct to cystoscopy and may lead to a reduction of unnecessary procedures for bladder cancer diagnosis. Clin Cancer Res; 23(14); 3700-10. ©2017 AACR . ©2017 American Association for Cancer Research.
Selection of reference genes for RT-qPCR analysis in a predatory biological control agent, Coleomegilla maculata (Coleoptera: Coccinellidae).

PubMed

Yang, Chunxiao; Pan, Huipeng; Noland, Jeffrey Edward; Zhang, Deyong; Zhang, Zhanhong; Liu, Yong; Zhou, Xuguo

2015-12-10

Reverse transcriptase-quantitative polymerase chain reaction (RT-qPCR) is a reliable technique for quantifying gene expression across various biological processes, of which requires a set of suited reference genes to normalize the expression data. Coleomegilla maculata (Coleoptera: Coccinellidae), is one of the most extensively used biological control agents in the field to manage arthropod pest species. In this study, expression profiles of 16 housekeeping genes selected from C. maculata were cloned and investigated. The performance of these candidates as endogenous controls under specific experimental conditions was evaluated by dedicated algorithms, including geNorm, Normfinder, BestKeeper, and ΔCt method. In addition, RefFinder, a comprehensive platform integrating all the above-mentioned algorithms, ranked the overall stability of these candidate genes. As a result, various sets of suitable reference genes were recommended specifically for experiments involving different tissues, developmental stages, sex, and C. maculate larvae treated with dietary double stranded RNA. This study represents the critical first step to establish a standardized RT-qPCR protocol for the functional genomics research in a ladybeetle C. maculate. Furthermore, it lays the foundation for conducting ecological risk assessment of RNAi-based gene silencing biotechnologies on non-target organisms; in this case, a key predatory biological control agent.
An integrated and comparative approach towards identification, characterization and functional annotation of candidate genes for drought tolerance in sorghum (Sorghum bicolor (L.) Moench).

PubMed

Woldesemayat, Adugna Abdi; Van Heusden, Peter; Ndimba, Bongani K; Christoffels, Alan

2017-12-22

Drought is the most disastrous abiotic stress that severely affects agricultural productivity worldwide. Understanding the biological basis of drought-regulated traits, requires identification and an in-depth characterization of genetic determinants using model organisms and high-throughput technologies. However, studies on drought tolerance have generally been limited to traditional candidate gene approach that targets only a single gene in a pathway that is related to a trait. In this study, we used sorghum, one of the model crops that is well adapted to arid regions, to mine genes and define determinants for drought tolerance using drought expression libraries and RNA-seq data. We provide an integrated and comparative in silico candidate gene identification, characterization and annotation approach, with an emphasis on genes playing a prominent role in conferring drought tolerance in sorghum. A total of 470 non-redundant functionally annotated drought responsive genes (DRGs) were identified using experimental data from drought responses by employing pairwise sequence similarity searches, pathway and interpro-domain analysis, expression profiling and orthology relation. Comparison of the genomic locations between these genes and sorghum quantitative trait loci (QTLs) showed that 40% of these genes were co-localized with QTLs known for drought tolerance. The genome reannotation conducted using the Program to Assemble Spliced Alignment (PASA), resulted in 9.6% of existing single gene models being updated. In addition, 210 putative novel genes were identified using AUGUSTUS and PASA based analysis on expression dataset. Among these, 50% were single exonic, 69.5% represented drought responsive and 5.7% were complete gene structure models. Analysis of biochemical metabolism revealed 14 metabolic pathways that are related to drought tolerance and also had a strong biological network, among categories of genes involved. Identification of these pathways, signifies the interplay of biochemical reactions that make up the metabolic network, constituting fundamental interface for sorghum defence mechanism against drought stress. This study suggests untapped natural variability in sorghum that could be used for developing drought tolerance. The data presented here, may be regarded as an initial reference point in functional and comparative genomics in the Gramineae family.
Stability evaluation of reference genes for gene expression analysis by RT-qPCR in soybean under different conditions.

PubMed

Wan, Qiao; Chen, Shuilian; Shan, Zhihui; Yang, Zhonglu; Chen, Limiao; Zhang, Chanjuan; Yuan, Songli; Hao, Qinnan; Zhang, Xiaojuan; Qiu, Dezhen; Chen, Haifeng; Zhou, Xinan

2017-01-01

Real-time quantitative reverse transcription PCR is a sensitive and widely used technique to quantify gene expression. To achieve a reliable result, appropriate reference genes are highly required for normalization of transcripts in different samples. In this study, 9 previously published reference genes (60S, Fbox, ELF1A, ELF1B, ACT11, TUA5, UBC4, G6PD, CYP2) of soybean [Glycine max (L.) Merr.] were selected. The expression stability of the 9 genes was evaluated under conditions of biotic stress caused by infection with soybean mosaic virus, nitrogen stress, across different cultivars and developmental stages. ΔCt and geNorm algorithms were used to evaluate and rank the expression stability of the 9 reference genes. Results obtained from two algorithms showed high consistency. Moreover, results of pairwise variation showed that two reference genes were sufficient to normalize the expression levels of target genes under each experimental setting. For virus infection, ELF1A and ELF1B were the most stable reference genes for accurate normalization. For different developmental stages, Fbox and G6PD had the highest expression stability between two soybean cultivars (Tanlong No. 1 and Tanlong No. 2). ELF1B and ACT11 were identified as the most stably expressed reference genes both under nitrogen stress and among different cultivars. The results showed that none of the candidate reference genes were uniformly expressed at different conditions, and selecting appropriate reference genes was pivotal for gene expression studies with particular condition and tissue. The most stable combination of genes identified in this study will help to achieve more accurate and reliable results in a wide variety of samples in soybean.
Identification and validation of superior reference gene for gene expression normalization via RT-qPCR in staminate and pistillate flowers of Jatropha curcas - A biodiesel plant.

PubMed

Karuppaiya, Palaniyandi; Yan, Xiao-Xue; Liao, Wang; Wu, Jun; Chen, Fang; Tang, Lin

2017-01-01

Physic nut (Jatropha curcas L) seed oil is a natural resource for the alternative production of fossil fuel. Seed oil production is mainly depended on seed yield, which was restricted by the low ratio of staminate flowers to pistillate flowers. Further, the mechanism of physic nut flower sex differentiation has not been fully understood yet. Quantitative Real Time-Polymerase Chain Reaction is a reliable and widely used technique to quantify the gene expression pattern in biological samples. However, for accuracy of qRT-PCR, appropriate reference gene is highly desirable to quantify the target gene level. Hence, the present study was aimed to identify the stable reference genes in staminate and pistillate flowers of J. curcas. In this study, 10 candidate reference genes were selected and evaluated for their expression stability in staminate and pistillate flowers, and their stability was validated by five different algorithms (ΔCt, BestKeeper, NormFinder, GeNorm and RefFinder). Resulting, TUB and EF found to be the two most stably expressed reference for staminate flower; while GAPDH1 and EF found to be the most stably expressed reference gene for pistillate flowers. Finally, RT-qPCR assays of target gene AGAMOUS using the identified most stable reference genes confirmed the reliability of selected reference genes in different stages of flower development. AGAMOUS gene expression levels at different stages were further proved by gene copy number analysis. Therefore, the present study provides guidance for selecting appropriate reference genes for analyzing the expression pattern of floral developmental genes in staminate and pistillate flowers of J. curcas.
Identification and validation of superior reference gene for gene expression normalization via RT-qPCR in staminate and pistillate flowers of Jatropha curcas – A biodiesel plant

PubMed Central

Karuppaiya, Palaniyandi; Yan, Xiao-Xue; Liao, Wang; Chen, Fang; Tang, Lin

2017-01-01

Physic nut (Jatropha curcas L) seed oil is a natural resource for the alternative production of fossil fuel. Seed oil production is mainly depended on seed yield, which was restricted by the low ratio of staminate flowers to pistillate flowers. Further, the mechanism of physic nut flower sex differentiation has not been fully understood yet. Quantitative Real Time—Polymerase Chain Reaction is a reliable and widely used technique to quantify the gene expression pattern in biological samples. However, for accuracy of qRT-PCR, appropriate reference gene is highly desirable to quantify the target gene level. Hence, the present study was aimed to identify the stable reference genes in staminate and pistillate flowers of J. curcas. In this study, 10 candidate reference genes were selected and evaluated for their expression stability in staminate and pistillate flowers, and their stability was validated by five different algorithms (ΔCt, BestKeeper, NormFinder, GeNorm and RefFinder). Resulting, TUB and EF found to be the two most stably expressed reference for staminate flower; while GAPDH1 and EF found to be the most stably expressed reference gene for pistillate flowers. Finally, RT-qPCR assays of target gene AGAMOUS using the identified most stable reference genes confirmed the reliability of selected reference genes in different stages of flower development. AGAMOUS gene expression levels at different stages were further proved by gene copy number analysis. Therefore, the present study provides guidance for selecting appropriate reference genes for analyzing the expression pattern of floral developmental genes in staminate and pistillate flowers of J. curcas. PMID:28234941
Bacterial reference genes for gene expression studies by RT-qPCR: survey and analysis.

PubMed

Rocha, Danilo J P; Santos, Carolina S; Pacheco, Luis G C

2015-09-01

The appropriate choice of reference genes is essential for accurate normalization of gene expression data obtained by the method of reverse transcription quantitative real-time PCR (RT-qPCR). In 2009, a guideline called the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) highlighted the importance of the selection and validation of more than one suitable reference gene for obtaining reliable RT-qPCR results. Herein, we searched the recent literature in order to identify the bacterial reference genes that have been most commonly validated in gene expression studies by RT-qPCR (in the first 5 years following publication of the MIQE guidelines). Through a combination of different search parameters with the text mining tool MedlineRanker, we identified 145 unique bacterial genes that were recently tested as candidate reference genes. Of these, 45 genes were experimentally validated and, in most of the cases, their expression stabilities were verified using the software tools geNorm and NormFinder. It is noteworthy that only 10 of these reference genes had been validated in two or more of the studies evaluated. An enrichment analysis using Gene Ontology classifications demonstrated that genes belonging to the functional categories of DNA Replication (GO: 0006260) and Transcription (GO: 0006351) rendered a proportionally higher number of validated reference genes. Three genes in the former functional class were also among the top five most stable genes identified through an analysis of gene expression data obtained from the Pathosystems Resource Integration Center. These results may provide a guideline for the initial selection of candidate reference genes for RT-qPCR studies in several different bacterial species.
Genome-wide identification of suitable zebrafish Danio rerio reference genes for normalization of gene expression data by RT-qPCR.

PubMed

Xu, H; Li, C; Zeng, Q; Agrawal, I; Zhu, X; Gong, Z

2016-06-01

In this study, to systematically identify the most stably expressed genes for internal reference in zebrafish Danio rerio investigations, 37 D. rerio transcriptomic datasets (both RNA sequencing and microarray data) were collected from gene expression omnibus (GEO) database and unpublished data, and gene expression variations were analysed under three experimental conditions: tissue types, developmental stages and chemical treatments. Forty-four putative candidate genes were identified with the c.v. <0·2 from all datasets. Following clustering into different functional groups, 21 genes, in addition to four conventional housekeeping genes (eef1a1l1, b2m, hrpt1l and actb1), were selected from different functional groups for further quantitative real-time (qrt-)PCR validation using 25 RNA samples from different adult tissues, developmental stages and chemical treatments. The qrt-PCR data were then analysed using the statistical algorithm refFinder for gene expression stability. Several new candidate genes showed better expression stability than the conventional housekeeping genes in all three categories. It was found that sep15 and metap1 were the top two stable genes for tissue types, ube2a and tmem50a the top two for different developmental stages, and rpl13a and rp1p0 the top two for chemical treatments. Thus, based on the extensive transcriptomic analyses and qrt-PCR validation, these new reference genes are recommended for normalization of D. rerio qrt-PCR data respectively for the three different experimental conditions. © 2016 The Fisheries Society of the British Isles.
Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes).

PubMed

Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-09-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.
Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)

PubMed Central

Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-01-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references. PMID:21712341
Reference Gene Selection for qPCR Normalization of Kosteletzkya virginica under Salt Stress

PubMed Central

Tang, Xiaoli; Wang, Hongyan; Shao, Chuyang; Shao, Hongbo

2015-01-01

Kosteletzkya virginica (L.) is a newly introduced perennial halophytic plant. Presently, reverse transcription quantitative real-time PCR (qPCR) is regarded as the best choice for analyzing gene expression and its accuracy mainly depends on the reference genes which are used for gene expression normalization. In this study, we employed qPCR to select the most stable reference gene in K. virginica which showed stable expression profiles under our experimental conditions. The candidate reference genes were 18S ribosomal RNA (18SrRNA), β-actin (ACT), α-tubulin (TUA), and elongation factor (EF). We tracked the gene expression profiles of the candidate genes and analyzed their stabilities through BestKeeper, geNorm, and NormFinder software programs. The results of the three programs were identical and 18SrRNA was assessed to be the most stable reference gene in this study. However, TUA was identified to be the most unstable. Our study proved again that the traditional reference genes indeed displayed a certain degree of variations under given experimental conditions. Importantly, our research also provides guidance for selecting most suitable reference genes and lays the foundation for further studies in K. virginica. PMID:26581422
Finding approximate gene clusters with Gecko 3.

PubMed

Winter, Sascha; Jahn, Katharina; Wehner, Stefanie; Kuchenbecker, Leon; Marz, Manja; Stoye, Jens; Böcker, Sebastian

2016-11-16

Gene-order-based comparison of multiple genomes provides signals for functional analysis of genes and the evolutionary process of genome organization. Gene clusters are regions of co-localized genes on genomes of different species. The rapid increase in sequenced genomes necessitates bioinformatics tools for finding gene clusters in hundreds of genomes. Existing tools are often restricted to few (in many cases, only two) genomes, and often make restrictive assumptions such as short perfect conservation, conserved gene order or monophyletic gene clusters. We present Gecko 3, an open-source software for finding gene clusters in hundreds of bacterial genomes, that comes with an easy-to-use graphical user interface. The underlying gene cluster model is intuitive, can cope with low degrees of conservation as well as misannotations and is complemented by a sound statistical evaluation. To evaluate the biological benefit of Gecko 3 and to exemplify our method, we search for gene clusters in a dataset of 678 bacterial genomes using Synechocystis sp. PCC 6803 as a reference. We confirm detected gene clusters reviewing the literature and comparing them to a database of operons; we detect two novel clusters, which were confirmed by publicly available experimental RNA-Seq data. The computational analysis is carried out on a laptop computer in <40 min. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
RNA-Seq reveals complex genetic response to Deepwater Horizon oil release in Fundulus grandis.

PubMed

Garcia, Tzintzuni I; Shen, Yingjia; Crawford, Douglas; Oleksiak, Marjorie F; Whitehead, Andrew; Walter, Ronald B

2012-09-12

The release of oil resulting from the blowout of the Deepwater Horizon (DH) drilling platform was one of the largest in history discharging more than 189 million gallons of oil and subject to widespread application of oil dispersants. This event impacted a wide range of ecological habitats with a complex mix of pollutants whose biological impact is still not yet fully understood. To better understand the effects on a vertebrate genome, we studied gene expression in the salt marsh minnow Fundulus grandis, which is local to the northern coast of the Gulf of Mexico and is a sister species of the ecotoxicological model Fundulus heteroclitus. To assess genomic changes, we quantified mRNA expression using high throughput sequencing technologies (RNA-Seq) in F. grandis populations in the marshes and estuaries impacted by DH oil release. This application of RNA-Seq to a non-model, wild, and ecologically significant organism is an important evaluation of the technology to quickly assess similar events in the future. Our de novo assembly of RNA-Seq data produced a large set of sequences which included many duplicates and fragments. In many cases several of these could be associated with a common reference sequence using blast to query a reference database. This reduced the set of significant genes to 1,070 down-regulated and 1,251 up-regulated genes. These genes indicate a broad and complex genomic response to DH oil exposure including the expected AHR-mediated response and CYP genes. In addition a response to hypoxic conditions and an immune response are also indicated. Several genes in the choriogenin family were down-regulated in the exposed group; a response that is consistent with AH exposure. These analyses are in agreement with oligonucleotide-based microarray analyses, and describe only a subset of significant genes with aberrant regulation in the exposed set. RNA-Seq may be successfully applied to feral and extremely polymorphic organisms that do not have an underlying genome sequence assembly to address timely environmental problems. Additionally, the observed changes in a large set of transcript expression levels are indicative of a complex response to the varied petroleum components to which the fish were exposed.
DEFINING THE PLAYERS IN HIGHER-ORDER NETWORKS: PREDICTIVE MODELING FOR REVERSE ENGINEERING FUNCTIONAL INFLUENCE NETWORKS

DOE Office of Scientific and Technical Information (OSTI.GOV)

McDermott, Jason E.; Costa, Michelle N.; Stevens, S.L.

A difficult problem that is currently growing rapidly due to the sharp increase in the amount of high-throughput data available for many systems is that of determining useful and informative causative influence networks. These networks can be used to predict behavior given observation of a small number of components, predict behavior at a future time point, or identify components that are critical to the functioning of the system under particular conditions. In these endeavors incorporating observations of systems from a wide variety of viewpoints can be particularly beneficial, but has often been undertaken with the objective of inferring networks thatmore » are generally applicable. The focus of the current work is to integrate both general observations and measurements taken for a particular pathology, that of ischemic stroke, to provide improved ability to produce useful predictions of systems behavior. A number of hybrid approaches have recently been proposed for network generation in which the Gene Ontology is used to filter or enrich network links inferred from gene expression data through reverse engineering methods. These approaches have been shown to improve the biological plausibility of the inferred relationships determined, but still treat knowledge-based and machine-learning inferences as incommensurable inputs. In this paper, we explore how further improvements may be achieved through a full integration of network inference insights achieved through application of the Gene Ontology and reverse engineering methods with specific reference to the construction of dynamic models of transcriptional regulatory networks. We show that integrating two approaches to network construction, one based on reverse-engineering from conditional transcriptional data, one based on reverse-engineering from in situ hybridization data, and another based on functional associations derived from Gene Ontology, using probabilities can improve results of clustering as evaluated by a predictive model of transcriptional expression levels.« less
Selection of suitable reference genes for gene expression studies in Staphylococcus capitis during growth under erythromycin stress.

PubMed

Cui, Bintao; Smooker, Peter M; Rouch, Duncan A; Deighton, Margaret A

2016-08-01

Accurate and reproducible measurement of gene transcription requires appropriate reference genes, which are stably expressed under different experimental conditions to provide normalization. Staphylococcus capitis is a human pathogen that produces biofilm under stress, such as imposed by antimicrobial agents. In this study, a set of five commonly used staphylococcal reference genes (gyrB, sodA, recA, tuf and rpoB) were systematically evaluated in two clinical isolates of Staphylococcus capitis (S. capitis subspecies urealyticus and capitis, respectively) under erythromycin stress in mid-log and stationary phases. Two public software programs (geNorm and NormFinder) and two manual calculation methods, reference residue normalization (RRN) and relative quantitative (RQ), were applied. The potential reference genes selected by the four algorithms were further validated by comparing the expression of a well-studied biofilm gene (icaA) with phenotypic biofilm formation in S. capitis under four different experimental conditions. The four methods differed considerably in their ability to predict the most suitable reference gene or gene combination for comparing icaA expression under different conditions. Under the conditions used here, the RQ method provided better selection of reference genes than the other three algorithms; however, this finding needs to be confirmed with a larger number of isolates. This study reinforces the need to assess the stability of reference genes for analysis of target gene expression under different conditions and the use of more than one algorithm in such studies. Although this work was conducted using a specific human pathogen, it emphasizes the importance of selecting suitable reference genes for accurate normalization of gene expression more generally.
Evaluation of reference gene suitability for quantitative expression analysis by quantitative polymerase chain reaction in the mandibular condyle of sheep.

PubMed

Jiang, Xin; Xue, Yang; Zhou, Hongzhi; Li, Shouhong; Zhang, Zongmin; Hou, Rui; Ding, Yuxiang; Hu, Kaijin

2015-10-01

Reference genes are commonly used as a reliable approach to normalize the results of quantitative polymerase chain reaction (qPCR), and to reduce errors in the relative quantification of gene expression. Suitable reference genes belonging to numerous functional classes have been identified for various types of species and tissue. However, little is currently known regarding the most suitable reference genes for bone, specifically for the sheep mandibular condyle. Sheep are important for the study of human bone diseases, particularly for temporomandibular diseases. The present study aimed to identify a set of reference genes suitable for the normalization of qPCR data from the mandibular condyle of sheep. A total of 12 reference genes belonging to various functional classes were selected, and the expression stability of the reference genes was determined in both the normal and fractured area of the sheep mandibular condyle. RefFinder, which integrates the following currently available computational algorithms: geNorm, NormFinder, BestKeeper, and the comparative ΔCt method, was used to compare and rank the candidate reference genes. The results obtained from the four methods demonstrated a similar trend: RPL19, ACTB, and PGK1 were the most stably expressed reference genes in the sheep mandibular condyle. As determined by RefFinder comprehensive analysis, the results of the present study suggested that RPL19 is the most suitable reference gene for studies associated with the sheep mandibular condyle. In addition, ACTB and PGK1 may be considered suitable alternatives.

Expression Profiling in Bemisia tabaci under Insecticide Treatment: Indicating the Necessity for Custom Reference Gene Selection

PubMed Central

Zhou, Xuguo; Gao, Xiwu

2014-01-01

Finding a suitable reference gene is the key for qRT-PCR analysis. However, none of the reference gene discovered thus far can be utilized universally under various biotic and abiotic experimental conditions. In this study, we further examine the stability of candidate reference genes under a single abiotic factor, insecticide treatment. After being exposed to eight commercially available insecticides, which belong to five different classes, the expression profiles of eight housekeeping genes in the sweetpotato whitefly, Bemisia tabaci, one of the most invasive and destructive pests in the world, were investigated using qRT-PCR analysis. In summary, elongation factor 1α (EF1α), α-tubulin (TUB1α) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH) were identified as the most stable reference genes under the insecticide treatment. The initial assessment of candidate reference genes was further validated with the expression of two target genes, a P450 (Cyp6cm1) and a glutathione S-transferase (GST). However, ranking of reference genes varied substantially among intra- and inter-classes of insecticides. These combined data strongly suggested the necessity of conducting custom reference gene selection designed for each and every experimental condition, even when examining the same abiotic or biotic factor. PMID:24498122
Identification of stable reference genes for quantitative PCR in cells derived from chicken lymphoid organs.

PubMed

Borowska, D; Rothwell, L; Bailey, R A; Watson, K; Kaiser, P

2016-02-01

Quantitative polymerase chain reaction (qPCR) is a powerful technique for quantification of gene expression, especially genes involved in immune responses. Although qPCR is a very efficient and sensitive tool, variations in the enzymatic efficiency, quality of RNA and the presence of inhibitors can lead to errors. Therefore, qPCR needs to be normalised to obtain reliable results and allow comparison. The most common approach is to use reference genes as internal controls in qPCR analyses. In this study, expression of seven genes, including β-actin (ACTB), β-2-microglobulin (B2M), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), β-glucuronidase (GUSB), TATA box binding protein (TBP), α-tubulin (TUBAT) and 28S ribosomal RNA (r28S), was determined in cells isolated from chicken lymphoid tissues and stimulated with three different mitogens. The stability of the genes was measured using geNorm, NormFinder and BestKeeper software. The results from both geNorm and NormFinder were that the three most stably expressed genes in this panel were TBP, GAPDH and r28S. BestKeeper did not generate clear answers because of the highly heterogeneous sample set. Based on these data we will include TBP in future qPCR normalisation. The study shows the importance of appropriate reference gene normalisation in other tissues before qPCR analysis. Copyright © 2016 Elsevier B.V. All rights reserved.
Reranking candidate gene models with cross-species comparison for improved gene prediction

PubMed Central

Liu, Qian; Crammer, Koby; Pereira, Fernando CN; Roos, David S

2008-01-01

Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models. PMID:18854050
Careful Selection of Reference Genes Is Required for Reliable Performance of RT-qPCR in Human Normal and Cancer Cell Lines

PubMed Central

Jacob, Francis; Guertler, Rea; Naim, Stephanie; Nixdorf, Sheri; Fedier, André; Hacker, Neville F.; Heinzelmann-Schwarz, Viola

2013-01-01

Reverse Transcription - quantitative Polymerase Chain Reaction (RT-qPCR) is a standard technique in most laboratories. The selection of reference genes is essential for data normalization and the selection of suitable reference genes remains critical. Our aim was to 1) review the literature since implementation of the MIQE guidelines in order to identify the degree of acceptance; 2) compare various algorithms in their expression stability; 3) identify a set of suitable and most reliable reference genes for a variety of human cancer cell lines. A PubMed database review was performed and publications since 2009 were selected. Twelve putative reference genes were profiled in normal and various cancer cell lines (n = 25) using 2-step RT-qPCR. Investigated reference genes were ranked according to their expression stability by five algorithms (geNorm, Normfinder, BestKeeper, comparative ΔCt, and RefFinder). Our review revealed 37 publications, with two thirds patient samples and one third cell lines. qPCR efficiency was given in 68.4% of all publications, but only 28.9% of all studies provided RNA/cDNA amount and standard curves. GeNorm and Normfinder algorithms were used in 60.5% in combination. In our selection of 25 cancer cell lines, we identified HSPCB, RRN18S, and RPS13 as the most stable expressed reference genes. In the subset of ovarian cancer cell lines, the reference genes were PPIA, RPS13 and SDHA, clearly demonstrating the necessity to select genes depending on the research focus. Moreover, a cohort of at least three suitable reference genes needs to be established in advance to the experiments, according to the guidelines. For establishing a set of reference genes for gene normalization we recommend the use of ideally three reference genes selected by at least three stability algorithms. The unfortunate lack of compliance to the MIQE guidelines reflects that these need to be further established in the research community. PMID:23554992
Validation of Reference Genes in mRNA Expression Analysis Applied to the Study of Asthma.

PubMed

Segundo-Val, Ignacio San; Sanz-Lozano, Catalina S

2016-01-01

The quantitative Polymerase Chain Reaction is the most used technique for the study of gene expression. To correct putative experimental errors of this technique is necessary normalizing the expression results of the gene of interest with the obtained for reference genes. Here, we describe an example of the process to select reference genes. In this particular case, we select reference genes for expression studies in the peripheral blood mononuclear cells of asthmatic patients.
Transcriptome-wide selection of a reliable set of reference genes for gene expression studies in potato cyst nematodes (Globodera spp.).

PubMed

Sabeh, Michael; Duceppe, Marc-Olivier; St-Arnaud, Marc; Mimee, Benjamin

2018-01-01

Relative gene expression analyses by qRT-PCR (quantitative reverse transcription PCR) require an internal control to normalize the expression data of genes of interest and eliminate the unwanted variation introduced by sample preparation. A perfect reference gene should have a constant expression level under all the experimental conditions. However, the same few housekeeping genes selected from the literature or successfully used in previous unrelated experiments are often routinely used in new conditions without proper validation of their stability across treatments. The advent of RNA-Seq and the availability of public datasets for numerous organisms are opening the way to finding better reference genes for expression studies. Globodera rostochiensis is a plant-parasitic nematode that is particularly yield-limiting for potato. The aim of our study was to identify a reliable set of reference genes to study G. rostochiensis gene expression. Gene expression levels from an RNA-Seq database were used to identify putative reference genes and were validated with qRT-PCR analysis. Three genes, GR, PMP-3, and aaRS, were found to be very stable within the experimental conditions of this study and are proposed as reference genes for future work.
Conditions for success of engineered underdominance gene drive systems.

PubMed

Edgington, Matthew P; Alphey, Luke S

2017-10-07

Engineered underdominance is one of a number of different gene drive strategies that have been proposed for the genetic control of insect vectors of disease. Here we model a two-locus engineered underdominance based gene drive system that is based on the concept of mutually suppressing lethals. In such a system two genetic constructs are introduced, each possessing a lethal element and a suppressor of the lethal at the other locus. Specifically, we formulate and analyse a population genetics model of this system to assess when different combinations of release strategies (i.e. single or multiple releases of both sexes or males only) and genetic systems (i.e. bisex lethal or female-specific lethal elements and different strengths of suppressors) will give population replacement or fail to do so. We anticipate that results presented here will inform the future design of engineered underdominance gene drive systems as well as providing a point of reference regarding release strategies for those looking to test such a system. Our discussion is framed in the context of genetic control of insect vectors of disease. One of several serious threats in this context are Aedes aegypti mosquitoes as they are the primary vectors of dengue viruses. However, results are also applicable to Ae. aegypti as vectors of Zika, yellow fever and chikungunya viruses and also to the control of a number of other insect species and thereby of insect-vectored pathogens. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
New alternatives for reference evapotranspiration estimation in West Africa using limited weather data and ancillary data supply strategies.

NASA Astrophysics Data System (ADS)

Landeras, Gorka; Bekoe, Emmanuel; Ampofo, Joseph; Logah, Frederick; Diop, Mbaye; Cisse, Madiama; Shiri, Jalal

2018-05-01

Accurate estimation of reference evapotranspiration ( ET 0 ) is essential for the computation of crop water requirements, irrigation scheduling, and water resources management. In this context, having a battery of alternative local calibrated ET 0 estimation methods is of great interest for any irrigation advisory service. The development of irrigation advisory services will be a major breakthrough for West African agriculture. In the case of many West African countries, the high number of meteorological inputs required by the Penman-Monteith equation has been indicated as constraining. The present paper investigates for the first time in Ghana, the estimation ability of artificial intelligence-based models (Artificial Neural Networks (ANNs) and Gene Expression Programing (GEPs)), and ancillary/external approaches for modeling reference evapotranspiration ( ET 0 ) using limited weather data. According to the results of this study, GEPs have emerged as a very interesting alternative for ET 0 estimation at all the locations of Ghana which have been evaluated in this study under different scenarios of meteorological data availability. The adoption of ancillary/external approaches has been also successful, moreover in the southern locations. The interesting results obtained in this study using GEPs and some ancillary approaches could be a reference for future studies about ET 0 estimation in West Africa.
An evaluation of potential reference genes for stability of expression in two salmonid cell lines after infection with either Piscirickettsia salmonis or IPNV

PubMed Central

2010-01-01

Background Due to the limited number of species specific antibodies against fish proteins, differential gene expression analyses are vital for the study of host immune responses. Quantitative real-time reverse transcription PCR (qRT-PCR) is one of the most powerful tools for this purpose. Nevertheless, the accuracy of the method will depend on the careful selection of genes whose expression are stable and can be used as internal controls for a particular experimental setting. Findings The expression stability of five commonly used housekeeping genes [beta-actin (ACTB), elongation factor 1-alpha (EF1A), ubiquitin (UBQ), glyceraldehyd-3-phosphate dehydrogenase (GAPDH) and tubulin alpha (TUBA)] were monitored in salmonid cell lines CHSE-214 and RTS11 after infection with two of the most fastidious fish pathogens, the facultative bacterium Piscirickettsia salmonis and the aquabirnavirus IPNV (Infectious Pancreatic Necrosis Virus). After geNorm analysis, UBQ and EF1A appeared as the most stable, although EF1A was slightly upregulated at late stages of P. salmonis infection in RTS11. ACTB instead, showed a good performance in each case, being always considered within the three most stable genes of the panel. In contrast, infection-dependent differential regulation of GAPDH and TUBA was also demonstrated. Conclusion Based on the data presented here with the cell culture models CHSE-214 and RTS11, we suggest the initial choice of UBQ, ACTB and EF1A as reference genes in qRT-PCR assays for studying the effect of P. salmonis and IPNV on the host immune response. PMID:20398263
Evaluation and Selection of Appropriate Reference Genes for Real-Time Quantitative PCR Analysis of Gene Expression in Nile Tilapia (Oreochromis niloticus) during Vaccination and Infection

PubMed Central

Wang, Erlong; Wang, Kaiyu; Chen, Defang; Wang, Jun; He, Yang; Long, Bo; Yang, Lei; Yang, Qian; Geng, Yi; Huang, Xiaoli; Ouyang, Ping; Lai, Weimin

2015-01-01

qPCR as a powerful and attractive methodology has been widely applied to aquaculture researches for gene expression analyses. However, the suitable reference selection is critical for normalizing target genes expression in qPCR. In the present study, six commonly used endogenous controls were selected as candidate reference genes to evaluate and analyze their expression levels, stabilities and normalization to immune-related gene IgM expression during vaccination and infection in spleen of tilapia with RefFinder and GeNorm programs. The results showed that all of these candidate reference genes exhibited transcriptional variations to some extent at different periods. Among them, EF1A was the most stable reference with RefFinder, followed by 18S rRNA, ACTB, UBCE, TUBA and GAPDH respectively and the optimal number of reference genes for IgM normalization under different experiment sets was two with GeNorm. Meanwhile, combination the Cq (quantification cycle) value and the recommended comprehensive ranking of reference genes, EF1A and ACTB, the two optimal reference genes, were used together as reference genes for accurate analysis of immune-related gene expression during vaccination and infection in Nile tilapia with qPCR. Moreover, the highest IgM expression level was at two weeks post-vaccination when normalized to EF1A, 18S rRNA, ACTB, and EF1A together with ACTB compared to one week post-vaccination before normalizing, which was also consistent with the IgM antibody titers detection by ELISA. PMID:25941937
Virtual Genome Walking across the 32 Gb Ambystoma mexicanum genome; assembling gene models and intronic sequence.

PubMed

Evans, Teri; Johnson, Andrew D; Loose, Matthew

2018-01-12

Large repeat rich genomes present challenges for assembly using short read technologies. The 32 Gb axolotl genome is estimated to contain ~19 Gb of repetitive DNA making an assembly from short reads alone effectively impossible. Indeed, this model species has been sequenced to 20× coverage but the reads could not be conventionally assembled. Using an alternative strategy, we have assembled subsets of these reads into scaffolds describing over 19,000 gene models. We call this method Virtual Genome Walking as it locally assembles whole genome reads based on a reference transcriptome, identifying exons and iteratively extending them into surrounding genomic sequence. These assemblies are then linked and refined to generate gene models including upstream and downstream genomic, and intronic, sequence. Our assemblies are validated by comparison with previously published axolotl bacterial artificial chromosome (BAC) sequences. Our analyses of axolotl intron length, intron-exon structure, repeat content and synteny provide novel insights into the genic structure of this model species. This resource will enable new experimental approaches in axolotl, such as ChIP-Seq and CRISPR and aid in future whole genome sequencing efforts. The assembled sequences and annotations presented here are freely available for download from https://tinyurl.com/y8gydc6n . The software pipeline is available from https://github.com/LooseLab/iterassemble .
Relative codon adaptation: a generic codon bias index for prediction of gene expression.

PubMed

Fox, Jesse M; Erill, Ivan

2010-06-01

The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.
BeeSpace Navigator: exploratory analysis of gene function using semantic indexing of biological literature.

PubMed

Sen Sarma, Moushumi; Arcoleo, David; Khetani, Radhika S; Chee, Brant; Ling, Xu; He, Xin; Jiang, Jing; Mei, Qiaozhu; Zhai, ChengXiang; Schatz, Bruce

2011-07-01

With the rapid decrease in cost of genome sequencing, the classification of gene function is becoming a primary problem. Such classification has been performed by human curators who read biological literature to extract evidence. BeeSpace Navigator is a prototype software for exploratory analysis of gene function using biological literature. The software supports an automatic analogue of the curator process to extract functions, with a simple interface intended for all biologists. Since extraction is done on selected collections that are semantically indexed into conceptual spaces, the curation can be task specific. Biological literature containing references to gene lists from expression experiments can be analyzed to extract concepts that are computational equivalents of a classification such as Gene Ontology, yielding discriminating concepts that differentiate gene mentions from other mentions. The functions of individual genes can be summarized from sentences in biological literature, to produce results resembling a model organism database entry that is automatically computed. Statistical frequency analysis based on literature phrase extraction generates offline semantic indexes to support these gene function services. The website with BeeSpace Navigator is free and open to all; there is no login requirement at www.beespace.illinois.edu for version 4. Materials from the 2010 BeeSpace Software Training Workshop are available at www.beespace.illinois.edu/bstwmaterials.php.
Identification of reference genes and validation for gene expression studies in diverse axolotl (Ambystoma mexicanum) tissues.

PubMed

Guelke, Eileen; Bucan, Vesna; Liebsch, Christina; Lazaridis, Andrea; Radtke, Christine; Vogt, Peter M; Reimers, Kerstin

2015-04-10

For the precise quantitative RT-PCR normalization a set of valid reference genes is obligatory. Moreover have to be taken into concern the experimental conditions as they bias the regulation of reference genes. Up till now, no reference targets have been described for the axolotl (Ambystoma mexicanum). In a search in the public database SalSite for genetic information of the axolotl we identified fourteen presumptive reference genes, eleven of which were further tested for their gene expression stability. This study characterizes the expressional patterns of 11 putative endogenous control genes during axolotl limb regeneration and in an axolotl tissue panel. All 11 reference genes showed variable expression. Strikingly, ACTB was to be found most stable expressed in all comparative tissue groups, so we reason it to be suitable for all different kinds of axolotl tissue-type investigations. Moreover do we suggest GAPDH and RPLP0 as suitable for certain axolotl tissue analysis. When it comes to axolotl limb regeneration, a validated pair of reference genes is ODC and RPLP0. With these findings, new insights into axolotl gene expression profiling might be gained. Copyright © 2015 Elsevier B.V. All rights reserved.
Tunicate mitogenomics and phylogenetics: peculiarities of the Herdmania momus mitochondrial genome and support for the new chordate phylogeny

PubMed Central

2009-01-01

Background Tunicates represent a key metazoan group as the sister-group of vertebrates within chordates. The six complete mitochondrial genomes available so far for tunicates have revealed distinctive features. Extensive gene rearrangements and particularly high evolutionary rates have been evidenced with regard to other chordates. This peculiar evolutionary dynamics has hampered the reconstruction of tunicate phylogenetic relationships within chordates based on mitogenomic data. Results In order to further understand the atypical evolutionary dynamics of the mitochondrial genome of tunicates, we determined the complete sequence of the solitary ascidian Herdmania momus. This genome from a stolidobranch ascidian presents the typical tunicate gene content with 13 protein-coding genes, 2 rRNAs and 24 tRNAs which are all encoded on the same strand. However, it also presents a novel gene arrangement, highlighting the extreme plasticity of gene order observed in tunicate mitochondrial genomes. Probabilistic phylogenetic inferences were conducted on the concatenation of the 13 mitochondrial protein-coding genes from representatives of major metazoan phyla. We show that whereas standard homogeneous amino acid models support an artefactual sister position of tunicates relative to all other bilaterians, the CAT and CAT+BP site- and time-heterogeneous mixture models place tunicates as the sister-group of vertebrates within monophyletic chordates. Moreover, the reference phylogeny indicates that tunicate mitochondrial genomes have experienced a drastic acceleration in their evolutionary rate that equally affects protein-coding and ribosomal-RNA genes. Conclusion This is the first mitogenomic study supporting the new chordate phylogeny revealed by recent phylogenomic analyses. It illustrates the beneficial effects of an increased taxon sampling coupled with the use of more realistic amino acid substitution models for the reconstruction of animal phylogeny. PMID:19922605
Tunicate mitogenomics and phylogenetics: peculiarities of the Herdmania momus mitochondrial genome and support for the new chordate phylogeny.

PubMed

Singh, Tiratha Raj; Tsagkogeorga, Georgia; Delsuc, Frédéric; Blanquart, Samuel; Shenkar, Noa; Loya, Yossi; Douzery, Emmanuel Jp; Huchon, Dorothée

2009-11-17

Tunicates represent a key metazoan group as the sister-group of vertebrates within chordates. The six complete mitochondrial genomes available so far for tunicates have revealed distinctive features. Extensive gene rearrangements and particularly high evolutionary rates have been evidenced with regard to other chordates. This peculiar evolutionary dynamics has hampered the reconstruction of tunicate phylogenetic relationships within chordates based on mitogenomic data. In order to further understand the atypical evolutionary dynamics of the mitochondrial genome of tunicates, we determined the complete sequence of the solitary ascidian Herdmania momus. This genome from a stolidobranch ascidian presents the typical tunicate gene content with 13 protein-coding genes, 2 rRNAs and 24 tRNAs which are all encoded on the same strand. However, it also presents a novel gene arrangement, highlighting the extreme plasticity of gene order observed in tunicate mitochondrial genomes. Probabilistic phylogenetic inferences were conducted on the concatenation of the 13 mitochondrial protein-coding genes from representatives of major metazoan phyla. We show that whereas standard homogeneous amino acid models support an artefactual sister position of tunicates relative to all other bilaterians, the CAT and CAT+BP site- and time-heterogeneous mixture models place tunicates as the sister-group of vertebrates within monophyletic chordates. Moreover, the reference phylogeny indicates that tunicate mitochondrial genomes have experienced a drastic acceleration in their evolutionary rate that equally affects protein-coding and ribosomal-RNA genes. This is the first mitogenomic study supporting the new chordate phylogeny revealed by recent phylogenomic analyses. It illustrates the beneficial effects of an increased taxon sampling coupled with the use of more realistic amino acid substitution models for the reconstruction of animal phylogeny.
RNA-Seq for gene identification and transcript profiling of three Stevia rebaudiana genotypes.

PubMed

Chen, Junwen; Hou, Kai; Qin, Peng; Liu, Hongchang; Yi, Bin; Yang, Wenting; Wu, Wei

2014-07-07

Stevia (Stevia rebaudiana) is an important medicinal plant that yields diterpenoid steviol glycosides (SGs). SGs are currently used in the preparation of medicines, food products and neutraceuticals because of its sweetening property (zero calories and about 300 times sweeter than sugar). Recently, some progress has been made in understanding the biosynthesis of SGs in Stevia, but little is known about the molecular mechanisms underlying this process. Additionally, the genomics of Stevia, a non-model species, remains uncharacterized. The recent advent of RNA-Seq, a next generation sequencing technology, provides an opportunity to expand the identification of Stevia genes through in-depth transcript profiling. We present a comprehensive landscape of the transcriptome profiles of three genotypes of Stevia with divergent SG compositions characterized using RNA-seq. 191,590,282 high-quality reads were generated and then assembled into 171,837 transcripts with an average sequence length of 969 base pairs. A total of 80,160 unigenes were annotated, and 14,211 of the unique sequences were assigned to specific metabolic pathways by the Kyoto Encyclopedia of Genes and Genomes. Gene sequences of all enzymes known to be involved in SG synthesis were examined. A total of 143 UDP-glucosyltransferase (UGT) unigenes were identified, some of which might be involved in SG biosynthesis. The expression patterns of eight of these genes were further confirmed by RT-QPCR. RNA-seq analysis identified candidate genes encoding enzymes responsible for the biosynthesis of SGs in Stevia, a non-model plant without a reference genome. The transcriptome data from this study yielded new insights into the process of SG accumulation in Stevia. Our results demonstrate that RNA-Seq can be successfully used for gene identification and transcript profiling in a non-model species.
Low-level lasers and mRNA levels of reference genes used in Escherichia coli

NASA Astrophysics Data System (ADS)

Teixeira, A. F.; Machado, Y. L. R. C.; Fonseca, A. S.; Mencalha, A. L.

2016-11-01

Low-level lasers are widely used for the treatment of diseases and antimicrobial photodynamic therapy. Reverse transcriptase quantitative polymerase chain reaction (RT-qPCR) is widely used to evaluate mRNA levels and output data from a target gene are commonly relative to a reference mRNA that cannot vary according to treatment. In this study, the level of reference genes from Escherichia coli exposed to red or infrared lasers at different fluences was evaluated. E. coli AB1157 cultures were exposed to red (660 nm) and infrared (808 nm) lasers, incubated (20 min, 37 °C), the total RNA was extracted, and cDNA synthesis was performed to evaluate mRNA levels from arcA, gyrA and rpoA genes by RT-qPCR. Melting curves and agarose gel electrophoresis were carried out to evaluate specific amplification. Data were analyzed by geNorm, NormFinder and BestKeeper. The melting curve and agarose gel electrophoresis showed specific amplification. Although mRNA levels from arcA, gyrA or rpoA genes presented no significant variations trough a traditional statistical analysis, Excel-based tools revealed that these reference genes are not suitable for E. coli cultures exposed to lasers. Our data showed that exposure to low-level red and infrared lasers at different fluences alter the mRNA levels from arcA, gyrA and rpoA in E. coli cells.
Selection of suitable endogenous reference genes for qPCR in kidney and hypothalamus of rats under testosterone influence

PubMed Central

2017-01-01

Real-time quantitative PCR (qPCR) is the most reliable and accurate technique for analyses of gene expression. Endogenous reference genes are being used to normalize qPCR data even though their expression may vary under different conditions and in different tissues. Nonetheless, verification of expression of reference genes in selected studied tissue is essential in order to accurately assess the level of expression of target genes of interest. Therefore, in this study, we attempted to examine six commonly used reference genes in order to identify the gene being expressed most constantly under the influence of testosterone in the kidneys and hypothalamus. The reference genes include glyceraldehyde-3-phosphate dehydrogenase (GAPDH), actin beta (ACTB), beta-2 microglobulin (B2m), hypoxanthine phosphoribosyltransferase 1 (HPRT), peptidylprolylisomerase A (Ppia) and hydroxymethylbilane synthase (Hmbs). The cycle threshold (Ct) value for each gene was determined and data obtained were analyzed using the software programs NormFinder, geNorm, BestKeeper, and rank aggregation. Results showed that Hmbs and Ppia genes were the most stably expressed in the hypothalamus. Meanwhile, in kidneys, Hmbs and GAPDH appeared to be the most constant genes. In conclusion, variations in expression levels of reference genes occur in kidneys and hypothalamus under similar conditions; thus, it is important to verify reference gene levels in these tissues prior to commencing any studies. PMID:28591185
GeneTopics - interpretation of gene sets via literature-driven topic models

PubMed Central

2013-01-01

Background Annotation of a set of genes is often accomplished through comparison to a library of labelled gene sets such as biological processes or canonical pathways. However, this approach might fail if the employed libraries are not up to date with the latest research, don't capture relevant biological themes or are curated at a different level of granularity than is required to appropriately analyze the input gene set. At the same time, the vast biomedical literature offers an unstructured repository of the latest research findings that can be tapped to provide thematic sub-groupings for any input gene set. Methods Our proposed method relies on a gene-specific text corpus and extracts commonalities between documents in an unsupervised manner using a topic model approach. We automatically determine the number of topics summarizing the corpus and calculate a gene relevancy score for each topic allowing us to eliminate non-specific topics. As a result we obtain a set of literature topics in which each topic is associated with a subset of the input genes providing directly interpretable keywords and corresponding documents for literature research. Results We validate our method based on labelled gene sets from the KEGG metabolic pathway collection and the genetic association database (GAD) and show that the approach is able to detect topics consistent with the labelled annotation. Furthermore, we discuss the results on three different types of experimentally derived gene sets, (1) differentially expressed genes from a cardiac hypertrophy experiment in mice, (2) altered transcript abundance in human pancreatic beta cells, and (3) genes implicated by GWA studies to be associated with metabolite levels in a healthy population. In all three cases, we are able to replicate findings from the original papers in a quick and semi-automated manner. Conclusions Our approach provides a novel way of automatically generating meaningful annotations for gene sets that are directly tied to relevant articles in the literature. Extending a general topic model method, the approach introduced here establishes a workflow for the interpretation of gene sets generated from diverse experimental scenarios that can complement the classical approach of comparison to reference gene sets. PMID:24564875

PRGdb: a bioinformatics platform for plant resistance gene analysis

PubMed Central

Sanseverino, Walter; Roma, Guglielmo; De Simone, Marco; Faino, Luigi; Melito, Sara; Stupka, Elia; Frusciante, Luigi; Ercolano, Maria Raffaella

2010-01-01

PRGdb is a web accessible open-source (http://www.prgdb.org) database that represents the first bioinformatic resource providing a comprehensive overview of resistance genes (R-genes) in plants. PRGdb holds more than 16 000 known and putative R-genes belonging to 192 plant species challenged by 115 different pathogens and linked with useful biological information. The complete database includes a set of 73 manually curated reference R-genes, 6308 putative R-genes collected from NCBI and 10463 computationally predicted putative R-genes. Thanks to a user-friendly interface, data can be examined using different query tools. A home-made prediction pipeline called Disease Resistance Analysis and Gene Orthology (DRAGO), based on reference R-gene sequence data, was developed to search for plant resistance genes in public datasets such as Unigene and Genbank. New putative R-gene classes containing unknown domain combinations were discovered and characterized. The development of the PRG platform represents an important starting point to conduct various experimental tasks. The inferred cross-link between genomic and phenotypic information allows access to a large body of information to find answers to several biological questions. The database structure also permits easy integration with other data types and opens up prospects for future implementations. PMID:19906694
Establishment and analysis of a reference transcriptome for Spodoptera frugiperda.

PubMed

Legeai, Fabrice; Gimenez, Sylvie; Duvic, Bernard; Escoubas, Jean-Michel; Gosselin Grenet, Anne-Sophie; Blanc, Florence; Cousserans, François; Séninet, Imène; Bretaudeau, Anthony; Mutuel, Doriane; Girard, Pierre-Alain; Monsempes, Christelle; Magdelenat, Ghislaine; Hilliou, Frédérique; Feyereisen, René; Ogliastro, Mylène; Volkoff, Anne-Nathalie; Jacquin-Joly, Emmanuelle; d'Alençon, Emmanuelle; Nègre, Nicolas; Fournier, Philippe

2014-08-23

Spodoptera frugiperda (Noctuidae) is a major agricultural pest throughout the American continent. The highly polyphagous larvae are frequently devastating crops of importance such as corn, sorghum, cotton and grass. In addition, the Sf9 cell line, widely used in biochemistry for in vitro protein production, is derived from S. frugiperda tissues. Many research groups are using S. frugiperda as a model organism to investigate questions such as plant adaptation, pest behavior or resistance to pesticides. In this study, we constructed a reference transcriptome assembly (Sf_TR2012b) of RNA sequences obtained from more than 35 S. frugiperda developmental time-points and tissue samples. We assessed the quality of this reference transcriptome by annotating a ubiquitous gene family--ribosomal proteins--as well as gene families that have a more constrained spatio-temporal expression and are involved in development, immunity and olfaction. We also provide a time-course of expression that we used to characterize the transcriptional regulation of the gene families studied. We conclude that the Sf_TR2012b transcriptome is a valid reference transcriptome. While its reliability decreases for the detection and annotation of genes under strong transcriptional constraint we still recover a fair percentage of tissue-specific transcripts. That allowed us to explore the spatial and temporal expression of genes and to observe that some olfactory receptors are expressed in antennae and palps but also in other non related tissues such as fat bodies. Similarly, we observed an interesting interplay of gene families involved in immunity between fat bodies and antennae.
Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling

PubMed Central

Shin, Junha; Lee, Insuk

2015-01-01

Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life—Archaea, Bacteria, and Eukaryota—suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co-inheritance analysis within the domains of life will greatly potentiate the use of the expected onslaught of sequenced genomes in the study of molecular pathways in higher eukaryotes. PMID:26394049
Reference gene selection for quantitative real-time PCR in Solanum lycopersicum L. inoculated with the mycorrhizal fungus Rhizophagus irregularis.

PubMed

Fuentes, Alejandra; Ortiz, Javier; Saavedra, Nicolás; Salazar, Luis A; Meneses, Claudio; Arriagada, Cesar

2016-04-01

The gene expression stability of candidate reference genes in the roots and leaves of Solanum lycopersicum inoculated with arbuscular mycorrhizal fungi was investigated. Eight candidate reference genes including elongation factor 1 α (EF1), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), phosphoglycerate kinase (PGK), protein phosphatase 2A (PP2Acs), ribosomal protein L2 (RPL2), β-tubulin (TUB), ubiquitin (UBI) and actin (ACT) were selected, and their expression stability was assessed to determine the most stable internal reference for quantitative PCR normalization in S. lycopersicum inoculated with the arbuscular mycorrhizal fungus Rhizophagus irregularis. The stability of each gene was analysed in leaves and roots together and separated using the geNorm and NormFinder algorithms. Differences were detected between leaves and roots, varying among the best-ranked genes depending on the algorithm used and the tissue analysed. PGK, TUB and EF1 genes showed higher stability in roots, while EF1 and UBI had higher stability in leaves. Statistical algorithms indicated that the GAPDH gene was the least stable under the experimental conditions assayed. Then, we analysed the expression levels of the LePT4 gene, a phosphate transporter whose expression is induced by fungal colonization in host plant roots. No differences were observed when the most stable genes were used as reference genes. However, when GAPDH was used as the reference gene, we observed an overestimation of LePT4 expression. In summary, our results revealed that candidate reference genes present variable stability in S. lycopersicum arbuscular mycorrhizal symbiosis depending on the algorithm and tissue analysed. Thus, reference gene selection is an important issue for obtaining reliable results in gene expression quantification. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
16S rRNA gene-based phylogenetic microarray for simultaneous identification of members of the genus Burkholderia.

PubMed

Schönmann, Susan; Loy, Alexander; Wimmersberger, Céline; Sobek, Jens; Aquino, Catharine; Vandamme, Peter; Frey, Beat; Rehrauer, Hubert; Eberl, Leo

2009-04-01

For cultivation-independent and highly parallel analysis of members of the genus Burkholderia, an oligonucleotide microarray (phylochip) consisting of 131 hierarchically nested 16S rRNA gene-targeted oligonucleotide probes was developed. A novel primer pair was designed for selective amplification of a 1.3 kb 16S rRNA gene fragment of Burkholderia species prior to microarray analysis. The diagnostic performance of the microarray for identification and differentiation of Burkholderia species was tested with 44 reference strains of the genera Burkholderia, Pandoraea, Ralstonia and Limnobacter. Hybridization patterns based on presence/absence of probe signals were interpreted semi-automatically using the novel likelihood-based strategy of the web-tool Phylo- Detect. Eighty-eight per cent of the reference strains were correctly identified at the species level. The evaluated microarray was applied to investigate shifts in the Burkholderia community structure in acidic forest soil upon addition of cadmium, a condition that selected for Burkholderia species. The microarray results were in agreement with those obtained from phylogenetic analysis of Burkholderia 16S rRNA gene sequences recovered from the same cadmiumcontaminated soil, demonstrating the value of the Burkholderia phylochip for determinative and environmental studies.
Validation of reference genes for gene expression studies in soybean aphid, Aphis glycines Matsumura

USDA-ARS?s Scientific Manuscript database

Quantitative real-time PCR (qRT-PCR) is a common tool for quantifying mRNA transcripts. To normalize results, a reference gene is mandatory. Aphis glycines is a significant soybean pest, yet gene expression and functional genomics studies are hindered by a lack of stable reference genes. We evalu...
Identification of Suitable Reference Genes for Investigating Gene Expression in Anterior Cruciate Ligament Injury by Using Reverse Transcription-Quantitative PCR.

PubMed

Leal, Mariana Ferreira; Astur, Diego Costa; Debieux, Pedro; Arliani, Gustavo Gonçalves; Silveira Franciozi, Carlos Eduardo; Loyola, Leonor Casilla; Andreoli, Carlos Vicente; Smith, Marília Cardoso; Pochini, Alberto de Castro; Ejnisman, Benno; Cohen, Moises

2015-01-01

The anterior cruciate ligament (ACL) is one of the most frequently injured structures during high-impact sporting activities. Gene expression analysis may be a useful tool for understanding ACL tears and healing failure. Reverse transcription-quantitative polymerase chain reaction (RT-qPCR) has emerged as an effective method for such studies. However, this technique requires the use of suitable reference genes for data normalization. Here, we evaluated the suitability of six reference genes (18S, ACTB, B2M, GAPDH, HPRT1, and TBP) by using ACL samples of 39 individuals with ACL tears (20 with isolated ACL tears and 19 with ACL tear and combined meniscal injury) and of 13 controls. The stability of the candidate reference genes was determined by using the NormFinder, geNorm, BestKeeper DataAssist, and RefFinder software packages and the comparative ΔCt method. ACTB was the best single reference gene and ACTB+TBP was the best gene pair. The GenEx software showed that the accumulated standard deviation is reduced when a larger number of reference genes is used for gene expression normalization. However, the use of a single reference gene may not be suitable. To identify the optimal combination of reference genes, we evaluated the expression of FN1 and PLOD1. We observed that at least 3 reference genes should be used. ACTB+HPRT1+18S is the best trio for the analyses involving isolated ACL tears and controls. Conversely, ACTB+TBP+18S is the best trio for the analyses involving (1) injured ACL tears and controls, and (2) ACL tears of patients with meniscal tears and controls. Therefore, if the gene expression study aims to compare non-injured ACL, isolated ACL tears and ACL tears from patients with meniscal tear as three independent groups ACTB+TBP+18S+HPRT1 should be used. In conclusion, 3 or more genes should be used as reference genes for analysis of ACL samples of individuals with and without ACL tears.
An endogenous reference gene of common and durum wheat for detection of genetically modified wheat.

PubMed

Imai, Shinjiro; Tanaka, Keiko; Nishitsuji, Yasuyuki; Kikuchi, Yosuke; Matsuoka, Yasuyuki; Arami, Shin-Ichiro; Sato, Megumi; Haraguchi, Hiroyuki; Kurimoto, Youichi; Mano, Junichi; Furui, Satoshi; Kitta, Kazumi

2012-01-01

To develop a method for detecting GM wheat that may be marketed in the near future, we evaluated the proline-rich protein (PRP) gene as an endogenous reference gene of common wheat (Triticum aestivum L.) and durum wheat (Triticum durum L.). Real-time PCR analysis showed that only DNA of wheat was amplified and no amplification product was observed for phylogenetically related cereals, indicating that the PRP detection system is specific to wheat. The intensities of the amplification products and Ct values among all wheat samples used in this study were very similar, with no nonspecific or additional amplification, indicating that the PRP detection system has high sequence stability. The limit of detection was estimated at 5 haploid genome copies. The PRP region was demonstrated to be present as a single or double copy in the common wheat haploid genome. Furthermore, the PRP detection system showed a highly linear relationship between Ct values and the amount of plasmid DNA, indicating that an appropriate calibration curve could be constructed for quantitative detection of GM wheat. All these results indicate that the PRP gene is a suitable endogenous reference gene for PCR-based detection of GM wheat.
Reference genes for normalization of qPCR assays in sugarcane plants under water deficit.

PubMed

de Andrade, Larissa Mara; Dos Santos Brito, Michael; Fávero Peixoto Junior, Rafael; Marchiori, Paulo Eduardo Ribeiro; Nóbile, Paula Macedo; Martins, Alexandre Palma Boer; Ribeiro, Rafael Vasconcelos; Creste, Silvana

2017-01-01

Sugarcane ( Saccharum spp.) is the main raw material for sugar and ethanol production. Among the abiotic stress, drought is the main one that negatively impact sugarcane yield. Although gene expression analysis through quantitative PCR (qPCR) has increased our knowledge about biological processes related to drought, gene network that mediates sugarcane responses to water deficit remains elusive. In such scenario, validation of reference gene is a major requirement for successful analyzes involving qPCR. In this study, candidate genes were tested for their suitable as reference genes for qPCR analyses in two sugarcane cultivars with varying drought tolerance. Eight candidate reference genes were evaluated in leaves sampled in plants subjected to water deficit in both field and greenhouse conditions. In addition, five genes were evaluated in shoot roots of plants subjected to water deficit by adding PEG8000 to the nutrient solution. NormFinder and RefFinder algorithms were used to identify the most stable gene(s) among genotypes and under different experimental conditions. Both algorithms revealed that in leaf samples, UBQ1 and GAPDH genes were more suitable as reference genes, whereas GAPDH was the best reference one in shoot roots. Reference genes suitable for sugarcane under water deficit were identified, which would lead to a more accurate and reliable analysis of qPCR. Thus, results obtained in this study may guide future research on gene expression in sugarcane under varying water conditions.
Optimal Reference Gene Selection for Expression Studies in Human Reticulocytes.

PubMed

Aggarwal, Anu; Jamwal, Manu; Viswanathan, Ganesh K; Sharma, Prashant; Sachdeva, ManUpdesh S; Bansal, Deepak; Malhotra, Pankaj; Das, Reena

2018-05-01

Reference genes are indispensable for normalizing mRNA levels across samples in real-time quantitative PCR. Their expression levels vary under different experimental conditions and because of several inherent characteristics. Appropriate reference gene selection is thus critical for gene-expression studies. This study aimed at selecting optimal reference genes for gene-expression analysis of reticulocytes and at validating them in hereditary spherocytosis (HS) and β-thalassemia intermedia (βTI) patients. Seven reference genes (PGK1, MPP1, HPRT1, ACTB, GAPDH, RN18S1, and SDHA) were selected because of published reports. Real-time quantitative PCR was performed on reticulocytes in 20 healthy volunteers, 15 HS patients, and 10 βTI patients. Threshold cycle values were compared with fold-change method and RefFinder software. The stable reference genes recommended by RefFinder were validated with SLC4A1 and flow cytometric eosin-5'-maleimide binding assay values in HS patients and HBG2 and high performance liquid chromatography-derived percentage of hemoglobin F in βTI. Comprehensive ranking predicted MPP1 and GAPDH as optimal reference genes for reticulocytes that were not affected in HS and βTI. This was further confirmed on validation with eosin-5'-maleimide results and percentage of hemoglobin F in HS and βTI patients, respectively. Hence, MPP1 and GAPDH are good reference genes for reticulocyte expression studies compared with ACTB and RN18S1, the two most commonly used reference genes. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Determination of internal controls for quantitative gene expression of Isochrysis zhangjiangensis at nitrogen stress condition

NASA Astrophysics Data System (ADS)

Wu, Shuang; Zhou, Jiannan; Cao, Xupeng; Xue, Song

2016-02-01

Isochrysis zhangjiangensis is a potential marine microalga for biodiesel production, which accumulates lipid under nitrogen limitation conditions, but the mechanism on molecular level is veiled. Quantitative real-time polymerase chain reaction (qPCR) provides the possibility to investigate the gene expression levels, and a valid reference for data normalization is an essential prerequisite for firing up the analysis. In this study, five housekeeping genes, actin (ACT), α-tubulin (TUA), ß-tubulin (TUB), ubiquitin (UBI), 18S rRNA (18S) and one target gene, diacylglycerol acyltransferase (DGAT), were used for determining the reference. By analyzing the stabilities based on calculation of the stability index and on operating the two types of software, geNorm and bestkeeper, it showed that the reference genes widely used in higher plant and microalgae, such as UBI, TUA and 18S, were not the most stable ones in nitrogen-stressed I. zhangjiangensis, and thus are not suitable for exploring the mRNA expression levels under these experimental conditions. Our results show that ACT together with TUB is the most feasible internal control for investigating gene expression under nitrogen-stressed conditions. Our findings will contribute not only to future qPCR studies of I. zhangjiangensis, but also to verification of comparative transcriptomics studies of the microalgae under similar conditions.
Validation of Reference Genes for Relative Quantitative Gene Expression Studies in Cassava (Manihot esculenta Crantz) by Using Quantitative Real-Time PCR

PubMed Central

Hu, Meizhen; Hu, Wenbin; Xia, Zhiqiang; Zhou, Xincheng; Wang, Wenquan

2016-01-01

Reverse transcription quantitative real-time polymerase chain reaction (real-time PCR, also referred to as quantitative RT-PCR or RT-qPCR) is a highly sensitive and high-throughput method used to study gene expression. Despite the numerous advantages of RT-qPCR, its accuracy is strongly influenced by the stability of internal reference genes used for normalizations. To date, few studies on the identification of reference genes have been performed on cassava (Manihot esculenta Crantz). Therefore, we selected 26 candidate reference genes mainly via the three following channels: reference genes used in previous studies on cassava, the orthologs of the most stable Arabidopsis genes, and the sequences obtained from 32 cassava transcriptome sequence data. Then, we employed ABI 7900 HT and SYBR Green PCR mix to assess the expression of these genes in 21 materials obtained from various cassava samples under different developmental and environmental conditions. The stability of gene expression was analyzed using two statistical algorithms, namely geNorm and NormFinder. geNorm software suggests the combination of cassava4.1_017977 and cassava4.1_006391 as sufficient reference genes for major cassava samples, the union of cassava4.1_014335 and cassava4.1_006884 as best choice for drought stressed samples, and the association of cassava4.1_012496 and cassava4.1_006391 as optimal choice for normally grown samples. NormFinder software recommends cassava4.1_006884 or cassava4.1_006776 as superior reference for qPCR analysis of different materials and organs of drought stressed or normally grown cassava, respectively. Results provide an important resource for cassava reference genes under specific conditions. The limitations of these findings were also discussed. Furthermore, we suggested some strategies that may be used to select candidate reference genes. PMID:27242878
Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance

PubMed Central

2011-01-01

Background Until recently, read lengths on the Solexa/Illumina system were too short to reliably assemble transcriptomes without a reference sequence, especially for non-model organisms. However, with read lengths up to 100 nucleotides available in the current version, an assembly without reference genome should be possible. For this study we created an EST data set for the common pond snail Radix balthica by Illumina sequencing of a normalized transcriptome. Performance of three different short read assemblers was compared with respect to: the number of contigs, their length, depth of coverage, their quality in various BLAST searches and the alignment to mitochondrial genes. Results A single sequencing run of a normalized RNA pool resulted in 16,923,850 paired end reads with median read length of 61 bases. The assemblies generated by VELVET, OASES, and SeqMan NGEN differed in the total number of contigs, contig length, the number and quality of gene hits obtained by BLAST searches against various databases, and contig performance in the mt genome comparison. While VELVET produced the highest overall number of contigs, a large fraction of these were of small size (< 200bp), and gave redundant hits in BLAST searches and the mt genome alignment. The best overall contig performance resulted from the NGEN assembly. It produced the second largest number of contigs, which on average were comparable to the OASES contigs but gave the highest number of gene hits in two out of four BLAST searches against different reference databases. A subsequent meta-assembly of the four contig sets resulted in larger contigs, less redundancy and a higher number of BLAST hits. Conclusion Our results document the first de novo transcriptome assembly of a non-model species using Illumina sequencing data. We show that de novo transcriptome assembly using this approach yields results useful for downstream applications, in particular if a meta-assembly of contig sets is used to increase contig quality. These results highlight the ongoing need for improvements in assembly methodology. PMID:21679424
Superior Cross-Species Reference Genes: A Blueberry Case Study

PubMed Central

Die, Jose V.; Rowland, Lisa J.

2013-01-01

The advent of affordable Next Generation Sequencing technologies has had major impact on studies of many crop species, where access to genomic technologies and genome-scale data sets has been extremely limited until now. The recent development of genomic resources in blueberry will enable the application of high throughput gene expression approaches that should relatively quickly increase our understanding of blueberry physiology. These studies, however, require a highly accurate and robust workflow and make necessary the identification of reference genes with high expression stability for correct target gene normalization. To create a set of superior reference genes for blueberry expression analyses, we mined a publicly available transcriptome data set from blueberry for orthologs to a set of Arabidopsis genes that showed the most stable expression in a developmental series. In total, the expression stability of 13 putative reference genes was evaluated by qPCR and a set of new references with high stability values across a developmental series in fruits and floral buds of blueberry were identified. We also demonstrated the need to use at least two, preferably three, reference genes to avoid inconsistencies in results, even when superior reference genes are used. The new references identified here provide a valuable resource for accurate normalization of gene expression in Vaccinium spp. and may be useful for other members of the Ericaceae family as well. PMID:24058469
Storm Water Management Model Reference Manual Volume I, Hydrology

EPA Science Inventory

SWMM is a dynamic rainfall-runoff simulation model used for single event or long-term (continuous) simulation of runoff quantity and quality from primarily urban areas. The runoff component of SWMM operates on a collection of subcatchment areas that receive precipitation and gene...
Storm Water Management Model Reference Manual Volume II – Hydraulics

EPA Science Inventory

SWMM is a dynamic rainfall-runoff simulation model used for single event or long-term (continuous) simulation of runoff quantity and quality from primarily urban areas. The runoff component of SWMM operates on a collection of subcatchment areas that receive precipitation and gene...
A gene expression biomarker accurately predicts estrogen ...

EPA Pesticide Factsheets

The EPA’s vision for the Endocrine Disruptor Screening Program (EDSP) in the 21st Century (EDSP21) includes utilization of high-throughput screening (HTS) assays coupled with computational modeling to prioritize chemicals with the goal of eventually replacing current Tier 1 screening tests. The ToxCast program currently includes 18 HTS in vitro assays that evaluate the ability of chemicals to modulate estrogen receptor α (ERα), an important endocrine target. We propose microarray-based gene expression profiling as a complementary approach to predict ERα modulation and have developed computational methods to identify ERα modulators in an existing database of whole-genome microarray data. The ERα biomarker consisted of 46 ERα-regulated genes with consistent expression patterns across 7 known ER agonists and 3 known ER antagonists. The biomarker was evaluated as a predictive tool using the fold-change rank-based Running Fisher algorithm by comparison to annotated gene expression data sets from experiments in MCF-7 cells. Using 141 comparisons from chemical- and hormone-treated cells, the biomarker gave a balanced accuracy for prediction of ERα activation or suppression of 94% or 93%, respectively. The biomarker was able to correctly classify 18 out of 21 (86%) OECD ER reference chemicals including “very weak” agonists and replicated predictions based on 18 in vitro ER-associated HTS assays. For 114 chemicals present in both the HTS data and the MCF-7 c
High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

PubMed

Lagarde, Julien; Uszczynska-Ratajczak, Barbara; Carbonell, Silvia; Pérez-Lluch, Sílvia; Abad, Amaya; Davis, Carrie; Gingeras, Thomas R; Frankish, Adam; Harrow, Jennifer; Guigo, Roderic; Johnson, Rory

2017-12-01

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.
Toxicogenomics in the 3T3-L1 cell line, a new approach for screening of obesogenic compounds.

PubMed

Pereira-Fernandes, Anna; Vanparys, Caroline; Vergauwen, Lucia; Knapen, Dries; Jorens, Philippe Germaines; Blust, Ronny

2014-08-01

The obesogen hypothesis states that together with an energy imbalance between calories consumed and calories expended, exposure to environmental compounds early in life or throughout lifetime might have an influence on obesity development. In this work, we propose a new approach for obesogen screening, i.e., the use of transcriptomics in the 3T3-L1 pre-adipocyte cell line. Based on the data from a previous study of our group using a lipid accumulation based adipocyte differentiation assay, several human-relevant obesogenic compounds were selected: reference obesogens (Rosiglitazone, Tributyltin), test obesogens (Butylbenzyl phthalate, butylparaben, propylparaben, Bisphenol A), and non-obesogens (Ethylene Brassylate, Bis (2-ethylhexyl)phthalate). The high stability and reproducibility of the 3T3-L1 gene transcription patterns over different experiments and cell batches is demonstrated by this study. Obesogens and non-obesogen gene transcription profiles were clearly distinguished using hierarchical clustering. Furthermore, a gradual distinction corresponding to differences in induction of lipid accumulation could be made between test and reference obesogens based on transcription patterns, indicating the potential use of this strategy for classification of obesogens. Marker genes that are able to distinguish between non, test, and reference obesogens were identified. Well-known genes involved in adipocyte differentiation as well as genes with unknown functions were selected, implying a potential adipocyte-related function of the latter. Cell-physiological lipid accumulation was well estimated based on transcription levels of the marker genes, indicating the biological relevance of omics data. In conclusion, this study shows the high relevance and reproducibility of this 3T3-L1 based in vitro toxicogenomics tool for classification of obesogens and biomarker discovery. Although the results presented here are promising, further confirmation of the predictive value of the set of candidate biomarkers identified as well as the validation of their clinical role will be needed. © The Author 2014. Published by Oxford University Press on behalf of the Society of Toxicology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Comparison of droplet digital PCR with quantitative real-time PCR for determination of zygosity in transgenic maize.

PubMed

Xu, Xiaoli; Peng, Cheng; Wang, Xiaofu; Chen, Xiaoyun; Wang, Qiang; Xu, Junfeng

2016-12-01

This study evaluated the applicability of droplet digital PCR (ddPCR) as a tool for maize zygosity determination using quantitative real-time PCR (qPCR) as a reference technology. Quantitative real-time PCR is commonly used to determine transgene copy number or GMO zygosity characterization. However, its effectiveness is based on identical reaction efficiencies for the transgene and the endogenous reference gene. Additionally, a calibrator sample should be utilized for accuracy. Droplet digital PCR is a DNA molecule counting technique that directly counts the absolute number of target and reference DNA molecules in a sample, independent of assay efficiency or external calibrators. The zygosity of the transgene can be easily determined using the ratio of the quantity of the target gene to the reference single copy endogenous gene. In this study, both the qPCR and ddPCR methods were used to determine insect-resistant transgenic maize IE034 zygosity. Both methods performed well, but the ddPCR method was more convenient because of its absolute quantification property.

Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data

PubMed Central

Zhao, Xin; Cheung, Leo Wang-Kit

2007-01-01

Background Designing appropriate machine learning methods for identifying genes that have a significant discriminating power for disease outcomes has become more and more important for our understanding of diseases at genomic level. Although many machine learning methods have been developed and applied to the area of microarray gene expression data analysis, the majority of them are based on linear models, which however are not necessarily appropriate for the underlying connection between the target disease and its associated explanatory genes. Linear model based methods usually also bring in false positive significant features more easily. Furthermore, linear model based algorithms often involve calculating the inverse of a matrix that is possibly singular when the number of potentially important genes is relatively large. This leads to problems of numerical instability. To overcome these limitations, a few non-linear methods have recently been introduced to the area. Many of the existing non-linear methods have a couple of critical problems, the model selection problem and the model parameter tuning problem, that remain unsolved or even untouched. In general, a unified framework that allows model parameters of both linear and non-linear models to be easily tuned is always preferred in real-world applications. Kernel-induced learning methods form a class of approaches that show promising potentials to achieve this goal. Results A hierarchical statistical model named kernel-imbedded Gaussian process (KIGP) is developed under a unified Bayesian framework for binary disease classification problems using microarray gene expression data. In particular, based on a probit regression setting, an adaptive algorithm with a cascading structure is designed to find the appropriate kernel, to discover the potentially significant genes, and to make the optimal class prediction accordingly. A Gibbs sampler is built as the core of the algorithm to make Bayesian inferences. Simulation studies showed that, even without any knowledge of the underlying generative model, the KIGP performed very close to the theoretical Bayesian bound not only in the case with a linear Bayesian classifier but also in the case with a very non-linear Bayesian classifier. This sheds light on its broader usability to microarray data analysis problems, especially to those that linear methods work awkwardly. The KIGP was also applied to four published microarray datasets, and the results showed that the KIGP performed better than or at least as well as any of the referred state-of-the-art methods did in all of these cases. Conclusion Mathematically built on the kernel-induced feature space concept under a Bayesian framework, the KIGP method presented in this paper provides a unified machine learning approach to explore both the linear and the possibly non-linear underlying relationship between the target features of a given binary disease classification problem and the related explanatory gene expression data. More importantly, it incorporates the model parameter tuning into the framework. The model selection problem is addressed in the form of selecting a proper kernel type. The KIGP method also gives Bayesian probabilistic predictions for disease classification. These properties and features are beneficial to most real-world applications. The algorithm is naturally robust in numerical computation. The simulation studies and the published data studies demonstrated that the proposed KIGP performs satisfactorily and consistently. PMID:17328811
Reference Gene Selection for Quantitative Real-Time Reverse-Transcriptase PCR in Annual Ryegrass (Lolium multiflorum) Subjected to Various Abiotic Stresses.

PubMed

Liu, Qiuxu; Qi, Xiao; Yan, Haidong; Huang, Linkai; Nie, Gang; Zhang, Xinquan

2018-01-16

To select the most stable reference genes in annual ryegrass ( Lolium multiflorum ), we studied annual ryegrass leaf tissues exposed to various abiotic stresses by qRT-PCR and selected 11 candidate reference genes, i.e., 18S rRNA, E2, GAPDH, eIF4A, HIS3, SAMDC, TBP-1, Unigene71, Unigene77, Unigene755, and Unigene14912. We then used GeNorm, NormFinder, and BestKeeper to analyze the expression stability of these 11 genes, and used RefFinder to comprehensively rank genes according to stability. Under different stress conditions, the most suitable reference genes for studies of leaf tissues of annual ryegrass were different. The expression of the eIF4A gene was the most stable under drought stress. Under saline-alkali stress, Unigene14912 has the highest expression stability. Under acidic aluminum stress, SAMDC expression stability was highest. Under heavy metal stress, Unigene71 expression had the highest stability. According to the software analyses, Unigene14912, HIS3, and eIF4A were the most suitable for analyses of abiotic stress in tissues of annual ryegrass. GAPDH was the least suitable reference gene. In conclusion, selecting appropriate reference genes under abiotic stress not only improves the accuracy of annual ryegrass gene expression analyses, but also provides a theoretical reference for the development of reference genes in plants of the genus Lolium .
Society, Land, Love or Money (A Strategic Model of How to Glue the Generations Together),

DTIC Science & Technology

1981-01-01

carrying of extra capital stock. 33 REFERENCES Dawkins , R. (1976), The Selfish Gene . New York: Oxford University Press. Dubey, P. and N. Shubik (1981...a gene would rapidly disappear. The care for offspring needs to be forthcoming until the last of the new generation is selfsufficient and able to...payoff than the firs:. it is more complicated. 20 4.3. Selfish Individuals and Threat Strategies (Model 1) In this section an example is fully
Selection of Suitable Reference Genes for RT-qPCR Normalization under Abiotic Stresses and Hormone Stimulation in Persimmon (Diospyros kaki Thunb)

PubMed Central

Wang, Peihong; Xiong, Aisheng; Gao, Zhihong; Yu, Xinyi; Li, Man; Hou, Yingjun; Sun, Chao; Qu, Shenchun

2016-01-01

The success of quantitative real-time reverse transcription polymerase chain reaction (RT-qPCR) to quantify gene expression depends on the stability of the reference genes used for data normalization. To date, systematic screening for reference genes in persimmon (Diospyros kaki Thunb) has never been reported. In this study, 13 candidate reference genes were cloned from 'Nantongxiaofangshi' using information available in the transcriptome database. Their expression stability was assessed by geNorm and NormFinder algorithms under abiotic stress and hormone stimulation. Our results showed that the most suitable reference genes across all samples were UBC and GAPDH, and not the commonly used persimmon reference gene ACT. In addition, UBC combined with RPII or TUA were found to be appropriate for the "abiotic stress" group and α-TUB combined with PP2A were found to be appropriate for the "hormone stimuli" group. For further validation, the transcript level of the DkDREB2C homologue under heat stress was studied with the selected genes (CYP, GAPDH, TUA, UBC, α-TUB, and EF1-α). The results suggested that it is necessary to choose appropriate reference genes according to the test materials or experimental conditions. Our study will be useful for future studies on gene expression in persimmon. PMID:27513755
Housekeeping while brain's storming Validation of normalizing factors for gene expression studies in a murine model of traumatic brain injury

PubMed Central

Rhinn, Hervé; Marchand-Leroux, Catherine; Croci, Nicole; Plotkine, Michel; Scherman, Daniel; Escriou, Virginie

2008-01-01

Background Traumatic brain injury models are widely studied, especially through gene expression, either to further understand implied biological mechanisms or to assess the efficiency of potential therapies. A large number of biological pathways are affected in brain trauma models, whose elucidation might greatly benefit from transcriptomic studies. However the suitability of reference genes needed for quantitative RT-PCR experiments is missing for these models. Results We have compared five potential reference genes as well as total cDNA level monitored using Oligreen reagent in order to determine the best normalizing factors for quantitative RT-PCR expression studies in the early phase (0–48 h post-trauma (PT)) of a murine model of diffuse brain injury. The levels of 18S rRNA, and of transcripts of β-actin, glyceraldehyde-3P-dehydrogenase (GAPDH), β-microtubulin and S100β were determined in the injured brain region of traumatized mice sacrificed at 30 min, 3 h, 6 h, 12 h, 24 h and 48 h post-trauma. The stability of the reference genes candidates and of total cDNA was evaluated by three different methods, leading to the following rankings as normalization factors, from the most suitable to the less: by using geNorm VBA applet, we obtained the following sequence: cDNA(Oligreen); GAPDH > 18S rRNA > S100β > β-microtubulin > β-actin; by using NormFinder Excel Spreadsheet, we obtained the following sequence: GAPDH > cDNA(Oligreen) > S100β > 18S rRNA > β-actin > β-microtubulin; by using a Confidence-Interval calculation, we obtained the following sequence: cDNA(Oligreen) > 18S rRNA; GAPDH > S100β > β-microtubulin > β-actin. Conclusion This work suggests that Oligreen cDNA measurements, 18S rRNA and GAPDH or a combination of them may be used to efficiently normalize qRT-PCR gene expression in mouse brain trauma injury, and that β-actin and β-microtubulin should be avoided. The potential of total cDNA as measured by Oligreen as a first-intention normalizing factor with a broad field of applications is highlighted. Pros and cons of the three methods of normalization factors selection are discussed. A generic time- and cost-effective procedure for normalization factor validation is proposed. PMID:18611280
Mixture models for detecting differentially expressed genes in microarrays.

PubMed

Jones, Liat Ben-Tovim; Bean, Richard; McLachlan, Geoffrey J; Zhu, Justin Xi

2006-10-01

An important and common problem in microarray experiments is the detection of genes that are differentially expressed in a given number of classes. As this problem concerns the selection of significant genes from a large pool of candidate genes, it needs to be carried out within the framework of multiple hypothesis testing. In this paper, we focus on the use of mixture models to handle the multiplicity issue. With this approach, a measure of the local FDR (false discovery rate) is provided for each gene. An attractive feature of the mixture model approach is that it provides a framework for the estimation of the prior probability that a gene is not differentially expressed, and this probability can subsequently be used in forming a decision rule. The rule can also be formed to take the false negative rate into account. We apply this approach to a well-known publicly available data set on breast cancer, and discuss our findings with reference to other approaches.
Validation of Suitable Reference Genes for Expression Normalization in Echinococcus spp. Larval Stages

PubMed Central

Espínola, Sergio Martin; Ferreira, Henrique Bunselmeyer; Zaha, Arnaldo

2014-01-01

In recent years, a significant amount of sequence data (both genomic and transcriptomic) for Echinococcus spp. has been published, thereby facilitating the analysis of genes expressed during a specific stage or involved in parasite development. To perform a suitable gene expression quantification analysis, the use of validated reference genes is strongly recommended. Thus, the aim of this work was to identify suitable reference genes to allow reliable expression normalization for genes of interest in Echinococcus granulosus sensu stricto (s.s.) (G1) and Echinococcus ortleppi upon induction of the early pre-adult development. Untreated protoscoleces (PS) and pepsin-treated protoscoleces (PSP) from E. granulosus s.s. (G1) and E. ortleppi metacestode were used. The gene expression stability of eleven candidate reference genes (βTUB, NDUFV2, RPL13, TBP, CYP-1, RPII, EF-1α, βACT-1, GAPDH, ETIF4A-III and MAPK3) was assessed using geNorm, Normfinder, and RefFinder. Our qPCR data showed a good correlation with the recently published RNA-seq data. Regarding expression stability, EF-1α and TBP were the most stable genes for both species. Interestingly, βACT-1 (the most commonly used reference gene), and GAPDH and ETIF4A-III (previously identified as housekeeping genes) did not behave stably in our assay conditions. We propose the use of EF-1α as a reference gene for studies involving gene expression analysis in both PS and PSP experimental conditions for E. granulosus s.s. and E. ortleppi. To demonstrate its applicability, EF-1α was used as a normalizer gene in the relative quantification of transcripts from genes coding for antigen B subunits. The same EF-1α reference gene may be used in studies with other Echinococcus sensu lato species. This report validates suitable reference genes for species of class Cestoda, phylum Platyhelminthes, thus providing a foundation for further validation in other epidemiologically important cestode species, such as those from the Taenia genus. PMID:25014071
Validation of suitable reference genes for expression normalization in Echinococcus spp. larval stages.

PubMed

Espínola, Sergio Martin; Ferreira, Henrique Bunselmeyer; Zaha, Arnaldo

2014-01-01

In recent years, a significant amount of sequence data (both genomic and transcriptomic) for Echinococcus spp. has been published, thereby facilitating the analysis of genes expressed during a specific stage or involved in parasite development. To perform a suitable gene expression quantification analysis, the use of validated reference genes is strongly recommended. Thus, the aim of this work was to identify suitable reference genes to allow reliable expression normalization for genes of interest in Echinococcus granulosus sensu stricto (s.s.) (G1) and Echinococcus ortleppi upon induction of the early pre-adult development. Untreated protoscoleces (PS) and pepsin-treated protoscoleces (PSP) from E. granulosus s.s. (G1) and E. ortleppi metacestode were used. The gene expression stability of eleven candidate reference genes (βTUB, NDUFV2, RPL13, TBP, CYP-1, RPII, EF-1α, βACT-1, GAPDH, ETIF4A-III and MAPK3) was assessed using geNorm, Normfinder, and RefFinder. Our qPCR data showed a good correlation with the recently published RNA-seq data. Regarding expression stability, EF-1α and TBP were the most stable genes for both species. Interestingly, βACT-1 (the most commonly used reference gene), and GAPDH and ETIF4A-III (previously identified as housekeeping genes) did not behave stably in our assay conditions. We propose the use of EF-1α as a reference gene for studies involving gene expression analysis in both PS and PSP experimental conditions for E. granulosus s.s. and E. ortleppi. To demonstrate its applicability, EF-1α was used as a normalizer gene in the relative quantification of transcripts from genes coding for antigen B subunits. The same EF-1α reference gene may be used in studies with other Echinococcus sensu lato species. This report validates suitable reference genes for species of class Cestoda, phylum Platyhelminthes, thus providing a foundation for further validation in other epidemiologically important cestode species, such as those from the Taenia genus.
Identification and analysis of unitary loss of long-established protein-coding genes in Poaceae shows evidences for biased gene loss and putatively functional transcription of relics.

PubMed

Zhao, Yi; Tang, Liang; Li, Zhe; Jin, Jinpu; Luo, Jingchu; Gao, Ge

2015-04-18

Long-established protein-coding genes may lose their coding potential during evolution ("unitary gene loss"). Members of the Poaceae family are a major food source and represent an ideal model clade for plant evolution research. However, the global pattern of unitary gene loss in Poaceae genomes as well as the evolutionary fate of lost genes are still less-investigated and remain largely elusive. Using a locally developed pipeline, we identified 129 unitary gene loss events for long-established protein-coding genes from four representative species of Poaceae, i.e. brachypodium, rice, sorghum and maize. Functional annotation suggested that the lost genes in all or most of Poaceae species are enriched for genes involved in development and response to endogenous stimulus. We also found that 44 mutated genomic loci of lost genes, which we referred as relics, were still actively transcribed, and of which 84% (37 of 44) showed significantly differential expression across different tissues. More interestingly, we found that there were totally five expressed relics may function as competitive endogenous RNA in brachypodium, rice and sorghum genome. Based on comparative genomics and transcriptome data, we firstly compiled a comprehensive catalogue of unitary gene loss events in Poaceae species and characterized a statistically significant functional preference for these lost genes as well showed the potential of relics functioning as competitive endogenous RNAs in Poaceae genomes.
Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

PubMed

Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

2015-01-01

In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.
A Risk Stratification Model for Lung Cancer Based on Gene Coexpression Network and Deep Learning

PubMed Central

2018-01-01

Risk stratification model for lung cancer with gene expression profile is of great interest. Instead of previous models based on individual prognostic genes, we aimed to develop a novel system-level risk stratification model for lung adenocarcinoma based on gene coexpression network. Using multiple microarray, gene coexpression network analysis was performed to identify survival-related networks. A deep learning based risk stratification model was constructed with representative genes of these networks. The model was validated in two test sets. Survival analysis was performed using the output of the model to evaluate whether it could predict patients' survival independent of clinicopathological variables. Five networks were significantly associated with patients' survival. Considering prognostic significance and representativeness, genes of the two survival-related networks were selected for input of the model. The output of the model was significantly associated with patients' survival in two test sets and training set (p < 0.00001, p < 0.0001 and p = 0.02 for training and test sets 1 and 2, resp.). In multivariate analyses, the model was associated with patients' prognosis independent of other clinicopathological features. Our study presents a new perspective on incorporating gene coexpression networks into the gene expression signature and clinical application of deep learning in genomic data science for prognosis prediction. PMID:29581968
Development of an ELA-DRA gene typing method based on pyrosequencing technology.

PubMed

Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G

2008-11-01

The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.
The zebrafish reference genome sequence and its relationship to the human genome.

PubMed

Howe, Kerstin; Clark, Matthew D; Torroja, Carlos F; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T; Guerra-Assunção, José A; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F; Laird, Gavin K; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Elliot, David; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Begum, Sharmin; Mortimore, Beverley; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Lloyd, Christine; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James D; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Lanz, Christa; Raddatz, Günter; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Schuster, Stephan C; Carter, Nigel P; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M J; Enright, Anton; Geisler, Robert; Plasterk, Ronald H A; Lee, Charles; Westerfield, Monte; de Jong, Pieter J; Zon, Leonard I; Postlethwait, John H; Nüsslein-Volhard, Christiane; Hubbard, Tim J P; Roest Crollius, Hugues; Rogers, Jane; Stemple, Derek L

2013-04-25

Zebrafish have become a popular organism for the study of vertebrate gene function. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination.
The zebrafish reference genome sequence and its relationship to the human genome

PubMed Central

Howe, Kerstin; Clark, Matthew D.; Torroja, Carlos F.; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E.; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C.; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T.; Guerra-Assunção, José A.; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F.; Laird, Gavin K.; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M.; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Carter, Nigel P.; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M. J.; Enright, Anton; Geisler, Robert; Plasterk, Ronald H. A.; Lee, Charles; Westerfield, Monte; de Jong, Pieter J.; Zon, Leonard I.; Postlethwait, John H.; Nüsslein-Volhard, Christiane; Hubbard, Tim J. P.; Crollius, Hugues Roest; Rogers, Jane; Stemple, Derek L.

2013-01-01

Zebrafish have become a popular organism for the study of vertebrate gene function1,2. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease3–5. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes6, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination. PMID:23594743
Identification of reference genes in human myelomonocytic cells for gene expression studies in altered gravity.

PubMed

Thiel, Cora S; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Unverdorben, Felix; Buttron, Isabell; Lauber, Beatrice; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E; Ullrich, Oliver

2015-01-01

Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes ("housekeeping genes") are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity.
Split-plot microarray experiments: issues of design, power and sample size.

PubMed

Tsai, Pi-Wen; Lee, Mei-Ling Ting

2005-01-01

This article focuses on microarray experiments with two or more factors in which treatment combinations of the factors corresponding to the samples paired together onto arrays are not completely random. A main effect of one (or more) factor(s) is confounded with arrays (the experimental blocks). This is called a split-plot microarray experiment. We utilise an analysis of variance (ANOVA) model to assess differentially expressed genes for between-array and within-array comparisons that are generic under a split-plot microarray experiment. Instead of standard t- or F-test statistics that rely on mean square errors of the ANOVA model, we use a robust method, referred to as 'a pooled percentile estimator', to identify genes that are differentially expressed across different treatment conditions. We illustrate the design and analysis of split-plot microarray experiments based on a case application described by Jin et al. A brief discussion of power and sample size for split-plot microarray experiments is also presented.
Storm Water Management Model Reference Manual Volume III – Water Quality

EPA Science Inventory

SWMM is a dynamic rainfall-runoff simulation model used for single event or long-term (continuous) simulation of runoff quantity and quality from primarily urban areas. The runoff component of SWMM operates on a collection of subcatchment areas that receive precipitation and gene...
Assessment of primer/template mismatch effects on real-time PCR amplification of target taxa for GMO quantification.

PubMed

Ghedira, Rim; Papazova, Nina; Vuylsteke, Marnik; Ruttink, Tom; Taverniers, Isabel; De Loose, Marc

2009-10-28

GMO quantification, based on real-time PCR, relies on the amplification of an event-specific transgene assay and a species-specific reference assay. The uniformity of the nucleotide sequences targeted by both assays across various transgenic varieties is an important prerequisite for correct quantification. Single nucleotide polymorphisms (SNPs) frequently occur in the maize genome and might lead to nucleotide variation in regions used to design primers and probes for reference assays. Further, they may affect the annealing of the primer to the template and reduce the efficiency of DNA amplification. We assessed the effect of a minor DNA template modification, such as a single base pair mismatch in the primer attachment site, on real-time PCR quantification. A model system was used based on the introduction of artificial mismatches between the forward primer and the DNA template in the reference assay targeting the maize starch synthase (SSIIb) gene. The results show that the presence of a mismatch between the primer and the DNA template causes partial to complete failure of the amplification of the initial DNA template depending on the type and location of the nucleotide mismatch. With this study, we show that the presence of a primer/template mismatch affects the estimated total DNA quantity to a varying degree.
UPLC/Q-TOF MS-Based Metabolomics and qRT-PCR in Enzyme Gene Screening with Key Role in Triterpenoid Saponin Biosynthesis of Polygala tenuifolia

PubMed Central

Li, Zhenyu; Xu, Xiaoshuang; Peng, Bing; Qin, Xuemei; Du, Guanhua

2014-01-01

Background The dried root of Polygala tenuifolia, named Radix Polygalae, is a well-known traditional Chinese medicine. Triterpenoid saponins are some of the most important components of Radix Polygalae extracts and are widely studied because of their valuable pharmacological properties. However, the relationship between gene expression and triterpenoid saponin biosynthesis in P. tenuifolia is unclear. Methodology/Findings In this study, ultra-performance liquid chromatography (UPLC) coupled with quadrupole time-of-flight mass spectrometry (Q-TOF MS)-based metabolomic analysis was performed to identify and quantify the different chemical constituents of the roots, stems, leaves, and seeds of P. tenuifolia. A total of 22 marker compounds (VIP>1) were explored, and significant differences in all 7 triterpenoid saponins among the different tissues were found. We also observed an efficient reference gene GAPDH for different tissues in this plant and determined the expression level of some genes in the triterpenoid saponin biosynthetic pathway. Results showed that MVA pathway has more important functions in the triterpenoid saponin biosynthesis of P. tenuifolia. The expression levels of squalene synthase (SQS), squalene monooxygenase (SQE), and beta-amyrin synthase (β-AS) were highly correlated with the peak area intensity of triterpenoid saponins compared with data from UPLC/Q-TOF MS-based metabolomic analysis. Conclusions/Significance This finding suggested that a combination of UPLC/Q-TOF MS-based metabolomics and gene expression analysis can effectively elucidate the mechanism of triterpenoid saponin biosynthesis and can provide useful information on gene discovery. These findings can serve as a reference for using the overexpression of genes encoding for SQS, SQE, and/or β-AS to increase the triterpenoid saponin production of P. tenuifolia. PMID:25148032
UPLC/Q-TOF MS-based metabolomics and qRT-PCR in enzyme gene screening with key role in triterpenoid saponin biosynthesis of Polygala tenuifolia.

PubMed

Zhang, Fusheng; Li, Xiaowei; Li, Zhenyu; Xu, Xiaoshuang; Peng, Bing; Qin, Xuemei; Du, Guanhua

2014-01-01

The dried root of Polygala tenuifolia, named Radix Polygalae, is a well-known traditional Chinese medicine. Triterpenoid saponins are some of the most important components of Radix Polygalae extracts and are widely studied because of their valuable pharmacological properties. However, the relationship between gene expression and triterpenoid saponin biosynthesis in P. tenuifolia is unclear. In this study, ultra-performance liquid chromatography (UPLC) coupled with quadrupole time-of-flight mass spectrometry (Q-TOF MS)-based metabolomic analysis was performed to identify and quantify the different chemical constituents of the roots, stems, leaves, and seeds of P. tenuifolia. A total of 22 marker compounds (VIP>1) were explored, and significant differences in all 7 triterpenoid saponins among the different tissues were found. We also observed an efficient reference gene GAPDH for different tissues in this plant and determined the expression level of some genes in the triterpenoid saponin biosynthetic pathway. Results showed that MVA pathway has more important functions in the triterpenoid saponin biosynthesis of P. tenuifolia. The expression levels of squalene synthase (SQS), squalene monooxygenase (SQE), and beta-amyrin synthase (β-AS) were highly correlated with the peak area intensity of triterpenoid saponins compared with data from UPLC/Q-TOF MS-based metabolomic analysis. This finding suggested that a combination of UPLC/Q-TOF MS-based metabolomics and gene expression analysis can effectively elucidate the mechanism of triterpenoid saponin biosynthesis and can provide useful information on gene discovery. These findings can serve as a reference for using the overexpression of genes encoding for SQS, SQE, and/or β-AS to increase the triterpenoid saponin production of P. tenuifolia.

Stable Reference Gene Selection for RT-qPCR Analysis in Nonviruliferous and Viruliferous Frankliniella occidentalis.

PubMed

Yang, Chunxiao; Li, Hui; Pan, Huipeng; Ma, Yabin; Zhang, Deyong; Liu, Yong; Zhang, Zhanhong; Zheng, Changying; Chu, Dong

2015-01-01

Reverse transcriptase-quantitative polymerase chain reaction (RT-qPCR) is a reliable technique for measuring and evaluating gene expression during variable biological processes. To facilitate gene expression studies, normalization of genes of interest relative to stable reference genes is crucial. The western flower thrips Frankliniella occidentalis (Pergande) (Thysanoptera: Thripidae), the main vector of tomato spotted wilt virus (TSWV), is a destructive invasive species. In this study, the expression profiles of 11 candidate reference genes from nonviruliferous and viruliferous F. occidentalis were investigated. Five distinct algorithms, geNorm, NormFinder, BestKeeper, the ΔCt method, and RefFinder, were used to determine the performance of these genes. geNorm, NormFinder, BestKeeper, and RefFinder identified heat shock protein 70 (HSP70), heat shock protein 60 (HSP60), elongation factor 1 α, and ribosomal protein l32 (RPL32) as the most stable reference genes, and the ΔCt method identified HSP60, HSP70, RPL32, and heat shock protein 90 as the most stable reference genes. Additionally, two reference genes were sufficient for reliable normalization in nonviruliferous and viruliferous F. occidentalis. This work provides a foundation for investigating the molecular mechanisms of TSWV and F. occidentalis interactions.
Selection of reference genes for expression analysis in the entomophthoralean fungus Pandora neoaphidis.

PubMed

Chen, Chun; Xie, Tingna; Ye, Sudan; Jensen, Annette Bruun; Eilenberg, Jørgen

2016-01-01

The selection of suitable reference genes is crucial for accurate quantification of gene expression and can add to our understanding of host-pathogen interactions. To identify suitable reference genes in Pandora neoaphidis, an obligate aphid pathogenic fungus, the expression of three traditional candidate genes including 18S rRNA(18S), 28S rRNA(28S) and elongation factor 1 alpha-like protein (EF1), were measured by quantitative polymerase chain reaction at different developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae), and under different nutritional conditions. We calculated the expression stability of candidate reference genes using four algorithms including geNorm, NormFinder, BestKeeper and Delta Ct. The analysis results revealed that the comprehensive ranking of candidate reference genes from the most stable to the least stable was 18S (1.189), 28S (1.414) and EF1 (3). The 18S was, therefore, the most suitable reference gene for real-time RT-PCR analysis of gene expression under all conditions. These results will support further studies on gene expression in P. neoaphidis. Copyright © 2015 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Computational Gene Expression Modeling Identifies Salivary Biomarker Analysis that Predict Oral Feeding Readiness in the Newborn

PubMed Central

Maron, Jill L.; Hwang, Jooyeon S.; Pathak, Subash; Ruthazer, Robin; Russell, Ruby L.; Alterovitz, Gil

2014-01-01

Objective To combine mathematical modeling of salivary gene expression microarray data and systems biology annotation with RT-qPCR amplification to identify (phase I) and validate (phase II) salivary biomarker analysis for the prediction of oral feeding readiness in preterm infants. Study design Comparative whole transcriptome microarray analysis from 12 preterm newborns pre- and post-oral feeding success was used for computational modeling and systems biology analysis to identify potential salivary transcripts associated with oral feeding success (phase I). Selected gene expression biomarkers (15 from computational modeling; 6 evidence-based; and 3 reference) were evaluated by RT-qPCR amplification on 400 salivary samples from successful (n=200) and unsuccessful (n=200) oral feeders (phase II). Genes, alone and in combination, were evaluated by a multivariate analysis controlling for sex and post-conceptional age (PCA) to determine the probability that newborns achieved successful oral feeding. Results Advancing post-conceptional age (p < 0.001) and female sex (p = 0.05) positively predicted an infant’s ability to feed orally. A combination of five genes, NPY2R (hunger signaling), AMPK (energy homeostasis), PLXNA1 (olfactory neurogenesis), NPHP4 (visual behavior) and WNT3 (facial development), in addition to PCA and sex, demonstrated good accuracy for determining feeding success (AUROC = 0.78). Conclusions We have identified objective and biologically relevant salivary biomarkers that noninvasively assess a newborn’s developing brain, sensory and facial development as they relate to oral feeding success. Understanding the mechanisms that underlie the development of oral feeding readiness through translational and computational methods may improve clinical decision making while decreasing morbidities and health care costs. PMID:25620512
A Nonlinear Model for Gene-Based Gene-Environment Interaction.

PubMed

Sa, Jian; Liu, Xu; He, Tao; Liu, Guifen; Cui, Yuehua

2016-06-04

A vast amount of literature has confirmed the role of gene-environment (G×E) interaction in the etiology of complex human diseases. Traditional methods are predominantly focused on the analysis of interaction between a single nucleotide polymorphism (SNP) and an environmental variable. Given that genes are the functional units, it is crucial to understand how gene effects (rather than single SNP effects) are influenced by an environmental variable to affect disease risk. Motivated by the increasing awareness of the power of gene-based association analysis over single variant based approach, in this work, we proposed a sparse principle component regression (sPCR) model to understand the gene-based G×E interaction effect on complex disease. We first extracted the sparse principal components for SNPs in a gene, then the effect of each principal component was modeled by a varying-coefficient (VC) model. The model can jointly model variants in a gene in which their effects are nonlinearly influenced by an environmental variable. In addition, the varying-coefficient sPCR (VC-sPCR) model has nice interpretation property since the sparsity on the principal component loadings can tell the relative importance of the corresponding SNPs in each component. We applied our method to a human birth weight dataset in Thai population. We analyzed 12,005 genes across 22 chromosomes and found one significant interaction effect using the Bonferroni correction method and one suggestive interaction. The model performance was further evaluated through simulation studies. Our model provides a system approach to evaluate gene-based G×E interaction.
Selection and Validation of Reference Genes for Accurate RT-qPCR Data Normalization in Coffea spp. under a Climate Changes Context of Interacting Elevated [CO2] and Temperature

PubMed Central

Martins, Madlles Q.; Fortunato, Ana S.; Rodrigues, Weverton P.; Partelli, Fábio L.; Campostrini, Eliemar; Lidon, Fernando C.; DaMatta, Fábio M.; Ramalho, José C.; Ribeiro-Barros, Ana I.

2017-01-01

World coffee production has faced increasing challenges associated with ongoing climatic changes. Several studies, which have been almost exclusively based on temperature increase, have predicted extensive reductions (higher than half by 2,050) of actual coffee cropped areas. However, recent studies showed that elevated [CO2] can strongly mitigate the negative impacts of heat stress at the physiological and biochemical levels in coffee leaves. In addition, it has also been shown that coffee genotypes can successfully cope with temperatures above what has been traditionally accepted. Altogether, this information suggests that the real impact of climate changes on coffee growth and production could be significantly lower than previously estimated. Gene expression studies are an important tool to unravel crop acclimation ability, demanding the use of adequate reference genes. We have examined the transcript stability of 10 candidate reference genes to normalize RT-qPCR expression studies using a set of 24 cDNAs from leaves of three coffee genotypes (CL153, Icatu, and IPR108), grown under 380 or 700 μL CO2 L−1, and submitted to increasing temperatures from 25/20°C (day/night) to 42/34°C. Samples were analyzed according to genotype, [CO2], temperature, multiple stress interaction ([CO2], temperature) and total stress interaction (genotype, [CO2], and temperature). The transcript stability of each gene was assessed through a multiple analytical approach combining the Coeficient of Variation method and three algorithms (geNorm, BestKeeper, NormFinder). The transcript stability varied according to the type of stress for most genes, but the consensus ranking obtained with RefFinder, classified MDH as the gene with the highest mRNA stability to a global use, followed by ACT and S15, whereas α-TUB and CYCL showed the least stable mRNA contents. Using the coffee expression profiles of the gene encoding the large-subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (RLS), results from the in silico aggregation and experimental validation of the best number of reference genes showed that two reference genes are adequate to normalize RT-qPCR data. Altogether, this work highlights the importance of an adequate selection of reference genes for each single or combined experimental condition and constitutes the basis to accurately study molecular responses of Coffea spp. in a context of climate changes and global warming. PMID:28326094
Selection and Validation of Reference Genes for Accurate RT-qPCR Data Normalization in Coffea spp. under a Climate Changes Context of Interacting Elevated [CO2] and Temperature.

PubMed

Martins, Madlles Q; Fortunato, Ana S; Rodrigues, Weverton P; Partelli, Fábio L; Campostrini, Eliemar; Lidon, Fernando C; DaMatta, Fábio M; Ramalho, José C; Ribeiro-Barros, Ana I

2017-01-01

World coffee production has faced increasing challenges associated with ongoing climatic changes. Several studies, which have been almost exclusively based on temperature increase, have predicted extensive reductions (higher than half by 2,050) of actual coffee cropped areas. However, recent studies showed that elevated [CO 2 ] can strongly mitigate the negative impacts of heat stress at the physiological and biochemical levels in coffee leaves. In addition, it has also been shown that coffee genotypes can successfully cope with temperatures above what has been traditionally accepted. Altogether, this information suggests that the real impact of climate changes on coffee growth and production could be significantly lower than previously estimated. Gene expression studies are an important tool to unravel crop acclimation ability, demanding the use of adequate reference genes. We have examined the transcript stability of 10 candidate reference genes to normalize RT-qPCR expression studies using a set of 24 cDNAs from leaves of three coffee genotypes (CL153, Icatu, and IPR108), grown under 380 or 700 μL CO 2 L -1 , and submitted to increasing temperatures from 25/20°C (day/night) to 42/34°C. Samples were analyzed according to genotype, [CO 2 ], temperature, multiple stress interaction ([CO 2 ], temperature) and total stress interaction (genotype, [CO 2 ], and temperature). The transcript stability of each gene was assessed through a multiple analytical approach combining the Coeficient of Variation method and three algorithms (geNorm, BestKeeper, NormFinder). The transcript stability varied according to the type of stress for most genes, but the consensus ranking obtained with RefFinder, classified MDH as the gene with the highest mRNA stability to a global use, followed by ACT and S15 , whereas α -TUB and CYCL showed the least stable mRNA contents. Using the coffee expression profiles of the gene encoding the large-subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase ( RLS ), results from the in silico aggregation and experimental validation of the best number of reference genes showed that two reference genes are adequate to normalize RT-qPCR data. Altogether, this work highlights the importance of an adequate selection of reference genes for each single or combined experimental condition and constitutes the basis to accurately study molecular responses of Coffea spp. in a context of climate changes and global warming.
Microarray-based cancer prediction using soft computing approach.

PubMed

Wang, Xiaosheng; Gotoh, Osamu

2009-05-26

One of the difficulties in using gene expression profiles to predict cancer is how to effectively select a few informative genes to construct accurate prediction models from thousands or ten thousands of genes. We screen highly discriminative genes and gene pairs to create simple prediction models involved in single genes or gene pairs on the basis of soft computing approach and rough set theory. Accurate cancerous prediction is obtained when we apply the simple prediction models for four cancerous gene expression datasets: CNS tumor, colon tumor, lung cancer and DLBCL. Some genes closely correlated with the pathogenesis of specific or general cancers are identified. In contrast with other models, our models are simple, effective and robust. Meanwhile, our models are interpretable for they are based on decision rules. Our results demonstrate that very simple models may perform well on cancerous molecular prediction and important gene markers of cancer can be detected if the gene selection approach is chosen reasonably.
Reference Genes for qPCR Analysis in Resin-Tapped Adult Slash Pine As a Tool to Address the Molecular Basis of Commercial Resinosis

PubMed Central

de Lima, Júlio C.; de Costa, Fernanda; Füller, Thanise N.; Rodrigues-Corrêa, Kelly C. da Silva; Kerber, Magnus R.; Lima, Mariano S.; Fett, Janette P.; Fett-Neto, Arthur G.

2016-01-01

Pine oleoresin is a major source of terpenes, consisting of turpentine (mono- and sesquiterpenes) and rosin (diterpenes) fractions. Higher oleoresin yields are of economic interest, since oleoresin derivatives make up a valuable source of materials for chemical industries. Oleoresin can be extracted from living trees, often by the bark streak method, in which bark removal is done periodically, followed by application of stimulant paste containing sulfuric acid and other chemicals on the freshly wounded exposed surface. To better understand the molecular basis of chemically-stimulated and wound induced oleoresin production, we evaluated the stability of 11 putative reference genes for the purpose of normalization in studying Pinus elliottii gene expression during oleoresinosis. Samples for RNA extraction were collected from field-grown adult trees under tapping operations using stimulant pastes with different compositions and at various time points after paste application. Statistical methods established by geNorm, NormFinder, and BestKeeper softwares were consistent in pointing as adequate reference genes HISTO3 and UBI. To confirm expression stability of the candidate reference genes, expression profiles of putative P. elliottii orthologs of resin biosynthesis-related genes encoding Pinus contorta β-pinene synthase [PcTPS-(−)β-pin1], P. contorta levopimaradiene/abietadiene synthase (PcLAS1), Pinus taeda α-pinene synthase [PtTPS-(+)αpin], and P. taeda α-farnesene synthase (PtαFS) were examined following stimulant paste application. Increased oleoresin yields observed in stimulated treatments using phytohormone-based pastes were consistent with higher expression of pinene synthases. Overall, the expression of all genes examined matched the expected profiles of oleoresin-related transcript changes reported for previously examined conifers. PMID:27379135
Reference Genes for qPCR Analysis in Resin-Tapped Adult Slash Pine As a Tool to Address the Molecular Basis of Commercial Resinosis.

PubMed

de Lima, Júlio C; de Costa, Fernanda; Füller, Thanise N; Rodrigues-Corrêa, Kelly C da Silva; Kerber, Magnus R; Lima, Mariano S; Fett, Janette P; Fett-Neto, Arthur G

2016-01-01

Pine oleoresin is a major source of terpenes, consisting of turpentine (mono- and sesquiterpenes) and rosin (diterpenes) fractions. Higher oleoresin yields are of economic interest, since oleoresin derivatives make up a valuable source of materials for chemical industries. Oleoresin can be extracted from living trees, often by the bark streak method, in which bark removal is done periodically, followed by application of stimulant paste containing sulfuric acid and other chemicals on the freshly wounded exposed surface. To better understand the molecular basis of chemically-stimulated and wound induced oleoresin production, we evaluated the stability of 11 putative reference genes for the purpose of normalization in studying Pinus elliottii gene expression during oleoresinosis. Samples for RNA extraction were collected from field-grown adult trees under tapping operations using stimulant pastes with different compositions and at various time points after paste application. Statistical methods established by geNorm, NormFinder, and BestKeeper softwares were consistent in pointing as adequate reference genes HISTO3 and UBI. To confirm expression stability of the candidate reference genes, expression profiles of putative P. elliottii orthologs of resin biosynthesis-related genes encoding Pinus contorta β-pinene synthase [PcTPS-(-)β-pin1], P. contorta levopimaradiene/abietadiene synthase (PcLAS1), Pinus taeda α-pinene synthase [PtTPS-(+)αpin], and P. taeda α-farnesene synthase (PtαFS) were examined following stimulant paste application. Increased oleoresin yields observed in stimulated treatments using phytohormone-based pastes were consistent with higher expression of pinene synthases. Overall, the expression of all genes examined matched the expected profiles of oleoresin-related transcript changes reported for previously examined conifers.
Genetic Bases of Stuttering: The State of the Art, 2011

PubMed Central

Kraft, Shelly Jo; Yairi, Ehud

2011-01-01

Objective The literature on the genetics of stuttering is reviewed with special reference to the historical development from psychosocial explanations leading up to current biological research of gene identification. Summary A gradual progression has been made from the early crude methods of counting percentages of stuttering probands who have relatives who stutter to recent studies using entire genomes of DNA collected from each participant. Despite the shortcomings of some early studies, investigators have accumulated a substantial body of data showing a large presence of familial stuttering. This encouraged more refined research in the form of twin studies. Concordance rates among twins were sufficiently high to lend additional support to the genetic perspective of stuttering. More sophisticated aggregation studies and segregation analyses followed, producing data that matched recognized genetic models, providing the final ‘go ahead’ to proceed from the behavior/statistical genetics into the sphere of biological genetics. Recent linkage and association studies have begun to reveal contributing genes to the disorder. Conclusion No definitive findings have been made regarding which transmission model, chromosomes, genes, or sex factors are involved in the expression of stuttering in the population at large. Future research and clinical implications are discussed. PMID:22067705
Evaluation of Reference Genes for RT-qPCR Studies in the Seagrass Zostera muelleri Exposed to Light Limitation

PubMed Central

Schliep, M.; Pernice, M.; Sinutok, S.; Bryant, C. V.; York, P. H.; Rasheed, M. A.; Ralph, P. J.

2015-01-01

Seagrass meadows are threatened by coastal development and global change. In the face of these pressures, molecular techniques such as reverse transcription quantitative real-time PCR (RT-qPCR) have great potential to improve management of these ecosystems by allowing early detection of chronic stress. In RT-qPCR, the expression levels of target genes are estimated on the basis of reference genes, in order to control for RNA variations. Although determination of suitable reference genes is critical for RT-qPCR studies, reports on the evaluation of reference genes are still absent for the major Australian species Zostera muelleri subsp. capricorni (Z. muelleri). Here, we used three different software (geNorm, NormFinder and Bestkeeper) to evaluate ten widely used reference genes according to their expression stability in Z. muelleri exposed to light limitation. We then combined results from different software and used a consensus rank of four best reference genes to validate regulation in Photosystem I reaction center subunit IV B and Heat Stress Transcription factor A- gene expression in Z. muelleri under light limitation. This study provides the first comprehensive list of reference genes in Z. muelleri and demonstrates RT-qPCR as an effective tool to identify early responses to light limitation in seagrass. PMID:26592440
Identification of Reference Genes in Human Myelomonocytic Cells for Gene Expression Studies in Altered Gravity

PubMed Central

Thiel, Cora S.; Hauschild, Swantje; Tauber, Svantje; Paulsen, Katrin; Raig, Christiane; Raem, Arnold; Biskup, Josefine; Gutewort, Annett; Hürlimann, Eva; Philpot, Claudia; Lier, Hartwin; Engelmann, Frank; Layer, Liliana E.

2015-01-01

Gene expression studies are indispensable for investigation and elucidation of molecular mechanisms. For the process of normalization, reference genes (“housekeeping genes”) are essential to verify gene expression analysis. Thus, it is assumed that these reference genes demonstrate similar expression levels over all experimental conditions. However, common recommendations about reference genes were established during 1 g conditions and therefore their applicability in studies with altered gravity has not been demonstrated yet. The microarray technology is frequently used to generate expression profiles under defined conditions and to determine the relative difference in expression levels between two or more different states. In our study, we searched for potential reference genes with stable expression during different gravitational conditions (microgravity, normogravity, and hypergravity) which are additionally not altered in different hardware systems. We were able to identify eight genes (ALB, B4GALT6, GAPDH, HMBS, YWHAZ, ABCA5, ABCA9, and ABCC1) which demonstrated no altered gene expression levels in all tested conditions and therefore represent good candidates for the standardization of gene expression studies in altered gravity. PMID:25654098
Evaluating the generalizability of GEP models for estimating reference evapotranspiration in distant humid and arid locations

NASA Astrophysics Data System (ADS)

Kiafar, Hamed; Babazadeh, Hosssien; Marti, Pau; Kisi, Ozgur; Landeras, Gorka; Karimi, Sepideh; Shiri, Jalal

2017-10-01

Evapotranspiration estimation is of crucial importance in arid and hyper-arid regions, which suffer from water shortage, increasing dryness and heat. A modeling study is reported here to cross-station assessment between hyper-arid and humid conditions. The derived equations estimate ET0 values based on temperature-, radiation-, and mass transfer-based configurations. Using data from two meteorological stations in a hyper-arid region of Iran and two meteorological stations in a humid region of Spain, different local and cross-station approaches are applied for developing and validating the derived equations. The comparison of the gene expression programming (GEP)-based-derived equations with corresponding empirical-semi empirical ET0 estimation equations reveals the superiority of new formulas in comparison with the corresponding empirical equations. Therefore, the derived models can be successfully applied in these hyper-arid and humid regions as well as similar climatic contexts especially in data-lack situations. The results also show that when relying on proper input configurations, cross-station might be a promising alternative for locally trained models for the stations with data scarcity.
Estimation of daily reference evapotranspiration (ETo) using artificial intelligence methods: Offering a new approach for lagged ETo data-based modeling

NASA Astrophysics Data System (ADS)

Mehdizadeh, Saeid

2018-04-01

Evapotranspiration (ET) is considered as a key factor in hydrological and climatological studies, agricultural water management, irrigation scheduling, etc. It can be directly measured using lysimeters. Moreover, other methods such as empirical equations and artificial intelligence methods can be used to model ET. In the recent years, artificial intelligence methods have been widely utilized to estimate reference evapotranspiration (ETo). In the present study, local and external performances of multivariate adaptive regression splines (MARS) and gene expression programming (GEP) were assessed for estimating daily ETo. For this aim, daily weather data of six stations with different climates in Iran, namely Urmia and Tabriz (semi-arid), Isfahan and Shiraz (arid), Yazd and Zahedan (hyper-arid) were employed during 2000-2014. Two types of input patterns consisting of weather data-based and lagged ETo data-based scenarios were considered to develop the models. Four statistical indicators including root mean square error (RMSE), mean absolute error (MAE), coefficient of determination (R2), and mean absolute percentage error (MAPE) were used to check the accuracy of models. The local performance of models revealed that the MARS and GEP approaches have the capability to estimate daily ETo using the meteorological parameters and the lagged ETo data as inputs. Nevertheless, the MARS had the best performance in the weather data-based scenarios. On the other hand, considerable differences were not observed in the models' accuracy for the lagged ETo data-based scenarios. In the innovation of this study, novel hybrid models were proposed in the lagged ETo data-based scenarios through combination of MARS and GEP models with autoregressive conditional heteroscedasticity (ARCH) time series model. It was concluded that the proposed novel models named MARS-ARCH and GEP-ARCH improved the performance of ETo modeling compared to the single MARS and GEP. In addition, the external analysis of the performance of models at stations with similar climatic conditions denoted the applicability of nearby station' data for estimation of the daily ETo at target station.
Identification of reference genes for RT-qPCR in ovine mammary tissue during late pregnancy and lactation and in response to maternal nutritional programming.

PubMed

Paten, A M; Pain, S J; Peterson, S W; Blair, H T; Kenyon, P R; Dearden, P K; Duncan, E J

2014-08-01

The mammary gland is a complex tissue consisting of multiple cell types which, over the lifetime of an animal, go through repeated cycles of development associated with pregnancy, lactation and involution. The mammary gland is also known to be sensitive to maternal programming by environmental stimuli such as nutrition. The molecular basis of these adaptations is of significant interest, but requires robust methods to measure gene expression. Reverse-transcription quantitative PCR (RT-qPCR) is commonly used to measure gene expression, and is currently the method of choice for validating genome-wide expression studies. RT-qPCR requires the selection of reference genes that are stably expressed over physiological states and treatments. In this study we identify suitable reference genes to normalize RT-qPCR data for the ovine mammary gland in two physiological states; late pregnancy and lactation. Biopsies were collected from offspring of ewes that had been subjected to different nutritional paradigms during pregnancy to examine effects of maternal programming on the mammary gland of the offspring. We evaluated eight candidate reference genes and found that two reference genes (PRPF3 and CUL1) are required for normalising RT-qPCR data from pooled RNA samples, but five reference genes are required for analyzing gene expression in individual animals (SENP2, EIF6, MRPL39, ATP1A1, CUL1). Using these stable reference genes, we showed that TET1, a key regulator of DNA methylation, is responsive to maternal programming and physiological state. The identification of these novel reference genes will be of utility to future studies of gene expression in the ovine mammary gland. Copyright © 2014 the American Physiological Society.
Identification of a novel reference gene for apple transcriptional profiling under postharvest conditions.

PubMed

Storch, Tatiane Timm; Pegoraro, Camila; Finatto, Taciane; Quecini, Vera; Rombaldi, Cesar Valmor; Girardi, César Luis

2015-01-01

Reverse Transcription quantitative PCR (RT-qPCR) is one of the most important techniques for gene expression profiling due to its high sensibility and reproducibility. However, the reliability of the results is highly dependent on data normalization, performed by comparisons between the expression profiles of the genes of interest against those of constitutively expressed, reference genes. Although the technique is widely used in fruit postharvest experiments, the transcription stability of reference genes has not been thoroughly investigated under these experimental conditions. Thus, we have determined the transcriptional profile, under these conditions, of three genes commonly used as reference--ACTIN (MdACT), PROTEIN DISULPHIDE ISOMERASE (MdPDI) and UBIQUITIN-CONJUGATING ENZYME E2 (MdUBC)--along with two novel candidates--HISTONE 1 (MdH1) and NUCLEOSSOME ASSEMBLY 1 PROTEIN (MdNAP1). The expression profile of the genes was investigated throughout five experiments, with three of them encompassing the postharvest period and the other two, consisting of developmental and spatial phases. The transcriptional stability was comparatively investigated using four distinct software packages: BestKeeper, NormFinder, geNorm and DataAssist. Gene ranking results for transcriptional stability were similar for the investigated software packages, with the exception of BestKeeper. The classic reference gene MdUBC ranked among the most stably transcribed in all investigated experimental conditions. Transcript accumulation profiles for the novel reference candidate gene MdH1 were stable throughout the tested conditions, especially in experiments encompassing the postharvest period. Thus, our results present a novel reference gene for postharvest experiments in apple and reinforce the importance of checking the transcription profile of reference genes under the experimental conditions of interest.
Selection of Reliable Reference Genes for Gene Expression Studies on Rhododendron molle G. Don.

PubMed

Xiao, Zheng; Sun, Xiaobo; Liu, Xiaoqing; Li, Chang; He, Lisi; Chen, Shangping; Su, Jiale

2016-01-01

The quantitative real-time polymerase chain reaction (qRT-PCR) approach has become a widely used method to analyze expression patterns of target genes. The selection of an optimal reference gene is a prerequisite for the accurate normalization of gene expression in qRT-PCR. The present study constitutes the first systematic evaluation of potential reference genes in Rhododendron molle G. Don. Eleven candidate reference genes in different tissues and flowers at different developmental stages of R. molle were assessed using the following three software packages: GeNorm, NormFinder, and BestKeeper. The results showed that EF1- α (elongation factor 1-alpha), 18S (18s ribosomal RNA), and RPL3 (ribosomal protein L3) were the most stable reference genes in developing rhododendron flowers and, thus, in all of the tested samples, while tublin ( TUB ) was the least stable. ACT5 (actin), RPL3 , 18S , and EF1- α were found to be the top four choices for different tissues, whereas TUB was not found to favor qRT-PCR normalization in these tissues. Three stable reference genes are recommended for the normalization of qRT-PCR data in R. molle . Furthermore, the expression profiles of RmPSY (phytoene synthase) and RmPDS (phytoene dehydrogenase) were assessed using EF1- α, 18S , ACT5 , RPL3 , and their combination as internals. Similar trends were found, but these trends varied when the least stable reference gene TUB was used. The results further prove that it is necessary to validate the stability of reference genes prior to their use for normalization under different experimental conditions. This study provides useful information for reliable qRT-PCR data normalization in gene studies of R. molle .
Validation of Reference Genes for RT-qPCR Studies of Gene Expression in Preharvest and Postharvest Longan Fruits under Different Experimental Conditions

PubMed Central

Wu, Jianyang; Zhang, Hongna; Liu, Liqin; Li, Weicai; Wei, Yongzan; Shi, Shengyou

2016-01-01

Reverse transcription quantitative PCR (RT-qPCR) as the accurate and sensitive method is use for gene expression analysis, but the veracity and reliability result depends on whether select appropriate reference gene or not. To date, several reliable reference gene validations have been reported in fruits trees, but none have been done on preharvest and postharvest longan fruits. In this study, 12 candidate reference genes, namely, CYP, RPL, GAPDH, TUA, TUB, Fe-SOD, Mn-SOD, Cu/Zn-SOD, 18SrRNA, Actin, Histone H3, and EF-1a, were selected. Expression stability of these genes in 150 longan samples was evaluated and analyzed using geNorm and NormFinder algorithms. Preharvest samples consisted of seven experimental sets, including different developmental stages, organs, hormone stimuli (NAA, 2,4-D, and ethephon) and abiotic stresses (bagging and girdling with defoliation). Postharvest samples consisted of different temperature treatments (4 and 22°C) and varieties. Our findings indicate that appropriate reference gene(s) should be picked for each experimental condition. Our data further showed that the commonly used reference gene Actin does not exhibit stable expression across experimental conditions in longan. Expression levels of the DlACO gene, which is a key gene involved in regulating fruit abscission under girdling with defoliation treatment, was evaluated to validate our findings. In conclusion, our data provide a useful framework for choice of suitable reference genes across different experimental conditions for RT-qPCR analysis of preharvest and postharvest longan fruits. PMID:27375640
Reference genes for normalization of gene expression studies in human osteoarthritic articular cartilage.

PubMed

Pombo-Suarez, Manuel; Calaza, Manuel; Gomez-Reino, Juan J; Gonzalez, Antonio

2008-01-29

Assessment of gene expression is an important component of osteoarthritis (OA) research, greatly improved by the development of quantitative real-time PCR (qPCR). This technique requires normalization for precise results, yet no suitable reference genes have been identified in human articular cartilage. We have examined ten well-known reference genes to determine the most adequate for this application. Analyses of expression stability in cartilage from 10 patients with hip OA, 8 patients with knee OA and 10 controls without OA were done with classical statistical tests and the software programs geNorm and NormFinder. Results from the three methods of analysis were broadly concordant. Some of the commonly used reference genes, GAPDH, ACTB and 18S RNA, performed poorly in our analysis. In contrast, the rarely used TBP, RPL13A and B2M genes were the best. It was necessary to use together several of these three genes to obtain the best results. The specific combination depended, to some extent, on the type of samples being compared. Our results provide a satisfactory set of previously unused reference genes for qPCR in hip and knee OA This confirms the need to evaluate the suitability of reference genes in every tissue and experimental situation before starting the quantitative assessment of gene expression by qPCR.
Measurement of Gene Expression in Archival Paraffin-Embedded Tissues

PubMed Central

Cronin, Maureen; Pho, Mylan; Dutta, Debjani; Stephans, James C.; Shak, Steven; Kiefer, Michael C.; Esteban, Jose M.; Baker, Joffre B.

2004-01-01

Throughout the last decade many laboratories have shown that mRNA levels in formalin-fixed and paraffin-embedded (FPE) tissue specimens can be quantified by reverse transcriptase-polymerase chain reaction (RT-PCR) techniques despite the extensive RNA fragmentation that occurs in tissues so preserved. We have developed RT-PCR methods that are sensitive, precise, and that have multianalyte capability for potential wide use in clinical research and diagnostic assays. Here it is shown that the extent of fragmentation of extracted FPE tissue RNA significantly increases with archive storage time. Probe and primer sets for RT-PCR assays based on amplicons that are both short and homogeneous in length enable effective reference gene-based data normalization for cross comparison of specimens that differ substantially in age. A 48-gene assay used to compare gene expression profiles from the same breast cancer tissue that had been either frozen or FPE showed very similar profiles after reference gene-based normalization. A 92-gene assay, using RNA extracted from three 10-μm FPE sections of archival breast cancer specimens (dating from 1985 to 2001) yielded analyzable data for these genes in all 62 tested specimens. The results were substantially concordant when estrogen receptor, progesterone receptor, and HER2 receptor status determined by RT-PCR was compared with immunohistochemistry assays for these receptors. Furthermore, the results highlight the advantages of RT-PCR over immunohistochemistry with respect to quantitation and dynamic range. These findings support the development of RT-PCR analysis of FPE tissue RNA as a platform for multianalyte clinical diagnostic tests. PMID:14695316

Technical note: Equivalent genomic models with a residual polygenic effect.

PubMed

Liu, Z; Goddard, M E; Hayes, B J; Reinhardt, F; Reents, R

2016-03-01

Routine genomic evaluations in animal breeding are usually based on either a BLUP with genomic relationship matrix (GBLUP) or single nucleotide polymorphism (SNP) BLUP model. For a multi-step genomic evaluation, these 2 alternative genomic models were proven to give equivalent predictions for genomic reference animals. The model equivalence was verified also for young genotyped animals without phenotypes. Due to incomplete linkage disequilibrium of SNP markers to genes or causal mutations responsible for genetic inheritance of quantitative traits, SNP markers cannot explain all the genetic variance. A residual polygenic effect is normally fitted in the genomic model to account for the incomplete linkage disequilibrium. In this study, we start by showing the proof that the multi-step GBLUP and SNP BLUP models are equivalent for the reference animals, when they have a residual polygenic effect included. Second, the equivalence of both multi-step genomic models with a residual polygenic effect was also verified for young genotyped animals without phenotypes. Additionally, we derived formulas to convert genomic estimated breeding values of the GBLUP model to its components, direct genomic values and residual polygenic effect. Third, we made a proof that the equivalence of these 2 genomic models with a residual polygenic effect holds also for single-step genomic evaluation. Both the single-step GBLUP and SNP BLUP models lead to equal prediction for genotyped animals with phenotypes (e.g., reference animals), as well as for (young) genotyped animals without phenotypes. Finally, these 2 single-step genomic models with a residual polygenic effect were proven to be equivalent for estimation of SNP effects, too. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Development and Validation of a Computational Model for Androgen Receptor Activity

PubMed Central

2016-01-01

Testing thousands of chemicals to identify potential androgen receptor (AR) agonists or antagonists would cost millions of dollars and take decades to complete using current validated methods. High-throughput in vitro screening (HTS) and computational toxicology approaches can more rapidly and inexpensively identify potential androgen-active chemicals. We integrated 11 HTS ToxCast/Tox21 in vitro assays into a computational network model to distinguish true AR pathway activity from technology-specific assay interference. The in vitro HTS assays probed perturbations of the AR pathway at multiple points (receptor binding, coregulator recruitment, gene transcription, and protein production) and multiple cell types. Confirmatory in vitro antagonist assay data and cytotoxicity information were used as additional flags for potential nonspecific activity. Validating such alternative testing strategies requires high-quality reference data. We compiled 158 putative androgen-active and -inactive chemicals from a combination of international test method validation efforts and semiautomated systematic literature reviews. Detailed in vitro assay information and results were compiled into a single database using a standardized ontology. Reference chemical concentrations that activated or inhibited AR pathway activity were identified to establish a range of potencies with reproducible reference chemical results. Comparison with existing Tier 1 AR binding data from the U.S. EPA Endocrine Disruptor Screening Program revealed that the model identified binders at relevant test concentrations (<100 μM) and was more sensitive to antagonist activity. The AR pathway model based on the ToxCast/Tox21 assays had balanced accuracies of 95.2% for agonist (n = 29) and 97.5% for antagonist (n = 28) reference chemicals. Out of 1855 chemicals screened in the AR pathway model, 220 chemicals demonstrated AR agonist or antagonist activity and an additional 174 chemicals were predicted to have potential weak AR pathway activity. PMID:27933809
Optimal consistency in microRNA expression analysis using reference-gene-based normalization.

PubMed

Wang, Xi; Gardiner, Erin J; Cairns, Murray J

2015-05-01

Normalization of high-throughput molecular expression profiles secures differential expression analysis between samples of different phenotypes or biological conditions, and facilitates comparison between experimental batches. While the same general principles apply to microRNA (miRNA) normalization, there is mounting evidence that global shifts in their expression patterns occur in specific circumstances, which pose a challenge for normalizing miRNA expression data. As an alternative to global normalization, which has the propensity to flatten large trends, normalization against constitutively expressed reference genes presents an advantage through their relative independence. Here we investigated the performance of reference-gene-based (RGB) normalization for differential miRNA expression analysis of microarray expression data, and compared the results with other normalization methods, including: quantile, variance stabilization, robust spline, simple scaling, rank invariant, and Loess regression. The comparative analyses were executed using miRNA expression in tissue samples derived from subjects with schizophrenia and non-psychiatric controls. We proposed a consistency criterion for evaluating methods by examining the overlapping of differentially expressed miRNAs detected using different partitions of the whole data. Based on this criterion, we found that RGB normalization generally outperformed global normalization methods. Thus we recommend the application of RGB normalization for miRNA expression data sets, and believe that this will yield a more consistent and useful readout of differentially expressed miRNAs, particularly in biological conditions characterized by large shifts in miRNA expression.
1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function

PubMed Central

Gorski, Mathias; van der Most, Peter J.; Teumer, Alexander; Chu, Audrey Y.; Li, Man; Mijatovic, Vladan; Nolte, Ilja M.; Cocca, Massimiliano; Taliun, Daniel; Gomez, Felicia; Li, Yong; Tayo, Bamidele; Tin, Adrienne; Feitosa, Mary F.; Aspelund, Thor; Attia, John; Biffar, Reiner; Bochud, Murielle; Boerwinkle, Eric; Borecki, Ingrid; Bottinger, Erwin P.; Chen, Ming-Huei; Chouraki, Vincent; Ciullo, Marina; Coresh, Josef; Cornelis, Marilyn C.; Curhan, Gary C.; d’Adamo, Adamo Pio; Dehghan, Abbas; Dengler, Laura; Ding, Jingzhong; Eiriksdottir, Gudny; Endlich, Karlhans; Enroth, Stefan; Esko, Tõnu; Franco, Oscar H.; Gasparini, Paolo; Gieger, Christian; Girotto, Giorgia; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Hancock, Stephen J.; Harris, Tamara B.; Helmer, Catherine; Höllerer, Simon; Hofer, Edith; Hofman, Albert; Holliday, Elizabeth G.; Homuth, Georg; Hu, Frank B.; Huth, Cornelia; Hutri-Kähönen, Nina; Hwang, Shih-Jen; Imboden, Medea; Johansson, Åsa; Kähönen, Mika; König, Wolfgang; Kramer, Holly; Krämer, Bernhard K.; Kumar, Ashish; Kutalik, Zoltan; Lambert, Jean-Charles; Launer, Lenore J.; Lehtimäki, Terho; de Borst, Martin; Navis, Gerjan; Swertz, Morris; Liu, Yongmei; Lohman, Kurt; Loos, Ruth J. F.; Lu, Yingchang; Lyytikäinen, Leo-Pekka; McEvoy, Mark A.; Meisinger, Christa; Meitinger, Thomas; Metspalu, Andres; Metzger, Marie; Mihailov, Evelin; Mitchell, Paul; Nauck, Matthias; Oldehinkel, Albertine J.; Olden, Matthias; WJH Penninx, Brenda; Pistis, Giorgio; Pramstaller, Peter P.; Probst-Hensch, Nicole; Raitakari, Olli T.; Rettig, Rainer; Ridker, Paul M.; Rivadeneira, Fernando; Robino, Antonietta; Rosas, Sylvia E.; Ruderfer, Douglas; Ruggiero, Daniela; Saba, Yasaman; Sala, Cinzia; Schmidt, Helena; Schmidt, Reinhold; Scott, Rodney J.; Sedaghat, Sanaz; Smith, Albert V.; Sorice, Rossella; Stengel, Benedicte; Stracke, Sylvia; Strauch, Konstantin; Toniolo, Daniela; Uitterlinden, Andre G.; Ulivi, Sheila; Viikari, Jorma S.; Völker, Uwe; Vollenweider, Peter; Völzke, Henry; Vuckovic, Dragana; Waldenberger, Melanie; Jin Wang, Jie; Yang, Qiong; Chasman, Daniel I.; Tromp, Gerard; Snieder, Harold; Heid, Iris M.; Fox, Caroline S.; Köttgen, Anna; Pattaro, Cristian; Böger, Carsten A.; Fuchsberger, Christian

2017-01-01

HapMap imputed genome-wide association studies (GWAS) have revealed >50 loci at which common variants with minor allele frequency >5% are associated with kidney function. GWAS using more complete reference sets for imputation, such as those from The 1000 Genomes project, promise to identify novel loci that have been missed by previous efforts. To investigate the value of such a more complete variant catalog, we conducted a GWAS meta-analysis of kidney function based on the estimated glomerular filtration rate (eGFR) in 110,517 European ancestry participants using 1000 Genomes imputed data. We identified 10 novel loci with p-value < 5 × 10−8 previously missed by HapMap-based GWAS. Six of these loci (HOXD8, ARL15, PIK3R1, EYA4, ASTN2, and EPB41L3) are tagged by common SNPs unique to the 1000 Genomes reference panel. Using pathway analysis, we identified 39 significant (FDR < 0.05) genes and 127 significantly (FDR < 0.05) enriched gene sets, which were missed by our previous analyses. Among those, the 10 identified novel genes are part of pathways of kidney development, carbohydrate metabolism, cardiac septum development and glucose metabolism. These results highlight the utility of re-imputing from denser reference panels, until whole-genome sequencing becomes feasible in large samples. PMID:28452372
1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function.

PubMed

Gorski, Mathias; van der Most, Peter J; Teumer, Alexander; Chu, Audrey Y; Li, Man; Mijatovic, Vladan; Nolte, Ilja M; Cocca, Massimiliano; Taliun, Daniel; Gomez, Felicia; Li, Yong; Tayo, Bamidele; Tin, Adrienne; Feitosa, Mary F; Aspelund, Thor; Attia, John; Biffar, Reiner; Bochud, Murielle; Boerwinkle, Eric; Borecki, Ingrid; Bottinger, Erwin P; Chen, Ming-Huei; Chouraki, Vincent; Ciullo, Marina; Coresh, Josef; Cornelis, Marilyn C; Curhan, Gary C; d'Adamo, Adamo Pio; Dehghan, Abbas; Dengler, Laura; Ding, Jingzhong; Eiriksdottir, Gudny; Endlich, Karlhans; Enroth, Stefan; Esko, Tõnu; Franco, Oscar H; Gasparini, Paolo; Gieger, Christian; Girotto, Giorgia; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Hancock, Stephen J; Harris, Tamara B; Helmer, Catherine; Höllerer, Simon; Hofer, Edith; Hofman, Albert; Holliday, Elizabeth G; Homuth, Georg; Hu, Frank B; Huth, Cornelia; Hutri-Kähönen, Nina; Hwang, Shih-Jen; Imboden, Medea; Johansson, Åsa; Kähönen, Mika; König, Wolfgang; Kramer, Holly; Krämer, Bernhard K; Kumar, Ashish; Kutalik, Zoltan; Lambert, Jean-Charles; Launer, Lenore J; Lehtimäki, Terho; de Borst, Martin; Navis, Gerjan; Swertz, Morris; Liu, Yongmei; Lohman, Kurt; Loos, Ruth J F; Lu, Yingchang; Lyytikäinen, Leo-Pekka; McEvoy, Mark A; Meisinger, Christa; Meitinger, Thomas; Metspalu, Andres; Metzger, Marie; Mihailov, Evelin; Mitchell, Paul; Nauck, Matthias; Oldehinkel, Albertine J; Olden, Matthias; Wjh Penninx, Brenda; Pistis, Giorgio; Pramstaller, Peter P; Probst-Hensch, Nicole; Raitakari, Olli T; Rettig, Rainer; Ridker, Paul M; Rivadeneira, Fernando; Robino, Antonietta; Rosas, Sylvia E; Ruderfer, Douglas; Ruggiero, Daniela; Saba, Yasaman; Sala, Cinzia; Schmidt, Helena; Schmidt, Reinhold; Scott, Rodney J; Sedaghat, Sanaz; Smith, Albert V; Sorice, Rossella; Stengel, Benedicte; Stracke, Sylvia; Strauch, Konstantin; Toniolo, Daniela; Uitterlinden, Andre G; Ulivi, Sheila; Viikari, Jorma S; Völker, Uwe; Vollenweider, Peter; Völzke, Henry; Vuckovic, Dragana; Waldenberger, Melanie; Jin Wang, Jie; Yang, Qiong; Chasman, Daniel I; Tromp, Gerard; Snieder, Harold; Heid, Iris M; Fox, Caroline S; Köttgen, Anna; Pattaro, Cristian; Böger, Carsten A; Fuchsberger, Christian

2017-04-28

HapMap imputed genome-wide association studies (GWAS) have revealed >50 loci at which common variants with minor allele frequency >5% are associated with kidney function. GWAS using more complete reference sets for imputation, such as those from The 1000 Genomes project, promise to identify novel loci that have been missed by previous efforts. To investigate the value of such a more complete variant catalog, we conducted a GWAS meta-analysis of kidney function based on the estimated glomerular filtration rate (eGFR) in 110,517 European ancestry participants using 1000 Genomes imputed data. We identified 10 novel loci with p-value < 5 × 10 -8 previously missed by HapMap-based GWAS. Six of these loci (HOXD8, ARL15, PIK3R1, EYA4, ASTN2, and EPB41L3) are tagged by common SNPs unique to the 1000 Genomes reference panel. Using pathway analysis, we identified 39 significant (FDR < 0.05) genes and 127 significantly (FDR < 0.05) enriched gene sets, which were missed by our previous analyses. Among those, the 10 identified novel genes are part of pathways of kidney development, carbohydrate metabolism, cardiac septum development and glucose metabolism. These results highlight the utility of re-imputing from denser reference panels, until whole-genome sequencing becomes feasible in large samples.
Research on Multi - Person Parallel Modeling Method Based on Integrated Model Persistent Storage

NASA Astrophysics Data System (ADS)

Qu, MingCheng; Wu, XiangHu; Tao, YongChao; Liu, Ying

2018-03-01

This paper mainly studies the multi-person parallel modeling method based on the integrated model persistence storage. The integrated model refers to a set of MDDT modeling graphics system, which can carry out multi-angle, multi-level and multi-stage description of aerospace general embedded software. Persistent storage refers to converting the data model in memory into a storage model and converting the storage model into a data model in memory, where the data model refers to the object model and the storage model is a binary stream. And multi-person parallel modeling refers to the need for multi-person collaboration, the role of separation, and even real-time remote synchronization modeling.
Transcriptome Assembly, Gene Annotation and Tissue Gene Expression Atlas of the Rainbow Trout

PubMed Central

Salem, Mohamed; Paneru, Bam; Al-Tobasei, Rafet; Abdouni, Fatima; Thorgaard, Gary H.; Rexroad, Caird E.; Yao, Jianbo

2015-01-01

Efforts to obtain a comprehensive genome sequence for rainbow trout are ongoing and will be complemented by transcriptome information that will enhance genome assembly and annotation. Previously, transcriptome reference sequences were reported using data from different sources. Although the previous work added a great wealth of sequences, a complete and well-annotated transcriptome is still needed. In addition, gene expression in different tissues was not completely addressed in the previous studies. In this study, non-normalized cDNA libraries were sequenced from 13 different tissues of a single doubled haploid rainbow trout from the same source used for the rainbow trout genome sequence. A total of ~1.167 billion paired-end reads were de novo assembled using the Trinity RNA-Seq assembler yielding 474,524 contigs > 500 base-pairs. Of them, 287,593 had homologies to the NCBI non-redundant protein database. The longest contig of each cluster was selected as a reference, yielding 44,990 representative contigs. A total of 4,146 contigs (9.2%), including 710 full-length sequences, did not match any mRNA sequences in the current rainbow trout genome reference. Mapping reads to the reference genome identified an additional 11,843 transcripts not annotated in the genome. A digital gene expression atlas revealed 7,678 housekeeping and 4,021 tissue-specific genes. Expression of about 16,000–32,000 genes (35–71% of the identified genes) accounted for basic and specialized functions of each tissue. White muscle and stomach had the least complex transcriptomes, with high percentages of their total mRNA contributed by a small number of genes. Brain, testis and intestine, in contrast, had complex transcriptomes, with a large numbers of genes involved in their expression patterns. This study provides comprehensive de novo transcriptome information that is suitable for functional and comparative genomics studies in rainbow trout, including annotation of the genome. PMID:25793877
Dynamic Network-Based Epistasis Analysis: Boolean Examples

PubMed Central

Azpeitia, Eugenio; Benítez, Mariana; Padilla-Longoria, Pablo; Espinosa-Soto, Carlos; Alvarez-Buylla, Elena R.

2011-01-01

In this article we focus on how the hierarchical and single-path assumptions of epistasis analysis can bias the inference of gene regulatory networks. Here we emphasize the critical importance of dynamic analyses, and specifically illustrate the use of Boolean network models. Epistasis in a broad sense refers to gene interactions, however, as originally proposed by Bateson, epistasis is defined as the blocking of a particular allelic effect due to the effect of another allele at a different locus (herein, classical epistasis). Classical epistasis analysis has proven powerful and useful, allowing researchers to infer and assign directionality to gene interactions. As larger data sets are becoming available, the analysis of classical epistasis is being complemented with computer science tools and system biology approaches. We show that when the hierarchical and single-path assumptions are not met in classical epistasis analysis, the access to relevant information and the correct inference of gene interaction topologies is hindered, and it becomes necessary to consider the temporal dynamics of gene interactions. The use of dynamical networks can overcome these limitations. We particularly focus on the use of Boolean networks that, like classical epistasis analysis, relies on logical formalisms, and hence can complement classical epistasis analysis and relax its assumptions. We develop a couple of theoretical examples and analyze them from a dynamic Boolean network model perspective. Boolean networks could help to guide additional experiments and discern among alternative regulatory schemes that would be impossible or difficult to infer without the elimination of these assumption from the classical epistasis analysis. We also use examples from the literature to show how a Boolean network-based approach has resolved ambiguities and guided epistasis analysis. Our article complements previous accounts, not only by focusing on the implications of the hierarchical and single-path assumption, but also by demonstrating the importance of considering temporal dynamics, and specifically introducing the usefulness of Boolean network models and also reviewing some key properties of network approaches. PMID:22645556
Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.).

PubMed

Huis, Rudy; Hawkins, Simon; Neutelings, Godfrey

2010-04-19

Quantitative real-time PCR (qRT-PCR) is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs). Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L). Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs) and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH) as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups.qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59). LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both geNorm-designated- and NormFinder-designated-reference genes. The use of 2 different statistical algorithms results in the identification of different combinations of flax HKGs for expression data normalization. Despite such differences, the use of geNorm-designated- and NormFinder-designated-reference genes enabled us to accurately compare the expression levels of a flax MYB gene in different organs and tissues. Our identification and validation of suitable flax HKGs will facilitate future developmental transcriptomic studies in this economically-important plant.
CORUM: the comprehensive resource of mammalian protein complexes

PubMed Central

Ruepp, Andreas; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Stransky, Michael; Waegele, Brigitte; Schmidt, Thorsten; Doudieu, Octave Noubibou; Stümpflen, Volker; Mewes, H. Werner

2008-01-01

Protein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. The CORUM (http://mips.gsf.de/genre/proj/corum/index.html) database is a collection of experimentally verified mammalian protein complexes. Information is manually derived by critical reading of the scientific literature from expert annotators. Information about protein complexes includes protein complex names, subunits, literature references as well as the function of the complexes. For functional annotation, we use the FunCat catalogue that enables to organize the protein complex space into biologically meaningful subsets. The database contains more than 1750 protein complexes that are built from 2400 different genes, thus representing 12% of the protein-coding genes in human. A web-based system is available to query, view and download the data. CORUM provides a comprehensive dataset of protein complexes for discoveries in systems biology, analyses of protein networks and protein complex-associated diseases. Comparable to the MIPS reference dataset of protein complexes from yeast, CORUM intends to serve as a reference for mammalian protein complexes. PMID:17965090
Measurement of gene expression in archival paraffin-embedded tissues: development and performance of a 92-gene reverse transcriptase-polymerase chain reaction assay.

PubMed

Cronin, Maureen; Pho, Mylan; Dutta, Debjani; Stephans, James C; Shak, Steven; Kiefer, Michael C; Esteban, Jose M; Baker, Joffre B

2004-01-01

Throughout the last decade many laboratories have shown that mRNA levels in formalin-fixed and paraffin-embedded (FPE) tissue specimens can be quantified by reverse transcriptase-polymerase chain reaction (RT-PCR) techniques despite the extensive RNA fragmentation that occurs in tissues so preserved. We have developed RT-PCR methods that are sensitive, precise, and that have multianalyte capability for potential wide use in clinical research and diagnostic assays. Here it is shown that the extent of fragmentation of extracted FPE tissue RNA significantly increases with archive storage time. Probe and primer sets for RT-PCR assays based on amplicons that are both short and homogeneous in length enable effective reference gene-based data normalization for cross comparison of specimens that differ substantially in age. A 48-gene assay used to compare gene expression profiles from the same breast cancer tissue that had been either frozen or FPE showed very similar profiles after reference gene-based normalization. A 92-gene assay, using RNA extracted from three 10- micro m FPE sections of archival breast cancer specimens (dating from 1985 to 2001) yielded analyzable data for these genes in all 62 tested specimens. The results were substantially concordant when estrogen receptor, progesterone receptor, and HER2 receptor status determined by RT-PCR was compared with immunohistochemistry assays for these receptors. Furthermore, the results highlight the advantages of RT-PCR over immunohistochemistry with respect to quantitation and dynamic range. These findings support the development of RT-PCR analysis of FPE tissue RNA as a platform for multianalyte clinical diagnostic tests.
Assessment of reference genes for reliable analysis of gene transcription by RT-qPCR in ovine leukocytes.

PubMed

Mahakapuge, T A N; Scheerlinck, J-P Y; Rojas, C A Alvarez; Every, A L; Hagen, J

2016-03-01

With the availability of genetic sequencing data, quantitative reverse transcription PCR (RT-qPCR) is increasingly being used for the quantification of gene transcription across species. Too often there is little regard to the selection of reference genes and the impact that a poor choice has on data interpretation. Indeed, RT-qPCR provides a snapshot of relative gene transcription at a given time-point, and hence is highly dependent on the stability of the transcription of the reference gene(s). Using ovine efferent lymph cells and peripheral blood mono-nuclear cells (PBMCs), the two most frequently used leukocytes in immunological studies, we have compared the stability of transcription of the most commonly used ovine reference genes: YWHAZ, RPL-13A, PGK1, B2M, GAPDH, HPRT, SDHA and ACTB. Using established algorithms for reference gene normalization "geNorm" and "Norm Finder", PGK1, GAPDH and YWHAZ were deemed the most stably transcribed genes for efferent leukocytes and PGK1, YWHAZ and SDHA were optimal in PBMCs. These genes should therefore be considered for accurate and reproducible RT-qPCR data analysis of gene transcription in sheep. Copyright © 2016. Published by Elsevier B.V.
Identification and evaluation of reference genes for qRT-PCR normalization in Ganoderma lucidum.

PubMed

Xu, Jiang; Xu, ZhiChao; Zhu, YingJie; Luo, HongMei; Qian, Jun; Ji, AiJia; Hu, YuanLei; Sun, Wei; Wang, Bo; Song, JingYuan; Sun, Chao; Chen, ShiLin

2014-01-01

Quantitative real-time reverse transcription PCR (qRT-PCR) is a rapid, sensitive, and reliable technique for gene expression studies. The accuracy and reliability of qRT-PCR results depend on the stability of the reference genes used for gene normalization. Therefore, a systematic process of reference gene evaluation is needed. Ganoderma lucidum is a famous medicinal mushroom in East Asia. In the current study, 10 potential reference genes were selected from the G. lucidum genomic data. The sequences of these genes were manually curated, and primers were designed following strict criteria. The experiment was conducted using qRT-PCR, and the stability of each candidate gene was assessed using four commonly used statistical programs-geNorm, NormFinder, BestKeeper, and RefFinder. According to our results, PP2A was expressed at the most stable levels under different fermentation conditions, and RPL4 was the most stably expressed gene in different tissues. RPL4, PP2A, and β-tubulin are the most commonly recommended reference genes for normalizing gene expression in the entire sample set. The current study provides a foundation for the further use of qRT-PCR in G. lucidum gene analysis.
Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kou, Qiang; Zhu, Binhai; Wu, Si

Various proteoforms may be generated from a single gene due to primary structure alterations (PSAs) such as genetic variations, alternative splicing, and post-translational modifications (PTMs). Top-down mass spectrometry is capable of analyzing intact proteins and identifying patterns of multiple PSAs, making it the method of choice for studying complex proteoforms. In top-down proteomics, proteoform identification is often performed by searching tandem mass spectra against a protein sequence database that contains only one reference protein sequence for each gene or transcript variant in a proteome. Because of the incompleteness of the protein database, an identified proteoform may contain unknown PSAs comparedmore » with the reference sequence. Proteoform characterization is to identify and localize PSAs in a proteoform. Although many software tools have been proposed for proteoform identification by top-down mass spectrometry, the characterization of proteoforms in identified proteoform-spectrum matches still relies mainly on manual annotation. We propose to use the Modification Identification Score (MIScore), which is based on Bayesian models, to automatically identify and localize PTMs in proteoforms. Experiments showed that the MIScore is accurate in identifying and localizing one or two modifications.« less
Identification of appropriate reference genes for normalizing transcript expression by quantitative real-time PCR in Litsea cubeba.

PubMed

Lin, Liyuan; Han, Xiaojiao; Chen, Yicun; Wu, Qingke; Wang, Yangdong

2013-12-01

Quantitative real-time PCR has emerged as a highly sensitive and widely used method for detection of gene expression profiles, via which accurate detection depends on reliable normalization. Since no single control is appropriate for all experimental treatments, it is generally advocated to select suitable internal controls prior to use for normalization. This study reported the evaluation of the expression stability of twelve potential reference genes in different tissue/organs and six fruit developmental stages of Litsea cubeba in order to screen the superior internal reference genes for data normalization. Two softwares-geNorm, and NormFinder-were used to identify stability of these candidate genes. The cycle threshold difference and coefficient of variance were also calculated to evaluate the expression stability of candidate genes. F-BOX, EF1α, UBC, and TUA were selected as the most stable reference genes across 11 sample pools. F-BOX, EF1α, and EIF4α exhibited the highest expression stability in different tissue/organs and different fruit developmental stages. Besides, a combination of two stable reference genes would be sufficient for gene expression normalization in different fruit developmental stages. In addition, the relative expression profiles of DXS and DXR were evaluated by EF1α, UBC, and SAMDC. The results further validated the reliability of stable reference genes and also highlighted the importance of selecting suitable internal controls for L. cubeba. These reference genes will be of great importance for transcript normalization in future gene expression studies on L. cubeba.
Identification of Importin 8 (IPO8) as the most accurate reference gene for the clinicopathological analysis of lung specimens

PubMed Central

Nguewa, Paul A; Agorreta, Jackeline; Blanco, David; Lozano, Maria Dolores; Gomez-Roman, Javier; Sanchez, Blas A; Valles, Iñaki; Pajares, Maria J; Pio, Ruben; Rodriguez, Maria Jose; Montuenga, Luis M; Calvo, Alfonso

2008-01-01

Background The accurate normalization of differentially expressed genes in lung cancer is essential for the identification of novel therapeutic targets and biomarkers by real time RT-PCR and microarrays. Although classical "housekeeping" genes, such as GAPDH, HPRT1, and beta-actin have been widely used in the past, their accuracy as reference genes for lung tissues has not been proven. Results We have conducted a thorough analysis of a panel of 16 candidate reference genes for lung specimens and lung cell lines. Gene expression was measured by quantitative real time RT-PCR and expression stability was analyzed with the softwares GeNorm and NormFinder, mean of |ΔCt| (= |Ct Normal-Ct tumor|) ± SEM, and correlation coefficients among genes. Systematic comparison between candidates led us to the identification of a subset of suitable reference genes for clinical samples: IPO8, ACTB, POLR2A, 18S, and PPIA. Further analysis showed that IPO8 had a very low mean of |ΔCt| (0.70 ± 0.09), with no statistically significant differences between normal and malignant samples and with excellent expression stability. Conclusion Our data show that IPO8 is the most accurate reference gene for clinical lung specimens. In addition, we demonstrate that the commonly used genes GAPDH and HPRT1 are inappropriate to normalize data derived from lung biopsies, although they are suitable as reference genes for lung cell lines. We thus propose IPO8 as a novel reference gene for lung cancer samples. PMID:19014639
LinkEHR-Ed: a multi-reference model archetype editor based on formal semantics.

PubMed

Maldonado, José A; Moner, David; Boscá, Diego; Fernández-Breis, Jesualdo T; Angulo, Carlos; Robles, Montserrat

2009-08-01

To develop a powerful archetype editing framework capable of handling multiple reference models and oriented towards the semantic description and standardization of legacy data. The main prerequisite for implementing tools providing enhanced support for archetypes is the clear specification of archetype semantics. We propose a formalization of the definition section of archetypes based on types over tree-structured data. It covers the specialization of archetypes, the relationship between reference models and archetypes and conformance of data instances to archetypes. LinkEHR-Ed, a visual archetype editor based on the former formalization with advanced processing capabilities that supports multiple reference models, the editing and semantic validation of archetypes, the specification of mappings to data sources, and the automatic generation of data transformation scripts, is developed. LinkEHR-Ed is a useful tool for building, processing and validating archetypes based on any reference model.
DREISS: Using State-Space Models to Infer the Dynamics of Gene Expression Driven by External and Internal Regulatory Networks.

PubMed

Wang, Daifeng; He, Fei; Maslov, Sergei; Gerstein, Mark

2016-10-01

Gene expression is controlled by the combinatorial effects of regulatory factors from different biological subsystems such as general transcription factors (TFs), cellular growth factors and microRNAs. A subsystem's gene expression may be controlled by its internal regulatory factors, exclusively, or by external subsystems, or by both. It is thus useful to distinguish the degree to which a subsystem is regulated internally or externally-e.g., how non-conserved, species-specific TFs affect the expression of conserved, cross-species genes during evolution. We developed a computational method (DREISS, dreiss.gerteinlab.org) for analyzing the Dynamics of gene expression driven by Regulatory networks, both External and Internal based on State Space models. Given a subsystem, the "state" and "control" in the model refer to its own (internal) and another subsystem's (external) gene expression levels. The state at a given time is determined by the state and control at a previous time. Because typical time-series data do not have enough samples to fully estimate the model's parameters, DREISS uses dimensionality reduction, and identifies canonical temporal expression trajectories (e.g., degradation, growth and oscillation) representing the regulatory effects emanating from various subsystems. To demonstrate capabilities of DREISS, we study the regulatory effects of evolutionarily conserved vs. divergent TFs across distant species. In particular, we applied DREISS to the time-series gene expression datasets of C. elegans and D. melanogaster during their embryonic development. We analyzed the expression dynamics of the conserved, orthologous genes (orthologs), seeing the degree to which these can be accounted for by orthologous (internal) versus species-specific (external) TFs. We found that between two species, the orthologs have matched, internally driven expression patterns but very different externally driven ones. This is particularly true for genes with evolutionarily ancient functions (e.g. the ribosomal proteins), in contrast to those with more recently evolved functions (e.g., cell-cell communication). This suggests that despite striking morphological differences, some fundamental embryonic-developmental processes are still controlled by ancient regulatory systems.
Evaluation of stability and validation of reference genes for RT-qPCR expression studies in rice plants under water deficit.

PubMed

Auler, Priscila Ariane; Benitez, Letícia Carvalho; do Amaral, Marcelo Nogueira; Vighi, Isabel Lopes; Dos Santos Rodrigues, Gabriela; da Maia, Luciano Carlos; Braga, Eugenia Jacira Bolacel

2017-05-01

Many studies use strategies that allow for the identification of a large number of genes expressed in response to different stress conditions to which the plant is subjected throughout its cycle. In order to obtain accurate and reliable results in gene expression studies, it is necessary to use reference genes, which must have uniform expression in the majority of cells in the organism studied. RNA isolation of leaves and expression analysis in real-time quantitative polymerase chain reaction (RT-qPCR) were carried out. In this study, nine candidate reference genes were tested, actin 11 (ACT11), ubiquitin conjugated to E2 enzyme (UBC-E2), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), beta tubulin (β-tubulin), eukaryotic initiation factor 4α (eIF-4α), ubiquitin 10 (UBQ10), ubiquitin 5 (UBQ5), aquaporin TIP41 (TIP41-Like) and cyclophilin, in two genotypes of rice, AN Cambará and BRS Querência, with different levels of soil moisture (20%, 10% and recovery) in the vegetative (V5) and reproductive stages (period preceding flowering). Currently, there are different softwares that perform stability analyses and define the most suitable reference genes for a particular study. In this study, we used five different methods: geNorm, BestKeeper, ΔCt method, NormFinder and RefFinder. The results indicate that UBC-E2 and UBQ5 can be used as reference genes in all samples and softwares evaluated. The genes β-tubulin and eIF-4α, traditionally used as reference genes, along with GAPDH, presented lower stability values. The gene expression of basic leucine zipper (bZIP23 and bZIP72) was used to validate the selected reference genes, demonstrating that the use of an inappropriate reference can induce erroneous results.
Validation of reference genes for quantitative expression analysis by real-time RT-PCR in Saccharomyces cerevisiae

PubMed Central

Teste, Marie-Ange; Duquenne, Manon; François, Jean M; Parrou, Jean-Luc

2009-01-01

Background Real-time RT-PCR is the recommended method for quantitative gene expression analysis. A compulsory step is the selection of good reference genes for normalization. A few genes often referred to as HouseKeeping Genes (HSK), such as ACT1, RDN18 or PDA1 are among the most commonly used, as their expression is assumed to remain unchanged over a wide range of conditions. Since this assumption is very unlikely, a geometric averaging of multiple, carefully selected internal control genes is now strongly recommended for normalization to avoid this problem of expression variation of single reference genes. The aim of this work was to search for a set of reference genes for reliable gene expression analysis in Saccharomyces cerevisiae. Results From public microarray datasets, we selected potential reference genes whose expression remained apparently invariable during long-term growth on glucose. Using the algorithm geNorm, ALG9, TAF10, TFC1 and UBC6 turned out to be genes whose expression remained stable, independent of the growth conditions and the strain backgrounds tested in this study. We then showed that the geometric averaging of any subset of three genes among the six most stable genes resulted in very similar normalized data, which contrasted with inconsistent results among various biological samples when the normalization was performed with ACT1. Normalization with multiple selected genes was therefore applied to transcriptional analysis of genes involved in glycogen metabolism. We determined an induction ratio of 100-fold for GPH1 and 20-fold for GSY2 between the exponential phase and the diauxic shift on glucose. There was no induction of these two genes at this transition phase on galactose, although in both cases, the kinetics of glycogen accumulation was similar. In contrast, SGA1 expression was independent of the carbon source and increased by 3-fold in stationary phase. Conclusion In this work, we provided a set of genes that are suitable reference genes for quantitative gene expression analysis by real-time RT-PCR in yeast biological samples covering a large panel of physiological states. In contrast, we invalidated and discourage the use of ACT1 as well as other commonly used reference genes (PDA1, TDH3, RDN18, etc) as internal controls for quantitative gene expression analysis in yeast. PMID:19874630

Validation of reference genes for quantitative expression analysis by real-time RT-PCR in Saccharomyces cerevisiae.

PubMed

Teste, Marie-Ange; Duquenne, Manon; François, Jean M; Parrou, Jean-Luc

2009-10-30

Real-time RT-PCR is the recommended method for quantitative gene expression analysis. A compulsory step is the selection of good reference genes for normalization. A few genes often referred to as HouseKeeping Genes (HSK), such as ACT1, RDN18 or PDA1 are among the most commonly used, as their expression is assumed to remain unchanged over a wide range of conditions. Since this assumption is very unlikely, a geometric averaging of multiple, carefully selected internal control genes is now strongly recommended for normalization to avoid this problem of expression variation of single reference genes. The aim of this work was to search for a set of reference genes for reliable gene expression analysis in Saccharomyces cerevisiae. From public microarray datasets, we selected potential reference genes whose expression remained apparently invariable during long-term growth on glucose. Using the algorithm geNorm, ALG9, TAF10, TFC1 and UBC6 turned out to be genes whose expression remained stable, independent of the growth conditions and the strain backgrounds tested in this study. We then showed that the geometric averaging of any subset of three genes among the six most stable genes resulted in very similar normalized data, which contrasted with inconsistent results among various biological samples when the normalization was performed with ACT1. Normalization with multiple selected genes was therefore applied to transcriptional analysis of genes involved in glycogen metabolism. We determined an induction ratio of 100-fold for GPH1 and 20-fold for GSY2 between the exponential phase and the diauxic shift on glucose. There was no induction of these two genes at this transition phase on galactose, although in both cases, the kinetics of glycogen accumulation was similar. In contrast, SGA1 expression was independent of the carbon source and increased by 3-fold in stationary phase. In this work, we provided a set of genes that are suitable reference genes for quantitative gene expression analysis by real-time RT-PCR in yeast biological samples covering a large panel of physiological states. In contrast, we invalidated and discourage the use of ACT1 as well as other commonly used reference genes (PDA1, TDH3, RDN18, etc) as internal controls for quantitative gene expression analysis in yeast.
Stable Reference Gene Selection for RT-qPCR Analysis in Nonviruliferous and Viruliferous Frankliniella occidentalis

PubMed Central

Pan, Huipeng; Ma, Yabin; Zhang, Deyong; Liu, Yong; Zhang, Zhanhong; Zheng, Changying; Chu, Dong

2015-01-01

Reverse transcriptase-quantitative polymerase chain reaction (RT-qPCR) is a reliable technique for measuring and evaluating gene expression during variable biological processes. To facilitate gene expression studies, normalization of genes of interest relative to stable reference genes is crucial. The western flower thrips Frankliniella occidentalis (Pergande) (Thysanoptera: Thripidae), the main vector of tomato spotted wilt virus (TSWV), is a destructive invasive species. In this study, the expression profiles of 11 candidate reference genes from nonviruliferous and viruliferous F. occidentalis were investigated. Five distinct algorithms, geNorm, NormFinder, BestKeeper, the ΔC t method, and RefFinder, were used to determine the performance of these genes. geNorm, NormFinder, BestKeeper, and RefFinder identified heat shock protein 70 (HSP70), heat shock protein 60 (HSP60), elongation factor 1 α, and ribosomal protein l32 (RPL32) as the most stable reference genes, and the ΔC t method identified HSP60, HSP70, RPL32, and heat shock protein 90 as the most stable reference genes. Additionally, two reference genes were sufficient for reliable normalization in nonviruliferous and viruliferous F. occidentalis. This work provides a foundation for investigating the molecular mechanisms of TSWV and F. occidentalis interactions. PMID:26244556
Reference genes for quantitative real-time PCR analysis in symbiont Entomomyces delphacidicola of Nilaparvata lugens (Stål)

PubMed Central

Wan, Pin-Jun; Tang, Yao-Hua; Yuan, San-Yue; He, Jia-Chun; Wang, Wei-Xia; Lai, Feng-Xiang; Fu, Qiang

2017-01-01

Nilaparvata lugens (Stål) (Hemiptera: Delphacidae) is a major rice pest that harbors an endosymbiont ascomycete fungus, Entomomyces delphacidicola str. NLU (also known as yeast-like symbiont, YLS). Driving by demand of novel population management tactics (e.g. RNAi), the importance of YLS has been studied and revealed, which greatly boosts the interest of molecular level studies related to YLS. The current study focuses on reference genes for RT-qPCR studies related to YLS. Eight previously unreported YLS genes were cloned, and their expressions were evaluated for N. lugens samples of different developmental stages and sexes, and under different nutritional conditions and temperatures. Expression stabilities were analyzed by BestKeeper, geNorm, NormFinder, ΔCt method and RefFinder. Furthermore, the selected reference genes for RT-qPCR of YLS genes were validated using targeted YLS genes that respond to different nutritional conditions (amino acid deprivation) and RNAi. The results suggest that ylsRPS15p/ylsACT are the most suitable reference genes for temporal gene expression profiling, while ylsTUB/ylsACT and ylsRPS15e/ylsGADPH are the most suitable reference gene choices for evaluating nutrition and temperature effects. Validation studies demonstrated the advantage of using endogenous YLS reference genes for YLS studies. PMID:28198810
Predictor-Based Model Reference Adaptive Control

NASA Technical Reports Server (NTRS)

Lavretsky, Eugene; Gadient, Ross; Gregory, Irene M.

2010-01-01

This paper is devoted to the design and analysis of a predictor-based model reference adaptive control. Stable adaptive laws are derived using Lyapunov framework. The proposed architecture is compared with the now classical model reference adaptive control. A simulation example is presented in which numerical evidence indicates that the proposed controller yields improved transient characteristics.
TriAnnot: A Versatile and High Performance Pipeline for the Automated Annotation of Plant Genomes

PubMed Central

Leroy, Philippe; Guilhot, Nicolas; Sakai, Hiroaki; Bernard, Aurélien; Choulet, Frédéric; Theil, Sébastien; Reboux, Sébastien; Amano, Naoki; Flutre, Timothée; Pelegrin, Céline; Ohyanagi, Hajime; Seidel, Michael; Giacomoni, Franck; Reichstadt, Mathieu; Alaux, Michael; Gicquello, Emmanuelle; Legeai, Fabrice; Cerutti, Lorenzo; Numa, Hisataka; Tanaka, Tsuyoshi; Mayer, Klaus; Itoh, Takeshi; Quesneville, Hadi; Feuillet, Catherine

2012-01-01

In support of the international effort to obtain a reference sequence of the bread wheat genome and to provide plant communities dealing with large and complex genomes with a versatile, easy-to-use online automated tool for annotation, we have developed the TriAnnot pipeline. Its modular architecture allows for the annotation and masking of transposable elements, the structural, and functional annotation of protein-coding genes with an evidence-based quality indexing, and the identification of conserved non-coding sequences and molecular markers. The TriAnnot pipeline is parallelized on a 712 CPU computing cluster that can run a 1-Gb sequence annotation in less than 5 days. It is accessible through a web interface for small scale analyses or through a server for large scale annotations. The performance of TriAnnot was evaluated in terms of sensitivity, specificity, and general fitness using curated reference sequence sets from rice and wheat. In less than 8 h, TriAnnot was able to predict more than 83% of the 3,748 CDS from rice chromosome 1 with a fitness of 67.4%. On a set of 12 reference Mb-sized contigs from wheat chromosome 3B, TriAnnot predicted and annotated 93.3% of the genes among which 54% were perfectly identified in accordance with the reference annotation. It also allowed the curation of 12 genes based on new biological evidences, increasing the percentage of perfect gene prediction to 63%. TriAnnot systematically showed a higher fitness than other annotation pipelines that are not improved for wheat. As it is easily adaptable to the annotation of other plant genomes, TriAnnot should become a useful resource for the annotation of large and complex genomes in the future. PMID:22645565
Validation of Endogenous Internal Real-Time PCR Controls in Renal Tissues

PubMed Central

Cui, Xiangqin; Zhou, Juling; Qiu, Jing; Johnson, Martin R.; Mrug, Michal

2009-01-01

Background Endogenous internal controls (‘reference’ or ‘housekeeping’ genes) are widely used in real-time PCR (RT-PCR) analyses. Their use relies on the premise of consistently stable expression across studied experimental conditions. Unfortunately, none of these controls fulfills this premise across a wide range of experimental conditions; consequently, none of them can be recommended for universal use. Methods To determine which endogenous RT-PCR controls are suitable for analyses of renal tissues altered by kidney disease, we studied the expression of 16 commonly used ‘reference genes’ in 7 mildly and 7 severely affected whole kidney tissues from a well-characterized cystic kidney disease model. Expression levels of these 16 genes, determined by TaqMan® RT-PCR analyses and Affymetrix GeneChip® arrays, were normalized and tested for overall variance and equivalence of the means. Results Both statistical approaches and both TaqMan- and GeneChip-based methods converged on 3 out of the 4 top-ranked genes (Ppia, Gapdh and Pgk1) that had the most constant expression levels across the studied phenotypes. Conclusion A combination of the top-ranked genes will provide a suitable endogenous internal control for similar studies of kidney tissues across a wide range of disease severity. PMID:19729889
Selection of Reference Genes for Expression Studies in Diaphorina citri (Hemiptera: Liviidae).

PubMed

Bassan, Meire Menezes; Angelotti-Mendonc A, Je Ssika; Alves, Gustavo Rodrigues; Yamamoto, Pedro Takao; Moura O Filho, Francisco de Assis Alves

2017-12-05

The Asian citrus psyllid, Diaphorina citri Kuwayama (Hemiptera: Liviidae), is considered the main vector of the bacteria associated with huanglongbing, a very serious disease that has threatened the world citrus industry. The absence of efficient control management protocols, including a lack of resistant cultivars, has led to the development of different approaches to study this pathosystem. The production of resistant genotypes relies on D. citri gene expression analyses by RT-qPCR to assess control of the vector population. High-quality, reliable RT-qPCR analyses depend upon proper reference gene selection and validation. However, adequate D. citri reference genes have not yet been identified. In the present study, we evaluated the genes EF 1-α, ACT, GAPDH, RPL7, RPL17, and TUB as candidate reference genes for this insect. Gene expression stability was evaluated using the mathematical algorithms deltaCt, NormFinder, BestKeeper, and geNorm, at five insect developmental stages, grown on two different plant hosts [Citrus sinensis (L.) Osbeck (Sapindales: Rutaceae) and Murraya paniculata (L.) Jack (Sapindales: Rutaceae)]. The final gene ranking was calculated using RefFinder software, and the V-ATPase-A gene was selected for validation. According to our results, two reference genes are recommended when different plant hosts and developmental stages are considered. Considering gene expression studies in D. citri grown on M. paniculata, regardless of the insect developmental stage, GAPDH and RPL7 have the best fit as reference genes in RT-qPCR analyses, whereas GAPDH and EF 1-α are recommended as reference genes in insect studies using C. sinensis. © The Author(s) 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Evaluation of RNA from human trabecular bone and identification of stable reference genes.

PubMed

Cepollaro, Simona; Della Bella, Elena; de Biase, Dario; Visani, Michela; Fini, Milena

2018-06-01

The isolation of good quality RNA from tissues is an essential prerequisite for gene expression analysis to study pathophysiological processes. This study evaluated the RNA isolated from human trabecular bone and defined a set of stable reference genes. After pulverization, RNA was extracted with a phenol/chloroform method and then purified using silica columns. The A260/280 ratio, A260/230 ratio, RIN, and ribosomal ratio were measured to evaluate RNA quality and integrity. Moreover, the expression of six candidates was analyzed by qPCR and different algorithms were applied to assess reference gene stability. A good purity and quality of RNA was achieved according to A260/280 and A260/230 ratios, and RIN values. TBP, YWHAZ, and PGK1 were the most stable reference genes that should be used for gene expression analysis. In summary, the method proposed is suitable for gene expression evaluation in human bone and a set of reliable reference genes has been identified. © 2017 Wiley Periodicals, Inc.
Statistical method to compare massive parallel sequencing pipelines.

PubMed

Elsensohn, M H; Leblay, N; Dimassi, S; Campan-Fournier, A; Labalme, A; Roucher-Boulez, F; Sanlaville, D; Lesca, G; Bardel, C; Roy, P

2017-03-01

Today, sequencing is frequently carried out by Massive Parallel Sequencing (MPS) that cuts drastically sequencing time and expenses. Nevertheless, Sanger sequencing remains the main validation method to confirm the presence of variants. The analysis of MPS data involves the development of several bioinformatic tools, academic or commercial. We present here a statistical method to compare MPS pipelines and test it in a comparison between an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and without reference to a gold standard (here, Sanger sequencing), on a panel of 41 genes in 43 epileptic patients. This method used the number of variants to fit log-linear models for pairwise agreements between pipelines. To assess the heterogeneity of the margins and the odds ratios of agreement, four log-linear models were used: a full model, a homogeneous-margin model, a model with single odds ratio for all patients, and a model with single intercept. Then a log-linear mixed model was fitted considering the biological variability as a random effect. Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and indels), respectively. Against the gold standard, the pipelines had similar sensitivities (63.47% vs. 63.42%) and close but significantly different specificities (99.57% vs. 99.65%; p < 0.001). Same-trend results were obtained when only single nucleotide variants were considered (99.98% specificity and 76.81% sensitivity for both pipelines). The method allows thus pipeline comparison and selection. It is generalizable to all types of MPS data and all pipelines.
A deep auto-encoder model for gene expression prediction.

PubMed

Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua

2017-11-17

Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.
Identification of TMEM208 and PQLC2 as reference genes for normalizing mRNA expression in colorectal cancer treated with aspirin

PubMed Central

Zhu, Yuanyuan; Yang, Chao; Weng, Mingjiao; Zhang, Yan; Yang, Chunhui; Jin, Yinji; Yang, Weiwei; He, Yan; Wu, Yiqi; Zhang, Yuhua; Wang, Guangyu; RajkumarEzakiel Redpath, Riju James; Zhang, Lei; Jin, Xiaoming; Liu, Ying; Sun, Yuchun; Ning, Ning; Qiao, Yu; Zhang, Fengmin; Li, Zhiwei; Wang, Tianzhen; Zhang, Yanqiao; Li, Xiaobo

2017-01-01

Numerous evidences indicate that aspirin usage causes a significant reduction in colorectal cancer. However, the molecular mechanisms about aspirin preventing colon cancer are largely unknown. Quantitative reverse transcription polymerase chain reaction (qRT-PCR) is a most frequently used method to identify the target molecules regulated by certain compound. However, this method needs stable internal reference genes to analyze the expression change of the targets. In this study, the transcriptional stabilities of several traditional reference genes were evaluated in colon cancer cells treated with aspirin, and also, the suitable internal reference genes were screened by using a microarray and were further identified by using the geNorm and NormFinder softwares, and then were validated in more cell lines and xenografts. We have showed that three traditional internal reference genes, β-actin, GAPDH and α-tubulin, are not suitable for studying gene transcription in colon cancer cells treated with aspirin, and we have identified and validated TMEM208 and PQLC2 as the ideal internal reference genes for detecting the molecular targets of aspirin in colon cancer in vitro and in vivo. This study reveals stable internal reference genes for studying the target genes of aspirin in colon cancer, which will contribute to identify the molecular mechanism behind aspirin preventing colon cancer. PMID:28184026
Identification of TMEM208 and PQLC2 as reference genes for normalizing mRNA expression in colorectal cancer treated with aspirin.

PubMed

Zhu, Yuanyuan; Yang, Chao; Weng, Mingjiao; Zhang, Yan; Yang, Chunhui; Jin, Yinji; Yang, Weiwei; He, Yan; Wu, Yiqi; Zhang, Yuhua; Wang, Guangyu; RajkumarEzakiel Redpath, Riju James; Zhang, Lei; Jin, Xiaoming; Liu, Ying; Sun, Yuchun; Ning, Ning; Qiao, Yu; Zhang, Fengmin; Li, Zhiwei; Wang, Tianzhen; Zhang, Yanqiao; Li, Xiaobo

2017-04-04

Numerous evidences indicate that aspirin usage causes a significant reduction in colorectal cancer. However, the molecular mechanisms about aspirin preventing colon cancer are largely unknown. Quantitative reverse transcription polymerase chain reaction (qRT-PCR) is a most frequently used method to identify the target molecules regulated by certain compound. However, this method needs stable internal reference genes to analyze the expression change of the targets. In this study, the transcriptional stabilities of several traditional reference genes were evaluated in colon cancer cells treated with aspirin, and also, the suitable internal reference genes were screened by using a microarray and were further identified by using the geNorm and NormFinder softwares, and then were validated in more cell lines and xenografts. We have showed that three traditional internal reference genes, β-actin, GAPDH and α-tubulin, are not suitable for studying gene transcription in colon cancer cells treated with aspirin, and we have identified and validated TMEM208 and PQLC2 as the ideal internal reference genes for detecting the molecular targets of aspirin in colon cancer in vitro and in vivo. This study reveals stable internal reference genes for studying the target genes of aspirin in colon cancer, which will contribute to identify the molecular mechanism behind aspirin preventing colon cancer.
Validation of reference genes for accurate normalization of gene expression for real time-quantitative PCR in strawberry fruits using different cultivars and osmotic stresses.

PubMed

Galli, Vanessa; Borowski, Joyce Moura; Perin, Ellen Cristina; Messias, Rafael da Silva; Labonde, Julia; Pereira, Ivan dos Santos; Silva, Sérgio Delmar Dos Anjos; Rombaldi, Cesar Valmor

2015-01-10

The increasing demand of strawberry (Fragaria×ananassa Duch) fruits is associated mainly with their sensorial characteristics and the content of antioxidant compounds. Nevertheless, the strawberry production has been hampered due to its sensitivity to abiotic stresses. Therefore, to understand the molecular mechanisms highlighting stress response is of great importance to enable genetic engineering approaches aiming to improve strawberry tolerance. However, the study of expression of genes in strawberry requires the use of suitable reference genes. In the present study, seven traditional and novel candidate reference genes were evaluated for transcript normalization in fruits of ten strawberry cultivars and two abiotic stresses, using RefFinder, which integrates the four major currently available software programs: geNorm, NormFinder, BestKeeper and the comparative delta-Ct method. The results indicate that the expression stability is dependent on the experimental conditions. The candidate reference gene DBP (DNA binding protein) was considered the most suitable to normalize expression data in samples of strawberry cultivars and under drought stress condition, and the candidate reference gene HISTH4 (histone H4) was the most stable under osmotic stresses and salt stress. The traditional genes GAPDH (glyceraldehyde-3-phosphate dehydrogenase) and 18S (18S ribosomal RNA) were considered the most unstable genes in all conditions. The expression of phenylalanine ammonia lyase (PAL) and 9-cis epoxycarotenoid dioxygenase (NCED1) genes were used to further confirm the validated candidate reference genes, showing that the use of an inappropriate reference gene may induce erroneous results. This study is the first survey on the stability of reference genes in strawberry cultivars and osmotic stresses and provides guidelines to obtain more accurate RT-qPCR results for future breeding efforts. Copyright © 2014 Elsevier B.V. All rights reserved.
Selection of reliable reference genes for gene expression studies in Trichoderma afroharzianum LTR-2 under oxalic acid stress.

PubMed

Lyu, Yuping; Wu, Xiaoqing; Ren, He; Zhou, Fangyuan; Zhou, Hongzi; Zhang, Xinjian; Yang, Hetong

2017-10-01

An appropriate reference gene is required to get reliable results from gene expression analysis by quantitative real-time reverse transcription PCR (qRT-PCR). In order to identify stable and reliable reference genes in Trichoderma afroharzianum under oxalic acid (OA) stress, six commonly used housekeeping genes, i.e., elongation factor 1, ubiquitin, ubiquitin-conjugating enzyme, glyceraldehyde-3-phosphate dehydrogenase, α-tubulin, actin, from the effective biocontrol isolate T. afroharzianum strain LTR-2 were tested for their expression during growth in liquid culture amended with OA. Four in silico programs (comparative ΔCt, NormFinder, geNorm and BestKeeper) were used to evaluate the expression stabilities of six candidate reference genes. The elongation factor 1 gene EF-1 was identified as the most stably expressed reference gene, and was used as the normalizer to quantify the expression level of the oxalate decarboxylase coding gene OXDC in T. afroharzianum strain LTR-2 under OA stress. The result showed that the expression of OXDC was significantly up-regulated as expected. This study provides an effective method to quantify expression changes of target genes in T. afroharzianum under OA stress. Copyright © 2017 Elsevier B.V. All rights reserved.
Identification of suitable qPCR reference genes in leaves of Brassica oleracea under abiotic stresses.

PubMed

Brulle, Franck; Bernard, Fabien; Vandenbulcke, Franck; Cuny, Damien; Dumez, Sylvain

2014-04-01

Real-time quantitative PCR is nowadays a standard method to study gene expression variations in various samples and experimental conditions. However, to interpret results accurately, data normalization with appropriate reference genes appears to be crucial. The present study describes the identification and the validation of suitable reference genes in Brassica oleracea leaves. Expression stability of eight candidates was tested following drought and cold abiotic stresses by using three different softwares (BestKeeper, NormFinder and geNorm). Four genes (BolC.TUB6, BolC.SAND1, BolC.UBQ2 and BolC.TBP1) emerged as the most stable across the tested conditions. Further gene expression analysis of a drought- and a cold-responsive gene (BolC.DREB2A and BolC.ELIP, respectively), confirmed the stability and the reliability of the identified reference genes when used for normalization in the leaves of B. oleracea. These four genes were finally tested upon a benzene exposure and all appeared to be useful reference genes along this toxicological condition. These results provide a good starting point for future studies involving gene expression measurement on leaves of B. oleracea exposed to environmental modifications.
Selection and Validation of Reference Genes for Quantitative Real-Time Polymerase Chain Reaction Studies in Mossy Maze Polypore, Cerrena unicolor (Higher Basidiomycetes).

PubMed

Yang, Jie; Lin, Qi; Lin, Juan; Ye, Xiuyun

2016-01-01

With its ability to produce ligninolytic enzymes such as laccases, white-rot basidiomycete Cerrena unicolor, a medicinal mushroom, has great potential in biotechnology. Elucidation of the expression profiles of genes encoding ligninolytic enzymes are important for increasing their production. Quantitative real-time polymerase chain reaction (qPCR) is a powerful tool to study transcriptional regulation of genes of interest. To ensure accuracy and reliability of qPCR analysis of C. unicolor, expression levels of seven candidate reference genes were studied at different growth phases, under various induction conditions, and with a range of carbon/nitrogen ratios and carbon and nitrogen sources. The stability of the genes were analyzed with five statistical approaches, namely geNorm, NormFinder, BestKeeper, the ΔCt method, and RefFinder. Our results indicated that the selection of reference genes varied with sample sets. A combination of four reference genes (Cyt-c, ATP6, TEF1, and β-tubulin) were recommended for normalizing gene expression at different growth phases. GAPDH and Cyt-c were the appropriate reference genes under different induction conditions. ATP6 and TEF1 were most stable in fermentation media with various carbon/nitrogen ratios. In the fermentation media with various carbon or nitrogen sources, 18S rRNA and GAPDH were the references of choice. The present study represents the first validation analysis of reference genes in C. unicolor and serves as a foundation for its qPCR analysis.
A Versatile Panel of Reference Gene Assays for the Measurement of Chicken mRNA by Quantitative PCR

PubMed Central

Maier, Helena J.; Van Borm, Steven; Young, John R.; Fife, Mark

2016-01-01

Quantitative real-time PCR assays are widely used for the quantification of mRNA within avian experimental samples. Multiple stably-expressed reference genes, selected for the lowest variation in representative samples, can be used to control random technical variation. Reference gene assays must be reliable, have high amplification specificity and efficiency, and not produce signals from contaminating DNA. Whilst recent research papers identify specific genes that are stable in particular tissues and experimental treatments, here we describe a panel of ten avian gene primer and probe sets that can be used to identify suitable reference genes in many experimental contexts. The panel was tested with TaqMan and SYBR Green systems in two experimental scenarios: a tissue collection and virus infection of cultured fibroblasts. GeNorm and NormFinder algorithms were able to select appropriate reference gene sets in each case. We show the effects of using the selected genes on the detection of statistically significant differences in expression. The results are compared with those obtained using 28s ribosomal RNA, the present most widely accepted reference gene in chicken work, identifying circumstances where its use might provide misleading results. Methods for eliminating DNA contamination of RNA reduced, but did not completely remove, detectable DNA. We therefore attached special importance to testing each qPCR assay for absence of signal using DNA template. The assays and analyses developed here provide a useful resource for selecting reference genes for investigations of avian biology. PMID:27537060
Escherichia coli O-Antigen Gene Clusters of Serogroups O62, O68, O131, O140, O142, and O163: DNA Sequences and Similarity between O62 and O68, and PCR-Based Serogrouping

PubMed Central

Liu, Yanhong; Yan, Xianghe; DebRoy, Chitrita; Fratamico, Pina M.; Needleman, David S.; Li, Robert W.; Wang, Wei; Losada, Liliana; Brinkac, Lauren; Radune, Diana; Toro, Magaly; Hegde, Narasimha; Meng, Jianghong

2015-01-01

The DNA sequence of the O-antigen gene clusters of Escherichia coli serogroups O62, O68, O131, O140, O142, and O163 was determined, and primers based on the wzx (O-antigen flippase) and/or wzy (O-antigen polymerase) genes within the O-antigen gene clusters were designed and used in PCR assays to identify each serogroup. Specificity was tested with E. coli reference strains, field isolates belonging to the target serogroups, and non-E. coli bacteria. The PCR assays were highly specific for the respective serogroups; however, the PCR assay targeting the O62 wzx gene reacted positively with strains belonging to E. coli O68, which was determined by serotyping. Analysis of the O-antigen gene cluster sequences of serogroups O62 and O68 reference strains showed that they were 94% identical at the nucleotide level, although O62 contained an insertion sequence (IS) element located between the rmlA and rmlC genes within the O-antigen gene cluster. A PCR assay targeting the rmlA and rmlC genes flanking the IS element was used to differentiate O62 and O68 serogroups. The PCR assays developed in this study can be used for the detection and identification of E. coli O62/O68, O131, O140, O142, and O163 strains isolated from different sources. PMID:25664526
Rapid detection of pathological mutations and deletions of the haemoglobin beta gene (HBB) by High Resolution Melting (HRM) analysis and Gene Ratio Analysis Copy Enumeration PCR (GRACE-PCR).

PubMed

Turner, Andrew; Sasse, Jurgen; Varadi, Aniko

2016-10-19

Inherited disorders of haemoglobin are the world's most common genetic diseases, resulting in significant morbidity and mortality. The large number of mutations associated with the haemoglobin beta gene (HBB) makes gene scanning by High Resolution Melting (HRM) PCR an attractive diagnostic approach. However, existing HRM-PCR assays are not able to detect all common point mutations and have only a very limited ability to detect larger gene rearrangements. The aim of the current study was to develop a HBB assay, which can be used as a screening test in highly heterogeneous populations, for detection of both point mutations and larger gene rearrangements. The assay is based on a combination of conventional HRM-PCR and a novel Gene Ratio Analysis Copy Enumeration (GRACE) PCR method. HRM-PCR was extensively optimised, which included the use of an unlabelled probe and incorporation of universal bases into primers to prevent interference from common non-pathological polymorphisms. GRACE-PCR was employed to determine HBB gene copy numbers relative to a reference gene using melt curve analysis to detect rearrangements in the HBB gene. The performance of the assay was evaluated by analysing 410 samples. A total of 44 distinct pathological genotypes were detected. In comparison with reference methods, the assay has a sensitivity of 100 % and a specificity of 98 %. We have developed an assay that detects both point mutations and larger rearrangements of the HBB gene. This assay is quick, sensitive, specific and cost effective making it suitable as an initial screening test that can be used for highly heterogeneous cohorts.
Development of Lentivirus-Based Reference Materials for Ebola Virus Nucleic Acid Amplification Technology-Based Assays.

PubMed

Mattiuzzo, Giada; Ashall, James; Doris, Kathryn S; MacLellan-Gibson, Kirsty; Nicolson, Carolyn; Wilkinson, Dianna E; Harvey, Ruth; Almond, Neil; Anderson, Robert; Efstathiou, Stacey; Minor, Philip D; Page, Mark

2015-01-01

The 2013-present Ebola virus outbreak in Western Africa has prompted the production of many diagnostic assays, mostly based on nucleic acid amplification technologies (NAT). The calibration and performance assessment of established assays and those under evaluation requires reference materials that can be used in parallel with the clinical sample to standardise or control for every step of the procedure, from extraction to the final qualitative/quantitative result. We have developed safe and stable Ebola virus RNA reference materials by encapsidating anti sense viral RNA into HIV-1-like particles. The lentiviral particles are replication-deficient and non-infectious due to the lack of HIV-1 genes and Envelope protein. Ebola virus genes were subcloned for encapsidation into two lentiviral preparations, one containing NP-VP35-GP and the other VP40 and L RNA. Each reference material was formulated as a high-titre standard for use as a calibrator for secondary or internal standards, and a 10,000-fold lower titre preparation to serve as an in-run control. The preparations have been freeze-dried to maximise stability. These HIV-Ebola virus RNA reference materials were suitable for use with in-house and commercial quantitative RT-PCR assays and with digital RT-PCR. The HIV-Ebola virus RNA reference materials are stable at up to 37°C for two weeks, allowing the shipment of the material worldwide at ambient temperature. These results support further evaluation of the HIV-Ebola virus RNA reference materials as part of an International collaborative study for the establishment of the 1st International Standard for Ebola virus RNA.

Candidate qRT-PCR reference genes for barley that demonstrate better stability than traditional housekeeping genes

USDA-ARS?s Scientific Manuscript database

Gene transcript expression analysis is a useful tool for correlating gene activity with plant phenotype. For these studies, an appropriate reference gene is necessary to quantify the expression of target genes. Classic housekeeping genes have often been used for this purpose, but may not be consis...
Optimization of algorithm of coding of genetic information of Chlamydia

NASA Astrophysics Data System (ADS)

Feodorova, Valentina A.; Ulyanov, Sergey S.; Zaytsev, Sergey S.; Saltykov, Yury V.; Ulianova, Onega V.

2018-04-01

New method of coding of genetic information using coherent optical fields is developed. Universal technique of transformation of nucleotide sequences of bacterial gene into laser speckle pattern is suggested. Reference speckle patterns of the nucleotide sequences of omp1 gene of typical wild strains of Chlamydia trachomatis of genovars D, E, F, G, J and K and Chlamydia psittaci serovar I as well are generated. Algorithm of coding of gene information into speckle pattern is optimized. Fully developed speckles with Gaussian statistics for gene-based speckles have been used as criterion of optimization.
Identification and validation of reference genes for quantitative real-time PCR studies in long yellow daylily, Hemerocallis citrina Borani

USDA-ARS?s Scientific Manuscript database

Gene expression analysis requires the use of reference genes in the target species. The long yellow daylily is rich in beneficial secondary metabolites and is considered as a functional vegetable. It is widely cultivated and consumed in East Asia. However, reference genes for use in RT-qPCR in this ...
GeneRIF indexing: sentence selection based on machine learning.

PubMed

Jimeno-Yepes, Antonio J; Sticco, J Caitlin; Mork, James G; Aronson, Alan R

2013-05-31

A Gene Reference Into Function (GeneRIF) describes novel functionality of genes. GeneRIFs are available from the National Center for Biotechnology Information (NCBI) Gene database. GeneRIF indexing is performed manually, and the intention of our work is to provide methods to support creating the GeneRIF entries. The creation of GeneRIF entries involves the identification of the genes mentioned in MEDLINE®; citations and the sentences describing a novel function. We have compared several learning algorithms and several features extracted or derived from MEDLINE sentences to determine if a sentence should be selected for GeneRIF indexing. Features are derived from the sentences or using mechanisms to augment the information provided by them: assigning a discourse label using a previously trained model, for example. We show that machine learning approaches with specific feature combinations achieve results close to one of the annotators. We have evaluated different feature sets and learning algorithms. In particular, Naïve Bayes achieves better performance with a selection of features similar to one used in related work, which considers the location of the sentence, the discourse of the sentence and the functional terminology in it. The current performance is at a level similar to human annotation and it shows that machine learning can be used to automate the task of sentence selection for GeneRIF annotation. The current experiments are limited to the human species. We would like to see how the methodology can be extended to other species, specifically the normalization of gene mentions in other species.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.

PubMed

Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D

2013-02-28

Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation

PubMed Central

2013-01-01

Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355
Suitable Reference Genes for Accurate Gene Expression Analysis in Parsley (Petroselinum crispum) for Abiotic Stresses and Hormone Stimuli

PubMed Central

Li, Meng-Yao; Song, Xiong; Wang, Feng; Xiong, Ai-Sheng

2016-01-01

Parsley, one of the most important vegetables in the Apiaceae family, is widely used in the food, medicinal, and cosmetic industries. Recent studies on parsley mainly focus on its chemical composition, and further research involving the analysis of the plant's gene functions and expressions is required. qPCR is a powerful method for detecting very low quantities of target transcript levels and is widely used to study gene expression. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, four software, namely geNorm, NormFinder, BestKeeper, and RefFinder were used to evaluate the expression stabilities of eight candidate reference genes of parsley (GAPDH, ACTIN, eIF-4α, SAND, UBC, TIP41, EF-1α, and TUB) under various conditions, including abiotic stresses (heat, cold, salt, and drought) and hormone stimuli treatments (GA, SA, MeJA, and ABA). Results showed that EF-1α and TUB were the most stable genes for abiotic stresses, whereas EF-1α, GAPDH, and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1α and TUB were the most stable reference genes among all tested samples, and UBC was the least stable one. Expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study can guide the selection of suitable reference genes in gene expression in parsley. PMID:27746803
Suitable Reference Genes for Accurate Gene Expression Analysis in Parsley (Petroselinum crispum) for Abiotic Stresses and Hormone Stimuli.

PubMed

Li, Meng-Yao; Song, Xiong; Wang, Feng; Xiong, Ai-Sheng

2016-01-01

Parsley, one of the most important vegetables in the Apiaceae family, is widely used in the food, medicinal, and cosmetic industries. Recent studies on parsley mainly focus on its chemical composition, and further research involving the analysis of the plant's gene functions and expressions is required. qPCR is a powerful method for detecting very low quantities of target transcript levels and is widely used to study gene expression. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, four software, namely geNorm, NormFinder, BestKeeper, and RefFinder were used to evaluate the expression stabilities of eight candidate reference genes of parsley ( GAPDH, ACTIN, eIF-4 α, SAND, UBC, TIP41, EF-1 α, and TUB ) under various conditions, including abiotic stresses (heat, cold, salt, and drought) and hormone stimuli treatments (GA, SA, MeJA, and ABA). Results showed that EF-1 α and TUB were the most stable genes for abiotic stresses, whereas EF-1 α, GAPDH , and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1 α and TUB were the most stable reference genes among all tested samples, and UBC was the least stable one. Expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study can guide the selection of suitable reference genes in gene expression in parsley.
Identification of a Novel Reference Gene for Apple Transcriptional Profiling under Postharvest Conditions

PubMed Central

Storch, Tatiane Timm; Pegoraro, Camila; Finatto, Taciane; Quecini, Vera; Rombaldi, Cesar Valmor; Girardi, César Luis

2015-01-01

Reverse Transcription quantitative PCR (RT-qPCR) is one of the most important techniques for gene expression profiling due to its high sensibility and reproducibility. However, the reliability of the results is highly dependent on data normalization, performed by comparisons between the expression profiles of the genes of interest against those of constitutively expressed, reference genes. Although the technique is widely used in fruit postharvest experiments, the transcription stability of reference genes has not been thoroughly investigated under these experimental conditions. Thus, we have determined the transcriptional profile, under these conditions, of three genes commonly used as reference—ACTIN (MdACT), PROTEIN DISULPHIDE ISOMERASE (MdPDI) and UBIQUITIN-CONJUGATING ENZYME E2 (MdUBC)—along with two novel candidates—HISTONE 1 (MdH1) and NUCLEOSSOME ASSEMBLY 1 PROTEIN (MdNAP1). The expression profile of the genes was investigated throughout five experiments, with three of them encompassing the postharvest period and the other two, consisting of developmental and spatial phases. The transcriptional stability was comparatively investigated using four distinct software packages: BestKeeper, NormFinder, geNorm and DataAssist. Gene ranking results for transcriptional stability were similar for the investigated software packages, with the exception of BestKeeper. The classic reference gene MdUBC ranked among the most stably transcribed in all investigated experimental conditions. Transcript accumulation profiles for the novel reference candidate gene MdH1 were stable throughout the tested conditions, especially in experiments encompassing the postharvest period. Thus, our results present a novel reference gene for postharvest experiments in apple and reinforce the importance of checking the transcription profile of reference genes under the experimental conditions of interest. PMID:25774904
Selection of Reliable Reference Genes for Gene Expression Studies of a Promising Oilseed Crop, Plukenetia volubilis, by Real-Time Quantitative PCR.

PubMed

Niu, Longjian; Tao, Yan-Bin; Chen, Mao-Sheng; Fu, Qiantang; Li, Chaoqiong; Dong, Yuling; Wang, Xiulan; He, Huiying; Xu, Zeng-Fu

2015-06-03

Real-time quantitative PCR (RT-qPCR) is a reliable and widely used method for gene expression analysis. The accuracy of the determination of a target gene expression level by RT-qPCR demands the use of appropriate reference genes to normalize the mRNA levels among different samples. However, suitable reference genes for RT-qPCR have not been identified in Sacha inchi (Plukenetia volubilis), a promising oilseed crop known for its polyunsaturated fatty acid (PUFA)-rich seeds. In this study, using RT-qPCR, twelve candidate reference genes were examined in seedlings and adult plants, during flower and seed development and for the entire growth cycle of Sacha inchi. Four statistical algorithms (delta cycle threshold (ΔCt), BestKeeper, geNorm, and NormFinder) were used to assess the expression stabilities of the candidate genes. The results showed that ubiquitin-conjugating enzyme (UCE), actin (ACT) and phospholipase A22 (PLA) were the most stable genes in Sacha inchi seedlings. For roots, stems, leaves, flowers, and seeds from adult plants, 30S ribosomal protein S13 (RPS13), cyclophilin (CYC) and elongation factor-1alpha (EF1α) were recommended as reference genes for RT-qPCR. During the development of reproductive organs, PLA, ACT and UCE were the optimal reference genes for flower development, whereas UCE, RPS13 and RNA polymerase II subunit (RPII) were optimal for seed development. Considering the entire growth cycle of Sacha inchi, UCE, ACT and EF1α were sufficient for the purpose of normalization. Our results provide useful guidelines for the selection of reliable reference genes for the normalization of RT-qPCR data for seedlings and adult plants, for reproductive organs, and for the entire growth cycle of Sacha inchi.
Genomic Analysis of Genotype-by-Social Environment Interaction for Drosophila melanogaster Aggressive Behavior.

PubMed

Rohde, Palle Duun; Gaertner, Bryn; Ward, Kirsty; Sørensen, Peter; Mackay, Trudy F C

2017-08-01

Human psychiatric disorders such as schizophrenia, bipolar disorder, and attention-deficit/hyperactivity disorder often include adverse behaviors including increased aggressiveness. Individuals with psychiatric disorders often exhibit social withdrawal, which can further increase the probability of conducting a violent act. Here, we used the inbred, sequenced lines of the Drosophila Genetic Reference Panel (DGRP) to investigate the genetic basis of variation in male aggressive behavior for flies reared in a socialized and socially isolated environment. We identified genetic variation for aggressive behavior, as well as significant genotype-by-social environmental interaction (GSEI); i.e. , variation among DGRP genotypes in the degree to which social isolation affected aggression. We performed genome-wide association (GWA) analyses to identify genetic variants associated with aggression within each environment. We used genomic prediction to partition genetic variants into gene ontology (GO) terms and constituent genes, and identified GO terms and genes with high prediction accuracies in both social environments and for GSEI. The top predictive GO terms significantly increased the proportion of variance explained, compared to prediction models based on all segregating variants. We performed genomic prediction across environments, and identified genes in common between the social environments that turned out to be enriched for genome-wide associated variants. A large proportion of the associated genes have previously been associated with aggressive behavior in Drosophila and mice. Further, many of these genes have human orthologs that have been associated with neurological disorders, indicating partially shared genetic mechanisms underlying aggression in animal models and human psychiatric disorders. Copyright © 2017 by the Genetics Society of America.
Connectivity Mapping for Candidate Therapeutics Identification Using Next Generation Sequencing RNA-Seq Data

PubMed Central

McArt, Darragh G.; Dunne, Philip D.; Blayney, Jaine K.; Salto-Tellez, Manuel; Van Schaeybroeck, Sandra; Hamilton, Peter W.; Zhang, Shu-Dong

2013-01-01

The advent of next generation sequencing technologies (NGS) has expanded the area of genomic research, offering high coverage and increased sensitivity over older microarray platforms. Although the current cost of next generation sequencing is still exceeding that of microarray approaches, the rapid advances in NGS will likely make it the platform of choice for future research in differential gene expression. Connectivity mapping is a procedure for examining the connections among diseases, genes and drugs by differential gene expression initially based on microarray technology, with which a large collection of compound-induced reference gene expression profiles have been accumulated. In this work, we aim to test the feasibility of incorporating NGS RNA-Seq data into the current connectivity mapping framework by utilizing the microarray based reference profiles and the construction of a differentially expressed gene signature from a NGS dataset. This would allow for the establishment of connections between the NGS gene signature and those microarray reference profiles, alleviating the associated incurring cost of re-creating drug profiles with NGS technology. We examined the connectivity mapping approach on a publicly available NGS dataset with androgen stimulation of LNCaP cells in order to extract candidate compounds that could inhibit the proliferative phenotype of LNCaP cells and to elucidate their potential in a laboratory setting. In addition, we also analyzed an independent microarray dataset of similar experimental settings. We found a high level of concordance between the top compounds identified using the gene signatures from the two datasets. The nicotine derivative cotinine was returned as the top candidate among the overlapping compounds with potential to suppress this proliferative phenotype. Subsequent lab experiments validated this connectivity mapping hit, showing that cotinine inhibits cell proliferation in an androgen dependent manner. Thus the results in this study suggest a promising prospect of integrating NGS data with connectivity mapping. PMID:23840550
Identification and evaluation of new reference genes in Gossypium hirsutum for accurate normalization of real-time quantitative RT-PCR data.

PubMed

Artico, Sinara; Nardeli, Sarah M; Brilhante, Osmundo; Grossi-de-Sa, Maria Fátima; Alves-Ferreira, Marcio

2010-03-21

Normalizing through reference genes, or housekeeping genes, can make more accurate and reliable results from reverse transcription real-time quantitative polymerase chain reaction (qPCR). Recent studies have shown that no single housekeeping gene is universal for all experiments. Thus, suitable reference genes should be the first step of any qPCR analysis. Only a few studies on the identification of housekeeping gene have been carried on plants. Therefore qPCR studies on important crops such as cotton has been hampered by the lack of suitable reference genes. By the use of two distinct algorithms, implemented by geNorm and NormFinder, we have assessed the gene expression of nine candidate reference genes in cotton: GhACT4, GhEF1alpha5, GhFBX6, GhPP2A1, GhMZA, GhPTB, GhGAPC2, GhbetaTUB3 and GhUBQ14. The candidate reference genes were evaluated in 23 experimental samples consisting of six distinct plant organs, eight stages of flower development, four stages of fruit development and in flower verticils. The expression of GhPP2A1 and GhUBQ14 genes were the most stable across all samples and also when distinct plants organs are examined. GhACT4 and GhUBQ14 present more stable expression during flower development, GhACT4 and GhFBX6 in the floral verticils and GhMZA and GhPTB during fruit development. Our analysis provided the most suitable combination of reference genes for each experimental set tested as internal control for reliable qPCR data normalization. In addition, to illustrate the use of cotton reference genes we checked the expression of two cotton MADS-box genes in distinct plant and floral organs and also during flower development. We have tested the expression stabilities of nine candidate genes in a set of 23 tissue samples from cotton plants divided into five different experimental sets. As a result of this evaluation, we recommend the use of GhUBQ14 and GhPP2A1 housekeeping genes as superior references for normalization of gene expression measures in different cotton plant organs; GhACT4 and GhUBQ14 for flower development, GhACT4 and GhFBX6 for the floral organs and GhMZA and GhPTB for fruit development. We also provide the primer sequences whose performance in qPCR experiments is demonstrated. These genes will enable more accurate and reliable normalization of qPCR results for gene expression studies in this important crop, the major source of natural fiber and also an important source of edible oil. The use of bona fide reference genes allowed a detailed and accurate characterization of the temporal and spatial expression pattern of two MADS-box genes in cotton.
Identification and evaluation of new reference genes in Gossypium hirsutum for accurate normalization of real-time quantitative RT-PCR data

PubMed Central

2010-01-01

Background Normalizing through reference genes, or housekeeping genes, can make more accurate and reliable results from reverse transcription real-time quantitative polymerase chain reaction (qPCR). Recent studies have shown that no single housekeeping gene is universal for all experiments. Thus, suitable reference genes should be the first step of any qPCR analysis. Only a few studies on the identification of housekeeping gene have been carried on plants. Therefore qPCR studies on important crops such as cotton has been hampered by the lack of suitable reference genes. Results By the use of two distinct algorithms, implemented by geNorm and NormFinder, we have assessed the gene expression of nine candidate reference genes in cotton: GhACT4, GhEF1α5, GhFBX6, GhPP2A1, GhMZA, GhPTB, GhGAPC2, GhβTUB3 and GhUBQ14. The candidate reference genes were evaluated in 23 experimental samples consisting of six distinct plant organs, eight stages of flower development, four stages of fruit development and in flower verticils. The expression of GhPP2A1 and GhUBQ14 genes were the most stable across all samples and also when distinct plants organs are examined. GhACT4 and GhUBQ14 present more stable expression during flower development, GhACT4 and GhFBX6 in the floral verticils and GhMZA and GhPTB during fruit development. Our analysis provided the most suitable combination of reference genes for each experimental set tested as internal control for reliable qPCR data normalization. In addition, to illustrate the use of cotton reference genes we checked the expression of two cotton MADS-box genes in distinct plant and floral organs and also during flower development. Conclusion We have tested the expression stabilities of nine candidate genes in a set of 23 tissue samples from cotton plants divided into five different experimental sets. As a result of this evaluation, we recommend the use of GhUBQ14 and GhPP2A1 housekeeping genes as superior references for normalization of gene expression measures in different cotton plant organs; GhACT4 and GhUBQ14 for flower development, GhACT4 and GhFBX6 for the floral organs and GhMZA and GhPTB for fruit development. We also provide the primer sequences whose performance in qPCR experiments is demonstrated. These genes will enable more accurate and reliable normalization of qPCR results for gene expression studies in this important crop, the major source of natural fiber and also an important source of edible oil. The use of bona fide reference genes allowed a detailed and accurate characterization of the temporal and spatial expression pattern of two MADS-box genes in cotton. PMID:20302670
Evaluation of four endogenous reference genes and their real-time PCR assays for common wheat quantification in GMOs detection.

PubMed

Huang, Huali; Cheng, Fang; Wang, Ruoan; Zhang, Dabing; Yang, Litao

2013-01-01

Proper selection of endogenous reference genes and their real-time PCR assays is quite important in genetically modified organisms (GMOs) detection. To find a suitable endogenous reference gene and its real-time PCR assay for common wheat (Triticum aestivum L.) DNA content or copy number quantification, four previously reported wheat endogenous reference genes and their real-time PCR assays were comprehensively evaluated for the target gene sequence variation and their real-time PCR performance among 37 common wheat lines. Three SNPs were observed in the PKABA1 and ALMT1 genes, and these SNPs significantly decreased the efficiency of real-time PCR amplification. GeNorm analysis of the real-time PCR performance of each gene among common wheat lines showed that the Waxy-D1 assay had the lowest M values with the best stability among all tested lines. All results indicated that the Waxy-D1 gene and its real-time PCR assay were most suitable to be used as an endogenous reference gene for common wheat DNA content quantification. The validated Waxy-D1 gene assay will be useful in establishing accurate and creditable qualitative and quantitative PCR analysis of GM wheat.
Evaluation of Four Endogenous Reference Genes and Their Real-Time PCR Assays for Common Wheat Quantification in GMOs Detection

PubMed Central

Huang, Huali; Cheng, Fang; Wang, Ruoan; Zhang, Dabing; Yang, Litao

2013-01-01

Proper selection of endogenous reference genes and their real-time PCR assays is quite important in genetically modified organisms (GMOs) detection. To find a suitable endogenous reference gene and its real-time PCR assay for common wheat (Triticum aestivum L.) DNA content or copy number quantification, four previously reported wheat endogenous reference genes and their real-time PCR assays were comprehensively evaluated for the target gene sequence variation and their real-time PCR performance among 37 common wheat lines. Three SNPs were observed in the PKABA1 and ALMT1 genes, and these SNPs significantly decreased the efficiency of real-time PCR amplification. GeNorm analysis of the real-time PCR performance of each gene among common wheat lines showed that the Waxy-D1 assay had the lowest M values with the best stability among all tested lines. All results indicated that the Waxy-D1 gene and its real-time PCR assay were most suitable to be used as an endogenous reference gene for common wheat DNA content quantification. The validated Waxy-D1 gene assay will be useful in establishing accurate and creditable qualitative and quantitative PCR analysis of GM wheat. PMID:24098735
Effect of reference genome selection on the performance of computational methods for genome-wide protein-protein interaction prediction.

PubMed

Muley, Vijaykumar Yogesh; Ranjan, Akash

2012-01-01

Recent progress in computational methods for predicting physical and functional protein-protein interactions has provided new insights into the complexity of biological processes. Most of these methods assume that functionally interacting proteins are likely to have a shared evolutionary history. This history can be traced out for the protein pairs of a query genome by correlating different evolutionary aspects of their homologs in multiple genomes known as the reference genomes. These methods include phylogenetic profiling, gene neighborhood and co-occurrence of the orthologous protein coding genes in the same cluster or operon. These are collectively known as genomic context methods. On the other hand a method called mirrortree is based on the similarity of phylogenetic trees between two interacting proteins. Comprehensive performance analyses of these methods have been frequently reported in literature. However, very few studies provide insight into the effect of reference genome selection on detection of meaningful protein interactions. We analyzed the performance of four methods and their variants to understand the effect of reference genome selection on prediction efficacy. We used six sets of reference genomes, sampled in accordance with phylogenetic diversity and relationship between organisms from 565 bacteria. We used Escherichia coli as a model organism and the gold standard datasets of interacting proteins reported in DIP, EcoCyc and KEGG databases to compare the performance of the prediction methods. Higher performance for predicting protein-protein interactions was achievable even with 100-150 bacterial genomes out of 565 genomes. Inclusion of archaeal genomes in the reference genome set improves performance. We find that in order to obtain a good performance, it is better to sample few genomes of related genera of prokaryotes from the large number of available genomes. Moreover, such a sampling allows for selecting 50-100 genomes for comparable accuracy of predictions when computational resources are limited.
Identification of suitable reference genes in bone marrow stromal cells from osteoarthritic donors.

PubMed

Schildberg, Theresa; Rauh, Juliane; Bretschneider, Henriette; Stiehler, Maik

2013-11-01

Bone marrow stromal cells (BMSCs) are key cellular components for musculoskeletal tissue engineering strategies. Furthermore, recent data suggest that BMSCs are involved in the development of Osteoarthritis (OA) being a frequently occurring degenerative joint disease. Reliable reference genes for the molecular evaluation of BMSCs derived from donors exhibiting OA as a primary co-morbidity have not been reported on yet. Hence, the aim of the study was to identify reference genes suitable for comparative gene expression analyses using OA-BMSCs. Passage 1 bone marrow derived BMSCs were isolated from n=13 patients with advanced stage idiopathic hip osteoarthritis and n=15 age-matched healthy donors. The expression of 31 putative reference genes was analyzed by quantitative reverse transcription polymerase chain reaction (qRT-PCR) using a commercially available TaqMan(®) assay. Calculating the coefficient of variation (CV), mRNA expression stability was determined and afterwards validated using geNorm and NormFinder algorithms. Importin 8 (IPO8), TATA box binding protein (TBP), and cancer susceptibility candidate 3 (CASC3) were identified as the most stable reference genes. Notably, commonly used reference genes, e.g. beta-actin (ACTB) and beta-2-microglobulin (B2M) were among the most unstable genes. For normalization of gene expression data of OA-BMSCs the combined use of IPO8, TBP, and CASC3 gene is recommended. © 2013.
Reference genes for quantitative PCR in the adipose tissue of mice with metabolic disease.

PubMed

Almeida-Oliveira, Fernanda; Leandro, João G B; Ausina, Priscila; Sola-Penna, Mauro; Majerowicz, David

2017-04-01

Obesity and diabetes are metabolic diseases and they are increasing in prevalence. The dynamics of gene expression associated with these diseases is fundamental to identifying genes involved in related biological processes. qPCR is a sensitive technique for mRNA quantification and the most commonly used method in gene-expression studies. However, the reliability of these results is directly influenced by data normalization. As reference genes are the major normalization method used, this work aims to identify reference genes for qPCR in adipose tissues of mice with type-I diabetes or obesity. We selected 12 genes that are commonly used as reference genes. The expression of these genes in the adipose tissues of mice was analyzed in the context of three different experimental protocols: 1) untreated animals; 2) high-fat-diet animals; and 3) streptozotocin-treated animals. Gene-expression stability was analyzed using four different algorithms. Our data indicate that TATA-binding protein is stably expressed across adipose tissues in control animals. This gene was also a useful reference when the brown adipose tissues of control and obese mice were analyzed. The mitochondrial ATP synthase F1 complex gene exhibits stable expression in subcutaneous and perigonadal adipose tissue from control and obese mice. Moreover, this gene is the best reference for qPCR normalization in adipose tissue from streptozotocin-treated animals. These results show that there is no perfect stable gene suited for use under all experimental conditions. In conclusion, the selection of appropriate genes is a prerequisite to ensure qPCR reliability and must be performed separately for different experimental protocols. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Identification of suitable reference gene and biomarkers of serum miRNAs for osteoporosis

PubMed Central

Chen, Jian; Li, Kai; Pang, Qianqian; Yang, Chao; Zhang, Hongyu; Wu, Feng; Cao, Hongqing; Liu, Hongju; Wan, Yumin; Xia, Weibo; Wang, Jinfu; Dai, Zhongquan; Li, Yinghui

2016-01-01

Our objective was to identify suitable reference genes in serum miRNA for normalization and screen potential new biomarkers for osteoporosis diagnosis by a systematic study. Two types of osteoporosis models were used like as mechanical unloading and estrogen deficiency. Through a large-scale screening using microarray, qPCR validation and statistical algorithms, we first identified miR-25-3p as a suitable reference gene for both type of osteoporosis, which also showed stability during the differentiation processes of osteoblast and osteoclast. Then 15 serum miRNAs with differential expression in OVX rats were identified by microarray and qPCR validation. We further detected these 15 miRNAs in postmenopausal women and bedrest rhesus monkeys and evaluated their diagnostic value by ROC analysis. Among these miRNAs, miR-30b-5p was significantly down-regulated in postmenopausal women with osteopenia or osteoporosis; miR-103-3p, miR-142-3p, miR-328-3p were only significantly decreased in osteoporosis. They all showed positive correlations with BMD. Except miR328-3p, the other three miRNAs were also declined in the rhesus monkeys after long-duration bedrest. Their AUC values (all >0.75) proved the diagnostic potential. Our results provided a reliable normalization reference gene and verified a group of circulating miRNAs as non-invasive biomarkers in the detection of postmenopausal- and mechanical unloading- osteoporosis. PMID:27821865

Diagnostic accuracy of nucleic acid amplification based assays for tuberculous meningitis: A meta-analysis.

PubMed

Gupta, Renu; Talwar, Puneet; Talwar, Pumanshi; Khurana, Sarbjeet; Kushwaha, Suman; Jalan, Nupur; Thakur, Rajeev

2018-05-25

Numerous in-house and commercial nucleic acid amplification tests (NAAT) have been evaluated using variable reference standards for diagnosis of TBM but their diagnostic potential is still not very clear. We conducted a meta-analysis to assess the diagnostic accuracy of different NAAT based assays for diagnosing TBM against 43 data sets of confirmed TBM (n = 1066) and 61 data sets of suspected TBM (n = 3721) as two reference standards. The summary estimate of the sensitivity and the specificity were obtained using the bivariate model. QUADAS-2 tool was used to perform the Quality assessment for bias and applicability. Publication bias was assessed with Deeks' funnel plot. Studies with confirmed TBM had better summary estimates as compared to studies with clinically suspected TBM irrespective of NAAT and index tests used. Among in-house assays, MPB as the gene target had best summary estimates in both confirmed [sensitivity:90%(83-95), specificity:97-%(87-99), DOR:247 (50-1221), AUC:99%(97-100), PLR:38.8-(6.6-133), NLR:0.11(0.05-0.18), I 2   = 15%] and clinically suspected [sensitivity:69%(47-85), specificity:96%(90-98), DOR:62(16.8-232), AUC:94%(92-97), PLR:16.9(6.5-36.8), NLR:0.33(0.16-0.56), I 2 :15.3%] groups. GeneXpert revealed good diagnostic accuracy only in confirmed TBM group [sensitivity = 57%(38-74), specificity = 98%(89-100), DOR = 62(7-589), AUC = 87%(79-96), PLR = 33.2(3.8-128), NLR = 0.45(0.26-0.68), I 2   = 0%]. This meta-analysis identified potential role of MPB gene among in-house assays and GeneXpert as commercial assay for diagnosing TBM. Copyright © 2018. Published by Elsevier Ltd.
Neural-Thyroid Interaction on Skeletal Isomyosin in Zero Gravity

NASA Technical Reports Server (NTRS)

Baldwin, Kenneth M.

2000-01-01

The primary goal of the project was to develop a ground based model to first study the role of the nerve and of thyroid hormone (T3) in the regulation of body growth and skeletal muscle growth and differentiation in rodents. A primary objective was to test the hypothesis that normal weight bearing activity is essential for the development of antigravity, slow twitch skeletal muscle and the corresponding slow myosin heavy chain (MHC) gene; whereas, T3 was obligatory for general body and muscle growth and the establishment of fast MHC phenotype in typically fast locomoter muscles. These ground based experiments would provide both the efficacy and background for a spaceflight experiment (referred to as the Neurolab Mission) jointly sponsored by the NIH and NASA.
Analyzing gene expression time-courses based on multi-resolution shape mixture model.

PubMed

Li, Ying; He, Ye; Zhang, Yu

2016-11-01

Biological processes actually are a dynamic molecular process over time. Time course gene expression experiments provide opportunities to explore patterns of gene expression change over a time and understand the dynamic behavior of gene expression, which is crucial for study on development and progression of biology and disease. Analysis of the gene expression time-course profiles has not been fully exploited so far. It is still a challenge problem. We propose a novel shape-based mixture model clustering method for gene expression time-course profiles to explore the significant gene groups. Based on multi-resolution fractal features and mixture clustering model, we proposed a multi-resolution shape mixture model algorithm. Multi-resolution fractal features is computed by wavelet decomposition, which explore patterns of change over time of gene expression at different resolution. Our proposed multi-resolution shape mixture model algorithm is a probabilistic framework which offers a more natural and robust way of clustering time-course gene expression. We assessed the performance of our proposed algorithm using yeast time-course gene expression profiles compared with several popular clustering methods for gene expression profiles. The grouped genes identified by different methods are evaluated by enrichment analysis of biological pathways and known protein-protein interactions from experiment evidence. The grouped genes identified by our proposed algorithm have more strong biological significance. A novel multi-resolution shape mixture model algorithm based on multi-resolution fractal features is proposed. Our proposed model provides a novel horizons and an alternative tool for visualization and analysis of time-course gene expression profiles. The R and Matlab program is available upon the request. Copyright © 2016 Elsevier Inc. All rights reserved.
Mutation Scanning in Wheat by Exon Capture and Next-Generation Sequencing.

PubMed

King, Robert; Bird, Nicholas; Ramirez-Gonzalez, Ricardo; Coghill, Jane A; Patil, Archana; Hassani-Pak, Keywan; Uauy, Cristobal; Phillips, Andrew L

2015-01-01

Targeted Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach to identify novel sequence variation in genomes, with the aims of investigating gene function and/or developing useful alleles for breeding. Despite recent advances in wheat genomics, most current TILLING methods are low to medium in throughput, being based on PCR amplification of the target genes. We performed a pilot-scale evaluation of TILLING in wheat by next-generation sequencing through exon capture. An oligonucleotide-based enrichment array covering ~2 Mbp of wheat coding sequence was used to carry out exon capture and sequencing on three mutagenised lines of wheat containing previously-identified mutations in the TaGA20ox1 homoeologous genes. After testing different mapping algorithms and settings, candidate SNPs were identified by mapping to the IWGSC wheat Chromosome Survey Sequences. Where sequence data for all three homoeologues were found in the reference, mutant calls were unambiguous; however, where the reference lacked one or two of the homoeologues, captured reads from these genes were mis-mapped to other homoeologues, resulting either in dilution of the variant allele frequency or assignment of mutations to the wrong homoeologue. Competitive PCR assays were used to validate the putative SNPs and estimate cut-off levels for SNP filtering. At least 464 high-confidence SNPs were detected across the three mutagenized lines, including the three known alleles in TaGA20ox1, indicating a mutation rate of ~35 SNPs per Mb, similar to that estimated by PCR-based TILLING. This demonstrates the feasibility of using exon capture for genome re-sequencing as a method of mutation detection in polyploid wheat, but accurate mutation calling will require an improved genomic reference with more comprehensive coverage of homoeologues.
Use of phylogenetic and phenotypic analyses to identify nonhemolytic streptococci isolated from bacteremic patients.

PubMed

Hoshino, Tomonori; Fujiwara, Taku; Kilian, Mogens

2005-12-01

The aim of this study was to evaluate molecular and phenotypic methods for the identification of nonhemolytic streptococci. A collection of 148 strains consisting of 115 clinical isolates from cases of infective endocarditis, septicemia, and meningitis and 33 reference strains, including type strains of all relevant Streptococcus species, were examined. Identification was performed by phylogenetic analysis of nucleotide sequences of four housekeeping genes, ddl, gdh, rpoB, and sodA; by PCR analysis of the glucosyltransferase (gtf) gene; and by conventional phenotypic characterization and identification using two commercial kits, Rapid ID 32 STREP and STREPTOGRAM and the associated databases. A phylogenetic tree based on concatenated sequences of the four housekeeping genes allowed unequivocal differentiation of recognized species and was used as the reference. Analysis of single gene sequences revealed deviation clustering in eight strains (5.4%) due to homologous recombination with other species. This was particularly evident in S. sanguinis and in members of the anginosus group of streptococci. The rate of correct identification of the strains by both commercial identification kits was below 50% but varied significantly between species. The most significant problems were observed with S. mitis and S. oralis and 11 Streptococcus species described since 1991. Our data indicate that identification based on multilocus sequence analysis is optimal. As a more practical alternative we recommend identification based on sodA sequences with reference to a comprehensive set of sequences that is available for downloading from our server. An analysis of the species distribution of 107 nonhemolytic streptococci from bacteremic patients showed a predominance of S. oralis and S. anginosus with various underlying infections.
Genomic Prediction for Quantitative Traits Is Improved by Mapping Variants to Gene Ontology Categories in Drosophila melanogaster

PubMed Central

Edwards, Stefan M.; Sørensen, Izel F.; Sarup, Pernille; Mackay, Trudy F. C.; Sørensen, Peter

2016-01-01

Predicting individual quantitative trait phenotypes from high-resolution genomic polymorphism data is important for personalized medicine in humans, plant and animal breeding, and adaptive evolution. However, this is difficult for populations of unrelated individuals when the number of causal variants is low relative to the total number of polymorphisms and causal variants individually have small effects on the traits. We hypothesized that mapping molecular polymorphisms to genomic features such as genes and their gene ontology categories could increase the accuracy of genomic prediction models. We developed a genomic feature best linear unbiased prediction (GFBLUP) model that implements this strategy and applied it to three quantitative traits (startle response, starvation resistance, and chill coma recovery) in the unrelated, sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel. Our results indicate that subsetting markers based on genomic features increases the predictive ability relative to the standard genomic best linear unbiased prediction (GBLUP) model. Both models use all markers, but GFBLUP allows differential weighting of the individual genetic marker relationships, whereas GBLUP weighs the genetic marker relationships equally. Simulation studies show that it is possible to further increase the accuracy of genomic prediction for complex traits using this model, provided the genomic features are enriched for causal variants. Our GFBLUP model using prior information on genomic features enriched for causal variants can increase the accuracy of genomic predictions in populations of unrelated individuals and provides a formal statistical framework for leveraging and evaluating information across multiple experimental studies to provide novel insights into the genetic architecture of complex traits. PMID:27235308
Ultra Low-Dose Radiation: Stress Responses and Impacts Using Rice as a Grass Model

PubMed Central

Rakwal, Randeep; Agrawal, Ganesh Kumar; Shibato, Junko; Imanaka, Tetsuji; Fukutani, Satoshi; Tamogami, Shigeru; Endo, Satoru; Sahoo, Sarata Kumar; Masuo, Yoshinori; Kimura, Shinzo

2009-01-01

We report molecular changes in leaves of rice plants (Oryza sativa L. - reference crop plant and grass model) exposed to ultra low-dose ionizing radiation, first using contaminated soil from the exclusion zone around Chernobyl reactor site. Results revealed induction of stress-related marker genes (Northern blot) and secondary metabolites (LC-MS/MS) in irradiated leaf segments over appropriate control. Second, employing the same in vitro model system, we replicated results of the first experiment using in-house fabricated sources of ultra low-dose gamma (γ) rays and selected marker genes by RT-PCR. Results suggest the usefulness of the rice model in studying ultra low-dose radiation response/s. PMID:19399245
Custom oligonucleotide array-based CGH: a reliable diagnostic tool for detection of exonic copy-number changes in multiple targeted genes

PubMed Central

Vasson, Aurélie; Leroux, Céline; Orhant, Lucie; Boimard, Mathieu; Toussaint, Aurélie; Leroy, Chrystel; Commere, Virginie; Ghiotti, Tiffany; Deburgrave, Nathalie; Saillour, Yoann; Atlan, Isabelle; Fouveaut, Corinne; Beldjord, Cherif; Valleix, Sophie; Leturcq, France; Dodé, Catherine; Bienvenu, Thierry; Chelly, Jamel; Cossée, Mireille

2013-01-01

The frequency of disease-related large rearrangements (referred to as copy-number mutations, CNMs) varies among genes, and search for these mutations has an important place in diagnostic strategies. In recent years, CGH method using custom-designed high-density oligonucleotide-based arrays allowed the development of a powerful tool for detection of alterations at the level of exons and made it possible to provide flexibility through the possibility of modeling chips. The aim of our study was to test custom-designed oligonucleotide CGH array in a diagnostic laboratory setting that analyses several genes involved in various genetic diseases, and to compare it with conventional strategies. To this end, we designed a 12-plex CGH array (135k; 135 000 probes/subarray) (Roche Nimblegen) with exonic and intronic oligonucleotide probes covering 26 genes routinely analyzed in the laboratory. We tested control samples with known CNMs and patients for whom genetic causes underlying their disorders were unknown. The contribution of this technique is undeniable. Indeed, it appeared reproducible, reliable and sensitive enough to detect heterozygous single-exon deletions or duplications, complex rearrangements and somatic mosaicism. In addition, it improves reliability of CNM detection and allows determination of boundaries precisely enough to direct targeted sequencing of breakpoints. All of these points, associated with the possibility of a simultaneous analysis of several genes and scalability ‘homemade' make it a valuable tool as a new diagnostic approach of CNMs. PMID:23340513
Gravitropic mechanisms derived from space experiments and magnetic gradients.

NASA Astrophysics Data System (ADS)

Hasenstein, Karl H.; Park, Myoung Ryoul

2016-07-01

Gravitropism is the result of a complex sequence of events that begins with the movement of dense particles, typically starch-filled amyloplasts in response to reorientation. Although these organelles change positions, it is not clear whether the critical signal is derived from sedimentation or dynamic interactions of amyloplasts with relevant membranes. Substituting gravity by high-gradient magnetic fields (HGMF) provides a localized stimulus for diamagnetic starch that is specific for amyloplasts and comparable to gravity without affecting other organelles. Experiments with Brassica rapa showed induction of root curvature by HGMF when roots moved sufficiently close to the magnetic gradient-inducing foci. The focused and short-range effectiveness of HGMFs provided a gravity-like stimulus and affected related gene expression. Root curvature was sensitive to the mutual alignment between roots and HGMF direction. Unrelated to any HGMF effects, the size of amyloplasts in space-grown roots increased by 30% compared to ground controls and suggests enhanced sensitivity in a gravity-reduced environment. Accompanying gene transcription studies showed greater differences between HGMF-exposed and space controls than between space and ground controls. This observation may lead to the identification of gravitropism-relevant genes. However, space grown roots showed stronger transcription of common reference genes such as actin and ubiquitin in magnetic fields than in non-magnetic conditions. In contrast, α-amylase, glucokinase and PIN encoding genes were transcribed stronger under non-magnetic conditions than under HGMF. The large number of comparisons between space, ground, and HGMF prompted the assessment of transcription differences between root segments, root-shoot junction, and seeds. Because presumed transcription of reference genes varied more than genes of interest, changes in gene expression cannot be based on reference genes. The data provide an example of complex and different responses to microgravity conditions, induced curvature, ground controls, clinorotation, and magnetic field exposure.
A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress

PubMed Central

2018-01-01

The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i) the proposed model is different from the previous models lacking the concept of time series; (ii) the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii) the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies. PMID:29765399
A Seasonal Time-Series Model Based on Gene Expression Programming for Predicting Financial Distress.

PubMed

Cheng, Ching-Hsue; Chan, Chia-Pang; Yang, Jun-He

2018-01-01

The issue of financial distress prediction plays an important and challenging research topic in the financial field. Currently, there have been many methods for predicting firm bankruptcy and financial crisis, including the artificial intelligence and the traditional statistical methods, and the past studies have shown that the prediction result of the artificial intelligence method is better than the traditional statistical method. Financial statements are quarterly reports; hence, the financial crisis of companies is seasonal time-series data, and the attribute data affecting the financial distress of companies is nonlinear and nonstationary time-series data with fluctuations. Therefore, this study employed the nonlinear attribute selection method to build a nonlinear financial distress prediction model: that is, this paper proposed a novel seasonal time-series gene expression programming model for predicting the financial distress of companies. The proposed model has several advantages including the following: (i) the proposed model is different from the previous models lacking the concept of time series; (ii) the proposed integrated attribute selection method can find the core attributes and reduce high dimensional data; and (iii) the proposed model can generate the rules and mathematical formulas of financial distress for providing references to the investors and decision makers. The result shows that the proposed method is better than the listing classifiers under three criteria; hence, the proposed model has competitive advantages in predicting the financial distress of companies.
nGASP--the nematode genome annotation assessment project.

PubMed

Coghlan, Avril; Fiedler, Tristan J; McKay, Sheldon J; Flicek, Paul; Harris, Todd W; Blasiar, Darin; Stein, Lincoln D

2008-12-19

While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets across 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with unusually many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs posed the greatest difficulty for gene-finders. This experiment establishes a baseline of gene prediction accuracy in Caenorhabditis genomes, and has guided the choice of gene-finders for the annotation of newly sequenced genomes of Caenorhabditis and other nematode species. We have created new gene sets for C. briggsae, C. remanei, C. brenneri, C. japonica, and Brugia malayi using some of the best-performing gene-finders.
Identification of reference genes for RT-qPCR in the Antarctic moss Sanionia uncinata under abiotic stress conditions

PubMed Central

Park, Mira; Hong, Soon Gyu; Park, Hyun; Lee, Byeong-ha

2018-01-01

Sanionia uncinata is a dominant moss species in the maritime Antarctic. Due to its high adaptability to harsh environments, this extremophile plant has been considered a good target for studying the molecular adaptation mechanisms of plants to a variety of environmental stresses. Despite the importance of S. uncinata as a representative Antarctic plant species for the identification and characterization of genes associated with abiotic stress tolerance, suitable reference genes, which are critical for RT-qPCR analyses, have not yet been identified. In this report, 11 traditionally used and 6 novel candidate reference genes were selected from transcriptome data of S. uncinata and the expression stability of these genes was evaluated under various abiotic stress conditions using three statistical algorithms; geNorm, NormFinder, and BestKeeper. The stability ranking analysis selected the best reference genes depending on the stress conditions. Among the 17 candidates, the most stable references were POB1 and UFD2 for cold stress, POB1 and AKB for drought treatment, and UFD2 and AKB for the field samples from a different water contents in Antarctica. Overall, novel genes POB1 and AKB were the most reliable references across all samples, irrespective of experimental conditions. In addition, 6 novel candidate genes including AKB, POB1 and UFD2, were more stable than the housekeeping genes traditionally used for internal controls, indicating that transcriptome data can be useful for identifying novel robust normalizers. The reference genes validated in this study will be useful for improving the accuracy of RT-qPCR analysis for gene expression studies of S. uncinata in Antarctica and for further functional genomic analysis of bryophytes. PMID:29920565
Identification and validation of reference genes for quantification of target gene expression with quantitative real-time PCR for tall fescue under four abiotic stresses.

PubMed

Yang, Zhimin; Chen, Yu; Hu, Baoyun; Tan, Zhiqun; Huang, Bingru

2015-01-01

Tall fescue (Festuca arundinacea Schreb.) is widely utilized as a major forage and turfgrass species in the temperate regions of the world and is a valuable plant material for studying molecular mechanisms of grass stress tolerance due to its superior drought and heat tolerance among cool-season species. Selection of suitable reference genes for quantification of target gene expression is important for the discovery of molecular mechanisms underlying improved growth traits and stress tolerance. The stability of nine potential reference genes (ACT, TUB, EF1a, GAPDH, SAND, CACS, F-box, PEPKR1 and TIP41) was evaluated using four programs, GeNorm, NormFinder, BestKeeper, and RefFinder. The combinations of SAND and TUB or TIP41 and TUB were most stably expressed in salt-treated roots or leaves. The combinations of GAPDH with TIP41 or TUB were stable in roots and leaves under drought stress. TIP41 and PEPKR1 exhibited stable expression in cold-treated roots, and the combination of F-box, TIP41 and TUB was also stable in cold-treated leaves. CACS and TUB were the two most stable reference genes in heat-stressed roots. TIP41 combined with TUB and ACT was stably expressed in heat-stressed leaves. Finally, quantitative real-time polymerase chain reaction (qRT-PCR) assays of the target gene FaWRKY1 using the identified most stable reference genes confirmed the reliability of selected reference genes. The selection of suitable reference genes in tall fescue will allow for more accurate identification of stress-tolerance genes and molecular mechanisms conferring stress tolerance in this stress-tolerant species.
Selection of reference genes for qRT-PCR analysis of gene expression in sea cucumber Apostichopus japonicus during aestivation

NASA Astrophysics Data System (ADS)

Zhao, Ye; Chen, Muyan; Wang, Tianming; Sun, Lina; Xu, Dongxue; Yang, Hongsheng

2014-11-01

Quantitative real-time reverse transcription-polymerase chain reaction (qRT-PCR) is a technique that is widely used for gene expression analysis, and its accuracy depends on the expression stability of the internal reference genes used as normalization factors. However, many applications of qRT-PCR used housekeeping genes as internal controls without validation. In this study, the expression stability of eight candidate reference genes in three tissues (intestine, respiratory tree, and muscle) of the sea cucumber Apostichopus japonicus was assessed during normal growth and aestivation using the geNorm, NormFinder, delta CT, and RefFinder algorithms. The results indicate that the reference genes exhibited significantly different expression patterns among the three tissues during aestivation. In general, the β-tubulin (TUBB) gene was relatively stable in the intestine and respiratory tree tissues. The optimal reference gene combination for intestine was 40S ribosomal protein S18 (RPS18), TUBB, and NADH dehydrogenase (NADH); for respiratory tree, it was β-actin (ACTB), TUBB, and succinate dehydrogenase cytochrome B small subunit (SDHC); and for muscle it was α-tubulin (TUBA) and NADH dehydrogenase [ubiquinone] 1 α subcomplex subunit 13 (NDUFA13). These combinations of internal control genes should be considered for use in further studies of gene expression in A. japonicus during aestivation.
Transcription of putative tonoplast transporters in response to glyphosate and paraquat stress in Conyza bonariensis and Conyza canadensis and selection of reference genes for qRT-PCR.

PubMed

Moretti, Marcelo L; Alárcon-Reverte, Rocio; Pearce, Stephen; Morran, Sarah; Hanson, Bradley D

2017-01-01

Herbicide resistance is a challenge for modern agriculture further complicated by cases of resistance to multiple herbicides. Conyza bonariensis and Conyza canadensis are invasive weeds of field crops, orchards, and non-cropped areas in many parts of the world. In California, USA, Conyza populations resistant to the herbicides glyphosate and paraquat have recently been described. Although the mechanism conferring resistance to glyphosate and paraquat in these species was not elucidated, reduced translocation of these herbicides was observed under experimental conditions in both species. Glyphosate and paraquat resistance associated with reduced translocation are hypothesized to be a result of sequestration of herbicides into the vacuole, with the possible involvement of over-expression of genes encoding tonoplast transporters of ABC-transporter families in cases of glyphosate resistance or cationic amino acid transporters (CAT) in cases of paraquat resistance. However, gene expression in response to herbicide treatment has not been studied in glyphosate and paraquat resistant populations. In the current study, we evaluated the transcript levels of genes possibly involved in resistance using real-time PCR. First, we evaluated eight candidate reference genes following herbicide treatment and selected three genes that exhibited stable expression profiles; ACTIN, HEAT-SHOCK-PROTEIN-70, and CYCLOPHILIN. The reference genes identified here can be used for further studies related to plant-herbicide interactions. We used these reference genes to assay the transcript levels of EPSPS, ABC transporters, and CAT in response to herbicide treatment in susceptible and resistant Conyza spp. lines. No transcription changes were observed in EPSPS or CAT genes after glyphosate or paraquat treatment, suggesting that these genes are not involved in the resistance mechanism. Transcription of the two ABC transporter genes increased following glyphosate treatment in all Conyza spp. lines. Transcription of ABC transporters also increased after paraquat treatment in all three lines of C. bonariensis. However, in C. canadensis, paraquat treatment increased transcription of only one ABC transporter gene in the susceptible line. The increase in transcription of ABC transporters after herbicide treatment is likely a stress response based on similar response observed across all Conyza lines regardless of resistance or sensitivity to glyphosate or paraquat, thus these genes do not appear to be directly involved in the mechanism of resistance in Conyza spp.
Transcription of putative tonoplast transporters in response to glyphosate and paraquat stress in Conyza bonariensis and Conyza canadensis and selection of reference genes for qRT-PCR

PubMed Central

Alárcon-Reverte, Rocio; Pearce, Stephen; Morran, Sarah; Hanson, Bradley D.

2017-01-01

Herbicide resistance is a challenge for modern agriculture further complicated by cases of resistance to multiple herbicides. Conyza bonariensis and Conyza canadensis are invasive weeds of field crops, orchards, and non-cropped areas in many parts of the world. In California, USA, Conyza populations resistant to the herbicides glyphosate and paraquat have recently been described. Although the mechanism conferring resistance to glyphosate and paraquat in these species was not elucidated, reduced translocation of these herbicides was observed under experimental conditions in both species. Glyphosate and paraquat resistance associated with reduced translocation are hypothesized to be a result of sequestration of herbicides into the vacuole, with the possible involvement of over-expression of genes encoding tonoplast transporters of ABC-transporter families in cases of glyphosate resistance or cationic amino acid transporters (CAT) in cases of paraquat resistance. However, gene expression in response to herbicide treatment has not been studied in glyphosate and paraquat resistant populations. In the current study, we evaluated the transcript levels of genes possibly involved in resistance using real-time PCR. First, we evaluated eight candidate reference genes following herbicide treatment and selected three genes that exhibited stable expression profiles; ACTIN, HEAT-SHOCK-PROTEIN-70, and CYCLOPHILIN. The reference genes identified here can be used for further studies related to plant-herbicide interactions. We used these reference genes to assay the transcript levels of EPSPS, ABC transporters, and CAT in response to herbicide treatment in susceptible and resistant Conyza spp. lines. No transcription changes were observed in EPSPS or CAT genes after glyphosate or paraquat treatment, suggesting that these genes are not involved in the resistance mechanism. Transcription of the two ABC transporter genes increased following glyphosate treatment in all Conyza spp. lines. Transcription of ABC transporters also increased after paraquat treatment in all three lines of C. bonariensis. However, in C. canadensis, paraquat treatment increased transcription of only one ABC transporter gene in the susceptible line. The increase in transcription of ABC transporters after herbicide treatment is likely a stress response based on similar response observed across all Conyza lines regardless of resistance or sensitivity to glyphosate or paraquat, thus these genes do not appear to be directly involved in the mechanism of resistance in Conyza spp. PMID:28700644
Selection of reference genes for expression analysis of Kumamoto and Portuguese oysters and their hybrid

NASA Astrophysics Data System (ADS)

Yan, Lulu; Su, Jiaqi; Wang, Zhaoping; Yan, Xiwu; Yu, Ruihai

2017-12-01

Quantitative real-time polymerase chain reaction (qRT-PCR) is a rapid and reliable technique which has been widely used to quantifying gene transcripts (expression analysis). It is also employed for studying heterosis, hybridization breeding and hybrid tolerability of oysters, an ecologically and economically important taxonomic group. For these studies, selection of a suitable set of housekeeping genes as references is crucial for correct interpretation of qRT-PCR data. To identify suitable reference genes for oysters during low temperature and low salinity stresses, we analyzed twelve genes from the gill tissue of Crassostrea sikamea (SS), Crassostrea angulata (AA) and their hybrid (SA), which included three ribosomal genes, 28S ribosomal protein S5 ( RPS5), ribosomal protein L35 ( RPL35), and 60S ribosomal protein L29 ( RPL29); three structural genes, tubulin gamma ( TUBγ), annexin A6 and A7 ( AA6 and AA7); three metabolic pathway genes, ornithine decarboxylase ( OD), glyceraldehyde-3-phosphate dehydrogenase ( GAPDH) and glutathione S-transferase P1 ( GSP); two transcription factors, elongation factor 1 alpha and beta ( EF1α and EF1β); and one protein synthesis gene (ubiquitin ( UBQ). Primers specific for these genes were successfully developed for the three groups of oysters. Three different algorithms, geNorm, NormFinder and BestKeeper, were used to evaluate the expression stability of these candidate genes. BestKeeper program was found to be the most reliable. Based on our analysis, we found that the expression of RPL35 and EF1α was stable under low salinity stress, and the expression of OD, GAPDH and EF1α was stable under low temperature stress in hybrid (SA) oyster; the expression of RPS5 and GAPDH was stable under low salinity stress, and the expression of RPS5, UBQ, GAPDH was stable under low temperature stress in SS oyster; the expression of RPS5, GAPDH, EF1β and AA7 was stable under low salinity stress, and the expression of RPL35, EF1α, GAPDH and EF1β was stable under low temperature stress in AA oyster. Furthermore, to evaluate their suitability, the reference genes were used to quantify six target genes. In conclusion, we have successfully developed primers appropriate for the expression analysis in SS, SA and AA.
Effect of carbon monoxide on gene expression in cerebrocortical astrocytes: Validation of reference genes for quantitative real-time PCR.

PubMed

Oliveira, Sara R; Vieira, Helena L A; Duarte, Carlos B

2015-09-15

Quantitative real-time reverse transcription-polymerase chain reaction (qRT-PCR) is a widely used technique to characterize changes in gene expression in complex cellular and tissue processes, such as cytoprotection or inflammation. The accurate assessment of changes in gene expression depends on the selection of adequate internal reference gene(s). Carbon monoxide (CO) affects several metabolic pathways and de novo protein synthesis is crucial in the cellular responses to this gasotransmitter. Herein a selection of commonly used reference genes was analyzed to identify the most suitable internal control genes to evaluate the effect of CO on gene expression in cultured cerebrocortical astrocytes. The cells were exposed to CO by treatment with CORM-A1 (CO releasing molecule A1) and four different algorithms (geNorm, NormFinder, Delta Ct and BestKeeper) were applied to evaluate the stability of eight putative reference genes. Our results indicate that Gapdh (glyceraldehyde-3-phosphate dehydrogenase) together with Ppia (peptidylpropyl isomerase A) is the most suitable gene pair for normalization of qRT-PCR results under the experimental conditions used. Pgk1 (phosphoglycerate kinase 1), Hprt1 (hypoxanthine guanine phosphoribosyl transferase I), Sdha (Succinate Dehydrogenase Complex, Subunit A), Tbp (TATA box binding protein), Actg1 (actin gamma 1) and Rn18s (18S rRNA) genes presented less stable expression profiles in cultured cortical astrocytes exposed to CORM-A1 for up to 60 min. For validation, we analyzed the effect of CO on the expression of Bdnf and bcl-2. Different results were obtained, depending on the reference genes used. A significant increase in the expression of both genes was found when the results were normalized with Gapdh and Ppia, in contrast with the results obtained when the other genes were used as reference. These findings highlight the need for a proper and accurate selection of the reference genes used in the quantification of qRT-PCR results in studies on the effect of CO in gene expression. Copyright © 2015 Elsevier Inc. All rights reserved.
ROCker: accurate detection and quantification of target genes in short-read metagenomic data sets by modeling sliding-window bitscores

DOE PAGES

Orellana, Luis H.; Rodriguez-R, Luis M.; Konstantinidis, Konstantinos T.

2016-10-07

Functional annotation of metagenomic and metatranscriptomic data sets relies on similarity searches based on e-value thresholds resulting in an unknown number of false positive and negative matches. To overcome these limitations, we introduce ROCker, aimed at identifying position-specific, most-discriminant thresholds in sliding windows along the sequence of a target protein, accounting for non-discriminative domains shared by unrelated proteins. ROCker employs the receiver operating characteristic (ROC) curve to minimize false discovery rate (FDR) and calculate the best thresholds based on how simulated shotgun metagenomic reads of known composition map onto well-curated reference protein sequences and thus, differs from HMM profiles andmore » related methods. We showcase ROCker using ammonia monooxygenase (amoA) and nitrous oxide reductase (nosZ) genes, mediating oxidation of ammonia and the reduction of the potent greenhouse gas, N 2O, to inert N 2, respectively. ROCker typically showed 60-fold lower FDR when compared to the common practice of using fixed e-values. Previously uncounted ‘atypical’ nosZ genes were found to be two times more abundant, on average, than their typical counterparts in most soil metagenomes and the abundance of bacterial amoA was quantified against the highly-related particulate methane monooxygenase (pmoA). Therefore, ROCker can reliably detect and quantify target genes in short-read metagenomes.« less

ROCker: accurate detection and quantification of target genes in short-read metagenomic data sets by modeling sliding-window bitscores

DOE Office of Scientific and Technical Information (OSTI.GOV)

Orellana, Luis H.; Rodriguez-R, Luis M.; Konstantinidis, Konstantinos T.

Functional annotation of metagenomic and metatranscriptomic data sets relies on similarity searches based on e-value thresholds resulting in an unknown number of false positive and negative matches. To overcome these limitations, we introduce ROCker, aimed at identifying position-specific, most-discriminant thresholds in sliding windows along the sequence of a target protein, accounting for non-discriminative domains shared by unrelated proteins. ROCker employs the receiver operating characteristic (ROC) curve to minimize false discovery rate (FDR) and calculate the best thresholds based on how simulated shotgun metagenomic reads of known composition map onto well-curated reference protein sequences and thus, differs from HMM profiles andmore » related methods. We showcase ROCker using ammonia monooxygenase (amoA) and nitrous oxide reductase (nosZ) genes, mediating oxidation of ammonia and the reduction of the potent greenhouse gas, N 2O, to inert N 2, respectively. ROCker typically showed 60-fold lower FDR when compared to the common practice of using fixed e-values. Previously uncounted ‘atypical’ nosZ genes were found to be two times more abundant, on average, than their typical counterparts in most soil metagenomes and the abundance of bacterial amoA was quantified against the highly-related particulate methane monooxygenase (pmoA). Therefore, ROCker can reliably detect and quantify target genes in short-read metagenomes.« less
ROCker: accurate detection and quantification of target genes in short-read metagenomic data sets by modeling sliding-window bitscores

PubMed Central

2017-01-01

Abstract Functional annotation of metagenomic and metatranscriptomic data sets relies on similarity searches based on e-value thresholds resulting in an unknown number of false positive and negative matches. To overcome these limitations, we introduce ROCker, aimed at identifying position-specific, most-discriminant thresholds in sliding windows along the sequence of a target protein, accounting for non-discriminative domains shared by unrelated proteins. ROCker employs the receiver operating characteristic (ROC) curve to minimize false discovery rate (FDR) and calculate the best thresholds based on how simulated shotgun metagenomic reads of known composition map onto well-curated reference protein sequences and thus, differs from HMM profiles and related methods. We showcase ROCker using ammonia monooxygenase (amoA) and nitrous oxide reductase (nosZ) genes, mediating oxidation of ammonia and the reduction of the potent greenhouse gas, N2O, to inert N2, respectively. ROCker typically showed 60-fold lower FDR when compared to the common practice of using fixed e-values. Previously uncounted ‘atypical’ nosZ genes were found to be two times more abundant, on average, than their typical counterparts in most soil metagenomes and the abundance of bacterial amoA was quantified against the highly-related particulate methane monooxygenase (pmoA). Therefore, ROCker can reliably detect and quantify target genes in short-read metagenomes. PMID:28180325
Novel primers for complete mitochondrial cytochrome b genesequencing in mammals

USGS Publications Warehouse

Naidu, Ashwin; Fitak, Robert R.; Munguia-Vega, Adrian; Culver, Melanie

2011-01-01

Sequence-based species identification relies on the extent and integrity of sequence data available in online databases such as GenBank. When identifying species from a sample of unknown origin, partial DNA sequences obtained from the sample are aligned against existing sequences in databases. When the sequence from the matching species is not present in the database, high-scoring alignments with closely related sequences might produce unreliable results on species identity. For species identification in mammals, the cytochrome b (cyt b) gene has been identified to be highly informative; thus, large amounts of reference sequence data from the cyt b gene are much needed. To enhance availability of cyt b gene sequence data on a large number of mammalian species in GenBank and other such publicly accessible online databases, we identified a primer pair for complete cyt b gene sequencing in mammals. Using this primer pair, we successfully PCR amplified and sequenced the complete cyt b gene from 40 of 44 mammalian species representing 10 orders of mammals. We submitted 40 complete, correctly annotated, cyt b protein coding sequences to GenBank. To our knowledge, this is the first single primer pair to amplify the complete cyt b gene in a broad range of mammalian species. This primer pair can be used for the addition of new cyt b gene sequences and to enhance data available on species represented in GenBank. The availability of novel and complete gene sequences as high-quality reference data can improve the reliability of sequence-based species identification.
Genetics Home Reference: Rothmund-Thomson syndrome

MedlinePlus

... syndromes are also characterized by radial ray defects, skeletal abnormalities, and slow growth. All of these conditions can be caused by mutations in the same gene. Based on these similarities, researchers are investigating whether ...
Evaluation of reference genes for quantitative real-time RT-PCR analysis of gene expression in Nile tilapia (Oreochromis niloticus).

PubMed

Yang, Chang Geng; Wang, Xian Li; Tian, Juan; Liu, Wei; Wu, Fan; Jiang, Ming; Wen, Hua

2013-09-15

Quantitative real-time reverse-transcriptase polymerase chain reaction (RT-qPCR) has been used frequently to study gene expression related to fish immunology. In such studies, a stable reference gene should be selected to correct the expression of the target gene. In this study, seven candidate reference genes (glyceraldehyde-3-phosphate dehydrogenase (GADPH), ubiquitin-conjugating enzyme (UBCE), 18S ribosomal RNA (18S rRNA), beta-2-microglobulin (B2M), elongation factor 1 alpha (EF1A), tubulin alpha chain-like (TUBA) and beta actin (ACTB)), were selected to analyze their stability and normalization in seven tissues (liver, spleen, kidney, brain, heart, muscle and intestine) of Nile tilapia (Oreochromis niloticus) challenged with Streptococcus agalactiae or Streptococcus iniae, respectively. The results showed that all the candidate reference genes exhibited tissue-dependent transcriptional variations. With PBS injection as a control, UBCE was the most stable and suitable single reference gene in the intestine, liver, brain, kidney, and spleen after S. iniae infection, and in the liver, kidney, and spleen after S. agalactiae infection. EF1A was the most suitable in heart and muscle after S. iniae or S. agalactiae infection. GADPH was the most suitable gene in intestine and brain after S. agalactiae infection. In normal conditions, UBCE and 18S rRNA were the most stably expressed genes across the various tissues. These results showed that for RT-qPCR analysis of tilapia, selecting two or more reference genes may be more suitable for cross-tissue analysis of gene expression. Copyright © 2013 Elsevier B.V. All rights reserved.
In-depth analysis of internal control genes for quantitative real-time PCR in Brassica oleracea var. botrytis.

PubMed

Sheng, X G; Zhao, Z Q; Yu, H F; Wang, J S; Zheng, C F; Gu, H H

2016-07-15

Quantitative reverse-transcription PCR (qRT-PCR) is a versatile technique for the analysis of gene expression. The selection of stable reference genes is essential for the application of this technique. Cauliflower (Brassica oleracea L. var. botrytis) is a commonly consumed vegetable that is rich in vitamin, calcium, and iron. Thus far, to our knowledge, there have been no reports on the validation of suitable reference genes for the data normalization of qRT-PCR in cauliflower. In the present study, we analyzed 12 candidate housekeeping genes in cauliflower subjected to different abiotic stresses, hormone treatment conditions, and accessions. geNorm and NormFinder algorithms were used to assess the expression stability of these genes. ACT2 and TIP41 were selected as suitable reference genes across all experimental samples in this study. When different accessions were compared, ACT2 and UNK3 were found to be the most suitable reference genes. In the hormone and abiotic stress treatments, ACT2, TIP41, and UNK2 were the most stably expressed. Our study also provided guidelines for selecting the best reference genes under various experimental conditions.
Evaluation of Reference Genes for Gene Expression Analysis Using Quantitative RT-PCR in Azospirillum brasilense

PubMed Central

McMillan, Mary; Pereg, Lily

2014-01-01

Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA) in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA) is sufficient for effective normalisation of qRT-PCR data. PMID:24841066
Evaluation of reference genes for gene expression analysis using quantitative RT-PCR in Azospirillum brasilense.

PubMed

McMillan, Mary; Pereg, Lily

2014-01-01

Azospirillum brasilense is a nitrogen fixing bacterium that has been shown to have various beneficial effects on plant growth and yield. Under normal conditions A. brasilense exists in a motile flagellated form, which, under starvation or stress conditions, can undergo differentiation into an encapsulated, cyst-like form. Quantitative RT-PCR can be used to analyse changes in gene expression during this differentiation process. The accuracy of quantification of mRNA levels by qRT-PCR relies on the normalisation of data against stably expressed reference genes. No suitable set of reference genes has yet been described for A. brasilense. Here we evaluated the expression of ten candidate reference genes (16S rRNA, gapB, glyA, gyrA, proC, pykA, recA, recF, rpoD, and tpiA) in wild-type and mutant A. brasilense strains under different culture conditions, including conditions that induce differentiation. Analysis with the software programs BestKeeper, NormFinder and GeNorm indicated that gyrA, glyA and recA are the most stably expressed reference genes in A. brasilense. The results also suggested that the use of two reference genes (gyrA and glyA) is sufficient for effective normalisation of qRT-PCR data.
Different distribution patterns of ten virulence genes in Legionella reference strains and strains isolated from environmental water and patients.

PubMed

Zhan, Xiao-Yong; Hu, Chao-Hui; Zhu, Qing-Yi

2016-04-01

Virulence genes are distinct regions of DNA which are present in the genome of pathogenic bacteria and absent in nonpathogenic strains of the same or related species. Virulence genes are frequently associated with bacterial pathogenicity in genus Legionella. In the present study, an assay was performed to detect ten virulence genes, including iraA, iraB, lvrA, lvrB, lvhD, cpxR, cpxA, dotA, icmC and icmD in different pathogenicity islands of 47 Legionella reference strains, 235 environmental strains isolated from water, and 4 clinical strains isolated from the lung tissue of pneumonia patients. The distribution frequencies of these genes in reference or/and environmental L. pneumophila strains were much higher than those in reference non-L. pneumophila or/and environmental non-L. pneumophila strains, respectively. L. pneumophila clinical strains also maintained higher frequencies of these genes compared to four other types of Legionella strains. Distribution frequencies of these genes in reference L. pneumophila strains were similar to those in environmental L. pneumophila strains. In contrast, environmental non-L. pneumophila maintained higher frequencies of these genes compared to those found in reference non-L. pneumophila strains. This study illustrates the association of virulence genes with Legionella pathogenicity and reveals the possible virulence evolution of non-L. pneumophia strains isolated from environmental water.
Selection of internal reference genes for normalization of quantitative reverse transcription polymerase chain reaction (qRT-PCR) analysis in the canine brain and other organs.

PubMed

Park, Sang-Je; Huh, Jae-Won; Kim, Young-Hyun; Lee, Sang-Rae; Kim, Sang-Hyun; Kim, Sun-Uk; Kim, Heui-Soo; Kim, Min Kyu; Chang, Kyu-Tae

2013-05-01

Quantitative reverse transcription polymerase chain reaction (qRT-PCR) is a specific and sensitive technique for quantifying gene expression. To analyze qRT-PCR data accurately, suitable reference genes that show consistent expression patterns across different tissues and experimental conditions should be selected. The objective of this study was to obtain the most stable reference genes in dogs, using samples from 13 different brain tissues and 10 other organs. 16 well-known candidate reference genes were analyzed by the geNorm, NormFinder, and BestKeeper programs. Brain tissues were derived from several different anatomical regions, including the forebrain, cerebrum, diencephalon, hindbrain, and metencephalon, and grouped accordingly. Combination of the three different analyses clearly indicated that the ideal reference genes are ribosomal protien S5 (RPS5) in whole brain, RPL8 and RPS5 in whole body tissues, RPS5 and RPS19 in the forebrain and cerebrum, RPL32 and RPS19 in the diencephalon, GAPDH and RPS19 in the hindbrain, and MRPS7 and RPL13A in the metencephalon. These genes were identified as ideal for the normalization of qRT-PCR results in the respective tissues. These findings indicate more suitable and stable reference genes for future studies of canine gene expression.
Identification of Reliable Reference Genes for Quantification of MicroRNAs in Serum Samples of Sulfur Mustard-Exposed Veterans.

PubMed

Gharbi, Sedigheh; Shamsara, Mehdi; Khateri, Shahriar; Soroush, Mohammad Reza; Ghorbanmehr, Nassim; Tavallaei, Mahmood; Nourani, Mohammad Reza; Mowla, Seyed Javad

2015-01-01

In spite of accumulating information about pathological aspects of sulfur mustard (SM), the precise mechanism responsible for its effects is not well understood. Circulating microRNAs (miRNAs) are promising biomarkers for disease diagnosis and prognosis. Accurate normalization using appropriate reference genes, is a critical step in miRNA expression studies. In this study, we aimed to identify appropriate reference gene for microRNA quantification in serum samples of SM victims. In this case and control experimental study, using quantitative real-time polymerase chain reaction (qRT-PCR), we evaluated the suitability of a panel of small RNAs including SNORD38B, SNORD49A, U6, 5S rRNA, miR-423-3p, miR-191, miR-16 and miR-103 in sera of 28 SM-exposed veterans of Iran-Iraq war (1980-1988) and 15 matched control volunteers. Different statistical algorithms including geNorm, Normfinder, best-keeper and comparative delta-quantification cycle (Cq) method were employed to find the least variable reference gene. miR-423-3p was identified as the most stably expressed reference gene, and miR- 103 and miR-16 ranked after that. We demonstrate that non-miRNA reference genes have the least stabil- ity in serum samples and that some house-keeping miRNAs may be used as more reliable reference genes for miRNAs in serum. In addition, using the geometric mean of two reference genes could increase the reliability of the normalizers.
Comparative transcriptomics of 5 high-altitude vertebrates and their low-altitude relatives

PubMed Central

Tang, Qianzi; Zhou, Xuming; Jin, Long; Guan, Jiuqiang; Liu, Rui; Li, Jing; Long, Kereng; Tian, Shilin; Che, Tiandong; Hu, Silu; Liang, Yan; Yang, Xuemei; Tao, Xuan; Zhong, Zhijun; Wang, Guosong; Chen, Xiaohui; Li, Diyan; Ma, Jideng; Wang, Xun; Mai, Miaomiao; Jiang, An’an; Luo, Xiaolin; Lv, Xuebin; Gladyshev, Vadim N; Li, Xuewei

2017-01-01

Abstract Background Species living at high altitude are subject to strong selective pressures due to inhospitable environments (e.g., hypoxia, low temperature, high solar radiation, and lack of biological production), making these species valuable models for comparative analyses of local adaptation. Studies that have examined high-altitude adaptation have identified a vast array of rapidly evolving genes that characterize the dramatic phenotypic changes in high-altitude animals. However, how high-altitude environment shapes gene expression programs remains largely unknown. Findings We generated a total of 910 Gb of high-quality RNA-seq data for 180 samples derived from 6 tissues of 5 agriculturally important high-altitude vertebrates (Tibetan chicken, Tibetan pig, Tibetan sheep, Tibetan goat, and yak) and their cross-fertile relatives living in geographically neighboring low-altitude regions. Of these, ∼75% reads could be aligned to their respective reference genomes, and on average ∼60% of annotated protein coding genes in each organism showed FPKM expression values greater than 0.5. We observed a general concordance in topological relationships between the nucleotide alignments and gene expression–based trees. Tissue and species accounted for markedly more variance than altitude based on either the expression or the alternative splicing patterns. Cross-species clustering analyses showed a tissue-dominated pattern of gene expression and a species-dominated pattern for alternative splicing. We also identified numerous differentially expressed genes that could potentially be involved in phenotypic divergence shaped by high-altitude adaptation. Conclusions These data serve as a valuable resource for examining the convergence and divergence of gene expression changes between species as they adapt or acclimatize to high-altitude environments. PMID:29149296
Comparative transcriptomics of 5 high-altitude vertebrates and their low-altitude relatives.

PubMed

Tang, Qianzi; Gu, Yiren; Zhou, Xuming; Jin, Long; Guan, Jiuqiang; Liu, Rui; Li, Jing; Long, Kereng; Tian, Shilin; Che, Tiandong; Hu, Silu; Liang, Yan; Yang, Xuemei; Tao, Xuan; Zhong, Zhijun; Wang, Guosong; Chen, Xiaohui; Li, Diyan; Ma, Jideng; Wang, Xun; Mai, Miaomiao; Jiang, An'an; Luo, Xiaolin; Lv, Xuebin; Gladyshev, Vadim N; Li, Xuewei; Li, Mingzhou

2017-12-01

Species living at high altitude are subject to strong selective pressures due to inhospitable environments (e.g., hypoxia, low temperature, high solar radiation, and lack of biological production), making these species valuable models for comparative analyses of local adaptation. Studies that have examined high-altitude adaptation have identified a vast array of rapidly evolving genes that characterize the dramatic phenotypic changes in high-altitude animals. However, how high-altitude environment shapes gene expression programs remains largely unknown. We generated a total of 910 Gb of high-quality RNA-seq data for 180 samples derived from 6 tissues of 5 agriculturally important high-altitude vertebrates (Tibetan chicken, Tibetan pig, Tibetan sheep, Tibetan goat, and yak) and their cross-fertile relatives living in geographically neighboring low-altitude regions. Of these, ∼75% reads could be aligned to their respective reference genomes, and on average ∼60% of annotated protein coding genes in each organism showed FPKM expression values greater than 0.5. We observed a general concordance in topological relationships between the nucleotide alignments and gene expression-based trees. Tissue and species accounted for markedly more variance than altitude based on either the expression or the alternative splicing patterns. Cross-species clustering analyses showed a tissue-dominated pattern of gene expression and a species-dominated pattern for alternative splicing. We also identified numerous differentially expressed genes that could potentially be involved in phenotypic divergence shaped by high-altitude adaptation. These data serve as a valuable resource for examining the convergence and divergence of gene expression changes between species as they adapt or acclimatize to high-altitude environments. © The Authors 2017. Published by Oxford University Press.
Identification of optimal reference genes for RT-qPCR in the rat hypothalamus and intestine for the study of obesity.

PubMed

Li, B; Matter, E K; Hoppert, H T; Grayson, B E; Seeley, R J; Sandoval, D A

2014-02-01

Obesity has a complicated metabolic pathology, and defining the underlying mechanisms of obesity requires integrative studies with molecular end points. Real-time quantitative PCR (RT-qPCR) is a powerful tool that has been widely utilized. However, the importance of using carefully validated reference genes in RT-qPCR seems to have been overlooked in obesity-related research. The objective of this study was to select a set of reference genes with stable expressions to be used for RT-qPCR normalization in rats under fasted vs re-fed and chow vs high-fat diet (HFD) conditions. Male long-Evans rats were treated under four conditions: chow/fasted, chow/re-fed, HFD/fasted and HFD/re-fed. Expression stabilities of 13 candidate reference genes were evaluated in the rat hypothalamus, duodenum, jejunum and ileum using the ReFinder software program. The optimal number of reference genes needed for RT-qPCR analyses was determined using geNorm. Using geNorm analysis, we found that it was sufficient to use the two most stably expressed genes as references in RT-qPCR analyses for each tissue under specific experimental conditions. B2M and RPLP0 in the hypothalamus, RPS18 and HMBS in the duodenum, RPLP2 and RPLP0 in the jejunum and RPS18 and YWHAZ in the ileum were the most suitable pairs for a normalization study when the four aforementioned experimental conditions were considered. Our study demonstrates that gene expression levels of reference genes commonly used in obesity-related studies, such as ACTB or RPS18, are altered by changes in acute or chronic energy status. These findings underline the importance of using reference genes that are stable in expression across experimental conditions when studying the rat hypothalamus and intestine, because these tissues have an integral role in the regulation of energy homeostasis. It is our hope that this study will raise awareness among obesity researchers on the essential need for reference gene validation in gene expression studies.
Selection and validation of reference genes for quantitative gene expression analyses in various tissues and seeds at different developmental stages in Bixa orellana L.

PubMed

Moreira, Viviane S; Soares, Virgínia L F; Silva, Raner J S; Sousa, Aurizangela O; Otoni, Wagner C; Costa, Marcio G C

2018-05-01

Bixa orellana L., popularly known as annatto, produces several secondary metabolites of pharmaceutical and industrial interest, including bixin, whose molecular basis of biosynthesis remain to be determined. Gene expression analysis by quantitative real-time PCR (qPCR) is an important tool to advance such knowledge. However, correct interpretation of qPCR data requires the use of suitable reference genes in order to reduce experimental variations. In the present study, we have selected four different candidates for reference genes in B. orellana , coding for 40S ribosomal protein S9 (RPS9), histone H4 (H4), 60S ribosomal protein L38 (RPL38) and 18S ribosomal RNA (18SrRNA). Their expression stabilities in different tissues (e.g. flower buds, flowers, leaves and seeds at different developmental stages) were analyzed using five statistical tools (NormFinder, geNorm, BestKeeper, ΔCt method and RefFinder). The results indicated that RPL38 is the most stable gene in different tissues and stages of seed development and 18SrRNA is the most unstable among the analyzed genes. In order to validate the candidate reference genes, we have analyzed the relative expression of a target gene coding for carotenoid cleavage dioxygenase 1 (CCD1) using the stable RPL38 and the least stable gene, 18SrRNA , for normalization of the qPCR data. The results demonstrated significant differences in the interpretation of the CCD1 gene expression data, depending on the reference gene used, reinforcing the importance of the correct selection of reference genes for normalization.
Validation of reference genes aiming accurate normalization of qRT-PCR data in Dendrocalamus latiflorus Munro.

PubMed

Liu, Mingying; Jiang, Jing; Han, Xiaojiao; Qiao, Guirong; Zhuo, Renying

2014-01-01

Dendrocalamus latiflorus Munro distributes widely in subtropical areas and plays vital roles as valuable natural resources. The transcriptome sequencing for D. latiflorus Munro has been performed and numerous genes especially those predicted to be unique to D. latiflorus Munro were revealed. qRT-PCR has become a feasible approach to uncover gene expression profiling, and the accuracy and reliability of the results obtained depends upon the proper selection of stable reference genes for accurate normalization. Therefore, a set of suitable internal controls should be validated for D. latiflorus Munro. In this report, twelve candidate reference genes were selected and the assessment of gene expression stability was performed in ten tissue samples and four leaf samples from seedlings and anther-regenerated plants of different ploidy. The PCR amplification efficiency was estimated, and the candidate genes were ranked according to their expression stability using three software packages: geNorm, NormFinder and Bestkeeper. GAPDH and EF1α were characterized to be the most stable genes among different tissues or in all the sample pools, while CYP showed low expression stability. RPL3 had the optimal performance among four leaf samples. The application of verified reference genes was illustrated by analyzing ferritin and laccase expression profiles among different experimental sets. The analysis revealed the biological variation in ferritin and laccase transcript expression among the tissues studied and the individual plants. geNorm, NormFinder, and BestKeeper analyses recommended different suitable reference gene(s) for normalization according to the experimental sets. GAPDH and EF1α had the highest expression stability across different tissues and RPL3 for the other sample set. This study emphasizes the importance of validating superior reference genes for qRT-PCR analysis to accurately normalize gene expression of D. latiflorus Munro.
Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.

PubMed

Nielsen, H Bjørn; Almeida, Mathieu; Juncker, Agnieszka Sierakowska; Rasmussen, Simon; Li, Junhua; Sunagawa, Shinichi; Plichta, Damian R; Gautier, Laurent; Pedersen, Anders G; Le Chatelier, Emmanuelle; Pelletier, Eric; Bonde, Ida; Nielsen, Trine; Manichanh, Chaysavanh; Arumugam, Manimozhiyan; Batto, Jean-Michel; Quintanilha Dos Santos, Marcelo B; Blom, Nikolaj; Borruel, Natalia; Burgdorf, Kristoffer S; Boumezbeur, Fouad; Casellas, Francesc; Doré, Joël; Dworzynski, Piotr; Guarner, Francisco; Hansen, Torben; Hildebrand, Falk; Kaas, Rolf S; Kennedy, Sean; Kristiansen, Karsten; Kultima, Jens Roat; Léonard, Pierre; Levenez, Florence; Lund, Ole; Moumen, Bouziane; Le Paslier, Denis; Pons, Nicolas; Pedersen, Oluf; Prifti, Edi; Qin, Junjie; Raes, Jeroen; Sørensen, Søren; Tap, Julien; Tims, Sebastian; Ussery, David W; Yamada, Takuji; Renault, Pierre; Sicheritz-Ponten, Thomas; Bork, Peer; Wang, Jun; Brunak, Søren; Ehrlich, S Dusko

2014-08-01

Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.
Performance comparison of two microarray platforms to assess differential gene expression in human monocyte and macrophage cells

PubMed Central

Maouche, Seraya; Poirier, Odette; Godefroy, Tiphaine; Olaso, Robert; Gut, Ivo; Collet, Jean-Phillipe; Montalescot, Gilles; Cambien, François

2008-01-01

Background In this study we assessed the respective ability of Affymetrix and Illumina microarray methodologies to answer a relevant biological question, namely the change in gene expression between resting monocytes and macrophages derived from these monocytes. Five RNA samples for each type of cell were hybridized to the two platforms in parallel. In addition, a reference list of differentially expressed genes (DEG) was generated from a larger number of hybridizations (mRNA from 86 individuals) using the RNG/MRC two-color platform. Results Our results show an important overlap of the Illumina and Affymetrix DEG lists. In addition, more than 70% of the genes in these lists were also present in the reference list. Overall the two platforms had very similar performance in terms of biological significance, evaluated by the presence in the DEG lists of an excess of genes belonging to Gene Ontology (GO) categories relevant for the biology of monocytes and macrophages. Our results support the conclusion of the MicroArray Quality Control (MAQC) project that the criteria used to constitute the DEG lists strongly influence the degree of concordance among platforms. However the importance of prioritizing genes by magnitude of effect (fold change) rather than statistical significance (p-value) to enhance cross-platform reproducibility recommended by the MAQC authors was not supported by our data. Conclusion Functional analysis based on GO enrichment demonstrates that the 2 compared technologies delivered very similar results and identified most of the relevant GO categories enriched in the reference list. PMID:18578872
Genomic resources for gene discovery, functional genome annotation, and evolutionary studies of maize and its close relatives.

PubMed

Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S S; Kudrna, David A; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A; Luo, Meizhong

2013-11-01

Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.
Evaluation of Candidate Reference Genes for Quantitative Gene Expression Analysis in Spodoptera exigu a after Long-time Exposure to Cadmium.

PubMed

Płachetka-Bożek, Anna; Augustyniak, Maria

2017-08-21

Studies on the transcriptional control of gene expression play an important role in many areas of biology. Reference genes, which are often referred to as housekeeping genes, such as GAPDH, G3PDH, EF2, RpL7A, RpL10, TUBα and Actin, have traditionally been assumed to be stably expressed in all conditions, and they are frequently used to normalize mRNA levels between different samples in qPCR analysis. However, it is known that the expression of these genes is influenced by numerous factors, such as experimental conditions. The difference in gene expression underlies a range of biological processes, including development, reproduction and behavior. The aim of this study was to show the problems associated with using reference genes in the qPCR technique, in a study on inbred strains of Spodoptera exigua selected toward cadmium resistance. We present and discuss our results and observations, and give some recommendations concerning the use and limitations of housekeeping genes as internal standards, especially in research on insects. Our results suggest that holometabolism and poikilothermia, as well as time since metamorphosis and the level of exposure to the selective factor (cadmium in this case), have a significant effect on the expression of reference genes.

Gramene 2016: comparative plant genomics and pathway resources

PubMed Central

Tello-Ruiz, Marcela K.; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M.; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A.; Huerta, Laura; Keays, Maria; Tang, Y. Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J.; Jaiswal, Pankaj; Ware, Doreen

2016-01-01

Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. PMID:26553803
Functional analysis and transcriptional output of the Göttingen minipig genome.

PubMed

Heckel, Tobias; Schmucki, Roland; Berrera, Marco; Ringshandl, Stephan; Badi, Laura; Steiner, Guido; Ravon, Morgane; Küng, Erich; Kuhn, Bernd; Kratochwil, Nicole A; Schmitt, Georg; Kiialainen, Anna; Nowaczyk, Corinne; Daff, Hamina; Khan, Azinwi Phina; Lekolool, Isaac; Pelle, Roger; Okoth, Edward; Bishop, Richard; Daubenberger, Claudia; Ebeling, Martin; Certa, Ulrich

2015-11-14

In the past decade the Göttingen minipig has gained increasing recognition as animal model in pharmaceutical and safety research because it recapitulates many aspects of human physiology and metabolism. Genome-based comparison of drug targets together with quantitative tissue expression analysis allows rational prediction of pharmacology and cross-reactivity of human drugs in animal models thereby improving drug attrition which is an important challenge in the process of drug development. Here we present a new chromosome level based version of the Göttingen minipig genome together with a comparative transcriptional analysis of tissues with pharmaceutical relevance as basis for translational research. We relied on mapping and assembly of WGS (whole-genome-shotgun sequencing) derived reads to the reference genome of the Duroc pig and predict 19,228 human orthologous protein-coding genes. Genome-based prediction of the sequence of human drug targets enables the prediction of drug cross-reactivity based on conservation of binding sites. We further support the finding that the genome of Sus scrofa contains about ten-times less pseudogenized genes compared to other vertebrates. Among the functional human orthologs of these minipig pseudogenes we found HEPN1, a putative tumor suppressor gene. The genomes of Sus scrofa, the Tibetan boar, the African Bushpig, and the Warthog show sequence conservation of all inactivating HEPN1 mutations suggesting disruption before the evolutionary split of these pig species. We identify 133 Sus scrofa specific, conserved long non-coding RNAs (lncRNAs) in the minipig genome and show that these transcripts are highly conserved in the African pigs and the Tibetan boar suggesting functional significance. Using a new minipig specific microarray we show high conservation of gene expression signatures in 13 tissues with biomedical relevance between humans and adult minipigs. We underline this relationship for minipig and human liver where we could demonstrate similar expression levels for most phase I drug-metabolizing enzymes. Higher expression levels and metabolic activities were found for FMO1, AKR/CRs and for phase II drug metabolizing enzymes in minipig as compared to human. The variability of gene expression in equivalent human and minipig tissues is considerably higher in minipig organs, which is important for study design in case a human target belongs to this variable category in the minipig. The first analysis of gene expression in multiple tissues during development from young to adult shows that the majority of transcriptional programs are concluded four weeks after birth. This finding is in line with the advanced state of human postnatal organ development at comparative age categories and further supports the minipig as model for pediatric drug safety studies. Genome based assessment of sequence conservation combined with gene expression data in several tissues improves the translational value of the minipig for human drug development. The genome and gene expression data presented here are important resources for researchers using the minipig as model for biomedical research or commercial breeding. Potential impact of our data for comparative genomics, translational research, and experimental medicine are discussed.
The use of laser microdissection in the identification of suitable reference genes for normalization of quantitative real-time PCR in human FFPE epithelial ovarian tissue samples.

PubMed

Cai, Jing; Li, Tao; Huang, Bangxing; Cheng, Henghui; Ding, Hui; Dong, Weihong; Xiao, Man; Liu, Ling; Wang, Zehua

2014-01-01

Quantitative real-time PCR (qPCR) is a powerful and reproducible method of gene expression analysis in which expression levels are quantified by normalization against reference genes. Therefore, to investigate the potential biomarkers and therapeutic targets for epithelial ovarian cancer by qPCR, it is critical to identify stable reference genes. In this study, twelve housekeeping genes (ACTB, GAPDH, 18S rRNA, GUSB, PPIA, PBGD, PUM1, TBP, HRPT1, RPLP0, RPL13A, and B2M) were analyzed in 50 ovarian samples from normal, benign, borderline, and malignant tissues. For reliable results, laser microdissection (LMD), an effective technique used to prepare homogeneous starting material, was utilized to precisely excise target tissues or cells. One-way analysis of variance (ANOVA) and nonparametric (Kruskal-Wallis) tests were used to compare the expression differences. NormFinder and geNorm software were employed to further validate the suitability and stability of the candidate genes. Results showed that epithelial cells occupied a small percentage of the normal ovary indeed. The expression of ACTB, PPIA, RPL13A, RPLP0, and TBP were stable independent of the disease progression. In addition, NormFinder and geNorm identified the most stable combination (ACTB, PPIA, RPLP0, and TBP) and the relatively unstable reference gene GAPDH from the twelve commonly used housekeeping genes. Our results highlight the use of homogeneous ovarian tissues and multiple-reference normalization strategy, e.g. the combination of ACTB, PPIA, RPLP0, and TBP, for qPCR in epithelial ovarian tissues, whereas GAPDH, the most commonly used reference gene, is not recommended, especially as a single reference gene.
Differential in vivo gene expression of major Leptospira proteins in resistant or susceptible animal models.

PubMed

Matsui, Mariko; Soupé, Marie-Estelle; Becam, Jérôme; Goarant, Cyrille

2012-09-01

Transcripts of Leptospira 16S rRNA, FlaB, LigB, LipL21, LipL32, LipL36, LipL41, and OmpL37 were quantified in the blood of susceptible (hamsters) and resistant (mice) animal models of leptospirosis. We first validated adequate reference genes and then evaluated expression patterns in vivo compared to in vitro cultures. LipL32 expression was downregulated in vivo and differentially regulated in resistant and susceptible animals. FlaB expression was also repressed in mice but not in hamsters. In contrast, LigB and OmpL37 were upregulated in vivo. Thus, we demonstrated that a virulent strain of Leptospira differentially adapts its gene expression in the blood of infected animals.
Identification and validation of quantitative real-time reverse transcription PCR reference genes for gene expression analysis in teak (Tectona grandis L.f.)

PubMed Central

2014-01-01

Background Teak (Tectona grandis L.f.) is currently the preferred choice of the timber trade for fabrication of woody products due to its extraordinary qualities and is widely grown around the world. Gene expression studies are essential to explore wood formation of vascular plants, and quantitative real-time reverse transcription PCR (qRT-PCR) is a sensitive technique employed for quantifying gene expression levels. One or more appropriate reference genes are crucial to accurately compare mRNA transcripts through different tissues/organs and experimental conditions. Despite being the focus of some genetic studies, a lack of molecular information has hindered genetic exploration of teak. To date, qRT-PCR reference genes have not been identified and validated for teak. Results Identification and cloning of nine commonly used qRT-PCR reference genes from teak, including ribosomal protein 60s (rp60s), clathrin adaptor complexes medium subunit family (Cac), actin (Act), histone 3 (His3), sand family (Sand), β-Tubulin (Β-Tub), ubiquitin (Ubq), elongation factor 1-α (Ef-1α), and glyceraldehyde-3-phosphate dehydrogenase (GAPDH). Expression profiles of these genes were evaluated by qRT-PCR in six tissue and organ samples (leaf, flower, seedling, root, stem and branch secondary xylem) of teak. Appropriate gene cloning and sequencing, primer specificity and amplification efficiency was verified for each gene. Their stability as reference genes was validated by NormFinder, BestKeeper, geNorm and Delta Ct programs. Results obtained from all programs showed that TgUbq and TgEf-1α are the most stable genes to use as qRT-PCR reference genes and TgAct is the most unstable gene in teak. The relative expression of the teak cinnamyl alcohol dehydrogenase (TgCAD) gene in lignified tissues at different ages was assessed by qRT-PCR, using TgUbq and TgEf-1α as internal controls. These analyses exposed a consistent expression pattern with both reference genes. Conclusion This study proposes a first broad collection of teak tissue and organ mRNA expression data for nine selected candidate qRT-PCR reference genes. NormFinder, Bestkeeper, geNorm and Delta Ct analyses suggested that TgUbq and TgEf-1α have the highest expression stability and provided similar results when evaluating TgCAD gene expression, while the commonly used Act should be avoided. PMID:25048176
Early gene expression profiles of patients with chronic hepatitis C treated with pegylated interferon-alfa and ribavirin.

PubMed

Younossi, Zobair M; Baranova, Ancha; Afendy, Arian; Collantes, Rochelle; Stepanova, Maria; Manyam, Ganiraju; Bakshi, Anita; Sigua, Christopher L; Chan, Joanne P; Iverson, Ayuko A; Santini, Christopher D; Chang, Sheng-Yung P

2009-03-01

Responsiveness to hepatitis C virus (HCV) therapy depends on viral and host factors. Our aim was to assess sustained virologic response (SVR)-associated early gene expression in patients with HCV receiving pegylated interferon-alpha2a (PEG-IFN-alpha2a) or PEG-IFN-alpha2b and ribavirin with the duration based on genotypes. Blood samples were collected into PAXgene tubes prior to treatment as well as 1, 7, 28, and 56 days after treatment. From the peripheral blood cells, total RNA was extracted, quantified, and used for one-step reverse transcription polymerase chain reaction to profile 154 messenger RNAs. Expression levels of messenger RNAs were normalized with six "housekeeping" genes and a reference RNA. Multiple regression and stepwise selection were performed to assess differences in gene expression at different time points, and predictive performance was evaluated for each model. A total of 68 patients were enrolled in the study and treated with combination therapy. The results of gene expression showed that SVR could be predicted by the gene expression of signal transducer and activator of transcription-6 (STAT-6) and suppressor of cytokine signaling-1 in the pretreatment samples. After 24 hours, SVR was predicted by the expression of interferon-dependent genes, and this dependence continued to be prominent throughout the treatment. Early gene expression during anti-HCV therapy may elucidate important molecular pathways that may be influencing the probability of achieving virologic response.
Reference genes for gene expression studies in wheat flag leaves grown under different farming conditions

PubMed Central

2011-01-01

Background Internal control genes with highly uniform expression throughout the experimental conditions are required for accurate gene expression analysis as no universal reference genes exists. In this study, the expression stability of 24 candidate genes from Triticum aestivum cv. Cubus flag leaves grown under organic and conventional farming systems was evaluated in two locations in order to select suitable genes that can be used for normalization of real-time quantitative reverse-transcription PCR (RT-qPCR) reactions. The genes were selected among the most common used reference genes as well as genes encoding proteins involved in several metabolic pathways. Findings Individual genes displayed different expression rates across all samples assayed. Applying geNorm, a set of three potential reference genes were suitable for normalization of RT-qPCR reactions in winter wheat flag leaves cv. Cubus: TaFNRII (ferredoxin-NADP(H) oxidoreductase; AJ457980.1), ACT2 (actin 2; TC234027), and rrn26 (a putative homologue to RNA 26S gene; AL827977.1). In addition of these three genes that were also top-ranked by NormFinder, two extra genes: CYP18-2 (Cyclophilin A, AY456122.1) and TaWIN1 (14-3-3 like protein, AB042193) were most consistently stably expressed. Furthermore, we showed that TaFNRII, ACT2, and CYP18-2 are suitable for gene expression normalization in other two winter wheat varieties (Tommi and Centenaire) grown under three treatments (organic, conventional and no nitrogen) and a different environment than the one tested with cv. Cubus. Conclusions This study provides a new set of reference genes which should improve the accuracy of gene expression analyses when using wheat flag leaves as those related to the improvement of nitrogen use efficiency for cereal production. PMID:21951810
Determining ACTB, ATP5B and RPL32 as optimal reference genes for quantitative RT-PCR studies of cryopreserved stallion semen.

PubMed

Pérez-Rico, A; Crespo, F; Sanmartín, M L; De Santiago, A; Vega-Pla, J L

2014-10-01

Equine germplasm bank management involves not only the conservation and use of semen doses, in addition it can also be a resource to study stallion semen quality and after thawing semen properties for reproductive purposes. A possible criterion to measure quality may be based on differential gene expression of loci involved during spermatogenesis and sperm quality maturation. The rapid degradation of sperm after thawing affects the integrity and availability of RNA. In this study we have analyzed genes expressed in equine cryopreserved sperm, which provided an adequate amplification, specificity, and stability to be used as future reference genes in expression studies. Live spermatozoa were selected from cryopreserved semen straws derived from 20 stallions, through a discontinuous concentration gradient. RNA purification followed a combination of the organic and column extraction methods together with a deoxyribonuclease treatment. The selective amplification of nine candidate genes was undertaken using reverse transcription and real-time polymerase chain reaction (qPCR) carried out in a one-step mode (qRT-PCR). Specificities were tested by melting curves, agarose gel electrophoresis and sequencing. In addition, gene stabilities were also calculated. Results indicated that five out of the nine candidate genes amplified properly (β-Actin, ATP synthase subunit beta, Protamine 1, L32 ribosomal protein and Ubiquitin B), of which β-Actin and the L32 Ribosomal protein showed the highest stability thus being the most suitable to be considered as reference genes for equine cryopreserved sperm studies, followed by the ATP synthase subunit beta and Ubiquitin B. Copyright © 2014 Elsevier B.V. All rights reserved.
Genetics Home Reference: L1 syndrome

MedlinePlus

... X-linked hydrocephalus: evidence for closely related clinical entities of unknown molecular bases. Acta Neuropathol. 2013 Sep; ... F. Three cases with L1 syndrome and two novel mutations in the L1CAM gene. Eur J Pediatr. ...
Comprehensive evaluation of candidate reference genes for gene expression studies in Lysiphlebia japonica (Hymenoptera: Aphidiidae) using RT-qPCR.

PubMed

Gao, Xue-Ke; Zhang, Shuai; Luo, Jun-Yu; Wang, Chun-Yi; Lü, Li-Min; Zhang, Li-Juan; Zhu, Xiang-Zhen; Wang, Li; Lu, Hui; Cui, Jin-Jie

2017-12-30

Lysiphlebia japonica (Ashmead) is a predominant parasitoid of cotton-melon aphids in the fields of northern China with a proven ability to effectively control cotton aphid populations in early summer. For accurate normalization of gene expression in L. japonica using quantitative reverse transcriptase-polymerase chain reaction (RT-qPCR), reference genes with stable gene expression patterns are essential. However, no appropriate reference genes is L. japonica have been investigated to date. In the present study, 12 selected housekeeping genes from L. japonica were cloned. We evaluated the stability of these genes under various experimental treatments by RT-qPCR using four independent (geNorm, NormFinder, BestKeeper and Delta Ct) and one comparative (RefFinder) algorithm. We identified genes showing the most stable levels of expression: DIMT, 18S rRNA, and RPL13 during different stages; AK, RPL13, and TBP among sexes; EF1A, PPI, and RPL27 in different tissues, and EF1A, RPL13, and PPI in adults fed on different diets. Moreover, the expression profile of a target gene (odorant receptor 1, OR1) studied during the developmental stages confirms the reliability of the chosen selected reference genes. This study provides for the first time a comprehensive list of suitable reference genes for gene expression studies in L. japonica and will benefit subsequent genomics and functional genomics research on this natural enemy. Copyright © 2017. Published by Elsevier B.V.
GAPDH, β-actin and β2-microglobulin, as three common reference genes, are not reliable for gene expression studies in equine adipose- and marrow-derived mesenchymal stem cells.

PubMed

Nazari, Fatemeh; Parham, Abbas; Maleki, Adham Fani

2015-01-01

Quantitative real time reverse transcription PCR (qRT-PCR) is one of the most important techniques for gene-expression analysis in molecular based studies. Selecting a proper internal control gene for normalizing data is a crucial step in gene expression analysis via this method. The expression levels of reference genes should be remained constant among cells in different tissues. However, it seems that the location of cells in different tissues might influence their expression. The purpose of this study was to determine whether the source of mesenchymal stem cells (MSCs) has any effect on expression level of three common reference genes (GAPDH, β-actin and β2-microglobulin) in equine marrow- and adipose- derived undifferentiated MSCs and consequently their reliability for comparative qRT-PCR. Adipose tissue (AT) and bone marrow (BM) samples were harvested from 3 mares. MSCs were isolated and cultured until passage 3 (P3). Total RNA of P3 cells was extracted for cDNA synthesis. The generated cDNAs were analyzed by quantitative real-time PCR. The PCR reactions were ended with a melting curve analysis to verify the specificity of amplicon. The expression levels of GAPDH were significantly different between AT- and BM- derived MSCs (p < 0.05). Differences in expression level of β-actin (P < 0.001) and B2M (P < 0.006.) between MSCs derived from AT and BM were substantially higher than GAPDH. In addition, the fold change in expression levels of GAPDH, β-actin and B2M in AT-derived MSCs compared to BM-derived MSCs were 2.38, 6.76 and 7.76, respectively. This study demonstrated that GAPDH and especially β-actin and B2M express in different levels in equine AT- and BM- derived MSCs. Thus they cannot be considered as reliable reference genes for comparative quantitative gene expression analysis in MSCs derived from equine bone marrow and adipose tissue.
Selection of Reliable Reference Genes for Gene Expression Studies of a Promising Oilseed Crop, Plukenetia volubilis, by Real-Time Quantitative PCR

PubMed Central

Niu, Longjian; Tao, Yan-Bin; Chen, Mao-Sheng; Fu, Qiantang; Li, Chaoqiong; Dong, Yuling; Wang, Xiulan; He, Huiying; Xu, Zeng-Fu

2015-01-01

Real-time quantitative PCR (RT-qPCR) is a reliable and widely used method for gene expression analysis. The accuracy of the determination of a target gene expression level by RT-qPCR demands the use of appropriate reference genes to normalize the mRNA levels among different samples. However, suitable reference genes for RT-qPCR have not been identified in Sacha inchi (Plukenetia volubilis), a promising oilseed crop known for its polyunsaturated fatty acid (PUFA)-rich seeds. In this study, using RT-qPCR, twelve candidate reference genes were examined in seedlings and adult plants, during flower and seed development and for the entire growth cycle of Sacha inchi. Four statistical algorithms (delta cycle threshold (ΔCt), BestKeeper, geNorm, and NormFinder) were used to assess the expression stabilities of the candidate genes. The results showed that ubiquitin-conjugating enzyme (UCE), actin (ACT) and phospholipase A22 (PLA) were the most stable genes in Sacha inchi seedlings. For roots, stems, leaves, flowers, and seeds from adult plants, 30S ribosomal protein S13 (RPS13), cyclophilin (CYC) and elongation factor-1alpha (EF1α) were recommended as reference genes for RT-qPCR. During the development of reproductive organs, PLA, ACT and UCE were the optimal reference genes for flower development, whereas UCE, RPS13 and RNA polymerase II subunit (RPII) were optimal for seed development. Considering the entire growth cycle of Sacha inchi, UCE, ACT and EF1α were sufficient for the purpose of normalization. Our results provide useful guidelines for the selection of reliable reference genes for the normalization of RT-qPCR data for seedlings and adult plants, for reproductive organs, and for the entire growth cycle of Sacha inchi. PMID:26047338
A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly

PubMed Central

2013-01-01

Background The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. Results We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. Conclusions These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies. PMID:23496952
A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly.

PubMed

Francis, Warren R; Christianson, Lynne M; Kiko, Rainer; Powers, Meghan L; Shaner, Nathan C; Haddock, Steven H D

2013-03-12

The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes. We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters. These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies.
Selection of reference genes for quantitative real-time RT-PCR assays in different morphological forms of dimorphic zygomycetous fungus Benjaminiella poitrasii.

PubMed

Pathan, Ejaj K; Ghormade, Vandana; Deshpande, Mukund V

2017-01-01

Benjaminiella poitrasii, a dimorphic non-pathogenic zygomycetous fungus, exhibits a morphological yeast (Y) to hypha (H) reversible transition in the vegetative phase, sporangiospores (S) in the asexual phase and zygospores (Z) in the sexual phase. To study the gene expression across these diverse morphological forms, suitable reference genes are required. In the present study, 13 genes viz. ACT, 18S rRNA, eEF1α, eEF-Tu,eIF-1A, Tub-α, Tub-b, Ubc, GAPDH, Try, WS-21, NADGDH and NADPGDH were evaluated for their potential as a reference, particularly for studying gene expression during the Y-H reversible transition and also for other asexual and sexual life stages of B. poitrasii. Analysis of RT-qPCR data using geNorm, normFinder and BestKeeper software revealed that genes such as Ubc, 18S rRNA and WS-21 were expressed at constant levels in each given subset of RNA samples from all the morphological phases of B. poitrasii. Therefore, these reference genes can be used to elucidate the role of morpho-genes in B. poitrasii. Further, use of the two most stably expressed genes (Ubc and WS-21) to normalize the expression of the ornithine decarboxylase gene (Bpodc) in different morphological forms of B. poitrasii, generated more reliable results, indicating that our selection of reference genes was appropriate.
RNA-sequence data normalization through in silico prediction of reference genes: the bacterial response to DNA damage as case study.

PubMed

Berghoff, Bork A; Karlsson, Torgny; Källman, Thomas; Wagner, E Gerhart H; Grabherr, Manfred G

2017-01-01

Measuring how gene expression changes in the course of an experiment assesses how an organism responds on a molecular level. Sequencing of RNA molecules, and their subsequent quantification, aims to assess global gene expression changes on the RNA level (transcriptome). While advances in high-throughput RNA-sequencing (RNA-seq) technologies allow for inexpensive data generation, accurate post-processing and normalization across samples is required to eliminate any systematic noise introduced by the biochemical and/or technical processes. Existing methods thus either normalize on selected known reference genes that are invariant in expression across the experiment, assume that the majority of genes are invariant, or that the effects of up- and down-regulated genes cancel each other out during the normalization. Here, we present a novel method, moose 2 , which predicts invariant genes in silico through a dynamic programming (DP) scheme and applies a quadratic normalization based on this subset. The method allows for specifying a set of known or experimentally validated invariant genes, which guides the DP. We experimentally verified the predictions of this method in the bacterium Escherichia coli , and show how moose 2 is able to (i) estimate the expression value distances between RNA-seq samples, (ii) reduce the variation of expression values across all samples, and (iii) to subsequently reveal new functional groups of genes during the late stages of DNA damage. We further applied the method to three eukaryotic data sets, on which its performance compares favourably to other methods. The software is implemented in C++ and is publicly available from http://grabherr.github.io/moose2/. The proposed RNA-seq normalization method, moose 2 , is a valuable alternative to existing methods, with two major advantages: (i) in silico prediction of invariant genes provides a list of potential reference genes for downstream analyses, and (ii) non-linear artefacts in RNA-seq data are handled adequately to minimize variations between replicates.
DREISS: Using State-Space Models to Infer the Dynamics of Gene Expression Driven by External and Internal Regulatory Networks

PubMed Central

Gerstein, Mark

2016-01-01

Gene expression is controlled by the combinatorial effects of regulatory factors from different biological subsystems such as general transcription factors (TFs), cellular growth factors and microRNAs. A subsystem’s gene expression may be controlled by its internal regulatory factors, exclusively, or by external subsystems, or by both. It is thus useful to distinguish the degree to which a subsystem is regulated internally or externally–e.g., how non-conserved, species-specific TFs affect the expression of conserved, cross-species genes during evolution. We developed a computational method (DREISS, dreiss.gerteinlab.org) for analyzing the Dynamics of gene expression driven by Regulatory networks, both External and Internal based on State Space models. Given a subsystem, the “state” and “control” in the model refer to its own (internal) and another subsystem’s (external) gene expression levels. The state at a given time is determined by the state and control at a previous time. Because typical time-series data do not have enough samples to fully estimate the model’s parameters, DREISS uses dimensionality reduction, and identifies canonical temporal expression trajectories (e.g., degradation, growth and oscillation) representing the regulatory effects emanating from various subsystems. To demonstrate capabilities of DREISS, we study the regulatory effects of evolutionarily conserved vs. divergent TFs across distant species. In particular, we applied DREISS to the time-series gene expression datasets of C. elegans and D. melanogaster during their embryonic development. We analyzed the expression dynamics of the conserved, orthologous genes (orthologs), seeing the degree to which these can be accounted for by orthologous (internal) versus species-specific (external) TFs. We found that between two species, the orthologs have matched, internally driven expression patterns but very different externally driven ones. This is particularly true for genes with evolutionarily ancient functions (e.g. the ribosomal proteins), in contrast to those with more recently evolved functions (e.g., cell-cell communication). This suggests that despite striking morphological differences, some fundamental embryonic-developmental processes are still controlled by ancient regulatory systems. PMID:27760135
The Arabidopsis Information Resource: Making and Mining the ‘Gold Standard’ Annotated Reference Plant Genome

PubMed Central

Berardini, Tanya Z.; Reiser, Leonore; Li, Donghui; Mezheritsky, Yarik; Muller, Robert; Strait, Emily; Huala, Eva

2015-01-01

The Arabidopsis Information Resource (TAIR) is a continuously updated, online database of genetic and molecular biology data for the model plant Arabidopsis thaliana that provides a global research community with centralized access to data for over 30,000 Arabidopsis genes. TAIR’s biocurators systematically extract, organize, and interconnect experimental data from the literature along with computational predictions, community submissions, and high throughput datasets to present a high quality and comprehensive picture of Arabidopsis gene function. TAIR provides tools for data visualization and analysis, and enables ordering of seed and DNA stocks, protein chips and other experimental resources. TAIR actively engages with its users who contribute expertise and data that augments the work of the curatorial staff. TAIR’s focus in an extensive and evolving ecosystem of online resources for plant biology is on the critically important role of extracting experimentally-based research findings from the literature and making that information computationally accessible. In response to the loss of government grant funding, the TAIR team founded a nonprofit entity, Phoenix Bioinformatics, with the aim of developing sustainable funding models for biological databases, using TAIR as a test case. Phoenix has successfully transitioned TAIR to subscription-based funding while still keeping its data relatively open and accessible. PMID:26201819
A common base method for analysis of qPCR data and the application of simple blocking in qPCR experiments.

PubMed

Ganger, Michael T; Dietz, Geoffrey D; Ewing, Sarah J

2017-12-01

qPCR has established itself as the technique of choice for the quantification of gene expression. Procedures for conducting qPCR have received significant attention; however, more rigorous approaches to the statistical analysis of qPCR data are needed. Here we develop a mathematical model, termed the Common Base Method, for analysis of qPCR data based on threshold cycle values (C q ) and efficiencies of reactions (E). The Common Base Method keeps all calculations in the logscale as long as possible by working with log 10 (E) ∙ C q , which we call the efficiency-weighted C q value; subsequent statistical analyses are then applied in the logscale. We show how efficiency-weighted C q values may be analyzed using a simple paired or unpaired experimental design and develop blocking methods to help reduce unexplained variation. The Common Base Method has several advantages. It allows for the incorporation of well-specific efficiencies and multiple reference genes. The method does not necessitate the pairing of samples that must be performed using traditional analysis methods in order to calculate relative expression ratios. Our method is also simple enough to be implemented in any spreadsheet or statistical software without additional scripts or proprietary components.
Characterization of reference genes for RT-qPCR in the desert moss Syntrichia caninervis in response to abiotic stress and desiccation/rehydration

PubMed Central

Li, Xiaoshuang; Zhang, Daoyuan; Li, Haiyan; Gao, Bei; Yang, Honglan; Zhang, Yuanming; Wood, Andrew J.

2015-01-01

Syntrichia caninervis is the dominant bryophyte of the biological soil crusts found in the Gurbantunggut desert. The extreme desert environment is characterized by prolonged drought, temperature extremes, high radiation and frequent cycles of hydration and dehydration. S. caninervis is an ideal organism for the identification and characterization of genes related to abiotic stress tolerance. Reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR) expression analysis is a powerful analytical technique that requires the use of stable reference genes. Using available S. caninervis transcriptome data, we selected 15 candidate reference genes and analyzed their relative expression stabilities in S. caninervis gametophores exposed to a range of abiotic stresses or a hydration-desiccation-rehydration cycle. The programs geNorm, NormFinder, and RefFinder were used to assess and rank the expression stability of the 15 candidate genes. The stability ranking results of reference genes under each specific experimental condition showed high consistency using different algorithms. For abiotic stress treatments, the combination of two genes (α-TUB2 and CDPK) were sufficient for accurate normalization. For the hydration-desiccation-rehydration process, the combination of two genes (α-TUB1 and CDPK) were sufficient for accurate normalization. 18S was among the least stable genes in all of the experimental sets and was unsuitable as reference gene in S. caninervis. This is the first systematic investigation and comparison of reference gene selection for RT-qPCR work in S. caninervis. This research will facilitate gene expression studies in S. caninervis, related moss species from the Syntrichia complex and other mosses. PMID:25699066

Some links on this page may take you to non-federal websites. Their policies may differ from this site.