gene expression features: Topics by Science.gov

Sample records for gene expression features

Constrained clusters of gene expression profiles with pathological features.

PubMed

Sese, Jun; Kurokawa, Yukinori; Monden, Morito; Kato, Kikuya; Morishita, Shinichi

2004-11-22

Gene expression profiles should be useful in distinguishing variations in disease, since they reflect accurately the status of cells. The primary clustering of gene expression reveals the genotypes that are responsible for the proximity of members within each cluster, while further clustering elucidates the pathological features of the individual members of each cluster. However, since the first clustering process and the second classification step, in which the features are associated with clusters, are performed independently, the initial set of clusters may omit genes that are associated with pathologically meaningful features. Therefore, it is important to devise a way of identifying gene expression clusters that are associated with pathological features. We present the novel technique of 'itemset constrained clustering' (IC-Clustering), which computes the optimal cluster that maximizes the interclass variance of gene expression between groups, which are divided according to the restriction that only divisions that can be expressed using common features are allowed. This constraint automatically labels each cluster with a set of pathological features which characterize that cluster. When applied to liver cancer datasets, IC-Clustering revealed informative gene expression clusters, which could be annotated with various pathological features, such as 'tumor' and 'man', or 'except tumor' and 'normal liver function'. In contrast, the k-means method overlooked these clusters.
A Review of Feature Extraction Software for Microarray Gene Expression Data

PubMed Central

Tan, Ching Siang; Ting, Wai Soon; Mohamad, Mohd Saberi; Chan, Weng Howe; Deris, Safaai; Ali Shah, Zuraini

2014-01-01

When gene expression data are too large to be processed, they are transformed into a reduced representation set of genes. Transforming large-scale gene expression data into a set of genes is called feature extraction. If the genes extracted are carefully chosen, this gene set can extract the relevant information from the large-scale gene expression data, allowing further analysis by using this reduced representation instead of the full size data. In this paper, we review numerous software applications that can be used for feature extraction. The software reviewed is mainly for Principal Component Analysis (PCA), Independent Component Analysis (ICA), Partial Least Squares (PLS), and Local Linear Embedding (LLE). A summary and sources of the software are provided in the last section for each feature extraction method. PMID:25250315
iPcc: a novel feature extraction method for accurate disease class discovery and prediction

PubMed Central

Ren, Xianwen; Wang, Yong; Zhang, Xiang-Sun; Jin, Qi

2013-01-01

Gene expression profiling has gradually become a routine procedure for disease diagnosis and classification. In the past decade, many computational methods have been proposed, resulting in great improvements on various levels, including feature selection and algorithms for classification and clustering. In this study, we present iPcc, a novel method from the feature extraction perspective to further propel gene expression profiling technologies from bench to bedside. We define ‘correlation feature space’ for samples based on the gene expression profiles by iterative employment of Pearson’s correlation coefficient. Numerical experiments on both simulated and real gene expression data sets demonstrate that iPcc can greatly highlight the latent patterns underlying noisy gene expression data and thus greatly improve the robustness and accuracy of the algorithms currently available for disease diagnosis and classification based on gene expression profiles. PMID:23761440
A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes

PubMed Central

Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

2015-01-01

In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data. PMID:26201006
A P-Norm Robust Feature Extraction Method for Identifying Differentially Expressed Genes.

PubMed

Liu, Jian; Liu, Jin-Xing; Gao, Ying-Lian; Kong, Xiang-Zhen; Wang, Xue-Song; Wang, Dong

2015-01-01

In current molecular biology, it becomes more and more important to identify differentially expressed genes closely correlated with a key biological process from gene expression data. In this paper, based on the Schatten p-norm and Lp-norm, a novel p-norm robust feature extraction method is proposed to identify the differentially expressed genes. In our method, the Schatten p-norm is used as the regularization function to obtain a low-rank matrix and the Lp-norm is taken as the error function to improve the robustness to outliers in the gene expression data. The results on simulation data show that our method can obtain higher identification accuracies than the competitive methods. Numerous experiments on real gene expression data sets demonstrate that our method can identify more differentially expressed genes than the others. Moreover, we confirmed that the identified genes are closely correlated with the corresponding gene expression data.
A Filter Feature Selection Method Based on MFA Score and Redundancy Excluding and It's Application to Tumor Gene Expression Data Analysis.

PubMed

Li, Jiangeng; Su, Lei; Pang, Zenan

2015-12-01

Feature selection techniques have been widely applied to tumor gene expression data analysis in recent years. A filter feature selection method named marginal Fisher analysis score (MFA score) which is based on graph embedding has been proposed, and it has been widely used mainly because it is superior to Fisher score. Considering the heavy redundancy in gene expression data, we proposed a new filter feature selection technique in this paper. It is named MFA score+ and is based on MFA score and redundancy excluding. We applied it to an artificial dataset and eight tumor gene expression datasets to select important features and then used support vector machine as the classifier to classify the samples. Compared with MFA score, t test and Fisher score, it achieved higher classification accuracy.
A novel feature extraction approach for microarray data based on multi-algorithm fusion

PubMed Central

Jiang, Zhu; Xu, Rong

2015-01-01

Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions. PMID:25780277
A novel feature extraction approach for microarray data based on multi-algorithm fusion.

PubMed

Jiang, Zhu; Xu, Rong

2015-01-01

Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions.
Analyzing gene expression time-courses based on multi-resolution shape mixture model.

PubMed

Li, Ying; He, Ye; Zhang, Yu

2016-11-01

Biological processes actually are a dynamic molecular process over time. Time course gene expression experiments provide opportunities to explore patterns of gene expression change over a time and understand the dynamic behavior of gene expression, which is crucial for study on development and progression of biology and disease. Analysis of the gene expression time-course profiles has not been fully exploited so far. It is still a challenge problem. We propose a novel shape-based mixture model clustering method for gene expression time-course profiles to explore the significant gene groups. Based on multi-resolution fractal features and mixture clustering model, we proposed a multi-resolution shape mixture model algorithm. Multi-resolution fractal features is computed by wavelet decomposition, which explore patterns of change over time of gene expression at different resolution. Our proposed multi-resolution shape mixture model algorithm is a probabilistic framework which offers a more natural and robust way of clustering time-course gene expression. We assessed the performance of our proposed algorithm using yeast time-course gene expression profiles compared with several popular clustering methods for gene expression profiles. The grouped genes identified by different methods are evaluated by enrichment analysis of biological pathways and known protein-protein interactions from experiment evidence. The grouped genes identified by our proposed algorithm have more strong biological significance. A novel multi-resolution shape mixture model algorithm based on multi-resolution fractal features is proposed. Our proposed model provides a novel horizons and an alternative tool for visualization and analysis of time-course gene expression profiles. The R and Matlab program is available upon the request. Copyright © 2016 Elsevier Inc. All rights reserved.
Expression Quantitative Trait Locus Mapping across Water Availability Environments Reveals Contrasting Associations with Genomic Features in Arabidopsis[C][W][OPEN

PubMed Central

Lowry, David B.; Logan, Tierney L.; Santuari, Luca; Hardtke, Christian S.; Richards, James H.; DeRose-Wilson, Leah J.; McKay, John K.; Sen, Saunak; Juenger, Thomas E.

2013-01-01

The regulation of gene expression is crucial for an organism’s development and response to stress, and an understanding of the evolution of gene expression is of fundamental importance to basic and applied biology. To improve this understanding, we conducted expression quantitative trait locus (eQTL) mapping in the Tsu-1 (Tsushima, Japan) × Kas-1 (Kashmir, India) recombinant inbred line population of Arabidopsis thaliana across soil drying treatments. We then used genome resequencing data to evaluate whether genomic features (promoter polymorphism, recombination rate, gene length, and gene density) are associated with genes responding to the environment (E) or with genes with genetic variation (G) in gene expression in the form of eQTLs. We identified thousands of genes that responded to soil drying and hundreds of main-effect eQTLs. However, we identified very few statistically significant eQTLs that interacted with the soil drying treatment (GxE eQTL). Analysis of genome resequencing data revealed associations of several genomic features with G and E genes. In general, E genes had lower promoter diversity and local recombination rates. By contrast, genes with eQTLs (G) had significantly greater promoter diversity and were located in genomic regions with higher recombination. These results suggest that genomic architecture may play an important a role in the evolution of gene expression. PMID:24045022
Integrative analysis of gene expression and DNA methylation using unsupervised feature extraction for detecting candidate cancer biomarkers.

PubMed

Moon, Myungjin; Nakai, Kenta

2018-04-01

Currently, cancer biomarker discovery is one of the important research topics worldwide. In particular, detecting significant genes related to cancer is an important task for early diagnosis and treatment of cancer. Conventional studies mostly focus on genes that are differentially expressed in different states of cancer; however, noise in gene expression datasets and insufficient information in limited datasets impede precise analysis of novel candidate biomarkers. In this study, we propose an integrative analysis of gene expression and DNA methylation using normalization and unsupervised feature extractions to identify candidate biomarkers of cancer using renal cell carcinoma RNA-seq datasets. Gene expression and DNA methylation datasets are normalized by Box-Cox transformation and integrated into a one-dimensional dataset that retains the major characteristics of the original datasets by unsupervised feature extraction methods, and differentially expressed genes are selected from the integrated dataset. Use of the integrated dataset demonstrated improved performance as compared with conventional approaches that utilize gene expression or DNA methylation datasets alone. Validation based on the literature showed that a considerable number of top-ranked genes from the integrated dataset have known relationships with cancer, implying that novel candidate biomarkers can also be acquired from the proposed analysis method. Furthermore, we expect that the proposed method can be expanded for applications involving various types of multi-omics datasets.
Unconventional function of an Achaete-Scute homolog as a terminal selector of nociceptive neuron identity

PubMed Central

Masoudi, Neda; Tavazoie, Saeed; Glenwinkel, Lori; Ryu, Leesun; Kim, Kyuhyung

2018-01-01

Proneural genes are among the most early-acting genes in nervous system development, instructing blast cells to commit to a neuronal fate. Drosophila Atonal and Achaete-Scute complex (AS-C) genes, as well as their vertebrate orthologs, are basic helix-loop-helix (bHLH) transcription factors with such proneural activity. We show here that a C. elegans AS-C homolog, hlh-4, functions in a fundamentally different manner. In the embryonic, larval, and adult nervous systems, hlh-4 is expressed exclusively in a single nociceptive neuron class, ADL, and its expression in ADL is maintained via transcriptional autoregulation throughout the life of the animal. However, in hlh-4 null mutants, the ADL neuron is generated and still appears neuronal in overall morphology and expression of panneuronal and pansensory features. Rather than acting as a proneural gene, we find that hlh-4 is required for the ADL neuron to function properly, to adopt its correct morphology, to express its unusually large repertoire of olfactory receptor–encoding genes, and to express other known features of terminal ADL identity, including neurotransmitter phenotype, neuropeptides, ion channels, and electrical synapse proteins. hlh-4 is sufficient to induce ADL identity features upon ectopic expression in other neuron types. The expression of ADL terminal identity features is directly controlled by HLH-4 via a phylogenetically conserved E-box motif, which, through bioinformatic analysis, we find to constitute a predictive feature of ADL-expressed terminal identity markers. The lineage that produces the ADL neuron was previously shown to require the conventional, transient proneural activity of another AS-C homolog, hlh-14, demonstrating sequential activities of distinct AS-C-type bHLH genes in neuronal specification. Taken together, we have defined here an unconventional function of an AS-C-type bHLH gene as a terminal selector of neuronal identity and we speculate that such function could be reflective of an ancestral function of an “ur-” bHLH gene. PMID:29672507
An Optimal Mean Based Block Robust Feature Extraction Method to Identify Colorectal Cancer Genes with Integrated Data.

PubMed

Liu, Jian; Cheng, Yuhu; Wang, Xuesong; Zhang, Lin; Liu, Hui

2017-08-17

It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with advanced clinical stage. In this paper, we conducted a feature extraction method named Optimal Mean based Block Robust Feature Extraction method (OMBRFE) to identify feature genes associated with advanced colorectal cancer in clinical stage by using the integrated colorectal cancer data. Firstly, based on the optimal mean and L 2,1 -norm, a novel feature extraction method called Optimal Mean based Robust Feature Extraction method (OMRFE) is proposed to identify feature genes. Then the OMBRFE method which introduces the block ideology into OMRFE method is put forward to process the colorectal cancer integrated data which includes multiple genomic data: copy number alterations, somatic mutations, methylation expression alteration, as well as gene expression changes. Experimental results demonstrate that the OMBRFE is more effective than previous methods in identifying the feature genes. Moreover, genes identified by OMBRFE are verified to be closely associated with advanced colorectal cancer in clinical stage.
Distributional fold change test – a statistical approach for detecting differential expression in microarray experiments

PubMed Central

2012-01-01

Background Because of the large volume of data and the intrinsic variation of data intensity observed in microarray experiments, different statistical methods have been used to systematically extract biological information and to quantify the associated uncertainty. The simplest method to identify differentially expressed genes is to evaluate the ratio of average intensities in two different conditions and consider all genes that differ by more than an arbitrary cut-off value to be differentially expressed. This filtering approach is not a statistical test and there is no associated value that can indicate the level of confidence in the designation of genes as differentially expressed or not differentially expressed. At the same time the fold change by itself provide valuable information and it is important to find unambiguous ways of using this information in expression data treatment. Results A new method of finding differentially expressed genes, called distributional fold change (DFC) test is introduced. The method is based on an analysis of the intensity distribution of all microarray probe sets mapped to a three dimensional feature space composed of average expression level, average difference of gene expression and total variance. The proposed method allows one to rank each feature based on the signal-to-noise ratio and to ascertain for each feature the confidence level and power for being differentially expressed. The performance of the new method was evaluated using the total and partial area under receiver operating curves and tested on 11 data sets from Gene Omnibus Database with independently verified differentially expressed genes and compared with the t-test and shrinkage t-test. Overall the DFC test performed the best – on average it had higher sensitivity and partial AUC and its elevation was most prominent in the low range of differentially expressed features, typical for formalin-fixed paraffin-embedded sample sets. Conclusions The distributional fold change test is an effective method for finding and ranking differentially expressed probesets on microarrays. The application of this test is advantageous to data sets using formalin-fixed paraffin-embedded samples or other systems where degradation effects diminish the applicability of correlation adjusted methods to the whole feature set. PMID:23122055
TANDEM: a two-stage approach to maximize interpretability of drug response models based on multiple molecular data types.

PubMed

Aben, Nanne; Vis, Daniel J; Michaut, Magali; Wessels, Lodewyk F A

2016-09-01

Clinical response to anti-cancer drugs varies between patients. A large portion of this variation can be explained by differences in molecular features, such as mutation status, copy number alterations, methylation and gene expression profiles. We show that the classic approach for combining these molecular features (Elastic Net regression on all molecular features simultaneously) results in models that are almost exclusively based on gene expression. The gene expression features selected by the classic approach are difficult to interpret as they often represent poorly studied combinations of genes, activated by aberrations in upstream signaling pathways. To utilize all data types in a more balanced way, we developed TANDEM, a two-stage approach in which the first stage explains response using upstream features (mutations, copy number, methylation and cancer type) and the second stage explains the remainder using downstream features (gene expression). Applying TANDEM to 934 cell lines profiled across 265 drugs (GDSC1000), we show that the resulting models are more interpretable, while retaining the same predictive performance as the classic approach. Using the more balanced contributions per data type as determined with TANDEM, we find that response to MAPK pathway inhibitors is largely predicted by mutation data, while predicting response to DNA damaging agents requires gene expression data, in particular SLFN11 expression. TANDEM is available as an R package on CRAN (for more information, see http://ccb.nki.nl/software/tandem). m.michaut@nki.nl or l.wessels@nki.nl Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Hybrid Binary Imperialist Competition Algorithm and Tabu Search Approach for Feature Selection Using Gene Expression Data.

PubMed

Wang, Shuaiqun; Aorigele; Kong, Wei; Zeng, Weiming; Hong, Xiaomin

2016-01-01

Gene expression data composed of thousands of genes play an important role in classification platforms and disease diagnosis. Hence, it is vital to select a small subset of salient features over a large number of gene expression data. Lately, many researchers devote themselves to feature selection using diverse computational intelligence methods. However, in the progress of selecting informative genes, many computational methods face difficulties in selecting small subsets for cancer classification due to the huge number of genes (high dimension) compared to the small number of samples, noisy genes, and irrelevant genes. In this paper, we propose a new hybrid algorithm HICATS incorporating imperialist competition algorithm (ICA) which performs global search and tabu search (TS) that conducts fine-tuned search. In order to verify the performance of the proposed algorithm HICATS, we have tested it on 10 well-known benchmark gene expression classification datasets with dimensions varying from 2308 to 12600. The performance of our proposed method proved to be superior to other related works including the conventional version of binary optimization algorithm in terms of classification accuracy and the number of selected genes.
Hybrid Binary Imperialist Competition Algorithm and Tabu Search Approach for Feature Selection Using Gene Expression Data

PubMed Central

Aorigele; Zeng, Weiming; Hong, Xiaomin

2016-01-01

Gene expression data composed of thousands of genes play an important role in classification platforms and disease diagnosis. Hence, it is vital to select a small subset of salient features over a large number of gene expression data. Lately, many researchers devote themselves to feature selection using diverse computational intelligence methods. However, in the progress of selecting informative genes, many computational methods face difficulties in selecting small subsets for cancer classification due to the huge number of genes (high dimension) compared to the small number of samples, noisy genes, and irrelevant genes. In this paper, we propose a new hybrid algorithm HICATS incorporating imperialist competition algorithm (ICA) which performs global search and tabu search (TS) that conducts fine-tuned search. In order to verify the performance of the proposed algorithm HICATS, we have tested it on 10 well-known benchmark gene expression classification datasets with dimensions varying from 2308 to 12600. The performance of our proposed method proved to be superior to other related works including the conventional version of binary optimization algorithm in terms of classification accuracy and the number of selected genes. PMID:27579323
Comprehensive Transcriptome Analysis of Sex-Biased Expressed Genes Reveals Discrete Biological and Physiological Features of Male and Female Schistosoma japonicum.

PubMed

Cai, Pengfei; Liu, Shuai; Piao, Xianyu; Hou, Nan; Gobert, Geoffrey N; McManus, Donald P; Chen, Qijun

2016-04-01

Schistosomiasis is a chronic and debilitating disease caused by blood flukes (digenetic trematodes) of the genus Schistosoma. Schistosomes are sexually dimorphic and exhibit dramatic morphological changes during a complex lifecycle which requires subtle gene regulatory mechanisms to fulfil these complex biological processes. In the current study, a 41,982 features custom DNA microarray, which represents the most comprehensive probe coverage for any schistosome transcriptome study, was designed based on public domain and local databases to explore differential gene expression in S. japonicum. We found that approximately 1/10 of the total annotated genes in the S. japonicum genome are differentially expressed between adult males and females. In general, genes associated with the cytoskeleton, and motor and neuronal activities were readily expressed in male adult worms, whereas genes involved in amino acid metabolism, nucleotide biosynthesis, gluconeogenesis, glycosylation, cell cycle processes, DNA synthesis and genome fidelity and stability were enriched in females. Further, miRNAs target sites within these gene sets were predicted, which provides a scenario whereby the miRNAs potentially regulate these sex-biased expressed genes. The study significantly expands the expressional and regulatory characteristics of gender-biased expressed genes in schistosomes with high accuracy. The data provide a better appreciation of the biological and physiological features of male and female schistosome parasites, which may lead to novel vaccine targets and the development of new therapeutic interventions.
Classification of early-stage non-small cell lung cancer by weighing gene expression profiles with connectivity information.

PubMed

Zhang, Ao; Tian, Suyan

2018-05-01

Pathway-based feature selection algorithms, which utilize biological information contained in pathways to guide which features/genes should be selected, have evolved quickly and become widespread in the field of bioinformatics. Based on how the pathway information is incorporated, we classify pathway-based feature selection algorithms into three major categories-penalty, stepwise forward, and weighting. Compared to the first two categories, the weighting methods have been underutilized even though they are usually the simplest ones. In this article, we constructed three different genes' connectivity information-based weights for each gene and then conducted feature selection upon the resulting weighted gene expression profiles. Using both simulations and a real-world application, we have demonstrated that when the data-driven connectivity information constructed from the data of specific disease under study is considered, the resulting weighted gene expression profiles slightly outperform the original expression profiles. In summary, a big challenge faced by the weighting method is how to estimate pathway knowledge-based weights more accurately and precisely. Only until the issue is conquered successfully will wide utilization of the weighting methods be impossible. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sample-space-based feature extraction and class preserving projection for gene expression data.

PubMed

Wang, Wenjun

2013-01-01

In order to overcome the problems of high computational complexity and serious matrix singularity for feature extraction using Principal Component Analysis (PCA) and Fisher's Linear Discrinimant Analysis (LDA) in high-dimensional data, sample-space-based feature extraction is presented, which transforms the computation procedure of feature extraction from gene space to sample space by representing the optimal transformation vector with the weighted sum of samples. The technique is used in the implementation of PCA, LDA, Class Preserving Projection (CPP) which is a new method for discriminant feature extraction proposed, and the experimental results on gene expression data demonstrate the effectiveness of the method.

Circular RNA Expression: Its Potential Regulation and Function.

PubMed

Salzman, Julia

2016-05-01

In 2012, a new feature of eukaryotic gene expression emerged: ubiquitous expression of circular RNA (circRNA) from genes traditionally thought to express messenger or linear noncoding (nc)RNA only. CircRNAs are covalently closed, circular RNA molecules that typically comprise exonic sequences and are spliced at canonical splice sites. This feature of gene expression was first recognized in humans and mouse, but it quickly emerged that it was common across essentially all eukaryotes studied by molecular biologists. CircRNA abundance, and even which alternatively spliced circRNA isoforms are expressed, varies by cell type and can exceed the abundance of the traditional linear mRNA or ncRNA transcript. CircRNAs are enriched in the brain and increase in abundance during fetal development. Together, these features raise fundamental questions regarding the regulation of circRNA in cis and in trans, and its function. Copyright © 2016. Published by Elsevier Ltd.
Lex-SVM: exploring the potential of exon expression profiling for disease classification.

PubMed

Yuan, Xiongying; Zhao, Yi; Liu, Changning; Bu, Dongbo

2011-04-01

Exon expression profiling technologies, including exon arrays and RNA-Seq, measure the abundance of every exon in a gene. Compared with gene expression profiling technologies like 3' array, exon expression profiling technologies could detect alterations in both transcription and alternative splicing, therefore they are expected to be more sensitive in diagnosis. However, exon expression profiling also brings higher dimension, more redundancy, and significant correlation among features. Ignoring the correlation structure among exons of a gene, a popular classification method like L1-SVM selects exons individually from each gene and thus is vulnerable to noise. To overcome this limitation, we present in this paper a new variant of SVM named Lex-SVM to incorporate correlation structure among exons and known splicing patterns to promote classification performance. Specifically, we construct a new norm, ex-norm, including our prior knowledge on exon correlation structure to regularize the coefficients of a linear SVM. Lex-SVM can be solved efficiently using standard linear programming techniques. The advantage of Lex-SVM is that it can select features group-wisely, force features in a subgroup to take equal weihts and exclude the features that contradict the majority in the subgroup. Experimental results suggest that on exon expression profile, Lex-SVM is more accurate than existing methods. Lex-SVM also generates a more compact model and selects genes more consistently in cross-validation. Unlike L1-SVM selecting only one exon in a gene, Lex-SVM assigns equal weights to as many exons in a gene as possible, lending itself easier for further interpretation.
Transcriptional profiling of the parr–smolt transformation in Atlantic salmon

USGS Publications Warehouse

Robertson, Laura S.; McCormick, Stephen D.

2012-01-01

The parr–smolt transformation in Atlantic salmon (Salmo salar) is a complex developmental process that culminates in the ability to migrate to and live in seawater. We used GRASP 16K cDNA microarrays to identify genes that are differentially expressed in the liver, gill, hypothalamus, pituitary, and olfactory rosettes of smolts compared to parr. Smolts had higher levels of gill Na+/K+-ATPase activity, plasma cortisol and plasma thyroid hormones relative to parr. Across all five tissues, stringent microarray analyses identified 48 features that were differentially expressed in smolts compared to parr. Using a less stringent method we found 477 features that were differentially expressed at least 1.2-fold in smolts, including 172 features in the gill. Smolts had higher mRNA levels of genes involved in transcription, protein biosynthesis and folding, electron transport, oxygen transport, and sensory perception and lower mRNA levels for genes involved in proteolysis. Quantitative RT-PCR was used to confirm differential expression in select genes identified by microarray analyses and to quantify expression of other genes known to be involved in smolting. This study expands our understanding of the molecular processes that underlie smolting in Atlantic salmon and identifies genes for further investigation.
Comparison of gene expression responses to hypoxia in viviparous (Xiphophorus) and oviparous (Oryzias) fishes using a medaka microarray.

PubMed

Boswell, Mikki G; Wells, Melissa C; Kirk, Lyndsey M; Ju, Zhenlin; Zhang, Ziping; Booth, Rachell E; Walter, Ronald B

2009-03-01

Gene expression profiling using DNA microarray technology is a useful tool for assessing gene transcript level responses after an organism is exposed to environmental stress. Herein, we detail results from studies using an 8 k medaka (Oryzias latipes) microarray to assess modulated gene expression patterns upon hypoxia exposure of the live-bearing aquaria fish, Xiphophorus maculatus. To assess the reproducibility and reliability of using the medaka array in cross-genus hybridization, a two-factor ANOVA analysis of gene expression was employed. The data show the tissue source of the RNA used for array hybridization contributed more to the observed response of modulated gene targets than did the species source of the RNA. In addition, hierarchical clustering via heat map analyses of groupings of tissues and species (Xiphophorus and medaka) suggests that hypoxia induced similar responses in the same tissues from these two diverse aquatic model organisms. Our Xiphophorus results indicate 206 brain, 37 liver, and 925 gill gene targets exhibit hypoxia induced expression changes. Analysis of the Xiphophorus data to determine those features exhibiting a significant (p<0.05)+/-3 fold change produced only two gene targets within brain tissue and 80 features within gill tissue. Of these 82 characterized features, 39 were identified via homology searching (cut-off E-value of 1 x 10(-5)) and placed into one or more biological process gene ontology groups. Among these 39 genes, metabolic energy changes and manipulation was the most affected biological pathway (13 genes).
[Establishment of an iRFP and luciferase dual-color fluorescence-traced hepatocellular carcinoma transplantation model in nude mice].

PubMed

Li, Hongjun; Yang, Tianhua; Huang, Yanping; Liu, Mingzhu; Qin, Zhongqiang; Chu, Fei; Li, Zhenghong; Li, Yonghai

2017-11-01

Objective To establish a hepatocellular carcinoma xenograft model in nude mice which could stably express gene and be monitored dynamically. Methods We first constructed the lentiviral particles containing luciferase (Luc) and near-infrared fluorescent protein (iRFP) and puromycin resistance gene, and then transduced them into the HepG2 hepatoma cells. The cell line stably expressing Luc and iRFP genes were screened and inoculated into nude mice to establish xenograft tumor model. Tumor growth was monitored using in vivo imaging system. HE staining and immunohistochemistry were used to evaluate the pathological features and tumorigenic ability. Results HepG2 cells stably expressing iRFP and Luc were obtained; with the engineered cell line, xenograft model was successfully established with the features of proper tumor developing time and high rate of tumor formation as well as typical pathological features as showed by HE staining and immunohistochemistry. Conclusion Hepatocellular carcinoma model in nude mice with the features of stable gene expression and dynamical monitoring has been established successfully with the HepG2-iRFP-Luc cell line.
Cell-type specific features of circular RNA expression.

PubMed

Salzman, Julia; Chen, Raymond E; Olsen, Mari N; Wang, Peter L; Brown, Patrick O

2013-01-01

Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program.
Application of machine learning on brain cancer multiclass classification

NASA Astrophysics Data System (ADS)

Panca, V.; Rustam, Z.

2017-07-01

Classification of brain cancer is a problem of multiclass classification. One approach to solve this problem is by first transforming it into several binary problems. The microarray gene expression dataset has the two main characteristics of medical data: extremely many features (genes) and only a few number of samples. The application of machine learning on microarray gene expression dataset mainly consists of two steps: feature selection and classification. In this paper, the features are selected using a method based on support vector machine recursive feature elimination (SVM-RFE) principle which is improved to solve multiclass classification, called multiple multiclass SVM-RFE. Instead of using only the selected features on a single classifier, this method combines the result of multiple classifiers. The features are divided into subsets and SVM-RFE is used on each subset. Then, the selected features on each subset are put on separate classifiers. This method enhances the feature selection ability of each single SVM-RFE. Twin support vector machine (TWSVM) is used as the method of the classifier to reduce computational complexity. While ordinary SVM finds single optimum hyperplane, the main objective Twin SVM is to find two non-parallel optimum hyperplanes. The experiment on the brain cancer microarray gene expression dataset shows this method could classify 71,4% of the overall test data correctly, using 100 and 1000 genes selected from multiple multiclass SVM-RFE feature selection method. Furthermore, the per class results show that this method could classify data of normal and MD class with 100% accuracy.
Partial least squares based gene expression analysis in estrogen receptor positive and negative breast tumors.

PubMed

Ma, W; Zhang, T-F; Lu, P; Lu, S H

2014-01-01

Breast cancer is categorized into two broad groups: estrogen receptor positive (ER+) and ER negative (ER-) groups. Previous study proposed that under trastuzumab-based neoadjuvant chemotherapy, tumor initiating cell (TIC) featured ER- tumors response better than ER+ tumors. Exploration of the molecular difference of these two groups may help developing new therapeutic strategies, especially for ER- patients. With gene expression profile from the Gene Expression Omnibus (GEO) database, we performed partial least squares (PLS) based analysis, which is more sensitive than common variance/regression analysis. We acquired 512 differentially expressed genes. Four pathways were found to be enriched with differentially expressed genes, involving immune system, metabolism and genetic information processing process. Network analysis identified five hub genes with degrees higher than 10, including APP, ESR1, SMAD3, HDAC2, and PRKAA1. Our findings provide new understanding for the molecular difference between TIC featured ER- and ER+ breast tumors with the hope offer supports for therapeutic studies.
Analysis of multiplex gene expression maps obtained by voxelation.

PubMed

An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

2009-04-29

Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists.
Human-specific features of spatial gene expression and regulation in eight brain regions.

PubMed

Xu, Chuan; Li, Qian; Efimova, Olga; He, Liu; Tatsumoto, Shoji; Stepanova, Vita; Oishi, Takao; Udono, Toshifumi; Yamaguchi, Katsushi; Shigenobu, Shuji; Kakita, Akiyoshi; Nawa, Hiroyuki; Khaitovich, Philipp; Go, Yasuhiro

2018-06-13

Molecular maps of the human brain alone do not inform us of the features unique to humans. Yet, the identification of these features is important for understanding both the evolution and nature of human cognition. Here, we approached this question by analyzing gene expression and H3K27ac chromatin modification data collected in eight brain regions of humans, chimpanzees, gorillas, a gibbon and macaques. An analysis of spatial transcriptome trajectories across eight brain regions in four primate species revealed 1,851 genes showing human-specific transcriptome differences in one or multiple brain regions, in contrast to 240 chimpanzee-specific ones. More than half of these human-specific differences represented elevated expression of genes enriched in neuronal and astrocytic markers in the human hippocampus, while the rest were enriched in microglial markers and displayed human-specific expression in several frontal cortical regions and the cerebellum. An analysis of the predicted regulatory interactions driving these differences revealed the role of transcription factors in species-specific transcriptome changes, while epigenetic modifications were linked to spatial expression differences conserved across species. Published by Cold Spring Harbor Laboratory Press.
Methods for monitoring multiple gene expression

DOEpatents

Berka, Randy; Bachkirova, Elena; Rey, Michael

2013-10-01

The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Methods for monitoring multiple gene expression

DOEpatents

Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA

2012-05-01

The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Methods for monitoring multiple gene expression

DOEpatents

Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA

2008-06-01

The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Single-feature polymorphism discovery in the barley transcriptome

PubMed Central

Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie

2005-01-01

A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
Modularity and evolutionary constraints in a baculovirus gene regulatory network

PubMed Central

2013-01-01

Background The structure of regulatory networks remains an open question in our understanding of complex biological systems. Interactions during complete viral life cycles present unique opportunities to understand how host-parasite network take shape and behave. The Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is a large double-stranded DNA virus, whose genome may encode for 152 open reading frames (ORFs). Here we present the analysis of the ordered cascade of the AgMNPV gene expression. Results We observed an earlier onset of the expression than previously reported for other baculoviruses, especially for genes involved in DNA replication. Most ORFs were expressed at higher levels in a more permissive host cell line. Genes with more than one copy in the genome had distinct expression profiles, which could indicate the acquisition of new functionalities. The transcription gene regulatory network (GRN) for 149 ORFs had a modular topology comprising five communities of highly interconnected nodes that separated key genes that are functionally related on different communities, possibly maximizing redundancy and GRN robustness by compartmentalization of important functions. Core conserved functions showed expression synchronicity, distinct GRN features and significantly less genetic diversity, consistent with evolutionary constraints imposed in key elements of biological systems. This reduced genetic diversity also had a positive correlation with the importance of the gene in our estimated GRN, supporting a relationship between phylogenetic data of baculovirus genes and network features inferred from expression data. We also observed that gene arrangement in overlapping transcripts was conserved among related baculoviruses, suggesting a principle of genome organization. Conclusions Albeit with a reduced number of nodes (149), the AgMNPV GRN had a topology and key characteristics similar to those observed in complex cellular organisms, which indicates that modularity may be a general feature of biological gene regulatory networks. PMID:24006890
Cell-Type Specific Features of Circular RNA Expression

PubMed Central

Salzman, Julia; Chen, Raymond E.; Olsen, Mari N.; Wang, Peter L.; Brown, Patrick O.

2013-01-01

Thousands of loci in the human and mouse genomes give rise to circular RNA transcripts; at many of these loci, the predominant RNA isoform is a circle. Using an improved computational approach for circular RNA identification, we found widespread circular RNA expression in Drosophila melanogaster and estimate that in humans, circular RNA may account for 1% as many molecules as poly(A) RNA. Analysis of data from the ENCODE consortium revealed that the repertoire of genes expressing circular RNA, the ratio of circular to linear transcripts for each gene, and even the pattern of splice isoforms of circular RNAs from each gene were cell-type specific. These results suggest that biogenesis of circular RNA is an integral, conserved, and regulated feature of the gene expression program. PMID:24039610
PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis.

PubMed

Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu

2003-11-07

To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s).
PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis

PubMed Central

Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu

2003-01-01

Background To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. Results We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. Conclusion PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s). PMID:14604444
An epigenetic state associated with areas of gene duplication

PubMed Central

Gimelbrant, Alexander A.; Chess, Andrew

2006-01-01

Asynchronous DNA replication is an epigenetically determined feature found in all cases of monoallelic expression, including genomic imprinting, X-inactivation, and random monoallelic expression of autosomal genes such as immunoglobulins and olfactory receptor genes. Most genes of the latter class were identified in experiments focused on genes functioning in the chemosensory and immune systems. We performed an unbiased survey of asynchronous replication in the mouse genome, excluding known asynchronously replicated genes. Fully 10% (eight of 80) of the genes tested exhibited asynchronous replication. A common feature of the newly identified asynchronously replicated areas is their proximity to areas of tandem gene duplication. Testing of other clustered areas supported the idea that such regions are enriched with asynchronously replicated genes. PMID:16687731
Robust Learning of High-dimensional Biological Networks with Bayesian Networks

NASA Astrophysics Data System (ADS)

Nägele, Andreas; Dejori, Mathäus; Stetter, Martin

Structure learning of Bayesian networks applied to gene expression data has become a potentially useful method to estimate interactions between genes. However, the NP-hardness of Bayesian network structure learning renders the reconstruction of the full genetic network with thousands of genes unfeasible. Consequently, the maximal network size is usually restricted dramatically to a small set of genes (corresponding with variables in the Bayesian network). Although this feature reduction step makes structure learning computationally tractable, on the downside, the learned structure might be adversely affected due to the introduction of missing genes. Additionally, gene expression data are usually very sparse with respect to the number of samples, i.e., the number of genes is much greater than the number of different observations. Given these problems, learning robust network features from microarray data is a challenging task. This chapter presents several approaches tackling the robustness issue in order to obtain a more reliable estimation of learned network features.

TU-CD-BRB-12: Radiogenomics of MRI-Guided Prostate Cancer Biopsy Habitats

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stoyanova, R; Lynne, C; Abraham, S

2015-06-15

Purpose: Diagnostic prostate biopsies are subject to sampling bias. We hypothesize that quantitative imaging with multiparametric (MP)-MRI can more accurately direct targeted biopsies to index lesions associated with highest risk clinical and genomic features. Methods: Regionally distinct prostate habitats were delineated on MP-MRI (T2-weighted, perfusion and diffusion imaging). Directed biopsies were performed on 17 habitats from 6 patients using MRI-ultrasound fusion. Biopsy location was characterized with 52 radiographic features. Transcriptome-wide analysis of 1.4 million RNA probes was performed on RNA from each habitat. Genomics features with insignificant expression values (<0.25) and interquartile range <0.5 were filtered, leaving total of 212more » genes. Correlation between imaging features, genes and a 22 feature genomic classifier (GC), developed as a prognostic assay for metastasis after radical prostatectomy was investigated. Results: High quality genomic data was derived from 17 (100%) biopsies. Using the 212 ‘unbiased’ genes, the samples clustered by patient origin in unsupervised analysis. When only prostate cancer related genomic features were used, hierarchical clustering revealed samples clustered by needle-biopsy Gleason score (GS). Similarly, principal component analysis of the imaging features, found the primary source of variance segregated the samples into high (≥7) and low (6) GS. Pearson’s correlation analysis of genes with significant expression showed two main patterns of gene expression clustering prostate peripheral and transitional zone MRI features. Two-way hierarchical clustering of GC with radiomics features resulted in the expected groupings of high and low expressed genes in this metastasis signature. Conclusions: MP-MRI-targeted diagnostic biopsies can potentially improve risk stratification by directing pathological and genomic analysis to clinically significant index lesions. As determinant lesions are more reliably identified, targeting with radiotherapy should improve outcome. This is the first demonstration of a link between quantitative imaging features (radiomics) with genomic features in MRI-directed prostate biopsies. The research was supported by NIH- NCI R01 CA 189295 and R01 CA 189295; E Davicioni is partial owner of GenomeDx Biosciences, Inc. M Takhar, N Erho, L Lam, C Buerki and E Davicioni are current employees at GenomeDx Biosciences, Inc.« less
Transcriptomic correlates of neuron electrophysiological diversity

PubMed Central

Li, Brenna; Crichlow, Cindy-Lee; Mancarci, B. Ogan; Pavlidis, Paul

2017-01-01

How neuronal diversity emerges from complex patterns of gene expression remains poorly understood. Here we present an approach to understand electrophysiological diversity through gene expression by integrating pooled- and single-cell transcriptomics with intracellular electrophysiology. Using neuroinformatics methods, we compiled a brain-wide dataset of 34 neuron types with paired gene expression and intrinsic electrophysiological features from publically accessible sources, the largest such collection to date. We identified 420 genes whose expression levels significantly correlated with variability in one or more of 11 physiological parameters. We next trained statistical models to infer cellular features from multivariate gene expression patterns. Such models were predictive of gene-electrophysiological relationships in an independent collection of 12 visual cortex cell types from the Allen Institute, suggesting that these correlations might reflect general principles relating expression patterns to phenotypic diversity across very different cell types. Many associations reported here have the potential to provide new insights into how neurons generate functional diversity, and correlations of ion channel genes like Gabrd and Scn1a (Nav1.1) with resting potential and spiking frequency are consistent with known causal mechanisms. Our work highlights the promise and inherent challenges in using cell type-specific transcriptomics to understand the mechanistic origins of neuronal diversity. PMID:29069078
Gene Expression Network Reconstruction by Convex Feature Selection when Incorporating Genetic Perturbations

PubMed Central

Logsdon, Benjamin A.; Mezey, Jason

2010-01-01

Cellular gene expression measurements contain regulatory information that can be used to discover novel network relationships. Here, we present a new algorithm for network reconstruction powered by the adaptive lasso, a theoretically and empirically well-behaved method for selecting the regulatory features of a network. Any algorithms designed for network discovery that make use of directed probabilistic graphs require perturbations, produced by either experiments or naturally occurring genetic variation, to successfully infer unique regulatory relationships from gene expression data. Our approach makes use of appropriately selected cis-expression Quantitative Trait Loci (cis-eQTL), which provide a sufficient set of independent perturbations for maximum network resolution. We compare the performance of our network reconstruction algorithm to four other approaches: the PC-algorithm, QTLnet, the QDG algorithm, and the NEO algorithm, all of which have been used to reconstruct directed networks among phenotypes leveraging QTL. We show that the adaptive lasso can outperform these algorithms for networks of ten genes and ten cis-eQTL, and is competitive with the QDG algorithm for networks with thirty genes and thirty cis-eQTL, with rich topologies and hundreds of samples. Using this novel approach, we identify unique sets of directed relationships in Saccharomyces cerevisiae when analyzing genome-wide gene expression data for an intercross between a wild strain and a lab strain. We recover novel putative network relationships between a tyrosine biosynthesis gene (TYR1), and genes involved in endocytosis (RCY1), the spindle checkpoint (BUB2), sulfonate catabolism (JLP1), and cell-cell communication (PRM7). Our algorithm provides a synthesis of feature selection methods and graphical model theory that has the potential to reveal new directed regulatory relationships from the analysis of population level genetic and gene expression data. PMID:21152011
A Feature Selection Algorithm to Compute Gene Centric Methylation from Probe Level Methylation Data.

PubMed

Baur, Brittany; Bozdag, Serdar

2016-01-01

DNA methylation is an important epigenetic event that effects gene expression during development and various diseases such as cancer. Understanding the mechanism of action of DNA methylation is important for downstream analysis. In the Illumina Infinium HumanMethylation 450K array, there are tens of probes associated with each gene. Given methylation intensities of all these probes, it is necessary to compute which of these probes are most representative of the gene centric methylation level. In this study, we developed a feature selection algorithm based on sequential forward selection that utilized different classification methods to compute gene centric DNA methylation using probe level DNA methylation data. We compared our algorithm to other feature selection algorithms such as support vector machines with recursive feature elimination, genetic algorithms and ReliefF. We evaluated all methods based on the predictive power of selected probes on their mRNA expression levels and found that a K-Nearest Neighbors classification using the sequential forward selection algorithm performed better than other algorithms based on all metrics. We also observed that transcriptional activities of certain genes were more sensitive to DNA methylation changes than transcriptional activities of other genes. Our algorithm was able to predict the expression of those genes with high accuracy using only DNA methylation data. Our results also showed that those DNA methylation-sensitive genes were enriched in Gene Ontology terms related to the regulation of various biological processes.
A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.

PubMed

Chen, Zhenyu; Li, Jianping; Wei, Liwei

2007-10-01

Recently, gene expression profiling using microarray techniques has been shown as a promising tool to improve the diagnosis and treatment of cancer. Gene expression data contain high level of noise and the overwhelming number of genes relative to the number of available samples. It brings out a great challenge for machine learning and statistic techniques. Support vector machine (SVM) has been successfully used to classify gene expression data of cancer tissue. In the medical field, it is crucial to deliver the user a transparent decision process. How to explain the computed solutions and present the extracted knowledge becomes a main obstacle for SVM. A multiple kernel support vector machine (MK-SVM) scheme, consisting of feature selection, rule extraction and prediction modeling is proposed to improve the explanation capacity of SVM. In this scheme, we show that the feature selection problem can be translated into an ordinary multiple parameters learning problem. And a shrinkage approach: 1-norm based linear programming is proposed to obtain the sparse parameters and the corresponding selected features. We propose a novel rule extraction approach using the information provided by the separating hyperplane and support vectors to improve the generalization capacity and comprehensibility of rules and reduce the computational complexity. Two public gene expression datasets: leukemia dataset and colon tumor dataset are used to demonstrate the performance of this approach. Using the small number of selected genes, MK-SVM achieves encouraging classification accuracy: more than 90% for both two datasets. Moreover, very simple rules with linguist labels are extracted. The rule sets have high diagnostic power because of their good classification performance.
Identifying Epigenetic Biomarkers using Maximal Relevance and Minimal Redundancy Based Feature Selection for Multi-Omics Data.

PubMed

Mallik, Saurav; Bhadra, Tapas; Maulik, Ujjwal

2017-01-01

Epigenetic Biomarker discovery is an important task in bioinformatics. In this article, we develop a new framework of identifying statistically significant epigenetic biomarkers using maximal-relevance and minimal-redundancy criterion based feature (gene) selection for multi-omics dataset. Firstly, we determine the genes that have both expression as well as methylation values, and follow normal distribution. Similarly, we identify the genes which consist of both expression and methylation values, but do not follow normal distribution. For each case, we utilize a gene-selection method that provides maximal-relevant, but variable-weighted minimum-redundant genes as top ranked genes. For statistical validation, we apply t-test on both the expression and methylation data consisting of only the normally distributed top ranked genes to determine how many of them are both differentially expressed andmethylated. Similarly, we utilize Limma package for performing non-parametric Empirical Bayes test on both expression and methylation data comprising only the non-normally distributed top ranked genes to identify how many of them are both differentially expressed and methylated. We finally report the top-ranking significant gene-markerswith biological validation. Moreover, our framework improves positive predictive rate and reduces false positive rate in marker identification. In addition, we provide a comparative analysis of our gene-selection method as well as othermethods based on classificationperformances obtained using several well-known classifiers.
Predicting features of breast cancer with gene expression patterns.

PubMed

Lu, Xuesong; Lu, Xin; Wang, Zhigang C; Iglehart, J Dirk; Zhang, Xuegong; Richardson, Andrea L

2008-03-01

Data from gene expression arrays hold an enormous amount of biological information. We sought to determine if global gene expression in primary breast cancers contained information about biologic, histologic, and anatomic features of the disease in individual patients. Microarray data from the tumors of 129 patients were analyzed for the ability to predict biomarkers [estrogen receptor (ER) and HER2], histologic features [grade and lymphatic-vascular invasion (LVI)], and stage parameters (tumor size and lymph node metastasis). Multiple statistical predictors were used and the prediction accuracy was determined by cross-validation error rate; multidimensional scaling (MDS) allowed visualization of the predicted states under study. Models built from gene expression data accurately predict ER and HER2 status, and divide tumor grade into high-grade and low-grade clusters; intermediate-grade tumors are not a unique group. In contrast, gene expression data is inaccurate at predicting tumor size, lymph node status or LVI. The best model for prediction of nodal status included tumor size, LVI status and pathologically defined tumor subtype (based on combinations of ER, HER2, and grade); the addition of microarray-based prediction to this model failed to improve the prediction accuracy. Global gene expression supports a binary division of ER, HER2, and grade, clearly separating tumors into two categories; intermediate values for these bio-indicators do not define intermediate tumor subsets. Results are consistent with a model of regional metastasis that depends on inherent biologic differences in metastatic propensity between breast cancer subtypes, upon which time and chance then operate.
Initial leukemic gene expression profiles of patients with poor in vivo prednisone response are similar to those of blasts persisting under prednisone treatment in childhood acute lymphoblastic leukemia.

PubMed

Cario, Gunnar; Fetz, Andrea; Bretscher, Christian; Möricke, Anja; Schrauder, Andre; Stanulla, Martin; Schrappe, Martin

2008-09-01

Response to initial glucocorticoid (GC) treatment is a strong prognostic factor in childhood acute lymphoblastic leukemia (ALL). Patients with a poor prednisone response (PPR) have a poor event-free survival as compared to those with a good prednisone response (PGR). Causes of prednisone resistance are still not well understood. We hypothesized that GC resistance is an intrinsic feature of ALL cells which is reflected in the gene expression pattern and analyzed genome-wide gene expression using microarrays. A case-control study was performed comparing gene expression profiles from initial ALL samples of 20 patients with PPR and those of 20 patients with PGR. Differential gene expression of a subset of genes was confirmed by real-time quantitative polymerase chain reaction analysis and validation was performed in a second independent patient sample (n=20). We identified 121 genes that clearly distinguished prednisone-resistant from sensitive ALL samples (FDR<5%, fold change>or=1.5). Differential gene expression of 21 of these genes could be validated in a second independent set. Of importance, there was a remarkable concordance of genes identified by comparing expression signatures of PPR and PGR cells at diagnosis and those previously described to be up- or downregulated in leukemic cells persisting under GC treatment. Thus, GC resistance seems at least in part to be an intrinsic feature of leukemic cells. Leukemic cells of patients with PPR are characterized by gene expression pattern which are similar to those of resistant cells persisting under glucocorticoid treatment.
Genes@Work: an efficient algorithm for pattern discovery and multivariate feature selection in gene expression data.

PubMed

Lepre, Jorge; Rice, J Jeremy; Tu, Yuhai; Stolovitzky, Gustavo

2004-05-01

Despite the growing literature devoted to finding differentially expressed genes in assays probing different tissues types, little attention has been paid to the combinatorial nature of feature selection inherent to large, high-dimensional gene expression datasets. New flexible data analysis approaches capable of searching relevant subgroups of genes and experiments are needed to understand multivariate associations of gene expression patterns with observed phenotypes. We present in detail a deterministic algorithm to discover patterns of multivariate gene associations in gene expression data. The patterns discovered are differential with respect to a control dataset. The algorithm is exhaustive and efficient, reporting all existent patterns that fit a given input parameter set while avoiding enumeration of the entire pattern space. The value of the pattern discovery approach is demonstrated by finding a set of genes that differentiate between two types of lymphoma. Moreover, these genes are found to behave consistently in an independent dataset produced in a different laboratory using different arrays, thus validating the genes selected using our algorithm. We show that the genes deemed significant in terms of their multivariate statistics will be missed using other methods. Our set of pattern discovery algorithms including a user interface is distributed as a package called Genes@Work. This package is freely available to non-commercial users and can be downloaded from our website (http://www.research.ibm.com/FunGen).
Functional clustering of time series gene expression data by Granger causality

PubMed Central

2012-01-01

Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425
A microarray study of gene and protein regulation in human and rat brain following middle cerebral artery occlusion

PubMed Central

Mitsios, Nick; Saka, Mohamad; Krupinski, Jerzy; Pennucci, Roberta; Sanfeliu, Coral; Wang, Qiuyu; Rubio, Francisco; Gaffney, John; Kumar, Pat; Kumar, Shant; Sullivan, Matthew; Slevin, Mark

2007-01-01

Background Altered gene expression is an important feature of ischemic cerebral injury and affects proteins of many functional classes. We have used microarrays to investigate the changes in gene expression at various times after middle cerebral artery occlusion in human and rat brain. Results Our results demonstrated a significant difference in the number of genes affected and the time-course of expression between the two cases. The total number of deregulated genes in the rat was 335 versus 126 in the human, while, of 393 overlapping genes between the two array sets, 184 were changed only in the rat and 36 in the human with a total of 41 genes deregulated in both cases. Interestingly, the mean fold changes were much higher in the human. The expression of novel genes, including p21-activated kinase 1 (PAK1), matrix metalloproteinase 11 (MMP11) and integrase interactor 1, was further analyzed by RT-PCR, Western blotting and immunohistochemistry. Strong neuronal staining was seen for PAK1 and MMP11. Conclusion Our findings confirmed previous studies reporting that gene expression screening can detect known and unknown transcriptional features of stroke and highlight the importance of research using human brain tissue in the search for novel therapeutic agents. PMID:17997827
Identifying spatially similar gene expression patterns in early stage fruit fly embryo images: binary feature versus invariant moment digital representations

PubMed Central

Gurunathan, Rajalakshmi; Van Emden, Bernard; Panchanathan, Sethuraman; Kumar, Sudhir

2004-01-01

Background Modern developmental biology relies heavily on the analysis of embryonic gene expression patterns. Investigators manually inspect hundreds or thousands of expression patterns to identify those that are spatially similar and to ultimately infer potential gene interactions. However, the rapid accumulation of gene expression pattern data over the last two decades, facilitated by high-throughput techniques, has produced a need for the development of efficient approaches for direct comparison of images, rather than their textual descriptions, to identify spatially similar expression patterns. Results The effectiveness of the Binary Feature Vector (BFV) and Invariant Moment Vector (IMV) based digital representations of the gene expression patterns in finding biologically meaningful patterns was compared for a small (226 images) and a large (1819 images) dataset. For each dataset, an ordered list of images, with respect to a query image, was generated to identify overlapping and similar gene expression patterns, in a manner comparable to what a developmental biologist might do. The results showed that the BFV representation consistently outperforms the IMV representation in finding biologically meaningful matches when spatial overlap of the gene expression pattern and the genes involved are considered. Furthermore, we explored the value of conducting image-content based searches in a dataset where individual expression components (or domains) of multi-domain expression patterns were also included separately. We found that this technique improves performance of both IMV and BFV based searches. Conclusions We conclude that the BFV representation consistently produces a more extensive and better list of biologically useful patterns than the IMV representation. The high quality of results obtained scales well as the search database becomes larger, which encourages efforts to build automated image query and retrieval systems for spatial gene expression patterns. PMID:15603586
Tensor decomposition-based and principal-component-analysis-based unsupervised feature extraction applied to the gene expression and methylation profiles in the brains of social insects with multiple castes.

PubMed

Taguchi, Y-H

2018-05-08

Even though coexistence of multiple phenotypes sharing the same genomic background is interesting, it remains incompletely understood. Epigenomic profiles may represent key factors, with unknown contributions to the development of multiple phenotypes, and social-insect castes are a good model for elucidation of the underlying mechanisms. Nonetheless, previous studies have failed to identify genes associated with aberrant gene expression and methylation profiles because of the lack of suitable methodology that can address this problem properly. A recently proposed principal component analysis (PCA)-based and tensor decomposition (TD)-based unsupervised feature extraction (FE) can solve this problem because these two approaches can deal with gene expression and methylation profiles even when a small number of samples is available. PCA-based and TD-based unsupervised FE methods were applied to the analysis of gene expression and methylation profiles in the brains of two social insects, Polistes canadensis and Dinoponera quadriceps. Genes associated with differential expression and methylation between castes were identified, and analysis of enrichment of Gene Ontology terms confirmed reliability of the obtained sets of genes from the biological standpoint. Biologically relevant genes, shown to be associated with significant differential gene expression and methylation between castes, were identified here for the first time. The identification of these genes may help understand the mechanisms underlying epigenetic control of development of multiple phenotypes under the same genomic conditions.
Transcriptomic features associated with energy production in the muscles of Pacific bluefin tuna and Pacific cod.

PubMed

Shibata, Mami; Mekuchi, Miyuki; Mori, Kazuki; Muta, Shigeru; Chowdhury, Vishwajit Sur; Nakamura, Yoji; Ojima, Nobuhiko; Saitoh, Kenji; Kobayashi, Takanori; Wada, Tokio; Inouye, Kiyoshi; Kuhara, Satoru; Tashiro, Kosuke

2016-06-01

Bluefin tuna are high-performance swimmers and top predators in the open ocean. Their swimming is grounded by unique features including an exceptional glycolytic potential in white muscle, which is supported by high enzymatic activities. Here we performed high-throughput RNA sequencing (RNA-Seq) in muscles of the Pacific bluefin tuna (Thunnus orientalis) and Pacific cod (Gadus macrocephalus) and conducted a comparative transcriptomic analysis of genes related to energy production. We found that the total expression of glycolytic genes was much higher in the white muscle of tuna than in the other muscles, and that the expression of only six genes for glycolytic enzymes accounted for 83.4% of the total. These expression patterns were in good agreement with the patterns of enzyme activity previously reported. The findings suggest that the mRNA expression of glycolytic genes may contribute directly to the enzymatic activities in the muscles of tuna.
The Model-Based Study of the Effectiveness of Reporting Lists of Small Feature Sets Using RNA-Seq Data.

PubMed

Kim, Eunji; Ivanov, Ivan; Hua, Jianping; Lampe, Johanna W; Hullar, Meredith Aj; Chapkin, Robert S; Dougherty, Edward R

2017-01-01

Ranking feature sets for phenotype classification based on gene expression is a challenging issue in cancer bioinformatics. When the number of samples is small, all feature selection algorithms are known to be unreliable, producing significant error, and error estimators suffer from different degrees of imprecision. The problem is compounded by the fact that the accuracy of classification depends on the manner in which the phenomena are transformed into data by the measurement technology. Because next-generation sequencing technologies amount to a nonlinear transformation of the actual gene or RNA concentrations, they can potentially produce less discriminative data relative to the actual gene expression levels. In this study, we compare the performance of ranking feature sets derived from a model of RNA-Seq data with that of a multivariate normal model of gene concentrations using 3 measures: (1) ranking power, (2) length of extensions, and (3) Bayes features. This is the model-based study to examine the effectiveness of reporting lists of small feature sets using RNA-Seq data and the effects of different model parameters and error estimators. The results demonstrate that the general trends of the parameter effects on the ranking power of the underlying gene concentrations are preserved in the RNA-Seq data, whereas the power of finding a good feature set becomes weaker when gene concentrations are transformed by the sequencing machine.
Distinct gene expression profiles determine molecular treatment response in childhood acute lymphoblastic leukemia.

PubMed

Cario, Gunnar; Stanulla, Martin; Fine, Bernard M; Teuffel, Oliver; Neuhoff, Nils V; Schrauder, André; Flohr, Thomas; Schäfer, Beat W; Bartram, Claus R; Welte, Karl; Schlegelberger, Brigitte; Schrappe, Martin

2005-01-15

Treatment resistance, as indicated by the presence of high levels of minimal residual disease (MRD) after induction therapy and induction consolidation, is associated with a poor prognosis in childhood acute lymphoblastic leukemia (ALL). We hypothesized that treatment resistance is an intrinsic feature of ALL cells reflected in the gene expression pattern and that resistance to chemotherapy can be predicted before treatment. To test these hypotheses, gene expression signatures of ALL samples with high MRD load were compared with those of samples without measurable MRD during treatment. We identified 54 genes that clearly distinguished resistant from sensitive ALL samples. Genes with low expression in resistant samples were predominantly associated with cell-cycle progression and apoptosis, suggesting that impaired cell proliferation and apoptosis are involved in treatment resistance. Prediction analysis using randomly selected samples as a training set and the remaining samples as a test set revealed an accuracy of 84%. We conclude that resistance to chemotherapy seems at least in part to be an intrinsic feature of ALL cells. Because treatment response could be predicted with high accuracy, gene expression profiling could become a clinically relevant tool for treatment stratification in the early course of childhood ALL.
Healthy Nordic diet downregulates the expression of genes involved in inflammation in subcutaneous adipose tissue in individuals with features of the metabolic syndrome.

PubMed

Kolehmainen, Marjukka; Ulven, Stine M; Paananen, Jussi; de Mello, Vanessa; Schwab, Ursula; Carlberg, Carsten; Myhrstad, Mari; Pihlajamäki, Jussi; Dungner, Elisabeth; Sjölin, Eva; Gunnarsdottir, Ingibjörg; Cloetens, Lieselotte; Landin-Olsson, Mona; Akesson, Björn; Rosqvist, Fredrik; Hukkanen, Janne; Herzig, Karl-Heinz; Dragsted, Lars O; Savolainen, Markku J; Brader, Lea; Hermansen, Kjeld; Risérus, Ulf; Thorsdottir, Inga; Poutanen, Kaisa S; Uusitupa, Matti; Arner, Peter; Dahlman, Ingrid

2015-01-01

Previously, a healthy Nordic diet (ND) has been shown to have beneficial health effects close to those of Mediterranean diets. The objective was to explore whether the ND has an impact on gene expression in abdominal subcutaneous adipose tissue (SAT) and whether changes in gene expression are associated with clinical and biochemical effects. Obese adults with features of the metabolic syndrome underwent an 18- to 24-wk randomized intervention study comparing the ND with the control diet (CD) (the SYSDIET study, carried out within Nordic Centre of Excellence of the Systems Biology in Controlled Dietary Interventions and Cohort Studies). The present study included participants from 3 Nordic SYSDIET centers [Kuopio (n = 20), Lund (n = 18), and Oulu (n = 18)] with a maximum weight change of ±4 kg, highly sensitive C-reactive protein concentration <10 mg/L at the beginning and the end of the intervention, and baseline body mass index (in kg/m²) <38. SAT biopsy specimens were obtained before and after the intervention and subjected to global transcriptome analysis with Gene 1.1 ST Arrays (Affymetrix). Altogether, 128 genes were differentially expressed in SAT between the ND and CD (nominal P < 0.01; false discovery rate, 25%). These genes were overrepresented in pathways related to immune response (adjusted P = 0.0076), resulting mainly from slightly decreased expression in the ND and increased expression in the CD. Immune-related pathways included leukocyte trafficking and macrophage recruitment (e.g., interferon regulatory factor 1, CD97), adaptive immune response (interleukin32, interleukin 6 receptor), and reactive oxygen species (neutrophil cytosolic factor 1). Interestingly, the regulatory region of the 128 genes was overrepresented for binding sites for the nuclear transcription factor κB. A healthy Nordic diet reduces inflammatory gene expression in SAT compared with a control diet independently of body weight change in individuals with features of the metabolic syndrome. © 2015 American Society for Nutrition.
Analysis of temporal transcription expression profiles reveal links between protein function and developmental stages of Drosophila melanogaster.

PubMed

Wan, Cen; Lees, Jonathan G; Minneci, Federico; Orengo, Christine A; Jones, David T

2017-10-01

Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster.
Conservation of Pax gene expression in ectodermal placodes of the lamprey

NASA Technical Reports Server (NTRS)

McCauley, David W.; Bronner-Fraser, Marianne

2002-01-01

Ectodermal placodes contribute to the cranial ganglia and sense organs of the head and, together with neural crest cells, represent defining features of the vertebrate embryo. The identity of different placodes appears to be specified in part by the expression of different Pax genes, with Pax-3/7 class genes being expressed in the trigeminal placode of mice, chick, frogs and fish, and Pax-2/5/8 class genes expressed in the otic placode. Here, we present the cloning and expression pattern of lamprey Pax-7 and Pax-2, which mark the trigeminal and otic placodes, respectively, as well as other structures characteristic of vertebrate Pax genes. These results suggest conservation of Pax genes and placodal structures in basal and derived vertebrates.
The Cross-Entropy Based Multi-Filter Ensemble Method for Gene Selection.

PubMed

Sun, Yingqiang; Lu, Chengbo; Li, Xiaobo

2018-05-17

The gene expression profile has the characteristics of a high dimension, low sample, and continuous type, and it is a great challenge to use gene expression profile data for the classification of tumor samples. This paper proposes a cross-entropy based multi-filter ensemble (CEMFE) method for microarray data classification. Firstly, multiple filters are used to select the microarray data in order to obtain a plurality of the pre-selected feature subsets with a different classification ability. The top N genes with the highest rank of each subset are integrated so as to form a new data set. Secondly, the cross-entropy algorithm is used to remove the redundant data in the data set. Finally, the wrapper method, which is based on forward feature selection, is used to select the best feature subset. The experimental results show that the proposed method is more efficient than other gene selection methods and that it can achieve a higher classification accuracy under fewer characteristic genes.

Transcriptome analysis reveals differential gene expression in intramuscular adipose tissues of Jinhua and Landrace pigs.

PubMed

Miao, Zhiguo; Wei, Panpeng; Khan, Muhammad Akram; Zhang, Jinzhou; Guo, Liping; Liu, Dongyang; Zhang, Xiaojian; Bai, Yueyu; Wang, Shan

2018-05-01

Meat is a rich source of protein, fatty acids and carbohydrates for human needs. In addition to necessary nutrients, high fat contents in pork increase the tenderness and juiciness of the meat, featuring diverse application in various dishes. This study investigated the transcriptomic profiles of intramuscular adipose tissues in Jinhua and Landrace pigs by employing advanced RNA sequencing. Results showed significant interesting to note that there were significant differences in the expression of genes. 1,632 genes showed significant differential expression, 837 genes were up-regulated and 195 genes were down-regulated. Variations in genes responsible for cell aggregation, extracellular matrix formation, cellular lipid catabolic process, and fatty acid binding strongly supported that both pig breeds feature variable fat and muscle metabolism. Certain differentially expressed genes are included in the pathway of mitogen-activated protein kinase signaling pathway, Ras signaling pathway and insulin pathway. Results from real-time quantitative polymerase chain reaction also validated the differential expression of 17 mRNAs between meats of the two pig breeds. Overall, these findings reveal significant differences in fat and protein metabolism of intramuscular adipose tissues of two pig breeds at the transcriptomic level and suggest diversification at the genetic level between breeds of the same species.
Novel gene sets improve set-level classification of prokaryotic gene expression data.

PubMed

Holec, Matěj; Kuželka, Ondřej; Železný, Filip

2015-10-28

Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.
ARNetMiT R Package: association rules based gene co-expression networks of miRNA targets.

PubMed

Özgür Cingiz, M; Biricik, G; Diri, B

2017-03-31

miRNAs are key regulators that bind to target genes to suppress their gene expression level. The relations between miRNA-target genes enable users to derive co-expressed genes that may be involved in similar biological processes and functions in cells. We hypothesize that target genes of miRNAs are co-expressed, when they are regulated by multiple miRNAs. With the usage of these co-expressed genes, we can theoretically construct co-expression networks (GCNs) related to 152 diseases. In this study, we introduce ARNetMiT that utilize a hash based association rule algorithm in a novel way to infer the GCNs on miRNA-target genes data. We also present R package of ARNetMiT, which infers and visualizes GCNs of diseases that are selected by users. Our approach assumes miRNAs as transactions and target genes as their items. Support and confidence values are used to prune association rules on miRNA-target genes data to construct support based GCNs (sGCNs) along with support and confidence based GCNs (scGCNs). We use overlap analysis and the topological features for the performance analysis of GCNs. We also infer GCNs with popular GNI algorithms for comparison with the GCNs of ARNetMiT. Overlap analysis results show that ARNetMiT outperforms the compared GNI algorithms. We see that using high confidence values in scGCNs increase the ratio of the overlapped gene-gene interactions between the compared methods. According to the evaluation of the topological features of ARNetMiT based GCNs, the degrees of nodes have power-law distribution. The hub genes discovered by ARNetMiT based GCNs are consistent with the literature.
NIMEFI: gene regulatory network inference using multiple ensemble feature importance algorithms.

PubMed

Ruyssinck, Joeri; Huynh-Thu, Vân Anh; Geurts, Pierre; Dhaene, Tom; Demeester, Piet; Saeys, Yvan

2014-01-01

One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available.
FOXI2: a possible gene contributing to ectodermal dysplasia.

PubMed

Kurban, Mazen; Zeineddine, Savo Bou; Hamie, Lamiaa; Safi, Remi; Abbas, Ossama; Kibbi, Abdul Ghani; Bitar, Fadi; Nemer, Georges

2017-12-01

Cardio-facio-cutaneous syndrome (CFC), Noonan syndrome (NS), and Costello syndrome are a group of diseases that belong to the RASopathies. The syndromes share clinical features making diagnosis a challenge. To investigate the phenotype and genotype of a 10-year-old Iraqi girl with overlapping features of CFC, NS, and Costello syndromes, with additional features of ectodermal dysplasia. DNA was examined by exome sequencing and protein expression by immunohistochemistry. Exome sequencing identified a mutation in the SOS1 gene and a de novo deletion in the FOXI2 gene which was neither present in the international databases, nor in 400 chromosomes from the same population. Based on immunohistochemical staining, FOXI2 was identified in the basal cell layer of the skin and overlapped with the expression of P63, a major player in ectodermal dysplasia. We therefore suggest screening for FOXI2 mutation in the setting of ectodermal features that are not associated with genes known to contribute to ectodermal dysplasia.
Case-based retrieval framework for gene expression data.

PubMed

Anaissi, Ali; Goyal, Madhu; Catchpoole, Daniel R; Braytee, Ali; Kennedy, Paul J

2015-01-01

The process of retrieving similar cases in a case-based reasoning system is considered a big challenge for gene expression data sets. The huge number of gene expression values generated by microarray technology leads to complex data sets and similarity measures for high-dimensional data are problematic. Hence, gene expression similarity measurements require numerous machine-learning and data-mining techniques, such as feature selection and dimensionality reduction, to be incorporated into the retrieval process. This article proposes a case-based retrieval framework that uses a k-nearest-neighbor classifier with a weighted-feature-based similarity to retrieve previously treated patients based on their gene expression profiles. The herein-proposed methodology is validated on several data sets: a childhood leukemia data set collected from The Children's Hospital at Westmead, as well as the Colon cancer, the National Cancer Institute (NCI), and the Prostate cancer data sets. Results obtained by the proposed framework in retrieving patients of the data sets who are similar to new patients are as follows: 96% accuracy on the childhood leukemia data set, 95% on the NCI data set, 93% on the Colon cancer data set, and 98% on the Prostate cancer data set. The designed case-based retrieval framework is an appropriate choice for retrieving previous patients who are similar to a new patient, on the basis of their gene expression data, for better diagnosis and treatment of childhood leukemia. Moreover, this framework can be applied to other gene expression data sets using some or all of its steps.
Asthma-COPD overlap. Clinical relevance of genomic signatures of type 2 inflammation in chronic obstructive pulmonary disease.

PubMed

Christenson, Stephanie A; Steiling, Katrina; van den Berge, Maarten; Hijazi, Kahkeshan; Hiemstra, Pieter S; Postma, Dirkje S; Lenburg, Marc E; Spira, Avrum; Woodruff, Prescott G

2015-04-01

Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease and likely includes a subgroup that is biologically comparable to asthma. Studying asthma-associated gene expression changes in COPD could add insight into COPD pathogenesis and reveal biomarkers that predict a favorable response to corticosteroids. To determine whether asthma-associated gene signatures are increased in COPD and associated with asthma-related features. We compared disease-associated airway epithelial gene expression alterations in an asthma cohort (n = 105) and two COPD cohorts (n = 237, 171). The T helper type 2 (Th2) signature (T2S) score, a gene expression metric induced in Th2-high asthma, was evaluated in these COPD cohorts. The T2S score was correlated with asthma-related features and response to corticosteroids in COPD in a randomized, placebo-controlled trial, the Groningen and Leiden Universities study of Corticosteroids in Obstructive Lung Disease (GLUCOLD; n = 89). The 200 genes most differentially expressed in asthma versus healthy control subjects were enriched among genes associated with more severe airflow obstruction in these COPD cohorts (P < 0.001), suggesting significant gene expression overlap. A higher T2S score was associated with decreased lung function (P < 0.001), but not asthma history, in both COPD cohorts. Higher T2S scores correlated with increased airway wall eosinophil counts (P = 0.003), blood eosinophil percentage (P = 0.03), bronchodilator reversibility (P = 0.01), and improvement in hyperinflation after corticosteroid treatment (P = 0.019) in GLUCOLD. These data identify airway gene expression alterations that can co-occur in asthma and COPD. The association of the T2S score with increased severity and "asthma-like" features (including a favorable corticosteroid response) in COPD suggests that Th2 inflammation is important in a COPD subset that cannot be identified by clinical history of asthma.
Asthma–COPD Overlap. Clinical Relevance of Genomic Signatures of Type 2 Inflammation in Chronic Obstructive Pulmonary Disease

PubMed Central

Steiling, Katrina; van den Berge, Maarten; Hijazi, Kahkeshan; Hiemstra, Pieter S.; Postma, Dirkje S.; Lenburg, Marc E.; Spira, Avrum; Woodruff, Prescott G.

2015-01-01

Rationale: Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease and likely includes a subgroup that is biologically comparable to asthma. Studying asthma-associated gene expression changes in COPD could add insight into COPD pathogenesis and reveal biomarkers that predict a favorable response to corticosteroids. Objectives: To determine whether asthma-associated gene signatures are increased in COPD and associated with asthma-related features. Methods: We compared disease-associated airway epithelial gene expression alterations in an asthma cohort (n = 105) and two COPD cohorts (n = 237, 171). The T helper type 2 (Th2) signature (T2S) score, a gene expression metric induced in Th2-high asthma, was evaluated in these COPD cohorts. The T2S score was correlated with asthma-related features and response to corticosteroids in COPD in a randomized, placebo-controlled trial, the Groningen and Leiden Universities study of Corticosteroids in Obstructive Lung Disease (GLUCOLD; n = 89). Measurements and Main Results: The 200 genes most differentially expressed in asthma versus healthy control subjects were enriched among genes associated with more severe airflow obstruction in these COPD cohorts (P < 0.001), suggesting significant gene expression overlap. A higher T2S score was associated with decreased lung function (P < 0.001), but not asthma history, in both COPD cohorts. Higher T2S scores correlated with increased airway wall eosinophil counts (P = 0.003), blood eosinophil percentage (P = 0.03), bronchodilator reversibility (P = 0.01), and improvement in hyperinflation after corticosteroid treatment (P = 0.019) in GLUCOLD. Conclusions: These data identify airway gene expression alterations that can co-occur in asthma and COPD. The association of the T2S score with increased severity and “asthma-like” features (including a favorable corticosteroid response) in COPD suggests that Th2 inflammation is important in a COPD subset that cannot be identified by clinical history of asthma. PMID:25611785
Pathway activity inference for multiclass disease classification through a mathematical programming optimisation framework.

PubMed

Yang, Lingjian; Ainali, Chrysanthi; Tsoka, Sophia; Papageorgiou, Lazaros G

2014-12-05

Applying machine learning methods on microarray gene expression profiles for disease classification problems is a popular method to derive biomarkers, i.e. sets of genes that can predict disease state or outcome. Traditional approaches where expression of genes were treated independently suffer from low prediction accuracy and difficulty of biological interpretation. Current research efforts focus on integrating information on protein interactions through biochemical pathway datasets with expression profiles to propose pathway-based classifiers that can enhance disease diagnosis and prognosis. As most of the pathway activity inference methods in literature are either unsupervised or applied on two-class datasets, there is good scope to address such limitations by proposing novel methodologies. A supervised multiclass pathway activity inference method using optimisation techniques is reported. For each pathway expression dataset, patterns of its constituent genes are summarised into one composite feature, termed pathway activity, and a novel mathematical programming model is proposed to infer this feature as a weighted linear summation of expression of its constituent genes. Gene weights are determined by the optimisation model, in a way that the resulting pathway activity has the optimal discriminative power with regards to disease phenotypes. Classification is then performed on the resulting low-dimensional pathway activity profile. The model was evaluated through a variety of published gene expression profiles that cover different types of disease. We show that not only does it improve classification accuracy, but it can also perform well in multiclass disease datasets, a limitation of other approaches from the literature. Desirable features of the model include the ability to control the maximum number of genes that may participate in determining pathway activity, which may be pre-specified by the user. Overall, this work highlights the potential of building pathway-based multi-phenotype classifiers for accurate disease diagnosis and prognosis problems.
A multigene locus containing the Manx and bobcat genes is required for development of chordate features in the ascidian tadpole larva.

PubMed

Swalla, B J; Just, M A; Pederson, E L; Jeffery, W R

1999-04-01

The Manx gene is required for the development of the tail and other chordate features in the ascidian tadpole larva. To determine the structure of the Manx gene, we isolated and sequenced genomic clones from the tailed ascidian Molgula oculata. The Manx gene contains 9 exons and encodes both major and minor Manx mRNAs, which differ in the length of their 5' untranslated regions. The coding region of the single-copy bobcat gene, which encodes a DEAD-box RNA helicase, is embedded within the first Manx intron. The organization of the bobcat and Manx transcription units was determined by comparing genomic and cDNA clones. The Manx-bobcat gene locus has an unusual organization in which a non-coding first exon is alternatively spliced at the 5' end of two different mRNAs. The bobcat and Manx genes are expressed coordinately during oogenesis and embryogenesis, but not during spermatogenesis, in which bobcat mRNA accumulates independently of Manx mRNA. Similar to Manx, zygotic bobcat transcripts accumulate in the embryonic primordia responsible for generating chordate features, including the dorsal neural tube and notochord, are downregulated during embryogenesis in the tailless species Molgula occulta and are upregulated in M. occulta X M. oculata hybrids, which restore these chordate features. Antisense experiments indicate that zygotic bobcat expression is required for development of the same suite of chordate features as Manx. The results show that the Manx-bobcat gene complex has a role in the development of chordate features in ascidian tadpole larvae.
Disruption of DNA methylation-dependent long gene repression in Rett syndrome

PubMed Central

Gabel, Harrison W.; Kinde, Benyam Z.; Stroud, Hume; Gilbert, Caitlin S.; Harmin, David A.; Kastan, Nathaniel R.; Hemberg, Martin; Ebert, Daniel H.; Greenberg, Michael E.

2015-01-01

Disruption of the MECP2 gene leads to Rett syndrome (RTT), a severe neurological disorder with features of autism1. MECP2 encodes a methyl-DNA-binding protein2 that has been proposed to function as a transcriptional repressor, but despite numerous studies examining neuronal gene expression in Mecp2 mutants, no clear model has emerged for how MeCP2 regulates transcription3–9. Here we identify a genome-wide length-dependent increase in gene expression in MeCP2 mutant mouse models and human RTT brains. We present evidence that MeCP2 represses gene expression by binding to methylated CA sites within long genes, and that in neurons lacking MeCP2, decreasing the expression of long genes attenuates RTT-associated cellular deficits. In addition, we find that long genes as a population are enriched for neuronal functions and selectively expressed in the brain. These findings suggest that mutations in MeCP2 may cause neurological dysfunction by specifically disrupting long gene expression in the brain. PMID:25762136
Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants

PubMed Central

Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

2015-01-01

Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5′ transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5′UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5′end can modulate protein levels up to 160%–300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes. PMID:26176266
Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants.

PubMed

Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

2015-01-01

Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5' transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5'UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5'end can modulate protein levels up to 160%-300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes.
DDAH2 mRNA expression is inversely associated with some cardiovascular risk-related features in healthy young adults.

PubMed

Puchau, Blanca; Hermsdorff, Helen Hermana M; Zulet, M Angeles; Martínez, J Alfredo

2009-01-01

The purpose of this study was to evaluate whether the mRNA expression profiles of three genes (PRMT1, DDAH2 and NOS3) are related to ADMA metabolism and signalling, and the potential relationships with anthropometrical, biochemical, lifestyle and inflammatory indicators in healthy young adults. An emphasis on the putative effect of different mRNA expression on cardiovascular risk-related features was paid. Anthropometrical measurements as well as lifestyle features were analyzed in 120 healthy young adults. Fasting blood samples were collected for the measurement of glucose and lipid profiles as well as the concentrations of selected inflammatory markers. Profiles of mRNA expression were assessed for PRMT1, DDAH2 and NOS3 genes from peripheral blood mononuclear cells. Regarding inflammatory biomarkers, DDAH2 was inversely associated with IL-6 and TNF-alpha. Moreover, subjects in the highest quintile of DDAH2 mRNA expression showed a reduced risk to have higher values of waist circumference, and to be more prone to show higher values of HDL-c. Interestingly, DDAH2 gene expression seemed to be related with some anthropometrical, biochemical, lifestyle and inflammatory indicators linked to cardiovascular risk in apparently healthy young adults, emerging as a potential disease marker.
Seq-ing answers: uncovering the unexpected in global gene regulation.

PubMed

Otto, George Maxwell; Brar, Gloria Ann

2018-04-19

The development of techniques for measuring gene expression globally has greatly expanded our understanding of gene regulatory mechanisms in depth and scale. We can now quantify every intermediate and transition in the canonical pathway of gene expression-from DNA to mRNA to protein-genome-wide. Employing such measurements in parallel can produce rich datasets, but extracting the most information requires careful experimental design and analysis. Here, we argue for the value of genome-wide studies that measure multiple outputs of gene expression over many timepoints during the course of a natural developmental process. We discuss our findings from a highly parallel gene expression dataset of meiotic differentiation, and those of others, to illustrate how leveraging these features can provide new and surprising insight into fundamental mechanisms of gene regulation.
Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods.

PubMed

Tuo, Youlin; An, Ning; Zhang, Ming

2018-03-01

The aim of the present study was to investigate the feature genes in metastatic breast cancer samples. A total of 5 expression profiles of metastatic breast cancer samples were downloaded from the Gene Expression Omnibus database, which were then analyzed using the MetaQC and MetaDE packages in R language. The feature genes between metastasis and non‑metastasis samples were screened under the threshold of P<0.05. Based on the protein‑protein interactions (PPIs) in the Biological General Repository for Interaction Datasets, Human Protein Reference Database and Biomolecular Interaction Network Database, the PPI network of the feature genes was constructed. The feature genes identified by topological characteristics were then used for support vector machine (SVM) classifier training and verification. The accuracy of the SVM classifier was then evaluated using another independent dataset from The Cancer Genome Atlas database. Finally, function and pathway enrichment analyses for genes in the SVM classifier were performed. A total of 541 feature genes were identified between metastatic and non‑metastatic samples. The top 10 genes with the highest betweenness centrality values in the PPI network of feature genes were Nuclear RNA Export Factor 1, cyclin‑dependent kinase 2 (CDK2), myelocytomatosis proto‑oncogene protein (MYC), Cullin 5, SHC Adaptor Protein 1, Clathrin heavy chain, Nucleolin, WD repeat domain 1, proteasome 26S subunit non‑ATPase 2 and telomeric repeat binding factor 2. The cyclin‑dependent kinase inhibitor 1A (CDKN1A), E2F transcription factor 1 (E2F1), and MYC interacted with CDK2. The SVM classifier constructed by the top 30 feature genes was able to distinguish metastatic samples from non‑metastatic samples [correct rate, specificity, positive predictive value and negative predictive value >0.89; sensitivity >0.84; area under the receiver operating characteristic curve (AUROC) >0.96]. The verification of the SVM classifier in an independent dataset (35 metastatic samples and 143 non‑metastatic samples) revealed an accuracy of 94.38% and AUROC of 0.958. Cell cycle associated functions and pathways were the most significant terms of the 30 feature genes. A SVM classifier was constructed to assess the possibility of breast cancer metastasis, which presented high accuracy in several independent datasets. CDK2, CDKN1A, E2F1 and MYC were indicated as the potential feature genes in metastatic breast cancer.
Chromosome position effects on gene expression in Escherichia coli K-12

PubMed Central

Bryant, Jack A.; Sellars, Laura E.; Busby, Stephen J. W.; Lee, David J.

2014-01-01

In eukaryotes, the location of a gene on the chromosome is known to affect its expression, but such position effects are poorly understood in bacteria. Here, using Escherichia coli K-12, we demonstrate that expression of a reporter gene cassette, comprised of the model E. coli lac promoter driving expression of gfp, varies by ∼300-fold depending on its precise position on the chromosome. At some positions, expression was more than 3-fold higher than at the natural lac promoter locus, whereas at several other locations, the reporter cassette was completely silenced: effectively overriding local lac promoter control. These effects were not due to differences in gene copy number, caused by partially replicated genomes. Rather, the differences in gene expression occur predominantly at the level of transcription and are mediated by several different features that are involved in chromosome organization. Taken together, our findings identify a tier of gene regulation above local promoter control and highlight the importance of chromosome position effects on gene expression profiles in bacteria. PMID:25209233
Features of cues and processes during chloroplast-mediated retrograde signaling in the alga Chlamydomonas

USDA-ARS?s Scientific Manuscript database

Retrograde signalling is a selective process defined by cues generated in chloroplast/mitochondria which traverse membranes and end up regulating nuclear gene expression and protein synthesis. The coding and encoding of organellar message(s) that alter nuclear gene expression and/or cellular metabo...
HOXB9 Expression Correlates with Histological Grade and Prognosis in LSCC

PubMed Central

2017-01-01

The purpose of this study was to investigate the HOX gene expression profile in laryngeal squamous cell carcinoma (LSCC) and assess whether some genes are associated with the clinicopathological features and prognosis in LSCC patients. The HOX gene levels were tested by microarray and validated by qRT-PCR in paired cancerous and adjacent noncancerous LSCC tissue samples. The microarray testing data of 39 HOX genes revealed 15 HOX genes that were at least 2-fold upregulated and 2 that were downregulated. After qRT-PCR evaluation, the three most upregulated genes (HOXB9, HOXB13, and HOXD13) were selected for tissue microarray (TMA) analysis. The correlations between the HOXB9, HOXB13, and HOXD13 expression levels and both clinicopathological features and prognosis were analyzed. Three HOX gene expression levels were markedly increased in LSCC tissues compared with adjacent noncancerous tissues (P < 0.001). HOXB9 was found to correlate with histological grade (P < 0.01) and prognosis (P < 0.01) in LSCC. In conclusion, this study revealed that HOXB9, HOXB13, and HOXD13 were upregulated and may play important roles in LSCC. Moreover, HOXB9 may serve as a novel marker of poor prognosis and a potential therapeutic target in LSCC patients. PMID:28808656
Correct Hox gene expression established independently of position in Caenorhabditis elegans.

PubMed

Cowing, D; Kenyon, C

1996-07-25

The Hox genes are expressed in a conserved sequence of spatial domains along the anteroposterior (A/P) body axes of many organisms. In Drosophila, position-specific signals located along the A/P axis establish the pattern of Hox gene expression. In the nematode Caenorhabditis elegans, it is not known how the pattern of Hox gene expression is established. C. elegans uses lineal control mechanisms and local cell interactions to specify early blastomere identities. However, many cells expressing the same Hox gene are unrelated by lineage, suggesting that, as in Drosophila, domains of Hox gene expression may be defined by cell-extrinsic A/P positional signals. To test this, we have investigated whether posterior mesodermal and ectodermal cells will express their normal posterior Hox gene when they are mispositioned in the anterior. Surprisingly, we find that correct Hox gene expression does not depend on cell position, but is highly correlated with cell lineage. Thus, although the most striking feature of Hox gene expression is its positional specificity, in C. elegans the pattern is achieved, at least in part, by a lineage-specific control system that operates without regard to A/P position.

Developmental stage related patterns of codon usage and genomic GC content: searching for evolutionary fingerprints with models of stem cell differentiation

PubMed Central

2007-01-01

Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
Alterations in gene expression of proprotein convertases in human lung cancer have a limited number of scenarios.

PubMed

Demidyuk, Ilya V; Shubin, Andrey V; Gasanov, Eugene V; Kurinov, Alexander M; Demkin, Vladimir V; Vinogradova, Tatyana V; Zinovyeva, Marina V; Sass, Alexander V; Zborovskaya, Irina B; Kostrov, Sergey V

2013-01-01

Proprotein convertases (PCs) is a protein family which includes nine highly specific subtilisin-like serine endopeptidases in mammals. The system of PCs is involved in carcinogenesis and levels of PC mRNAs alter in cancer, which suggests expression status of PCs as a possible marker for cancer typing and prognosis. The goal of this work was to assess the information value of expression profiling of PC genes. Quantitative polymerase chain reaction was used for the first time to analyze mRNA levels of all PC genes as well as matrix metalloproteinase genes MMP2 and MMP14, which are substrates of PCs, in 30 matched pairs of samples of human lung cancer tumor and adjacent tissues without pathology. Significant changes in the expression of PCs have been revealed in tumor tissues: increased FURIN mRNA level (p<0.00005) and decreased mRNA levels of PCSK2 (p<0.007), PCSK5 (p<0.0002), PCSK7 (p<0.002), PCSK9 (p<0.00008), and MBTPS1 (p<0.00004) as well as a tendency to increase in the level of PCSK1 mRNA. Four distinct groups of samples have been identified by cluster analysis of the expression patterns of PC genes in tumor vs. normal tissue. Three of these groups covering 80% of samples feature a strong elevation in the expression of a single gene in cancer: FURIN, PCSK1, or PCSK6. Thus, the changes in the expression of PC genes have a limited number of scenarios, which may reflect different pathways of tumor development and cryptic features of tumors. This finding allows to consider the mRNAs of PC genes as potentially important tumor markers.
Structural features based genome-wide characterization and prediction of nucleosome organization

PubMed Central

2012-01-01

Background Nucleosome distribution along chromatin dictates genomic DNA accessibility and thus profoundly influences gene expression. However, the underlying mechanism of nucleosome formation remains elusive. Here, taking a structural perspective, we systematically explored nucleosome formation potential of genomic sequences and the effect on chromatin organization and gene expression in S. cerevisiae. Results We analyzed twelve structural features related to flexibility, curvature and energy of DNA sequences. The results showed that some structural features such as DNA denaturation, DNA-bending stiffness, Stacking energy, Z-DNA, Propeller twist and free energy, were highly correlated with in vitro and in vivo nucleosome occupancy. Specifically, they can be classified into two classes, one positively and the other negatively correlated with nucleosome occupancy. These two kinds of structural features facilitated nucleosome binding in centromere regions and repressed nucleosome formation in the promoter regions of protein-coding genes to mediate transcriptional regulation. Based on these analyses, we integrated all twelve structural features in a model to predict more accurately nucleosome occupancy in vivo than the existing methods that mainly depend on sequence compositional features. Furthermore, we developed a novel approach, named DLaNe, that located nucleosomes by detecting peaks of structural profiles, and built a meta predictor to integrate information from different structural features. As a comparison, we also constructed a hidden Markov model (HMM) to locate nucleosomes based on the profiles of these structural features. The result showed that the meta DLaNe and HMM-based method performed better than the existing methods, demonstrating the power of these structural features in predicting nucleosome positions. Conclusions Our analysis revealed that DNA structures significantly contribute to nucleosome organization and influence chromatin structure and gene expression regulation. The results indicated that our proposed methods are effective in predicting nucleosome occupancy and positions and that these structural features are highly predictive of nucleosome organization. The implementation of our DLaNe method based on structural features is available online. PMID:22449207
A new approach to enhance the performance of decision tree for classifying gene expression data.

PubMed

Hassan, Md; Kotagiri, Ramamohanarao

2013-12-20

Gene expression data classification is a challenging task due to the large dimensionality and very small number of samples. Decision tree is one of the popular machine learning approaches to address such classification problems. However, the existing decision tree algorithms use a single gene feature at each node to split the data into its child nodes and hence might suffer from poor performance specially when classifying gene expression dataset. By using a new decision tree algorithm where, each node of the tree consists of more than one gene, we enhance the classification performance of traditional decision tree classifiers. Our method selects suitable genes that are combined using a linear function to form a derived composite feature. To determine the structure of the tree we use the area under the Receiver Operating Characteristics curve (AUC). Experimental analysis demonstrates higher classification accuracy using the new decision tree compared to the other existing decision trees in literature. We experimentally compare the effect of our scheme against other well known decision tree techniques. Experiments show that our algorithm can substantially boost the classification performance of the decision tree.
Identification, characterization and expression analysis of lineage-specific genes within sweet orange (Citrus sinensis).

PubMed

Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang

2015-11-23

With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange.
ABC gene expression profiles have clinical importance and possibly form a new hallmark of cancer.

PubMed

Dvorak, Pavel; Pesta, Martin; Soucek, Pavel

2017-05-01

Adenosine triphosphate-binding cassette proteins constitute a large family of active transporters through extracellular and intracellular membranes. Increased drug efflux based on adenosine triphosphate-binding cassette protein activity is related to the development of cancer cell chemoresistance. Several articles have focused on adenosine triphosphate-binding cassette gene expression profiles (signatures), based on the expression of all 49 human adenosine triphosphate-binding cassette genes, in individual tumor types and reported connections to established clinicopathological features. The aim of this study was to test our theory about the existence of adenosine triphosphate-binding cassette gene expression profiles common to multiple types of tumors, which may modify tumor progression and provide clinically relevant information. Such general adenosine triphosphate-binding cassette profiles could constitute a new attribute of carcinogenesis. Our combined cohort consisted of tissues from 151 cancer patients-breast, colorectal, and pancreatic carcinomas. Standard protocols for RNA isolation and quantitative real-time polymerase chain reaction were followed. Gene expression data from individual tumor types as well as a merged tumor dataset were analyzed by bioinformatics tools. Several general adenosine triphosphate-binding cassette profiles, with differences in gene functions, were established and shown to have significant relations to clinicopathological features such as tumor size, histological grade, or clinical stage. Genes ABCC7, A3, A8, A12, and C8 prevailed among the most upregulated or downregulated ones. In conclusion, the results supported our theory about general adenosine triphosphate-binding cassette gene expression profiles and their importance for cancer on clinical as well as research levels. The presence of ABCC7 (official symbol CFTR) among the genes with key roles in the profiles supports the emerging evidence about its crucial role in various cancers. Graphical abstract.
Widespread Enhancer Activity from Core Promoters.

PubMed

Medina-Rivera, Alejandra; Santiago-Algarra, David; Puthier, Denis; Spicuglia, Salvatore

2018-06-01

Gene expression in higher eukaryotes is precisely regulated in time and space through the interplay between promoters and gene-distal regulatory regions, known as enhancers. The original definition of enhancers implies the ability to activate gene expression remotely, while promoters entail the capability to locally induce gene expression. Despite the conventional distinction between them, promoters and enhancers share many genomic and epigenomic features. One intriguing finding in the gene regulation field comes from the observation that many core promoter regions display enhancer activity. Recent high-throughput reporter assays along with clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9-related approaches have indicated that this phenomenon is common and might have a strong impact on our global understanding of genome organisation and gene expression regulation. Copyright © 2018 Elsevier Ltd. All rights reserved.
MHC class I-associated peptides derive from selective regions of the human genome.

PubMed

Pearson, Hillary; Daouda, Tariq; Granados, Diana Paola; Durette, Chantal; Bonneil, Eric; Courcelles, Mathieu; Rodenbrock, Anja; Laverdure, Jean-Philippe; Côté, Caroline; Mader, Sylvie; Lemieux, Sébastien; Thibault, Pierre; Perreault, Claude

2016-12-01

MHC class I-associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology.
MHC class I–associated peptides derive from selective regions of the human genome

PubMed Central

Pearson, Hillary; Granados, Diana Paola; Durette, Chantal; Bonneil, Eric; Courcelles, Mathieu; Rodenbrock, Anja; Laverdure, Jean-Philippe; Côté, Caroline; Thibault, Pierre

2016-01-01

MHC class I–associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology. PMID:27841757
Short interspersed element (SINE) depletion and long interspersed element (LINE) abundance are not features universally required for imprinting.

PubMed

Cowley, Michael; de Burca, Anna; McCole, Ruth B; Chahal, Mandeep; Saadat, Ghazal; Oakey, Rebecca J; Schulz, Reiner

2011-04-20

Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.
Analysis of host response to bacterial infection using error model based gene expression microarray experiments

PubMed Central

Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco

2005-01-01

A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204
Base composition and expression level of human genes.

PubMed

Arhondakis, Stilianos; Auletta, Fabio; Torelli, Giuseppe; D'Onofrio, Giuseppe

2004-01-21

It is well known that the gene distribution is non-uniform in the human genome, reaching the highest concentration in the GC-rich isochores. Also the amino acid frequencies, and the hydrophobicity, of the corresponding encoded proteins are affected by the high GC level of the genes localized in the GC-rich isochores. It was hypothesized that the gene expression level as well is higher in GC-rich compared to GC-poor isochores [Mol. Biol. Evol. 10 (1993) 186]. Several features of human genes and proteins, namely expression level, coding and non-coding lengths, and hydrophobicity were investigated in the present paper. The results support the hypothesis reported above, since all the parameters so far studied converge to the same conclusion, that the average expression level of the GC-rich genes is significantly higher than that of the GC-poor genes.
A high-resolution transcriptome map of cell cycle reveals novel connections between periodic genes and cancer

PubMed Central

Dominguez, Daniel; Tsai, Yi-Hsuan; Gomez, Nicholas; Jha, Deepak Kumar; Davis, Ian; Wang, Zefeng

2016-01-01

Progression through the cell cycle is largely dependent on waves of periodic gene expression, and the regulatory networks for these transcriptome dynamics have emerged as critical points of vulnerability in various aspects of tumor biology. Through RNA-sequencing of human cells during two continuous cell cycles (>2.3 billion paired reads), we identified over 1 000 mRNAs, non-coding RNAs and pseudogenes with periodic expression. Periodic transcripts are enriched in functions related to DNA metabolism, mitosis, and DNA damage response, indicating these genes likely represent putative cell cycle regulators. Using our set of periodic genes, we developed a new approach termed “mitotic trait” that can classify primary tumors and normal tissues by their transcriptome similarity to different cell cycle stages. By analyzing >4 000 tumor samples in The Cancer Genome Atlas (TCGA) and other expression data sets, we found that mitotic trait significantly correlates with genetic alterations, tumor subtype and, notably, patient survival. We further defined a core set of 67 genes with robust periodic expression in multiple cell types. Proteins encoded by these genes function as major hubs of protein-protein interaction and are mostly required for cell cycle progression. The core genes also have unique chromatin features including increased levels of CTCF/RAD21 binding and H3K36me3. Loss of these features in uterine and kidney cancers is associated with altered expression of the core 67 genes. Our study suggests new chromatin-associated mechanisms for periodic gene regulation and offers a predictor of cancer patient outcomes. PMID:27364684
NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms

PubMed Central

Ruyssinck, Joeri; Huynh-Thu, Vân Anh; Geurts, Pierre; Dhaene, Tom; Demeester, Piet; Saeys, Yvan

2014-01-01

One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available. PMID:24667482
eQTL Mapping Using RNA-seq Data

PubMed Central

Hu, Yijuan

2012-01-01

As RNA-seq is replacing gene expression microarrays to assess genome-wide transcription abundance, gene expression Quantitative Trait Locus (eQTL) studies using RNA-seq have emerged. RNA-seq delivers two novel features that are important for eQTL studies. First, it provides information on allele-specific expression (ASE), which is not available from gene expression microarrays. Second, it generates unprecedentedly rich data to study RNA-isoform expression. In this paper, we review current methods for eQTL mapping using ASE and discuss some future directions. We also review existing works that use RNA-seq data to study RNA-isoform expression and we discuss the gaps between these works and isoform-specific eQTL mapping. PMID:23667399
Analysis of gene expression in a developmental context emphasizes distinct biological leitmotifs in human cancers

PubMed Central

Naxerova, Kamila; Bult, Carol J; Peaston, Anne; Fancher, Karen; Knowles, Barbara B; Kasif, Simon; Kohane, Isaac S

2008-01-01

Background In recent years, the molecular underpinnings of the long-observed resemblance between neoplastic and immature tissue have begun to emerge. Genome-wide transcriptional profiling has revealed similar gene expression signatures in several tumor types and early developmental stages of their tissue of origin. However, it remains unclear whether such a relationship is a universal feature of malignancy, whether heterogeneities exist in the developmental component of different tumor types and to which degree the resemblance between cancer and development is a tissue-specific phenomenon. Results We defined a developmental landscape by summarizing the main features of ten developmental time courses and projected gene expression from a variety of human tumor types onto this landscape. This comparison demonstrates a clear imprint of developmental gene expression in a wide range of tumors and with respect to different, even non-cognate developmental backgrounds. Our analysis reveals three classes of cancers with developmentally distinct transcriptional patterns. We characterize the biological processes dominating these classes and validate the class distinction with respect to a new time series of murine embryonic lung development. Finally, we identify a set of genes that are upregulated in most cancers and we show that this signature is active in early development. Conclusion This systematic and quantitative overview of the relationship between the neoplastic and developmental transcriptome spanning dozens of tissues provides a reliable outline of global trends in cancer gene expression, reveals potentially clinically relevant differences in the gene expression of different cancer types and represents a reference framework for interpretation of smaller-scale functional studies. PMID:18611264
Biological and Clinical Significance of MAD2L1 and BUB1, Genes Frequently Appearing in Expression Signatures for Breast Cancer Prognosis

PubMed Central

Wang, Zhanwei; Katsaros, Dionyssios; Shen, Yi; Fu, Yuanyuan; Canuto, Emilie Marion; Benedetto, Chiara; Lu, Lingeng; Chu, Wen-Ming; Risch, Harvey A.; Yu, Herbert

2015-01-01

To investigate the biologic relevance and clinical implication of genes involved in multiple gene expression signatures for breast cancer prognosis, we identified 16 published gene expression signatures, and selected two genes, MAD2L1 and BUB1. These genes appeared in 5 signatures and were involved in cell-cycle regulation. We analyzed the expression of these genes in relation to tumor features and disease outcomes. In vitro experiments were also performed in two breast cancer cell lines, MDA-MB-231 and MDA-MB-468, to assess cell proliferation, migration and invasion after knocking down the expression of these genes. High expression of these genes was found to be associated with aggressive tumors and poor disease-free survival of 203 breast cancer patients in our study, and the association with survival was confirmed in an online database consisting of 914 patients. In vitro experiments demonstrated that lowering the expression of these genes by siRNAs reduced tumor cell growth and inhibited cell migration and invasion. Our investigation suggests that MAD2L1 and BUB1 may play important roles in breast cancer progression, and measuring the expression of these genes may assist the prediction of breast cancer prognosis. PMID:26287798
Coupling between nucleotide excision repair and gene expression.

PubMed

Cambindo Botto, Adrián E; Muñoz, Juan C; Muñoz, Manuel J

2018-05-17

Gene expression and DNA repair are fundamental processes for life. During the last decade, accumulating experimental evidence point towards different modes of coupling between these processes. Here we discuss the molecular mechanisms by which RNAPII-dependent transcription affects repair by the Nucleotide Excision Repair system (NER) and how NER activity, through the generation of single stranded DNA intermediates and activation of the DNA damage response kinase ATR, drives gene expression in a genotoxic scenario. Since NER-dependent repair is compromised in Xeroderma Pigmentosum (XP) patients, and having in mind that these patients present a high degree of clinical heterogeneity, we speculate that some of the clinical features of XP patients can be explained by misregulation of gene expression.
Meta-Analysis of DNA Tumor-Viral Integration Site Selection Indicates a Role for Repeats, Gene Expression and Epigenetics

PubMed Central

Doolittle-Hall, Janet M.; Cunningham Glasspoole, Danielle L.; Seaman, William T.; Webster-Cyriaque, Jennifer

2015-01-01

Oncoviruses cause tremendous global cancer burden. For several DNA tumor viruses, human genome integration is consistently associated with cancer development. However, genomic features associated with tumor viral integration are poorly understood. We sought to define genomic determinants for 1897 loci prone to hosting human papillomavirus (HPV), hepatitis B virus (HBV) or Merkel cell polyomavirus (MCPyV). These were compared to HIV, whose enzyme-mediated integration is well understood. A comprehensive catalog of integration sites was constructed from the literature and experimentally-determined HPV integration sites. Features were scored in eight categories (genes, expression, open chromatin, histone modifications, methylation, protein binding, chromatin segmentation and repeats) and compared to random loci. Random forest models determined loci classification and feature selection. HPV and HBV integrants were not fragile site associated. MCPyV preferred integration near sensory perception genes. Unique signatures of integration-associated predictive genomic features were detected. Importantly, repeats, actively-transcribed regions and histone modifications were common tumor viral integration signatures. PMID:26569308
Altered Cytokine Gene Expression in Peripheral Blood Monocytes across the Menstrual Cycle in Primary Dysmenorrhea: A Case-Control Study

PubMed Central

Ma, Hongyue; Hong, Min; Duan, Jinao; Liu, Pei; Fan, Xinsheng; Shang, Erxin; Su, Shulan; Guo, Jianming; Qian, Dawei; Tang, Yuping

2013-01-01

Primary dysmenorrhea is one of the most common gynecological complaints in young women, but potential peripheral immunologic features underlying this condition remain undefined. In this paper, we compared 84 common cytokine gene expression profiles of peripheral blood mononuclear cells (PBMCs) from six primary dysmenorrheic young women and three unaffected controls on the seventh day before (secretory phase), and the first (menstrual phase) and the fifth (regenerative phase) days of menstruation, using a real-time PCR array assay combined with pattern recognition and gene function annotation methods. Comparisons between dysmenorrhea and normal control groups identified 11 (nine increased and two decreased), 14 (five increased and nine decreased), and 15 (seven increased and eight decreased) genes with ≥2-fold difference in expression (P<0.05) in the three phases of menstruation, respectively. In the menstrual phase, genes encoding pro-inflammatory cytokines (IL1B, TNF, IL6, and IL8) were up-regulated, and genes encoding TGF-β superfamily members (BMP4, BMP6, GDF5, GDF11, LEFTY2, NODAL, and MSTN) were down-regulated. Functional annotation revealed an excessive inflammatory response and insufficient TGF-β superfamily member signals with anti-inflammatory consequences, which may directly contribute to menstrual pain. In the secretory and regenerative phases, increased expression of pro-inflammatory cytokines and decreased expression of growth factors were also observed. These factors may be involved in the regulation of decidualization, endometrium breakdown and repair, and indirectly exacerbate primary dysmenorrhea. This first study of cytokine gene expression profiles in PBMCs from young primary dysmenorrheic women demonstrates a shift in the balance between expression patterns of pro-inflammatory cytokines and TGF-β superfamily members across the whole menstrual cycle, underlying the peripheral immunologic features of primary dysmenorrhea. PMID:23390521

Evaluation of correlation between CT image features and ERCC1 protein expression in assessing lung cancer prognosis

NASA Astrophysics Data System (ADS)

Tan, Maxine; Emaminejad, Nastaran; Qian, Wei; Sun, Shenshen; Kang, Yan; Guan, Yubao; Lure, Fleming; Zheng, Bin

2014-03-01

Stage I non-small-cell lung cancers (NSCLC) usually have favorable prognosis. However, high percentage of NSCLC patients have cancer relapse after surgery. Accurately predicting cancer prognosis is important to optimally treat and manage the patients to minimize the risk of cancer relapse. Studies have shown that an excision repair crosscomplementing 1 (ERCC1) gene was a potentially useful genetic biomarker to predict prognosis of NSCLC patients. Meanwhile, studies also found that chronic obstructive pulmonary disease (COPD) was highly associated with lung cancer prognosis. In this study, we investigated and evaluated the correlations between COPD image features and ERCC1 gene expression. A database involving 106 NSCLC patients was used. Each patient had a thoracic CT examination and ERCC1 genetic test. We applied a computer-aided detection scheme to segment and quantify COPD image features. A logistic regression method and a multilayer perceptron network were applied to analyze the correlation between the computed COPD image features and ERCC1 protein expression. A multilayer perceptron network (MPN) was also developed to test performance of using COPD-related image features to predict ERCC1 protein expression. A nine feature based logistic regression analysis showed the average COPD feature values in the low and high ERCC1 protein expression groups are significantly different (p < 0.01). Using a five-fold cross validation method, the MPN yielded an area under ROC curve (AUC = 0.669±0.053) in classifying between the low and high ERCC1 expression cases. The study indicates that CT phenotype features are associated with the genetic tests, which may provide supplementary information to help improve accuracy in assessing prognosis of NSCLC patients.
In situ Expression of Functional Genes Reveals Nitrogen Cycling at High Temperatures in Terrestrial Hydrothermal Systems

NASA Astrophysics Data System (ADS)

Loiacono, S. T.; Meyer-Dombard, D. R.

2011-12-01

An essential element for life, nitrogen occurs in all living organisms and is critical for the synthesis of amino acids, proteins, nucleic acids, and other forms of biomass. Thus, nitrogen cycling likely plays a vital role in microbial metabolic processes as well as nutrient availability. For microorganisms in "extreme" environments, this means developing adaptations that allow them to survive in harsh conditions and still perform the metabolisms essential to sustain life. Recent studies have screened biofilms and thermal sediments of Yellowstone National Park (YNP) thermal features for the presence of nifH genes, which code for a key enzyme in the nitrogen fixation process [1-4]. Furthermore, analysis of nitrogen isotopes in biofilms across a temperature and chemical gradient revealed that nitrogen fixation likely varies across the chemosynthetic/photosynthetic ecotone [5]. Although research has evaluated and confirmed the presence of nifH genes in various thermophilic microbial communities, the existence of a gene in the DNA of an organism does not verify its use. Instead, other methods, such as culturing, isotope tracer assays, and gene expression studies are required to provide direct evidence of biological nitrogen fixation. Culturing and isotope tracer approaches have successfully revealed high-temperature biological nitrogen fixation in both marine hydrothermal vent microbial communities [6] and in acidic, terrestrial hydrothermal sediment [3]. Transcriptomics-based techniques (using mRNA extracted from samples to confirm in situ expression of targeted genes) have been much more limited in number, and only a few studies have, to date, investigated in situ expression of the nifH gene in thermophilic microbial communities [2, 7]. This study explores the presence and expression of nifH genes in several features of the Lower Geyser Basin (LGB) of YNP. Nucleic acids from chemosynthetic and photosynthetic microbial communities were extracted and then amplified using (reverse-transcription) polymerase chain reaction to identify the presence and expression of nifH genes, and resultant (RT-)PCR product was cloned and sequenced. Results reveal high-temperature in situ expression of nifH in select LGB features [7] which is, to the authors' knowledge, the first direct evidence of nifH transcription in the chemosynthetic zones of terrestrial hydrothermal systems. Results also indicate the presence of novel nifH sequences and allow phylogenetic comparison of nifH genes along geochemical gradients within individual hot spring features and between various thermal features in the LGB. Collectively, these results provide evidence for microbial adaptations that have led to the ability to support basic metabolic processes under "extreme" conditions. [1] Hall et al., 2008. AEM 74: 4910-4922. [2] Steunou et al., 2008. The ISME Journal 2: 364-378. [3] Hamilton et al., 2011. Microb Ecol DOI 10.1007/s00248-011-9824-9. [4] Raymond et al., 2008. EOS Trans AGU. Abstract B14A-03. [5] Havig et al., 2010. J Geophys Res-Biogeo 116: G01005. [6] Mehta & Baross, 2006. Science 314: 1783-1786. [7] Loiacono et al., 2011. Submitted FEMS Microbiol Ecol.
Genomic Features That Predict Allelic Imbalance in Humans Suggest Patterns of Constraint on Gene Expression Variation

PubMed Central

Fédrigo, Olivier; Haygood, Ralph; Mukherjee, Sayan; Wray, Gregory A.

2009-01-01

Variation in gene expression is an important contributor to phenotypic diversity within and between species. Although this variation often has a genetic component, identification of the genetic variants driving this relationship remains challenging. In particular, measurements of gene expression usually do not reveal whether the genetic basis for any observed variation lies in cis or in trans to the gene, a distinction that has direct relevance to the physical location of the underlying genetic variant, and which may also impact its evolutionary trajectory. Allelic imbalance measurements identify cis-acting genetic effects by assaying the relative contribution of the two alleles of a cis-regulatory region to gene expression within individuals. Identification of patterns that predict commonly imbalanced genes could therefore serve as a useful tool and also shed light on the evolution of cis-regulatory variation itself. Here, we show that sequence motifs, polymorphism levels, and divergence levels around a gene can be used to predict commonly imbalanced genes in a human data set. Reduction of this feature set to four factors revealed that only one factor significantly differentiated between commonly imbalanced and nonimbalanced genes. We demonstrate that these results are consistent between the original data set and a second published data set in humans obtained using different technical and statistical methods. Finally, we show that variation in the single allelic imbalance-associated factor is partially explained by the density of genes in the region of a target gene (allelic imbalance is less probable for genes in gene-dense regions), and, to a lesser extent, the evenness of expression of the gene across tissues and the magnitude of negative selection on putative regulatory regions of the gene. These results suggest that the genomic distribution of functional cis-regulatory variants in the human genome is nonrandom, perhaps due to local differences in evolutionary constraint. PMID:19506001
Comprehensive Expression Map of Transcription Regulators in the Adult Zebrafish Telencephalon Reveals Distinct Neurogenic Niches

PubMed Central

Diotel, Nicolas; Rodriguez Viales, Rebecca; Armant, Olivier; März, Martin; Ferg, Marco; Rastegar, Sepand; Strähle, Uwe

2015-01-01

The zebrafish has become a model to study adult vertebrate neurogenesis. In particular, the adult telencephalon has been an intensely studied structure in the zebrafish brain. Differential expression of transcriptional regulators (TRs) is a key feature of development and tissue homeostasis. Here we report an expression map of 1,202 TR genes in the telencephalon of adult zebrafish. Our results are summarized in a database with search and clustering functions to identify genes expressed in particular regions of the telencephalon. We classified 562 genes into 13 distinct patterns, including genes expressed in the proliferative zone. The remaining 640 genes displayed unique and complex patterns of expression and could thus not be grouped into distinct classes. The neurogenic ventricular regions express overlapping but distinct sets of TR genes, suggesting regional differences in the neurogenic niches in the telencephalon. In summary, the small telencephalon of the zebrafish shows a remarkable complexity in TR gene expression. The adult zebrafish telencephalon has become a model to study neurogenesis. We established the expression pattern of more than 1200 transcription regulators (TR) in the adult telencephalon. The neurogenic regions express overlapping but distinct sets of TR genes suggesting regional differences in the neurogenic potential. J. Comp. Neurol. 523:1202–1221, 2015. © 2015 Wiley Periodicals, Inc. PMID:25556858
Comprehensive expression map of transcription regulators in the adult zebrafish telencephalon reveals distinct neurogenic niches.

PubMed

Diotel, Nicolas; Rodriguez Viales, Rebecca; Armant, Olivier; März, Martin; Ferg, Marco; Rastegar, Sepand; Strähle, Uwe

2015-06-01

The zebrafish has become a model to study adult vertebrate neurogenesis. In particular, the adult telencephalon has been an intensely studied structure in the zebrafish brain. Differential expression of transcriptional regulators (TRs) is a key feature of development and tissue homeostasis. Here we report an expression map of 1,202 TR genes in the telencephalon of adult zebrafish. Our results are summarized in a database with search and clustering functions to identify genes expressed in particular regions of the telencephalon. We classified 562 genes into 13 distinct patterns, including genes expressed in the proliferative zone. The remaining 640 genes displayed unique and complex patterns of expression and could thus not be grouped into distinct classes. The neurogenic ventricular regions express overlapping but distinct sets of TR genes, suggesting regional differences in the neurogenic niches in the telencephalon. In summary, the small telencephalon of the zebrafish shows a remarkable complexity in TR gene expression. The adult zebrafish telencephalon has become a model to study neurogenesis. We established the expression pattern of more than 1200 transcription regulators (TR) in the adult telencephalon. The neurogenic regions express overlapping but distinct sets of TR genes suggesting regional differences in the neurogenic potential. © 2015 Wiley Periodicals, Inc.
Prader-Willi Syndrome: Intellectual Abilities and Behavioural Features by Genetic Subtype

ERIC Educational Resources Information Center

Milner, Katja M.; Craig, Ellen E.; Thompson, Russell J.; Veltman, Marijcke W. M.; Simon Thomas, N.; Roberts, Sian; Bellamy, Margaret; Curran, Sarah R.; Sporikou, Caroline M. J.; Bolton, Patrick F.

2005-01-01

Background: Studies of chromosome 15 abnormality have implicated over-expression of paternally imprinted genes in the 15q11-13 region in the aetiology of autism. To test this hypothesis we compared individuals with Prader-Willi syndrome (PWS) due to uniparental disomy (UPD--where paternally imprinted genes are over-expressed) to individuals with…
Regulatory logic of pan-neuronal gene expression in C. elegans

PubMed Central

Stefanakis, Nikolaos; Carrera, Ines; Hobert, Oliver

2015-01-01

While neuronal cell types display an astounding degree of phenotypic diversity, most if not all neuron types share a core panel of terminal features. However, little is known about how pan-neuronal expression patterns are genetically programmed. Through an extensive analysis of the cis-regulatory control regions of a battery of pan-neuronal C.elegans genes, including genes involved in synaptic vesicle biology and neuropeptide signaling, we define a common organizational principle in the regulation of pan-neuronal genes in the form of a surprisingly complex array of seemingly redundant, parallel-acting cis-regulatory modules that direct expression to broad, overlapping domains throughout the nervous system. These parallel-acting cis-regulatory modules are responsive to a multitude of distinct trans-acting factors. Neuronal gene expression programs therefore fall into two fundamentally distinct classes. Neuron type-specific genes are generally controlled by discrete and non-redundantly acting regulatory inputs, while pan-neuronal gene expression is controlled by diverse, coincident and seemingly redundant regulatory inputs. PMID:26291158
Histone Deacetylase Inhibitor Induces the Expression of Select Epithelial Genes in Mouse Utricle Sensory Epithelia-Derived Progenitor Cells

PubMed Central

Wang, Jue

2014-01-01

Abstract Mouse utricle sensory epithelial cell–derived progenitor cells (MUCs), which have hair cell progenitor and mesenchymal features via epithelial-to-mesenchymal transition (EMT) as previously described, provide a potential approach for hair cell regeneration via cell transplantation. In this study, we treated MUCs with trichostatin A (TSA) to determine whether histone deacetylase inhibitor is able to stimulate the expression of epithelial genes in MUCs, an essential step for guiding mesenchymal-like MUCs to become sensory epithelial cells. After 72 h of TSA treatment, MUCs acquired epithelial-like features, which were indicated by increased expression of epithelial markers such as Cdh1, Krt18, and Dsp. Additionally, TSA decreased the expression of mesenchymal markers, including Zeb1, Zeb2, Snai1, and Snai2, and prosensory genes Lfng, Six1, and Dlx5. Moreover, the expression of the hair cell genes Atoh1 and Myo6 was increased in TSA-treated MUCs. We also observed significantly decreased expression of Hdac2 and Hdac3 in TSA-treated MUCs. However, no remarkable change was detected in protein expression using immunofluorescence, indicating that TSA-induced HDAC inhibition may contribute to the initial stage of the mesenchymal-to-epithelial phenotypic change. In the future, more work is needed to induce hair cell regeneration using inner ear tissue–derived progenitors to achieve an entire mesenchymal-to-epithelial transition. PMID:24945595
Regulation of gene expression by the BLM helicase correlates with the presence of G-quadruplex DNA motifs

PubMed Central

Nguyen, Giang Huong; Tang, Weiliang; Robles, Ana I.; Beyer, Richard P.; Gray, Lucas T.; Welsh, Judith A.; Schetter, Aaron J.; Kumamoto, Kensuke; Wang, Xin Wei; Hickson, Ian D.; Maizels, Nancy; Monnat, Raymond J.; Harris, Curtis C.

2014-01-01

Bloom syndrome is a rare autosomal recessive disorder characterized by genetic instability and cancer predisposition, and caused by mutations in the gene encoding the Bloom syndrome, RecQ helicase-like (BLM) protein. To determine whether altered gene expression might be responsible for pathological features of Bloom syndrome, we analyzed mRNA and microRNA (miRNA) expression in fibroblasts from individuals with Bloom syndrome and in BLM-depleted control fibroblasts. We identified mRNA and miRNA expression differences in Bloom syndrome patient and BLM-depleted cells. Differentially expressed mRNAs are connected with cell proliferation, survival, and molecular mechanisms of cancer, and differentially expressed miRNAs target genes involved in cancer and in immune function. These and additional altered functions or pathways may contribute to the proportional dwarfism, elevated cancer risk, immune dysfunction, and other features observed in Bloom syndrome individuals. BLM binds to G-quadruplex (G4) DNA, and G4 motifs were enriched at transcription start sites (TSS) and especially within first introns (false discovery rate ≤ 0.001) of differentially expressed mRNAs in Bloom syndrome compared with normal cells, suggesting that G-quadruplex structures formed at these motifs are physiologic targets for BLM. These results identify a network of mRNAs and miRNAs that may drive the pathogenesis of Bloom syndrome. PMID:24958861
Regulation of gene expression by the BLM helicase correlates with the presence of G-quadruplex DNA motifs.

PubMed

Nguyen, Giang Huong; Tang, Weiliang; Robles, Ana I; Beyer, Richard P; Gray, Lucas T; Welsh, Judith A; Schetter, Aaron J; Kumamoto, Kensuke; Wang, Xin Wei; Hickson, Ian D; Maizels, Nancy; Monnat, Raymond J; Harris, Curtis C

2014-07-08

Bloom syndrome is a rare autosomal recessive disorder characterized by genetic instability and cancer predisposition, and caused by mutations in the gene encoding the Bloom syndrome, RecQ helicase-like (BLM) protein. To determine whether altered gene expression might be responsible for pathological features of Bloom syndrome, we analyzed mRNA and microRNA (miRNA) expression in fibroblasts from individuals with Bloom syndrome and in BLM-depleted control fibroblasts. We identified mRNA and miRNA expression differences in Bloom syndrome patient and BLM-depleted cells. Differentially expressed mRNAs are connected with cell proliferation, survival, and molecular mechanisms of cancer, and differentially expressed miRNAs target genes involved in cancer and in immune function. These and additional altered functions or pathways may contribute to the proportional dwarfism, elevated cancer risk, immune dysfunction, and other features observed in Bloom syndrome individuals. BLM binds to G-quadruplex (G4) DNA, and G4 motifs were enriched at transcription start sites (TSS) and especially within first introns (false discovery rate ≤ 0.001) of differentially expressed mRNAs in Bloom syndrome compared with normal cells, suggesting that G-quadruplex structures formed at these motifs are physiologic targets for BLM. These results identify a network of mRNAs and miRNAs that may drive the pathogenesis of Bloom syndrome.
Amplified 7q21-22 gene MCM7 and its intronic miR-25 suppress COL1A2 associated genes to sustain intestinal gastric cancer features.

PubMed

Tamilzhalagan, Sembulingam; Rathinam, Dhanasekaran; Ganesan, Kumaresan

2017-06-01

Frequent amplification of 7q21-22 genomic region is known in gastric cancer. Multiple genes including SHFM1, MCM7, and COL1A2 were reported to be the potential cancer candidate genes of this 20 Mb amplicon. This amplicon has two polycistrionic miRNA clusters and in the present study, miR-106b-25 cluster located in intron-13 of MCM7 was identified to express in gastric tumors. Among the 7q21-22 candidate genes, SHFM1 and MCM7 are expressed in intestinal type gastric tumors, whereas COL1A2 is expressed in diffuse type gastric tumors. Across gastric tumors, miR-25 was identified to co-express with MCM7 and SHFM1. On the other hand, negative correlation was observed between miR-25 and COL1A2 expression. miR-25 originating from MCM7 was found capable of selectively targeting the adjacent gene COL1A2. Silencing of miR-25 was found capable of elevating the expression of COL1A2 and inhibiting E-cadherin expression, revealing the diffuse type gastric cancer suppressive role conferred by miR-25. miR-25 was also found to suppress p53, and activate c-Src revealing its intestinal type gastric cancer associated oncogenic functions. Genome-wide expression profiling upon miR-25 silencing reveals that miR-25 is capable of suppressing 40 genes which are co-expressed with COL1A2, involved in epithelial to mesenchymal transition and angiogenesis which are the typical diffuse type gastric cancer features. The results clearly demonstrate 7q21-22 amplification, MCM7, and its intronic miR-25 are the major molecular switches involved in the complex oncogenic circuits of gastric cancer. © 2017 Wiley Periodicals, Inc.
A study of metaheuristic algorithms for high dimensional feature selection on microarray data

NASA Astrophysics Data System (ADS)

Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna

2017-11-01

Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.
Regulatory Features for Odorant Receptor Genes in the Mouse Genome.

PubMed

Degl'Innocenti, Andrea; D'Errico, Anna

2017-01-01

The odorant receptor genes, seven transmembrane receptor genes constituting the vastest mammalian gene multifamily, are expressed monogenically and monoallelicaly in each sensory neuron in the olfactory epithelium. This characteristic, often referred to as the one neuron-one receptor rule, is driven by mostly uncharacterized molecular dynamics, generally named odorant receptor gene choice . Much attention has been paid by the scientific community to the identification of sequences regulating the expression of odorant receptor genes within their loci , where related genes are usually arranged in genomic clusters. A number of studies identified transcription factor binding sites on odorant receptor promoter sequences. Similar binding sites were also found on a number of enhancers that regulate in cis their transcription, but have been proposed to form interchromosomal networks. Odorant receptor gene choice seems to occur via the local removal of strongly repressive epigenetic markings, put in place during the maturation of the sensory neuron on each odorant receptor locus . Here we review the fast-changing state of art for the study of regulatory features for odorant receptor genes.
Gene Profiling in Experimental Models of Eye Growth: Clues to Myopia Pathogenesis

PubMed Central

Stone, Richard A.; Khurana, Tejvir S.

2010-01-01

To understand the complex regulatory pathways that underlie the development of refractive errors, expression profiling has evaluated gene expression in ocular tissues of well-characterized experimental models that alter postnatal eye growth and induce refractive errors. Derived from a variety of platforms (e.g. differential display, spotted microarrays or Affymetrix GeneChips), gene expression patterns are now being identified in species that include chicken, mouse and primate. Reconciling available results is hindered by varied experimental designs and analytical/statistical features. Continued application of these methods offers promise to provide the much-needed mechanistic framework to develop therapies to normalize refractive development in children. PMID:20363242
Dynamic CRM occupancy reflects a temporal map of developmental progression.

PubMed

Wilczyński, Bartek; Furlong, Eileen E M

2010-06-22

Development is driven by tightly coordinated spatio-temporal patterns of gene expression, which are initiated through the action of transcription factors (TFs) binding to cis-regulatory modules (CRMs). Although many studies have investigated how spatial patterns arise, precise temporal control of gene expression is less well understood. Here, we show that dynamic changes in the timing of CRM occupancy is a prevalent feature common to all TFs examined in a developmental ChIP time course to date. CRMs exhibit complex binding patterns that cannot be explained by the sequence motifs or expression of the TFs themselves. The temporal changes in TF binding are highly correlated with dynamic patterns of target gene expression, which in turn reflect transitions in cellular function during different stages of development. Thus, it is not only the timing of a TF's expression, but also its temporal occupancy in refined time windows, which determines temporal gene expression. Systematic measurement of dynamic CRM occupancy may therefore serve as a powerful method to decode dynamic changes in gene expression driving developmental progression.
Mining featured biomarkers associated with prostatic carcinoma based on bioinformatics.

PubMed

Piao, Guanying; Wu, Jiarui

2013-11-01

To analyze the differentially expressed genes and identify featured biomarkers from prostatic carcinoma. The software "Significance Analysis of Microarray" (SAM) was used to identify the differentially coexpressed genes (DCGs). The DCGs existed in two datasets were analyzed by GO (Gene Ontology) functional annotation. A total of 389 DCGs were obtained. By GO analysis, we found these DCGs were closely related with the acinus development, TGF-β receptor and signal transduction pathways. Furthermore, five featured biomarkers were discovered by interaction analysis. These important signal pathways and oncogenes may provide potential therapeutic targets for prostatic carcinoma.
Identification and expression analyses of two genes encoding putative low-affinity nitrate transporters from Nicotiana plumbaginifolia.

PubMed

Fraisier, V; Dorbe, M F; Daniel-Vedele, F

2001-01-01

Higher plants have both high- and low-affinity nitrate uptake systems (HATS and LATS respectively). Here we report the isolation and characterization of two genes, NpNRT1.1 and NpNRT1.2, from Nicotiana plumbaginifolia whose structural features suggest that they both belong to the NRT1 gene family, which is involved in the LATS. Amino acid sequence alignment showed that the N. plumbaginifolia proteins have greater similarity to their corresponding tomato homologues than to each other. Genomic Southern blot analysis indicates that there are probably more than two members of this family in N. plumbaginifolia. Northern blot analysis shows that NpNRT1.2 expression is restricted strictly to roots, whereas NpNRT1.1, in addition to roots, is expressed at a basal level in all other plant organs. Likewise, differential expression in response to external treatments with various N sources was observed for these two genes: NpNRT1.1 can be considered as a constitutively expressed gene whereas NpNRT1.2 expression is dependent strictly on high nitrate concentrations. Finally, over-expression of a gene involved in the HATS does not lead to any modification of LATS gene expression.
Evolutionary Approach for Relative Gene Expression Algorithms

PubMed Central

Czajkowski, Marcin

2014-01-01

A Relative Expression Analysis (RXA) uses ordering relationships in a small collection of genes and is successfully applied to classiffication using microarray data. As checking all possible subsets of genes is computationally infeasible, the RXA algorithms require feature selection and multiple restrictive assumptions. Our main contribution is a specialized evolutionary algorithm (EA) for top-scoring pairs called EvoTSP which allows finding more advanced gene relations. We managed to unify the major variants of relative expression algorithms through EA and introduce weights to the top-scoring pairs. Experimental validation of EvoTSP on public available microarray datasets showed that the proposed solution significantly outperforms in terms of accuracy other relative expression algorithms and allows exploring much larger solution space. PMID:24790574
Disruption of an Evolutionarily Novel Synaptic Expression Pattern in Autism

PubMed Central

Jiang, Xi; Hu, Haiyang; Guijarro, Patricia; Mitchell, Amanda; Ely, John J.; Sherwood, Chet C.; Hof, Patrick R.; Qiu, Zilong; Pääbo, Svante; Akbarian, Schahram; Khaitovich, Philipp

2016-01-01

Cognitive defects in autism spectrum disorder (ASD) include socialization and communication: key behavioral capacities that separate humans from other species. Here, we analyze gene expression in the prefrontal cortex of 63 autism patients and control individuals, as well as 62 chimpanzees and macaques, from natal to adult age. We show that among all aberrant expression changes seen in ASD brains, a single aberrant expression pattern overrepresented in genes involved synaptic-related pathways is enriched in nucleotide variants linked to autism. Furthermore, only this pattern contains an excess of developmental expression features unique to humans, thus resulting in the disruption of human-specific developmental programs in autism. Several members of the early growth response (EGR) transcription factor family can be implicated in regulation of this aberrant developmental change. Our study draws a connection between the genetic risk architecture of autism and molecular features of cortical development unique to humans. PMID:27685936
Properties of genes essential for mouse development

PubMed Central

Kabir, Mitra; Barradas, Ana; Tzotzos, George T.; Hentges, Kathryn E.

2017-01-01

Essential genes are those that are critical for life. In the specific case of the mouse, they are the set of genes whose deletion means that a mouse is unable to survive after birth. As such, they are the key minimal set of genes needed for all the steps of development to produce an organism capable of life ex utero. We explored a wide range of sequence and functional features to characterise essential (lethal) and non-essential (viable) genes in mice. Experimental data curated manually identified 1301 essential genes and 3451 viable genes. Very many sequence features show highly significant differences between essential and viable mouse genes. Essential genes generally encode complex proteins, with multiple domains and many introns. These genes tend to be: long, highly expressed, old and evolutionarily conserved. These genes tend to encode ligases, transferases, phosphorylated proteins, intracellular proteins, nuclear proteins, and hubs in protein-protein interaction networks. They are involved with regulating protein-protein interactions, gene expression and metabolic processes, cell morphogenesis, cell division, cell proliferation, DNA replication, cell differentiation, DNA repair and transcription, cell differentiation and embryonic development. Viable genes tend to encode: membrane proteins or secreted proteins, and are associated with functions such as cellular communication, apoptosis, behaviour and immune response, as well as housekeeping and tissue specific functions. Viable genes are linked to transport, ion channels, signal transduction, calcium binding and lipid binding, consistent with their location in membranes and involvement with cell-cell communication. From the analysis of the composite features of essential and viable genes, we conclude that essential genes tend to be required for intracellular functions, and viable genes tend to be involved with extracellular functions and cell-cell communication. Knowledge of the features that are over-represented in essential genes allows for a deeper understanding of the functions and processes implemented during mammalian development. PMID:28562614

Analysis of the GRNs Inference by Using Tsallis Entropy and a Feature Selection Approach

NASA Astrophysics Data System (ADS)

Lopes, Fabrício M.; de Oliveira, Evaldo A.; Cesar, Roberto M.

An important problem in the bioinformatics field is to understand how genes are regulated and interact through gene networks. This knowledge can be helpful for many applications, such as disease treatment design and drugs creation purposes. For this reason, it is very important to uncover the functional relationship among genes and then to construct the gene regulatory network (GRN) from temporal expression data. However, this task usually involves data with a large number of variables and small number of observations. In this way, there is a strong motivation to use pattern recognition and dimensionality reduction approaches. In particular, feature selection is specially important in order to select the most important predictor genes that can explain some phenomena associated with the target genes. This work presents a first study about the sensibility of entropy methods regarding the entropy functional form, applied to the problem of topology recovery of GRNs. The generalized entropy proposed by Tsallis is used to study this sensibility. The inference process is based on a feature selection approach, which is applied to simulated temporal expression data generated by an artificial gene network (AGN) model. The inferred GRNs are validated in terms of global network measures. Some interesting conclusions can be drawn from the experimental results, as reported for the first time in the present paper.
Gene Expression Ratios Lead to Accurate and Translatable Predictors of DR5 Agonism across Multiple Tumor Lineages.

PubMed

Reddy, Anupama; Growney, Joseph D; Wilson, Nick S; Emery, Caroline M; Johnson, Jennifer A; Ward, Rebecca; Monaco, Kelli A; Korn, Joshua; Monahan, John E; Stump, Mark D; Mapa, Felipa A; Wilson, Christopher J; Steiger, Janine; Ledell, Jebediah; Rickles, Richard J; Myer, Vic E; Ettenberg, Seth A; Schlegel, Robert; Sellers, William R; Huet, Heather A; Lehár, Joseph

2015-01-01

Death Receptor 5 (DR5) agonists demonstrate anti-tumor activity in preclinical models but have yet to demonstrate robust clinical responses. A key limitation may be the lack of patient selection strategies to identify those most likely to respond to treatment. To overcome this limitation, we screened a DR5 agonist Nanobody across >600 cell lines representing 21 tumor lineages and assessed molecular features associated with response. High expression of DR5 and Casp8 were significantly associated with sensitivity, but their expression thresholds were difficult to translate due to low dynamic ranges. To address the translational challenge of establishing thresholds of gene expression, we developed a classifier based on ratios of genes that predicted response across lineages. The ratio classifier outperformed the DR5+Casp8 classifier, as well as standard approaches for feature selection and classification using genes, instead of ratios. This classifier was independently validated using 11 primary patient-derived pancreatic xenograft models showing perfect predictions as well as a striking linearity between prediction probability and anti-tumor response. A network analysis of the genes in the ratio classifier captured important biological relationships mediating drug response, specifically identifying key positive and negative regulators of DR5 mediated apoptosis, including DR5, CASP8, BID, cFLIP, XIAP and PEA15. Importantly, the ratio classifier shows translatability across gene expression platforms (from Affymetrix microarrays to RNA-seq) and across model systems (in vitro to in vivo). Our approach of using gene expression ratios presents a robust and novel method for constructing translatable biomarkers of compound response, which can also probe the underlying biology of treatment response.
Gene Expression Ratios Lead to Accurate and Translatable Predictors of DR5 Agonism across Multiple Tumor Lineages

PubMed Central

Reddy, Anupama; Growney, Joseph D.; Wilson, Nick S.; Emery, Caroline M.; Johnson, Jennifer A.; Ward, Rebecca; Monaco, Kelli A.; Korn, Joshua; Monahan, John E.; Stump, Mark D.; Mapa, Felipa A.; Wilson, Christopher J.; Steiger, Janine; Ledell, Jebediah; Rickles, Richard J.; Myer, Vic E.; Ettenberg, Seth A.; Schlegel, Robert; Sellers, William R.

2015-01-01

Death Receptor 5 (DR5) agonists demonstrate anti-tumor activity in preclinical models but have yet to demonstrate robust clinical responses. A key limitation may be the lack of patient selection strategies to identify those most likely to respond to treatment. To overcome this limitation, we screened a DR5 agonist Nanobody across >600 cell lines representing 21 tumor lineages and assessed molecular features associated with response. High expression of DR5 and Casp8 were significantly associated with sensitivity, but their expression thresholds were difficult to translate due to low dynamic ranges. To address the translational challenge of establishing thresholds of gene expression, we developed a classifier based on ratios of genes that predicted response across lineages. The ratio classifier outperformed the DR5+Casp8 classifier, as well as standard approaches for feature selection and classification using genes, instead of ratios. This classifier was independently validated using 11 primary patient-derived pancreatic xenograft models showing perfect predictions as well as a striking linearity between prediction probability and anti-tumor response. A network analysis of the genes in the ratio classifier captured important biological relationships mediating drug response, specifically identifying key positive and negative regulators of DR5 mediated apoptosis, including DR5, CASP8, BID, cFLIP, XIAP and PEA15. Importantly, the ratio classifier shows translatability across gene expression platforms (from Affymetrix microarrays to RNA-seq) and across model systems (in vitro to in vivo). Our approach of using gene expression ratios presents a robust and novel method for constructing translatable biomarkers of compound response, which can also probe the underlying biology of treatment response. PMID:26378449
Heterogeneous gene expression signatures correspond to distinct lung pathologies and biomarkers of disease severity in idiopathic pulmonary fibrosis.

PubMed

DePianto, Daryle J; Chandriani, Sanjay; Abbas, Alexander R; Jia, Guiquan; N'Diaye, Elsa N; Caplazi, Patrick; Kauder, Steven E; Biswas, Sabyasachi; Karnik, Satyajit K; Ha, Connie; Modrusan, Zora; Matthay, Michael A; Kukreja, Jasleen; Collard, Harold R; Egen, Jackson G; Wolters, Paul J; Arron, Joseph R

2015-01-01

There is microscopic spatial and temporal heterogeneity of pathological changes in idiopathic pulmonary fibrosis (IPF) lung tissue, which may relate to heterogeneity in pathophysiological mediators of disease and clinical progression. We assessed relationships between gene expression patterns, pathological features, and systemic biomarkers to identify biomarkers that reflect the aggregate disease burden in patients with IPF. Gene expression microarrays (N=40 IPF; 8 controls) and immunohistochemical analyses (N=22 IPF; 8 controls) of lung biopsies. Clinical characterisation and blood biomarker levels of MMP3 and CXCL13 in a separate cohort of patients with IPF (N=80). 2940 genes were significantly differentially expressed between IPF and control samples (|fold change| >1.5, p<0.05). Two clusters of co-regulated genes related to bronchiolar epithelium or lymphoid aggregates exhibited substantial heterogeneity within the IPF population. Gene expression in bronchiolar and lymphoid clusters corresponded to the extent of bronchiolisation and lymphoid aggregates determined by immunohistochemistry in adjacent tissue sections. Elevated serum levels of MMP3, encoded in the bronchiolar cluster, and CXCL13, encoded in the lymphoid cluster, corresponded to disease severity and shortened survival time (p<10(-7) for MMP3 and p<10(-5) for CXCL13; Cox proportional hazards model). Microscopic pathological heterogeneity in IPF lung tissue corresponds to specific gene expression patterns related to bronchiolisation and lymphoid aggregates. MMP3 and CXCL13 are systemic biomarkers that reflect the aggregate burden of these pathological features across total lung tissue. These biomarkers may have clinical utility as prognostic and/or surrogate biomarkers of disease activity in interventional studies in IPF. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Expression Atlas: gene and protein expression across multiple studies and organisms

PubMed Central

Tang, Y Amy; Bazant, Wojciech; Burke, Melissa; Fuentes, Alfonso Muñoz-Pomer; George, Nancy; Koskinen, Satu; Mohammed, Suhaib; Geniza, Matthew; Preece, Justin; Jarnuczak, Andrew F; Huber, Wolfgang; Stegle, Oliver; Brazma, Alvis; Petryszak, Robert

2018-01-01

Abstract Expression Atlas (http://www.ebi.ac.uk/gxa) is an added value database that provides information about gene and protein expression in different species and contexts, such as tissue, developmental stage, disease or cell type. The available public and controlled access data sets from different sources are curated and re-analysed using standardized, open source pipelines and made available for queries, download and visualization. As of August 2017, Expression Atlas holds data from 3,126 studies across 33 different species, including 731 from plants. Data from large-scale RNA sequencing studies including Blueprint, PCAWG, ENCODE, GTEx and HipSci can be visualized next to each other. In Expression Atlas, users can query genes or gene-sets of interest and explore their expression across or within species, tissues, developmental stages in a constitutive or differential context, representing the effects of diseases, conditions or experimental interventions. All processed data matrices are available for direct download in tab-delimited format or as R-data. In addition to the web interface, data sets can now be searched and downloaded through the Expression Atlas R package. Novel features and visualizations include the on-the-fly analysis of gene set overlaps and the option to view gene co-expression in experiments investigating constitutive gene expression across tissues or other conditions. PMID:29165655
ReliefSeq: A Gene-Wise Adaptive-K Nearest-Neighbor Feature Selection Tool for Finding Gene-Gene Interactions and Main Effects in mRNA-Seq Gene Expression Data

PubMed Central

McKinney, Brett A.; White, Bill C.; Grill, Diane E.; Li, Peter W.; Kennedy, Richard B.; Poland, Gregory A.; Oberg, Ann L.

2013-01-01

Relief-F is a nonparametric, nearest-neighbor machine learning method that has been successfully used to identify relevant variables that may interact in complex multivariate models to explain phenotypic variation. While several tools have been developed for assessing differential expression in sequence-based transcriptomics, the detection of statistical interactions between transcripts has received less attention in the area of RNA-seq analysis. We describe a new extension and assessment of Relief-F for feature selection in RNA-seq data. The ReliefSeq implementation adapts the number of nearest neighbors (k) for each gene to optimize the Relief-F test statistics (importance scores) for finding both main effects and interactions. We compare this gene-wise adaptive-k (gwak) Relief-F method with standard RNA-seq feature selection tools, such as DESeq and edgeR, and with the popular machine learning method Random Forests. We demonstrate performance on a panel of simulated data that have a range of distributional properties reflected in real mRNA-seq data including multiple transcripts with varying sizes of main effects and interaction effects. For simulated main effects, gwak-Relief-F feature selection performs comparably to standard tools DESeq and edgeR for ranking relevant transcripts. For gene-gene interactions, gwak-Relief-F outperforms all comparison methods at ranking relevant genes in all but the highest fold change/highest signal situations where it performs similarly. The gwak-Relief-F algorithm outperforms Random Forests for detecting relevant genes in all simulation experiments. In addition, Relief-F is comparable to the other methods based on computational time. We also apply ReliefSeq to an RNA-Seq study of smallpox vaccine to identify gene expression changes between vaccinia virus-stimulated and unstimulated samples. ReliefSeq is an attractive tool for inclusion in the suite of tools used for analysis of mRNA-Seq data; it has power to detect both main effects and interaction effects. Software Availability: http://insilico.utulsa.edu/ReliefSeq.php. PMID:24339943
DNA microarray analysis of Methanosarcina mazei Gö1 reveals adaptation to different methanogenic substrates.

PubMed

Hovey, Raymond; Lentes, Sabine; Ehrenreich, Armin; Salmon, Kirsty; Saba, Karla; Gottschalk, Gerhard; Gunsalus, Robert P; Deppenmeier, Uwe

2005-05-01

Methansarcina mazei Gö1 DNA arrays were constructed and used to evaluate the genomic expression patterns of cells grown on either of two alternative methanogenic substrates, acetate or methanol, as sole carbon and energy source. Analysis of differential transcription across the genome revealed two functionally grouped sets of genes that parallel the central biochemical pathways in, and reflect many known features of, acetate and methanol metabolism. These include the acetate-induced genes encoding acetate activating enzymes, acetyl-CoA synthase/CO dehydrogenase, and carbonic anhydrase. Interestingly, additional genes expressed at significantly higher levels during growth on acetate included two energy-conserving complexes (the Ech hydrogenase, and the A1A0-type ATP synthase). Many previously unknown features included the induction by acetate of genes coding for ferredoxins and flavoproteins, an aldehyde:ferredoxin oxidoreductase, enzymes for the synthesis of aromatic amino acids, and components of iron, cobalt and oligopeptide uptake systems. In contrast, methanol-grown cells exhibited elevated expression of genes assigned to the methylotrophic pathway of methanogenesis. Expression of genes for components of the translation apparatus was also elevated in cells grown in the methanol medium relative to acetate, and was correlated with the faster growth rate observed on the former substrate. These experiments provide the first comprehensive insight into substrate-dependent gene expression in a methanogenic archaeon. This genome-wide approach, coupled with the complementary molecular and biochemical tools, should greatly accelerate the exploration of Methanosarcina cell physiology, given the present modest level of our knowledge of these large archaeal genomes.
minepath.org: a free interactive pathway analysis web server.

PubMed

Koumakis, Lefteris; Roussos, Panos; Potamias, George

2017-07-03

( www.minepath.org ) is a web-based platform that elaborates on, and radically extends the identification of differentially expressed sub-paths in molecular pathways. Besides the network topology, the underlying MinePath algorithmic processes exploit exact gene-gene molecular relationships (e.g. activation, inhibition) and are able to identify differentially expressed pathway parts. Each pathway is decomposed into all its constituent sub-paths, which in turn are matched with corresponding gene expression profiles. The highly ranked, and phenotype inclined sub-paths are kept. Apart from the pathway analysis algorithm, the fundamental innovation of the MinePath web-server concerns its advanced visualization and interactive capabilities. To our knowledge, this is the first pathway analysis server that introduces and offers visualization of the underlying and active pathway regulatory mechanisms instead of genes. Other features include live interaction, immediate visualization of functional sub-paths per phenotype and dynamic linked annotations for the engaged genes and molecular relations. The user can download not only the results but also the corresponding web viewer framework of the performed analysis. This feature provides the flexibility to immediately publish results without publishing source/expression data, and get all the functionality of a web based pathway analysis viewer. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Base excision repair imbalance in colorectal cancer has prognostic value and modulates response to chemotherapy

PubMed Central

Leguisamo, Natalia M.; Gloria, Helena C.; Kalil, Antonio N.; Martins, Talita V.; Azambuja, Daniel B.

2017-01-01

Colorectal cancer (CRC) is prevalent worldwide, and treatment often involves surgery and genotoxic chemotherapy. DNA repair mechanisms, such as base excision repair (BER) and mismatch repair (MMR), may not only influence tumour characteristics and prognosis but also dictate chemotherapy response. Defective MMR contributes to chemoresistance in colorectal cancer. Moreover, BER affects cellular survival by repairing genotoxic base damage in a process that itself can disrupt metabolism. In this study, we characterized BER and MMR gene expression in colorectal tumours and the association between this repair profile with patients’ clinical and pathological features. In addition, we exploited the possible mechanisms underlying the association between altered DNA repair, metabolism and response to chemotherapy. Seventy pairs of sporadic colorectal tumour samples and adjacent non-tumour mucosal specimens were assessed for BER and MMR gene and protein expression and their association with pathological and clinical features. MMR-deficient colon cancer cells (HCT116) transiently overexpressing MPG or XRCC1 were treated with 5-FU or TMZ and evaluated for viability and metabolic intermediate levels. Increase in BER gene and protein expression is associated with more aggressive tumour features and poor pathological outcomes in CRC. However, tumours with reduced MMR gene expression also displayed low MPG, OGG1 and PARP1 expression. Imbalancing BER by overexpression of MPG, but not XRCC1, sensitises MMR-deficient colon cancer cells to 5-FU and TMZ and leads to ATP depletion and lactate accumulation. MPG overexpression alters DNA repair and metabolism and is a potential strategy to overcome 5-FU chemotherapeutic resistance in MMR-deficient CRC. PMID:28903334
Arabidopsis female gametophyte gene expression map reveals similarities between plant and animal gametes.

PubMed

Wuest, Samuel E; Vijverberg, Kitty; Schmidt, Anja; Weiss, Manuel; Gheyselinck, Jacqueline; Lohr, Miriam; Wellmer, Frank; Rahnenführer, Jörg; von Mering, Christian; Grossniklaus, Ueli

2010-03-23

The development of multicellular organisms is controlled by differential gene expression whereby cells adopt distinct fates. A spatially resolved view of gene expression allows the elucidation of transcriptional networks that are linked to cellular identity and function. The haploid female gametophyte of flowering plants is a highly reduced organism: at maturity, it often consists of as few as three cell types derived from a common precursor [1, 2]. However, because of its inaccessibility and small size, we know little about the molecular basis of cell specification and differentiation in the female gametophyte. Here we report expression profiles of all cell types in the mature Arabidopsis female gametophyte. Differentially expressed posttranscriptional regulatory modules and metabolic pathways characterize the distinct cell types. Several transcription factor families are overrepresented in the female gametophyte in comparison to other plant tissues, e.g., type I MADS domain, RWP-RK, and reproductive meristem transcription factors. PAZ/Piwi-domain encoding genes are upregulated in the egg, indicating a role of epigenetic regulation through small RNA pathways-a feature paralleled in the germline of animals [3]. A comparison of human and Arabidopsis egg cells for enrichment of functional groups identified several similarities that may represent a consequence of coevolution or ancestral gametic features. 2010 Elsevier Ltd. All rights reserved.
RNA-Seq Analysis of Abdominal Fat in Genetically Fat and Lean Chickens Highlights a Divergence in Expression of Genes Controlling Adiposity, Hemostasis, and Lipid Metabolism

PubMed Central

Resnyk, Christopher W.; Chen, Chuming; Huang, Hongzhan; Wu, Cathy H.; Simon, Jean; Le Bihan-Duval, Elisabeth; Duclos, Michel J.; Cogburn, Larry A.

2015-01-01

Genetic selection for enhanced growth rate in meat-type chickens (Gallus domesticus) is usually accompanied by excessive adiposity, which has negative impacts on both feed efficiency and carcass quality. Enhanced visceral fatness and several unique features of avian metabolism (i.e., fasting hyperglycemia and insulin insensitivity) mimic overt symptoms of obesity and related metabolic disorders in humans. Elucidation of the genetic and endocrine factors that contribute to excessive visceral fatness in chickens could also advance our understanding of human metabolic diseases. Here, RNA sequencing was used to examine differential gene expression in abdominal fat of genetically fat and lean chickens, which exhibit a 2.8-fold divergence in visceral fatness at 7 wk. Ingenuity Pathway Analysis revealed that many of 1687 differentially expressed genes are associated with hemostasis, endocrine function and metabolic syndrome in mammals. Among the highest expressed genes in abdominal fat, across both genotypes, were 25 differentially expressed genes associated with de novo synthesis and metabolism of lipids. Over-expression of numerous adipogenic and lipogenic genes in the FL chickens suggests that in situ lipogenesis in chickens could make a more substantial contribution to expansion of visceral fat mass than previously recognized. Distinguishing features of the abdominal fat transcriptome in lean chickens were high abundance of multiple hemostatic and vasoactive factors, transporters, and ectopic expression of several hormones/receptors, which could control local vasomotor tone and proteolytic processing of adipokines, hemostatic factors and novel endocrine factors. Over-expression of several thrombogenic genes in abdominal fat of lean chickens is quite opposite to the pro-thrombotic state found in obese humans. Clearly, divergent genetic selection for an extreme (2.5–2.8-fold) difference in visceral fatness provokes a number of novel regulatory responses that govern growth and metabolism of visceral fat in this unique avian model of juvenile-onset obesity and glucose-insulin imbalance. PMID:26445145
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders.

PubMed

Forero, Diego A; Prada, Carlos F; Perry, George

2016-01-01

In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD.
Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders

PubMed Central

Forero, Diego A.; Prada, Carlos F.; Perry, George

2016-01-01

Background: In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. Objective: To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. Methods: A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. Results: We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. Conclusion: These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD. PMID:27990183
Genetic biomarkers for brain hemisphere differentiation in Parkinson's Disease

NASA Astrophysics Data System (ADS)

Hourani, Mou'ath; Mendes, Alexandre; Berretta, Regina; Moscato, Pablo

2007-11-01

This work presents a study on the genetic profile of the left and right hemispheres of the brain of a mouse model of Parkinson's disease (PD). The goal is to characterize, in a genetic basis, PD as a disease that affects these two brain regions in different ways. Using the same whole-genome microarray expression data introduced by Brown et al. (2002) [1], we could find significant differences in the expression of some key genes, well-known to be involved in the mechanisms of dopamine production control and PD. The problem of selecting such genes was modeled as the MIN (α,β)—FEATURE SET problem [2]; a similar approach to that employed previously to find biomarkers for different types of cancer using gene expression microarray data [3]. The Feature Selection method produced a series of genetic signatures for PD, with distinct expression profiles in the Parkinson's model and control mice experiments. In addition, a close examination of the genes composing those signatures shows that many of them belong to genetic pathways or have ontology annotations considered to be involved in the onset and development of PD. Such elements could provide new clues on which mechanisms are implicated in hemisphere differentiation in PD.
A deep auto-encoder model for gene expression prediction.

PubMed

Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua

2017-11-17

Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.
Genome-wide characterization of mammalian promoters with distal enhancer functions.

PubMed

Dao, Lan T M; Galindo-Albarrán, Ariel O; Castro-Mondragon, Jaime A; Andrieu-Soler, Charlotte; Medina-Rivera, Alejandra; Souaid, Charbel; Charbonnier, Guillaume; Griffon, Aurélien; Vanhille, Laurent; Stephen, Tharshana; Alomairi, Jaafar; Martin, David; Torres, Magali; Fernandez, Nicolas; Soler, Eric; van Helden, Jacques; Puthier, Denis; Spicuglia, Salvatore

2017-07-01

Gene expression in mammals is precisely regulated by the combination of promoters and gene-distal regulatory regions, known as enhancers. Several studies have suggested that some promoters might have enhancer functions. However, the extent of this type of promoters and whether they actually function to regulate the expression of distal genes have remained elusive. Here, by exploiting a high-throughput enhancer reporter assay, we unravel a set of mammalian promoters displaying enhancer activity. These promoters have distinct genomic and epigenomic features and frequently interact with other gene promoters. Extensive CRISPR-Cas9 genomic manipulation demonstrated the involvement of these promoters in the cis regulation of expression of distal genes in their natural loci. Our results have important implications for the understanding of complex gene regulation in normal development and disease.
Genome-wide Hi-C analysis reveals extensive hierarchical chromatin interactions in rice.

PubMed

Dong, Qianli; Li, Ning; Li, Xiaochong; Yuan, Zan; Xie, Dejian; Wang, Xiaofei; Li, Jianing; Yu, Yanan; Wang, Jinbin; Ding, Baoxu; Zhang, Zhibin; Li, Changping; Bian, Yao; Zhang, Ai; Wu, Ying; Liu, Bao; Gong, Lei

2018-06-01

The non-random spatial packing of chromosomes in the nucleus plays a critical role in orchestrating gene expression and genome function. Here, we present a Hi-C analysis of the chromatin interaction patterns in rice (Oryza sativa L.) at hierarchical architectural levels. We confirm that rice chromosomes occupy their own territories with certain preferential inter-chromosomal associations. Moderate compartment delimitation and extensive TADs (Topologically Associated Domains) were determined to be associated with heterogeneous genomic compositions and epigenetic marks in the rice genome. We found subtle features including chromatin loops, gene loops, and off-/near-diagonal intensive interaction regions. Gene chromatin loops associated with H3K27me3 could be positively involved in gene expression. In addition to insulated enhancing effects for neighbor gene expression, the identified rice gene loops could bi-directionally (+/-) affect the expression of looped genes themselves. Finally, web-interleaved off-diagonal IHIs/KEEs (Interactive Heterochromatic Islands or KNOT ENGAGED ELEMENTs) could trap transposable elements (TEs) via the enrichment of silencing epigenetic marks. In parallel, the near-diagonal FIREs (Frequently Interacting Regions) could positively affect the expression of involved genes. Our results suggest that the chromatin packing pattern in rice is generally similar to that in Arabidopsis thaliana but with clear differences at specific structural levels. We conclude that genomic composition, epigenetic modification, and transcriptional activity could act in combination to shape global and local chromatin packing in rice. Our results confirm recent observations in rice and A. thaliana but also provide additional insights into the patterns and features of chromatin organization in higher plants. © 2018 The Authors. The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.
Gene expression profiles of breast biopsies from healthy women identify a group with claudin-low features.

PubMed

Haakensen, Vilde D; Lingjaerde, Ole Christian; Lüders, Torben; Riis, Margit; Prat, Aleix; Troester, Melissa A; Holmen, Marit M; Frantzen, Jan Ole; Romundstad, Linda; Navjord, Dina; Bukholm, Ida K; Johannesen, Tom B; Perou, Charles M; Ursin, Giske; Kristensen, Vessela N; Børresen-Dale, Anne-Lise; Helland, Aslaug

2011-11-01

Increased understanding of the variability in normal breast biology will enable us to identify mechanisms of breast cancer initiation and the origin of different subtypes, and to better predict breast cancer risk. Gene expression patterns in breast biopsies from 79 healthy women referred to breast diagnostic centers in Norway were explored by unsupervised hierarchical clustering and supervised analyses, such as gene set enrichment analysis and gene ontology analysis and comparison with previously published genelists and independent datasets. Unsupervised hierarchical clustering identified two separate clusters of normal breast tissue based on gene-expression profiling, regardless of clustering algorithm and gene filtering used. Comparison of the expression profile of the two clusters with several published gene lists describing breast cells revealed that the samples in cluster 1 share characteristics with stromal cells and stem cells, and to a certain degree with mesenchymal cells and myoepithelial cells. The samples in cluster 1 also share many features with the newly identified claudin-low breast cancer intrinsic subtype, which also shows characteristics of stromal and stem cells. More women belonging to cluster 1 have a family history of breast cancer and there is a slight overrepresentation of nulliparous women in cluster 1. Similar findings were seen in a separate dataset consisting of histologically normal tissue from both breasts harboring breast cancer and from mammoplasty reductions. This is the first study to explore the variability of gene expression patterns in whole biopsies from normal breasts and identified distinct subtypes of normal breast tissue. Further studies are needed to determine the specific cell contribution to the variation in the biology of normal breasts, how the clusters identified relate to breast cancer risk and their possible link to the origin of the different molecular subtypes of breast cancer.
Hidden among the crowd: differential DNA methylation-expression correlations in cancer occur at important oncogenic pathways.

PubMed

Mosquera Orgueira, Adrián

2015-01-01

DNA methylation is a frequent epigenetic mechanism that participates in transcriptional repression. Variations in DNA methylation with respect to gene expression are constant, and, for unknown reasons, some genes with highly methylated promoters are sometimes overexpressed. In this study we have analyzed the expression and methylation patterns of thousands of genes in five groups of cancer and normal tissue samples in order to determine local and genome-wide differences. We observed significant changes in global methylation-expression correlation in all the neoplasms, which suggests that differential correlation events are frequent in cancer. A focused analysis in the breast cancer cohort identified 1662 genes whose correlation varies significantly between normal and cancerous breast, but whose DNA methylation and gene expression patterns do not change substantially. These genes were enriched in cancer-related pathways and repressive chromatin features across various model cell lines, such as PRC2 binding and H3K27me3 marks. Substantial changes in methylation-expression correlation indicate that these genes are subject to epigenetic remodeling, where the differential activity of other factors break the expected relationship between both variables. Our findings suggest a complex regulatory landscape where a redistribution of local and large-scale chromatin repressive domains at differentially correlated genes (DCGs) creates epigenetic hotspots that modulate cancer-specific gene expression.
Zipf's Law in Gene Expression

NASA Astrophysics Data System (ADS)

Furusawa, Chikara; Kaneko, Kunihiko

2003-02-01

Using data from gene expression databases on various organisms and tissues, including yeast, nematodes, human normal and cancer tissues, and embryonic stem cells, we found that the abundances of expressed genes exhibit a power-law distribution with an exponent close to -1; i.e., they obey Zipf’s law. Furthermore, by simulations of a simple model with an intracellular reaction network, we found that Zipf’s law of chemical abundance is a universal feature of cells where such a network optimizes the efficiency and faithfulness of self-reproduction. These findings provide novel insights into the nature of the organization of reaction dynamics in living cells.

Cloud-scale genomic signals processing classification analysis for gene expression microarray data.

PubMed

Harvey, Benjamin; Soo-Yeon Ji

2014-01-01

As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
Feature genes predicting the FLT3/ITD mutation in acute myeloid leukemia

PubMed Central

LI, CHENGLONG; ZHU, BIAO; CHEN, JIAO; HUANG, XIAOBING

2016-01-01

In the present study, gene expression profiles of acute myeloid leukemia (AML) samples were analyzed to identify feature genes with the capacity to predict the mutation status of FLT3/ITD. Two machine learning models, namely the support vector machine (SVM) and random forest (RF) methods, were used for classification. Four datasets were downloaded from the European Bioinformatics Institute, two of which (containing 371 samples, including 281 FLT3/ITD mutation-negative and 90 mutation-positive samples) were randomly defined as the training group, while the other two datasets (containing 488 samples, including 350 FLT3/ITD mutation-negative and 138 mutation-positive samples) were defined as the test group. Differentially expressed genes (DEGs) were identified by significance analysis of the micro-array data by using the training samples. The classification efficiency of the SCM and RF methods was evaluated using the following parameters: Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and the area under the receiver operating characteristic curve. Functional enrichment analysis was performed for the feature genes with DAVID. A total of 585 DEGs were identified in the training group, of which 580 were upregulated and five were downregulated. The classification accuracy rates of the two methods for the training group, the test group and the combined group using the 585 feature genes were >90%. For the SVM and RF methods, the rates of correct determination, specificity and PPV were >90%, while the sensitivity and NPV were >80%. The SVM method produced a slightly better classification effect than the RF method. A total of 13 biological pathways were overrepresented by the feature genes, mainly involving energy metabolism, chromatin organization and translation. The feature genes identified in the present study may be used to predict the mutation status of FLT3/ITD in patients with AML. PMID:27177049
Feature genes predicting the FLT3/ITD mutation in acute myeloid leukemia.

PubMed

Li, Chenglong; Zhu, Biao; Chen, Jiao; Huang, Xiaobing

2016-07-01

In the present study, gene expression profiles of acute myeloid leukemia (AML) samples were analyzed to identify feature genes with the capacity to predict the mutation status of FLT3/ITD. Two machine learning models, namely the support vector machine (SVM) and random forest (RF) methods, were used for classification. Four datasets were downloaded from the European Bioinformatics Institute, two of which (containing 371 samples, including 281 FLT3/ITD mutation-negative and 90 mutation‑positive samples) were randomly defined as the training group, while the other two datasets (containing 488 samples, including 350 FLT3/ITD mutation-negative and 138 mutation-positive samples) were defined as the test group. Differentially expressed genes (DEGs) were identified by significance analysis of the microarray data by using the training samples. The classification efficiency of the SCM and RF methods was evaluated using the following parameters: Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and the area under the receiver operating characteristic curve. Functional enrichment analysis was performed for the feature genes with DAVID. A total of 585 DEGs were identified in the training group, of which 580 were upregulated and five were downregulated. The classification accuracy rates of the two methods for the training group, the test group and the combined group using the 585 feature genes were >90%. For the SVM and RF methods, the rates of correct determination, specificity and PPV were >90%, while the sensitivity and NPV were >80%. The SVM method produced a slightly better classification effect than the RF method. A total of 13 biological pathways were overrepresented by the feature genes, mainly involving energy metabolism, chromatin organization and translation. The feature genes identified in the present study may be used to predict the mutation status of FLT3/ITD in patients with AML.
SoFoCles: feature filtering for microarray classification based on gene ontology.

PubMed

Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

2010-02-01

Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.
Functional and topological characteristics of mammalian regulatory domains

PubMed Central

Symmons, Orsolya; Uslu, Veli Vural; Tsujimura, Taro; Ruf, Sandra; Nassari, Sonya; Schwarzer, Wibke; Ettwiller, Laurence; Spitz, François

2014-01-01

Long-range regulatory interactions play an important role in shaping gene-expression programs. However, the genomic features that organize these activities are still poorly characterized. We conducted a large operational analysis to chart the distribution of gene regulatory activities along the mouse genome, using hundreds of insertions of a regulatory sensor. We found that enhancers distribute their activities along broad regions and not in a gene-centric manner, defining large regulatory domains. Remarkably, these domains correlate strongly with the recently described TADs, which partition the genome into distinct self-interacting blocks. Different features, including specific repeats and CTCF-binding sites, correlate with the transition zones separating regulatory domains, and may help to further organize promiscuously distributed regulatory influences within large domains. These findings support a model of genomic organization where TADs confine regulatory activities to specific but large regulatory domains, contributing to the establishment of specific gene expression profiles. PMID:24398455
Gene Selection and Cancer Classification: A Rough Sets Based Approach

NASA Astrophysics Data System (ADS)

Sun, Lijun; Miao, Duoqian; Zhang, Hongyun

Indentification of informative gene subsets responsible for discerning between available samples of gene expression data is an important task in bioinformatics. Reducts, from rough sets theory, corresponding to a minimal set of essential genes for discerning samples, is an efficient tool for gene selection. Due to the compuational complexty of the existing reduct algoritms, feature ranking is usually used to narrow down gene space as the first step and top ranked genes are selected . In this paper,we define a novel certierion based on the expression level difference btween classes and contribution to classification of the gene for scoring genes and present a algorithm for generating all possible reduct from informative genes.The algorithm takes the whole attribute sets into account and find short reduct with a significant reduction in computational complexity. An exploration of this approach on benchmark gene expression data sets demonstrates that this approach is successful for selecting high discriminative genes and the classification accuracy is impressive.
Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes

PubMed Central

Richardson, Casey R.; Luo, Qing-Jun; Gontcharova, Viktoria; Jiang, Ying-Wen; Samanta, Manoj; Youn, Eunseog; Rock, Christopher D.

2010-01-01

Background MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. Principal Findings We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. Conclusions Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation. PMID:20520764
Mutations in a novel gene, NHS, cause the pleiotropic effects of Nance-Horan syndrome, including severe congenital cataract, dental anomalies, and mental retardation.

PubMed

Burdon, Kathryn P; McKay, James D; Sale, Michèle M; Russell-Eggitt, Isabelle M; Mackey, David A; Wirth, M Gabriela; Elder, James E; Nicoll, Alan; Clarke, Michael P; FitzGerald, Liesel M; Stankovich, James M; Shaw, Marie A; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K; Thomas, Tim; Gécz, Jozef; Craig, Jamie E

2003-11-01

Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called "NHS," in five families. The NHS gene encompasses approximately 650 kb of genomic DNA, coding for a 1,630-amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development.
Mutations in a Novel Gene, NHS, Cause the Pleiotropic Effects of Nance-Horan Syndrome, Including Severe Congenital Cataract, Dental Anomalies, and Mental Retardation

PubMed Central

Burdon, Kathryn P.; McKay, James D.; Sale, Michèle M.; Russell-Eggitt, Isabelle M.; Mackey, David A.; Wirth, M. Gabriela; Elder, James E.; Nicoll, Alan; Clarke, Michael P.; FitzGerald, Liesel M.; Stankovich, James M.; Shaw, Marie A.; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K.; Thomas, Tim; Gécz, Jozef; Craig, Jamie E.

2003-01-01

Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called “NHS,” in five families. The NHS gene encompasses ∼650 kb of genomic DNA, coding for a 1,630–amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development. PMID:14564667
Identification and Validation of Selected Universal Stress Protein Domain Containing Drought-Responsive Genes in Pigeonpea (Cajanus cajan L.)

PubMed Central

Sinha, Pallavi; Pazhamala, Lekha T.; Singh, Vikas K.; Saxena, Rachit K.; Krishnamurthy, L.; Azam, Sarwar; Khan, Aamir W.; Varshney, Rajeev K.

2016-01-01

Pigeonpea is a resilient crop, which is relatively more drought tolerant than many other legume crops. To understand the molecular mechanisms of this unique feature of pigeonpea, 51 genes were selected using the Hidden Markov Models (HMM) those codes for proteins having close similarity to universal stress protein domain. Validation of these genes was conducted on three pigeonpea genotypes (ICPL 151, ICPL 8755, and ICPL 227) having different levels of drought tolerance. Gene expression analysis using qRT-PCR revealed 6, 8, and 18 genes to be ≥2-fold differentially expressed in ICPL 151, ICPL 8755, and ICPL 227, respectively. A total of 10 differentially expressed genes showed ≥2-fold up-regulation in the more drought tolerant genotype, which encoded four different classes of proteins. These include plant U-box protein (four genes), universal stress protein A-like protein (four genes), cation/H(+) antiporter protein (one gene) and an uncharacterized protein (one gene). Genes C.cajan_29830 and C.cajan_33874 belonging to uspA, were found significantly expressed in all the three genotypes with ≥2-fold expression variations. Expression profiling of these two genes on the four other legume crops revealed their specific role in pigeonpea. Therefore, these genes seem to be promising candidates for conferring drought tolerance specifically to pigeonpea. PMID:26779199
Identification of aberrant gene expression associated with aberrant promoter methylation in primordial germ cells between E13 and E16 rat F3 generation vinclozolin lineage.

PubMed

Taguchi, Y-h

2015-01-01

Transgenerational epigenetics (TGE) are currently considered important in disease, but the mechanisms involved are not yet fully understood. TGE abnormalities expected to cause disease are likely to be initiated during development and to be mediated by aberrant gene expression associated with aberrant promoter methylation that is heritable between generations. However, because methylation is removed and then re-established during development, it is not easy to identify promoter methylation abnormalities by comparing normal lineages with those expected to exhibit TGE abnormalities. This study applied the recently proposed principal component analysis (PCA)-based unsupervised feature extraction to previously reported and publically available gene expression/promoter methylation profiles of rat primordial germ cells, between E13 and E16 of the F3 generation vinclozolin lineage that are expected to exhibit TGE abnormalities, to identify multiple genes that exhibited aberrant gene expression/promoter methylation during development. The biological feasibility of the identified genes were tested via enrichment analyses of various biological concepts including pathway analysis, gene ontology terms and protein-protein interactions. All validations suggested superiority of the proposed method over three conventional and popular supervised methods that employed t test, limma and significance analysis of microarrays, respectively. The identified genes were globally related to tumors, the prostate, kidney, testis and the immune system and were previously reported to be related to various diseases caused by TGE. Among the genes reported by PCA-based unsupervised feature extraction, we propose that chemokine signaling pathways and leucine rich repeat proteins are key factors that initiate transgenerational epigenetic-mediated diseases, because multiple genes included in these two categories were identified in this study.
Identification of aberrant gene expression associated with aberrant promoter methylation in primordial germ cells between E13 and E16 rat F3 generation vinclozolin lineage

PubMed Central

2015-01-01

Background Transgenerational epigenetics (TGE) are currently considered important in disease, but the mechanisms involved are not yet fully understood. TGE abnormalities expected to cause disease are likely to be initiated during development and to be mediated by aberrant gene expression associated with aberrant promoter methylation that is heritable between generations. However, because methylation is removed and then re-established during development, it is not easy to identify promoter methylation abnormalities by comparing normal lineages with those expected to exhibit TGE abnormalities. Methods This study applied the recently proposed principal component analysis (PCA)-based unsupervised feature extraction to previously reported and publically available gene expression/promoter methylation profiles of rat primordial germ cells, between E13 and E16 of the F3 generation vinclozolin lineage that are expected to exhibit TGE abnormalities, to identify multiple genes that exhibited aberrant gene expression/promoter methylation during development. Results The biological feasibility of the identified genes were tested via enrichment analyses of various biological concepts including pathway analysis, gene ontology terms and protein-protein interactions. All validations suggested superiority of the proposed method over three conventional and popular supervised methods that employed t test, limma and significance analysis of microarrays, respectively. The identified genes were globally related to tumors, the prostate, kidney, testis and the immune system and were previously reported to be related to various diseases caused by TGE. Conclusions Among the genes reported by PCA-based unsupervised feature extraction, we propose that chemokine signaling pathways and leucine rich repeat proteins are key factors that initiate transgenerational epigenetic-mediated diseases, because multiple genes included in these two categories were identified in this study. PMID:26677731
Gene expression profiling gut microbiota in different races of humans

NASA Astrophysics Data System (ADS)

Chen, Lei; Zhang, Yu-Hang; Huang, Tao; Cai, Yu-Dong

2016-03-01

The gut microbiome is shaped and modified by the polymorphisms of microorganisms in the intestinal tract. Its composition shows strong individual specificity and may play a crucial role in the human digestive system and metabolism. Several factors can affect the composition of the gut microbiome, such as eating habits, living environment, and antibiotic usage. Thus, various races are characterized by different gut microbiome characteristics. In this present study, we studied the gut microbiomes of three different races, including individuals of Asian, European and American races. The gut microbiome and the expression levels of gut microbiome genes were analyzed in these individuals. Advanced feature selection methods (minimum redundancy maximum relevance and incremental feature selection) and four machine-learning algorithms (random forest, nearest neighbor algorithm, sequential minimal optimization, Dagging) were employed to capture key differentially expressed genes. As a result, sequential minimal optimization was found to yield the best performance using the 454 genes, which could effectively distinguish the gut microbiomes of different races. Our analyses of extracted genes support the widely accepted hypotheses that eating habits, living environments and metabolic levels in different races can influence the characteristics of the gut microbiome.
Gene expression profiling gut microbiota in different races of humans

PubMed Central

Chen, Lei; Zhang, Yu-Hang; Huang, Tao; Cai, Yu-Dong

2016-01-01

The gut microbiome is shaped and modified by the polymorphisms of microorganisms in the intestinal tract. Its composition shows strong individual specificity and may play a crucial role in the human digestive system and metabolism. Several factors can affect the composition of the gut microbiome, such as eating habits, living environment, and antibiotic usage. Thus, various races are characterized by different gut microbiome characteristics. In this present study, we studied the gut microbiomes of three different races, including individuals of Asian, European and American races. The gut microbiome and the expression levels of gut microbiome genes were analyzed in these individuals. Advanced feature selection methods (minimum redundancy maximum relevance and incremental feature selection) and four machine-learning algorithms (random forest, nearest neighbor algorithm, sequential minimal optimization, Dagging) were employed to capture key differentially expressed genes. As a result, sequential minimal optimization was found to yield the best performance using the 454 genes, which could effectively distinguish the gut microbiomes of different races. Our analyses of extracted genes support the widely accepted hypotheses that eating habits, living environments and metabolic levels in different races can influence the characteristics of the gut microbiome. PMID:26975620
Tumor necrosis is an important hallmark of aggressive endometrial cancer and associates with hypoxia, angiogenesis and inflammation responses

PubMed Central

Stefansson, Ingunn M.; Birkeland, Even; Bø, Trond Hellem; Øyan, Anne M.; Trovik, Jone; Kalland, Karl-Henning; Jonassen, Inge; Salvesen, Helga B.; Wik, Elisabeth; Akslen, Lars A.

2015-01-01

Aims Tumor necrosis is associated with aggressive features of endometrial cancer and poor prognosis. Here, we investigated gene expression patterns and potential treatment targets related to presence of tumor necrosis in primary endometrial cancer lesions. Methods and Results By DNA microarray analysis, expression of genes related to tumor necrosis reflected multiple tumor-microenvironment interactions like tissue hypoxia, angiogenesis and inflammation pathways. A tumor necrosis signature of 38 genes and a related patient cluster (Cluster I, 67% of the cases) were associated with features of aggressive tumors such as type II cancers, estrogen receptor negative tumors and vascular invasion. Further, the tumor necrosis signature was increased in tumor cells grown in hypoxic conditions in vitro. Multiple genes with increased expression are known to be activated by HIF1A and NF-kB. Conclusions Our findings indicate that the presence of tumor necrosis within primary tumors is associated with hypoxia, angiogenesis and inflammation responses. HIF1A, NF-kB and PI3K/mTOR might be potential treatment targets in aggressive endometrial cancers with presence of tumor necrosis. PMID:26485755
Tumor necrosis is an important hallmark of aggressive endometrial cancer and associates with hypoxia, angiogenesis and inflammation responses.

PubMed

Bredholt, Geir; Mannelqvist, Monica; Stefansson, Ingunn M; Birkeland, Even; Bø, Trond Hellem; Øyan, Anne M; Trovik, Jone; Kalland, Karl-Henning; Jonassen, Inge; Salvesen, Helga B; Wik, Elisabeth; Akslen, Lars A

2015-11-24

Tumor necrosis is associated with aggressive features of endometrial cancer and poor prognosis. Here, we investigated gene expression patterns and potential treatment targets related to presence of tumor necrosis in primary endometrial cancer lesions. By DNA microarray analysis, expression of genes related to tumor necrosis reflected multiple tumor-microenvironment interactions like tissue hypoxia, angiogenesis and inflammation pathways. A tumor necrosis signature of 38 genes and a related patient cluster (Cluster I, 67% of the cases) were associated with features of aggressive tumors such as type II cancers, estrogen receptor negative tumors and vascular invasion. Further, the tumor necrosis signature was increased in tumor cells grown in hypoxic conditions in vitro. Multiple genes with increased expression are known to be activated by HIF1A and NF-kB. Our findings indicate that the presence of tumor necrosis within primary tumors is associated with hypoxia, angiogenesis and inflammation responses. HIF1A, NF-kB and PI3K/mTOR might be potential treatment targets in aggressive endometrial cancers with presence of tumor necrosis.
Feature Genes Selection Using Supervised Locally Linear Embedding and Correlation Coefficient for Microarray Classification

PubMed Central

Wang, Yun; Huang, Fangzhou

2018-01-01

The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC2), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible. PMID:29666661
Feature Genes Selection Using Supervised Locally Linear Embedding and Correlation Coefficient for Microarray Classification.

PubMed

Xu, Jiucheng; Mu, Huiyu; Wang, Yun; Huang, Fangzhou

2018-01-01

The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC 2 ), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible.
Transcriptional Modulation of Genes Encoding Structural Characteristics of Differentiating Enterocytes During Development of a Polarized Epithelium In Vitro

PubMed Central

Halbleib, Jennifer M.; Sääf, Annika M.

2007-01-01

Although there is considerable evidence implicating posttranslational mechanisms in the development of epithelial cell polarity, little is known about the patterns of gene expression and transcriptional regulation during this process. We characterized the temporal program of gene expression during cell–cell adhesion–initiated polarization of human Caco-2 cells in tissue culture, which develop structural and functional polarity similar to that of enterocytes in vivo. A distinctive switch in gene expression patterns occurred upon formation of cell–cell contacts between neighboring cells. Expression of genes involved in cell proliferation was down-regulated concomitant with induction of genes necessary for functional specialization of polarized epithelial cells. Transcriptional up-regulation of these latter genes correlated with formation of important structural and functional features in enterocyte differentiation and establishment of structural and functional cell polarity; components of the apical microvilli were induced as the brush border formed during polarization; as barrier function was established, expression of tight junction transmembrane proteins peaked; transcripts encoding components of the apical, but not the basal-lateral trafficking machinery were increased during polarization. Coordinated expression of genes encoding components of functional cell structures were often observed indicating temporal control of expression and assembly of multiprotein complexes. PMID:17699590
Cell cycle-regulated oscillator coordinates core histone gene transcription through histone acetylation

PubMed Central

Kurat, Christoph F.; Lambert, Jean-Philippe; Petschnigg, Julia; Friesen, Helena; Pawson, Tony; Rosebrock, Adam; Gingras, Anne-Claude; Fillingham, Jeffrey; Andrews, Brenda

2014-01-01

DNA replication occurs during the synthetic (S) phase of the eukaryotic cell cycle and features a dramatic induction of histone gene expression for concomitant chromatin assembly. Ectopic production of core histones outside of S phase is toxic, underscoring the critical importance of regulatory pathways that ensure proper expression of histone genes. Several regulators of histone gene expression in the budding yeast Saccharomyces cerevisiae are known, yet the key oscillator responsible for restricting gene expression to S phase has remained elusive. Here, we show that suppressor of Ty (Spt)10, a putative histone acetyltransferase, and its binding partner Spt21 are key determinants of S-phase–specific histone gene expression. We show that Spt21 abundance is restricted to S phase in part by anaphase promoting complex Cdc20-homologue 1 (APCCdh1) and that it is recruited to histone gene promoters in S phase by Spt10. There, Spt21-Spt10 enables the recruitment of a cascade of regulators, including histone chaperones and the histone-acetyltransferase general control nonderepressible (Gcn) 5, which we hypothesize lead to histone acetylation and consequent transcription activation. PMID:25228766

htsint: a Python library for sequencing pipelines that combines data through gene set generation.

PubMed

Richards, Adam J; Herrel, Anthony; Bonneaud, Camille

2015-09-24

Sequencing technologies provide a wealth of details in terms of genes, expression, splice variants, polymorphisms, and other features. A standard for sequencing analysis pipelines is to put genomic or transcriptomic features into a context of known functional information, but the relationships between ontology terms are often ignored. For RNA-Seq, considering genes and their genetic variants at the group level enables a convenient way to both integrate annotation data and detect small coordinated changes between experimental conditions, a known caveat of gene level analyses. We introduce the high throughput data integration tool, htsint, as an extension to the commonly used gene set enrichment frameworks. The central aim of htsint is to compile annotation information from one or more taxa in order to calculate functional distances among all genes in a specified gene space. Spectral clustering is then used to partition the genes, thereby generating functional modules. The gene space can range from a targeted list of genes, like a specific pathway, all the way to an ensemble of genomes. Given a collection of gene sets and a count matrix of transcriptomic features (e.g. expression, polymorphisms), the gene sets produced by htsint can be tested for 'enrichment' or conditional differences using one of a number of commonly available packages. The database and bundled tools to generate functional modules were designed with sequencing pipelines in mind, but the toolkit nature of htsint allows it to also be used in other areas of genomics. The software is freely available as a Python library through GitHub at https://github.com/ajrichards/htsint.
Developmental gene regulatory network architecture across 500 million years of echinoderm evolution

NASA Technical Reports Server (NTRS)

Hinman, Veronica F.; Nguyen, Albert T.; Cameron, R. Andrew; Davidson, Eric H.

2003-01-01

Evolutionary change in morphological features must depend on architectural reorganization of developmental gene regulatory networks (GRNs), just as true conservation of morphological features must imply retention of ancestral developmental GRN features. Key elements of the provisional GRN for embryonic endomesoderm development in the sea urchin are here compared with those operating in embryos of a distantly related echinoderm, a starfish. These animals diverged from their common ancestor 520-480 million years ago. Their endomesodermal fate maps are similar, except that sea urchins generate a skeletogenic cell lineage that produces a prominent skeleton lacking entirely in starfish larvae. A relevant set of regulatory genes was isolated from the starfish Asterina miniata, their expression patterns determined, and effects on the other genes of perturbing the expression of each were demonstrated. A three-gene feedback loop that is a fundamental feature of the sea urchin GRN for endoderm specification is found in almost identical form in the starfish: a detailed element of GRN architecture has been retained since the Cambrian Period in both echinoderm lineages. The significance of this retention is highlighted by the observation of numerous specific differences in the GRN connections as well. A regulatory gene used to drive skeletogenesis in the sea urchin is used entirely differently in the starfish, where it responds to endomesodermal inputs that do not affect it in the sea urchin embryo. Evolutionary changes in the GRNs since divergence are limited sharply to certain cis-regulatory elements, whereas others have persisted unaltered.
Noise in gene expression is coupled to growth rate.

PubMed

Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran

2015-12-01

Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle-regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. © 2015 Keren et al.; Published by Cold Spring Harbor Laboratory Press.
Noise in gene expression is coupled to growth rate

PubMed Central

Keren, Leeat; van Dijk, David; Weingarten-Gabbay, Shira; Davidi, Dan; Jona, Ghil; Weinberger, Adina; Milo, Ron; Segal, Eran

2015-01-01

Genetically identical cells exposed to the same environment display variability in gene expression (noise), with important consequences for the fidelity of cellular regulation and biological function. Although population average gene expression is tightly coupled to growth rate, the effects of changes in environmental conditions on expression variability are not known. Here, we measure the single-cell expression distributions of approximately 900 Saccharomyces cerevisiae promoters across four environmental conditions using flow cytometry, and find that gene expression noise is tightly coupled to the environment and is generally higher at lower growth rates. Nutrient-poor conditions, which support lower growth rates, display elevated levels of noise for most promoters, regardless of their specific expression values. We present a simple model of noise in expression that results from having an asynchronous population, with cells at different cell-cycle stages, and with different partitioning of the cells between the stages at different growth rates. This model predicts non-monotonic global changes in noise at different growth rates as well as overall higher variability in expression for cell-cycle–regulated genes in all conditions. The consistency between this model and our data, as well as with noise measurements of cells growing in a chemostat at well-defined growth rates, suggests that cell-cycle heterogeneity is a major contributor to gene expression noise. Finally, we identify gene and promoter features that play a role in gene expression noise across conditions. Our results show the existence of growth-related global changes in gene expression noise and suggest their potential phenotypic implications. PMID:26355006
Expression profiling of mouse subplate reveals a dynamic gene network and disease association with autism and schizophrenia

PubMed Central

Hoerder-Suabedissen, Anna; Oeschger, Franziska M.; Krishnan, Michelle L.; Belgard, T. Grant; Wang, Wei Zhi; Lee, Sheena; Webber, Caleb; Petretto, Enrico; Edwards, A. David; Molnár, Zoltán

2013-01-01

The subplate zone is a highly dynamic transient sector of the developing cerebral cortex that contains some of the earliest generated neurons and the first functional synapses of the cerebral cortex. Subplate cells have important functions in early establishment and maturation of thalamocortical connections, as well as in the development of inhibitory cortical circuits in sensory areas. So far no role has been identified for cells in the subplate in the mature brain and disease association of the subplate-specific genes has not been analyzed systematically. Here we present gene expression evidence for distinct roles of the mouse subplate across development as well as unique molecular markers to extend the repertoire of subplate labels. Performing systematic comparisons between different ages (embryonic days 15 and 18, postnatal day 8, and adult), we reveal the dynamic and constant features of the markers labeling subplate cells during embryonic and early postnatal development and in the adult. This can be visualized using the online database of subplate gene expression at https://molnar.dpag.ox.ac.uk/subplate/. We also identify embryonic similarities in gene expression between the ventricular zones, intermediate zone, and subplate, and distinct postnatal similarities between subplate, layer 5, and layers 2/3. The genes expressed in a subplate-specific manner at some point during development show a statistically significant enrichment for association with autism spectrum disorders and schizophrenia. Our report emphasizes the importance of the study of transient features of the developing brain to better understand neurodevelopmental disorders. PMID:23401504
dictyExpress: a Dictyostelium discoideum gene expression database with an explorative data analysis web-based interface.

PubMed

Rot, Gregor; Parikh, Anup; Curk, Tomaz; Kuspa, Adam; Shaulsky, Gad; Zupan, Blaz

2009-08-25

Bioinformatics often leverages on recent advancements in computer science to support biologists in their scientific discovery process. Such efforts include the development of easy-to-use web interfaces to biomedical databases. Recent advancements in interactive web technologies require us to rethink the standard submit-and-wait paradigm, and craft bioinformatics web applications that share analytical and interactive power with their desktop relatives, while retaining simplicity and availability. We have developed dictyExpress, a web application that features a graphical, highly interactive explorative interface to our database that consists of more than 1000 Dictyostelium discoideum gene expression experiments. In dictyExpress, the user can select experiments and genes, perform gene clustering, view gene expression profiles across time, view gene co-expression networks, perform analyses of Gene Ontology term enrichment, and simultaneously display expression profiles for a selected gene in various experiments. Most importantly, these tasks are achieved through web applications whose components are seamlessly interlinked and immediately respond to events triggered by the user, thus providing a powerful explorative data analysis environment. dictyExpress is a precursor for a new generation of web-based bioinformatics applications with simple but powerful interactive interfaces that resemble that of the modern desktop. While dictyExpress serves mainly the Dictyostelium research community, it is relatively easy to adapt it to other datasets. We propose that the design ideas behind dictyExpress will influence the development of similar applications for other model organisms.
dictyExpress: a Dictyostelium discoideum gene expression database with an explorative data analysis web-based interface

PubMed Central

Rot, Gregor; Parikh, Anup; Curk, Tomaz; Kuspa, Adam; Shaulsky, Gad; Zupan, Blaz

2009-01-01

Background Bioinformatics often leverages on recent advancements in computer science to support biologists in their scientific discovery process. Such efforts include the development of easy-to-use web interfaces to biomedical databases. Recent advancements in interactive web technologies require us to rethink the standard submit-and-wait paradigm, and craft bioinformatics web applications that share analytical and interactive power with their desktop relatives, while retaining simplicity and availability. Results We have developed dictyExpress, a web application that features a graphical, highly interactive explorative interface to our database that consists of more than 1000 Dictyostelium discoideum gene expression experiments. In dictyExpress, the user can select experiments and genes, perform gene clustering, view gene expression profiles across time, view gene co-expression networks, perform analyses of Gene Ontology term enrichment, and simultaneously display expression profiles for a selected gene in various experiments. Most importantly, these tasks are achieved through web applications whose components are seamlessly interlinked and immediately respond to events triggered by the user, thus providing a powerful explorative data analysis environment. Conclusion dictyExpress is a precursor for a new generation of web-based bioinformatics applications with simple but powerful interactive interfaces that resemble that of the modern desktop. While dictyExpress serves mainly the Dictyostelium research community, it is relatively easy to adapt it to other datasets. We propose that the design ideas behind dictyExpress will influence the development of similar applications for other model organisms. PMID:19706156
Translocations and mutations involving the nucleophosmin (NPM1) gene in lymphomas and leukemias.

PubMed

Falini, Brunangelo; Nicoletti, Ildo; Bolli, Niccolò; Martelli, Maria Paola; Liso, Arcangelo; Gorello, Paolo; Mandelli, Franco; Mecucci, Cristina; Martelli, Massimo Fabrizio

2007-04-01

Nucleophosmin (NPM) is a ubiquitously expressed nucleolar phoshoprotein which shuttles continuously between the nucleus and cytoplasm. Many findings have revealed a complex scenario of NPM functions and interactions, pointing to proliferative and growth-suppressive roles of this molecule. The gene NPM1 that encodes for nucleophosmin (NPM1) is translocated or mutated in various lymphomas and leukemias, forming fusion proteins (NPM-ALK, NPM-RARalpha, NPM-MLF1) or NPM mutant products. Here, we review the structure and functions of NPM, as well as the biological, clinical and pathological features of human hematologic malignancies with NPM1 gene alterations. NPM-ALK indentifies a new category of T/Null lymphomas with distinctive molecular and clinico-pathological features, that is going to be included as a novel disease entity (ALK+ anaplastic large cell lymphoma) in the new WHO classification of lymphoid neoplasms. NPM1 mutations occur specifically in about 30% of adult de novo AML and cause aberrant cytoplasmic expression of NPM (hence the term NPMc+ AML). NPMc+ AML associates with normal karyotpe, and shows wide morphological spectrum, multilineage involvement, a unique gene expression signature, a high frequency of FLT3-internal tandem duplications, and distinctive clinical and prognostic features. The availability of specific antibodies and molecular techniques for the detection of NPM1 gene alterations has an enormous impact in the biological study diagnosis, prognostic stratification, and monitoring of minimal residual disease of various lymphomas and leukemias. The discovery of NPM1 gene alterations also represents the rationale basis for development of molecular targeted drugs.
A 3,000-loci transcription map of chromosome 3B unravels the structural and functional features of gene islands in hexaploid wheat.

PubMed

Rustenholz, Camille; Choulet, Frédéric; Laugier, Christel; Safár, Jan; Simková, Hana; Dolezel, Jaroslav; Magni, Federica; Scalabrin, Simone; Cattonaro, Federica; Vautrin, Sonia; Bellec, Arnaud; Bergès, Hélène; Feuillet, Catherine; Paux, Etienne

2011-12-01

To improve our understanding of the organization and regulation of the wheat (Triticum aestivum) gene space, we established a transcription map of a wheat chromosome (3B) by hybridizing a newly developed wheat expression microarray with bacterial artificial chromosome pools from a new version of the 3B physical map as well as with cDNA probes derived from 15 RNA samples. Mapping data for almost 3,000 genes showed that the gene space spans the whole chromosome 3B with a 2-fold increase of gene density toward the telomeres due to an increase in the number of genes in islands. Comparative analyses with rice (Oryza sativa) and Brachypodium distachyon revealed that these gene islands are composed mainly of genes likely originating from interchromosomal gene duplications. Gene Ontology and expression profile analyses for the 3,000 genes located along the chromosome revealed that the gene islands are enriched significantly in genes sharing the same function or expression profile, thereby suggesting that genes in islands acquired shared regulation during evolution. Only a small fraction of these clusters of cofunctional and coexpressed genes was conserved with rice and B. distachyon, indicating a recent origin. Finally, genes with the same expression profiles in remote islands (coregulation islands) were identified suggesting long-distance regulation of gene expression along the chromosomes in wheat.
The AMT1 family genes from Malus robusta display differential transcription features and ammonium transport abilities.

PubMed

Li, Hui; Yang, Qing-Song; Liu, Wei; Lin, Jing; Chang, You-Hong

2017-10-01

Ammonium is an important nitrogen sources for plant growth. In this study, we report on the gene characterization of the ammonium transporter AMT1 subfamily in the apple rootstock Malus robusta Rehd. Thirteen AMT genes were comprehensively evaluated from the apple genome (version 1.0). Then the gene features and expression patterns of five AMT1 members from M. robusta were analyzed. These genes fell into four clusters in the AMT phylogenetic tree: clade I (MrAMT1;1 and MrAMT1;3), clade II (MrAMT1;4), clade III (MrAMT1;2), and clade IV (MrAMT1;5). All the AMT1s, apart from MrAMT1;4, were expressed in vegetative organs and strongly responded to nitrogen concentration changes. For example, MrAMT1;2 and MrAMT1;3 had high transcript accumulation levels in the leaves and roots, respectively. Finally, the functions of these AMT1s were studied in detail by heterologous expression in yeast. These genes allowed strain 31019b to assimilate nitrogen, but their 15 NH 4 + uptake kinetics varied. These results revealed the functional roles of AMT1 during ammonium absorption in the AMT-defective mutant yeast system.
Growth-rate dependent global effects on gene expression in bacteria

PubMed Central

Klumpp, Stefan; Zhang, Zhongge; Hwa, Terence

2010-01-01

Summary Bacterial gene expression depends not only on specific regulations but also directly on bacterial growth, because important global parameters such as the abundance of RNA polymerases and ribosomes are all growth-rate dependent. Understanding these global effects is necessary for a quantitative understanding of gene regulation and for the robust design of synthetic genetic circuits. The observed growth-rate dependence of constitutive gene expression can be explained by a simple model using the measured growth-rate dependence of the relevant cellular parameters. More complex growth dependences for genetic circuits involving activators, repressors and feedback control were analyzed, and salient features were verified experimentally using synthetic circuits. The results suggest a novel feedback mechanism mediated by general growth-dependent effects and not requiring explicit gene regulation, if the expressed protein affects cell growth. This mechanism can lead to growth bistability and promote the acquisition of important physiological functions such as antibiotic resistance and tolerance (persistence). PMID:20064380
Discretization of Gene Expression Data Unmasks Molecular Subgroups Recurring in Different Human Cancer Types.

PubMed

Beleut, Manfred; Soeldner, Robert; Egorov, Mark; Guenther, Rolf; Dehler, Silvia; Morys-Wortmann, Corinna; Moch, Holger; Henco, Karsten; Schraml, Peter

2016-01-01

Despite the individually different molecular alterations in tumors, the malignancy associated biological traits are strikingly similar. Results of a previous study using renal cell carcinoma (RCC) as a model pointed towards cancer-related features, which could be visualized as three groups by microarray based gene expression analysis. In this study, we used a mathematic model to verify the presence of these groups in RCC as well as in other cancer types. We developed an algorithm for gene-expression deviation profiling for analyzing gene expression data of a total of 8397 patients with 13 different cancer types and normal tissues. We revealed three common Cancer Transcriptomic Profiles (CTPs) which recurred in all investigated tumors. Additionally, CTPs remained robust regardless of the functions or numbers of genes analyzed. CTPs may represent common genetic fingerprints, which potentially reflect the closely related biological traits of human cancers.
GECKO: a complete large-scale gene expression analysis platform.

PubMed

Theilhaber, Joachim; Ulyanov, Anatoly; Malanthara, Anish; Cole, Jack; Xu, Dapeng; Nahf, Robert; Heuer, Michael; Brockel, Christoph; Bushnell, Steven

2004-12-10

Gecko (Gene Expression: Computation and Knowledge Organization) is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing approximately 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph), in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (approximately 100 users) and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.
Pediatric acute myeloid leukemia with NPM1 mutations is characterized by a gene expression profile with dysregulated HOX gene expression distinct from MLL-rearranged leukemias.

PubMed

Mullighan, C G; Kennedy, A; Zhou, X; Radtke, I; Phillips, L A; Shurtleff, S A; Downing, J R

2007-09-01

Somatic mutations in nucleophosmin (NPM1) occur in approximately 35% of adult acute myeloid leukemia (AML). To assess the frequency of NPM1 mutations in pediatric AML, we sequenced NPM1 in the diagnostic blasts from 93 pediatric AML patients. Six cases harbored NPM1 mutations, with each case lacking common cytogenetic abnormalities. To explore the phenotype of the AMLs with NPM1 mutations, gene expression profiles were obtained using Affymetrix U133A microarrays. NPM1 mutations were associated with increased expression of multiple homeobox genes including HOXA9, A10, B2, B6 and MEIS1. As dysregulated homeobox gene expression is also a feature of MLL-rearranged leukemia, the gene expression signatures of NPM1-mutated and MLL-rearranged leukemias were compared. Significant differences were identified between these leukemia subtypes including the expression of different HOX genes, with NPM1-mutated AML showing higher levels of expression of HOXB2, B3, B6 and D4. These results confirm recent reports of perturbed HOX expression in NPM1-mutated adult AML, and provide the first evidence that the NPM1-mutated signature is distinct from MLL-rearranged AML. These findings suggest that mutated NPM1 leads to dysregulated HOX expression via a different mechanism than MLL rearrangement.
CoNekT: an open-source framework for comparative genomic and transcriptomic network analyses.

PubMed

Proost, Sebastian; Mutwil, Marek

2018-05-01

The recent accumulation of gene expression data in the form of RNA sequencing creates unprecedented opportunities to study gene regulation and function. Furthermore, comparative analysis of the expression data from multiple species can elucidate which functional gene modules are conserved across species, allowing the study of the evolution of these modules. However, performing such comparative analyses on raw data is not feasible for many biologists. Here, we present CoNekT (Co-expression Network Toolkit), an open source web server, that contains user-friendly tools and interactive visualizations for comparative analyses of gene expression data and co-expression networks. These tools allow analysis and cross-species comparison of (i) gene expression profiles; (ii) co-expression networks; (iii) co-expressed clusters involved in specific biological processes; (iv) tissue-specific gene expression; and (v) expression profiles of gene families. To demonstrate these features, we constructed CoNekT-Plants for green alga, seed plants and flowering plants (Picea abies, Chlamydomonas reinhardtii, Vitis vinifera, Arabidopsis thaliana, Oryza sativa, Zea mays and Solanum lycopersicum) and thus provide a web-tool with the broadest available collection of plant phyla. CoNekT-Plants is freely available from http://conekt.plant.tools, while the CoNekT source code and documentation can be found at https://github.molgen.mpg.de/proost/CoNekT/.
Up-regulation of mismatch repair genes MSH6, PMS2 and MLH1 parallels development of genetic instability and is linked to tumor aggressiveness and early PSA recurrence in prostate cancer.

PubMed

Wilczak, Waldemar; Rashed, Semin; Hube-Magg, Claudia; Kluth, Martina; Simon, Ronald; Büscheck, Franziska; Clauditz, Till Sebastian; Grupp, Katharina; Minner, Sarah; Tsourlakis, Maria Christina; Möller-Koop, Christina; Graefen, Markus; Adam, Meike; Haese, Alexander; Wittmer, Corinna; Sauter, Guido; Izbicki, Jakob Robert; Huland, Hartwig; Schlomm, Thorsten; Steurer, Stefan; Krech, Till; Lebok, Patrick

2017-01-01

DNA mismatch repair (MMR) is integral to the maintenance of genetic stability. We aimed to evaluate the clinical impact of MMR gene expression in prostate cancer. The MMR genes MSH6, MLH1 and PMS2 were analyzed by immunohistochemistry on a tissue microarray containing 11152 prostate cancer specimens. Results were compared with ETS-related gene status and deletions of PTEN, 3p13, 5q21 and 6q15. MSH6, MLH1 and PMS2 expression was detectable in 89.5%, 85.4% and 85.0% of cancers and was particularly strong in cancers with advanced pathological tumor stage (P < 0.0001 each), high Gleason grade (P < 0.0001 each), nodal metastasis (P ≤ 0.0083) and early biochemical recurrence (P < 0.0001). High levels of MMR gene expression paralleled features of genetic instability, such as the number of genomic deletions per cancer; strong expression of all three MMR genes was found in 24%, 29%, 30%, 33% and 42% of cancers with no, one, two, three or four to five deletions (P < 0.0001). The prognostic value of the analyzed MMR genes was largely driven by the subset of cancers lacking ERG fusion (P < 0.0001), while the prognostic impact of MMR gene overexpression was only marginal in ERG-positive cancers. Multivariate analyses suggested an independent prognostic relevance of MMR genes in ERG-negative prostate cancers when compared with prognostic parameters available at the time of initial biopsy. In conclusion, MMR overexpression is common in prostate cancer and is linked to poor outcome as well as features indicating genetic instability. ERG fusion should be analyzed along with MMR gene expression in potential clinical tests. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data.

PubMed

Awazu, Akinori; Tanabe, Takahiro; Kamitani, Mari; Tezuka, Ayumi; Nagano, Atsushi J

2018-05-29

Gene expression levels exhibit stochastic variations among genetically identical organisms under the same environmental conditions. In many recent transcriptome analyses based on RNA sequencing (RNA-seq), variations in gene expression levels among replicates were assumed to follow a negative binomial distribution, although the physiological basis of this assumption remains unclear. In this study, RNA-seq data were obtained from Arabidopsis thaliana under eight conditions (21-27 replicates), and the characteristics of gene-dependent empirical probability density function (ePDF) profiles of gene expression levels were analyzed. For A. thaliana and Saccharomyces cerevisiae, various types of ePDF of gene expression levels were obtained that were classified as Gaussian, power law-like containing a long tail, or intermediate. These ePDF profiles were well fitted with a Gauss-power mixing distribution function derived from a simple model of a stochastic transcriptional network containing a feedback loop. The fitting function suggested that gene expression levels with long-tailed ePDFs would be strongly influenced by feedback regulation. Furthermore, the features of gene expression levels are correlated with their functions, with the levels of essential genes tending to follow a Gaussian-like ePDF while those of genes encoding nucleic acid-binding proteins and transcription factors exhibit long-tailed ePDF.
Characterization of TALE genes expression during the first lineage segregation in mammalian embryos.

PubMed

Sonnet, Wendy; Rezsöhazy, Rene; Donnay, Isabelle

2012-11-01

Three amino acid loop extension (TALE) homeodomain-containing transcription factors are generally recognized for their role in organogenesis and differentiation during embryogenesis. However, very little is known about the expression and function of Meis, Pbx, and Prep genes during early development. In order to determine whether TALE proteins could contribute to the early cell fate decisions in mammalian development, this study aimed to characterize in a systematic manner the pattern of expression of all Meis, Pbx, and Prep genes from the precompaction to blastocyst stage corresponding to the first step of cell differentiation in mammals. To reveal to what extent TALE genes expression at these early stages is a conserved feature among mammals, this study was performed in parallel in the bovine and mouse models. We demonstrated the transcription and translation of TALE genes, before gastrulation in the two species. At least one member of Meis, Pbx, and Prep subfamilies was found expressed at the RNA and protein levels but different patterns of expression were observed between genes and between species, suggesting specific gene regulations. Taken together, these results suggest a previously unexpected involvement of these factors during the early development in mammals. Copyright © 2012 Wiley Periodicals, Inc.
Polycistronic gene expression in Aspergillus niger.

PubMed

Schuetze, Tabea; Meyer, Vera

2017-09-25

Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at least three genes polycistronically in A. niger. This approach can now be applied to heterologously express entire secondary metabolite gene clusters polycistronically or to co-express any genes of interest in equimolar amounts.
Developmental Transcriptomic Features of the Carcinogenic Liver Fluke, Clonorchis sinensis

PubMed Central

Cho, Pyo Yun; Kim, Tae Im; Cho, Shin-Hyeong; Choi, Sang-Haeng; Park, Hong-Seog; Kim, Tong-Soo; Hong, Sung-Jong

2011-01-01

Clonorchis sinensis is the causative agent of the life-threatening disease endemic to China, Korea, and Vietnam. It is estimated that about 15 million people are infected with this fluke. C. sinensis provokes inflammation, epithelial hyperplasia, and periductal fibrosis in bile ducts, and may cause cholangiocarcinoma in chronically infected individuals. Accumulation of a large amount of biological information about the adult stage of this liver fluke in recent years has advanced our understanding of the pathological interplay between this parasite and its hosts. However, no developmental gene expression profiles of C. sinensis have been published. In this study, we generated gene expression profiles of three developmental stages of C. sinensis by analyzing expressed sequence tags (ESTs). Complementary DNA libraries were constructed from the adult, metacercaria, and egg developmental stages of C. sinensis. A total of 52,745 ESTs were generated and assembled into 12,830 C. sinensis assembled EST sequences, and then these assemblies were further categorized into groups according to biological functions and developmental stages. Most of the genes that were differentially expressed in the different stages were consistent with the biological and physical features of the particular developmental stage; high energy metabolism, motility and reproduction genes were differentially expressed in adults, minimal metabolism and final host adaptation genes were differentially expressed in metacercariae, and embryonic genes were differentially expressed in eggs. The higher expression of glucose transporters, proteases, and antioxidant enzymes in the adults accounts for active uptake of nutrients and defense against host immune attacks. The types of ion channels present in C. sinensis are consistent with its parasitic nature and phylogenetic placement in the tree of life. We anticipate that the transcriptomic information on essential regulators of development, bile chemotaxis, and physico-metabolic pathways in C. sinensis that presented in this study will guide further studies to identify novel drug targets and diagnostic antigens. PMID:21738807

Characterization and expression of amphioxus ApoD gene encoding an archetype of vertebrate ApoD proteins.

PubMed

Wang, Lei; Zhang, Shicui; Liu, Zhenhui; Li, Hongyan; Wang, Yongjun; Jiang, Shengjuan

2007-01-01

Here we report a homologue of the apolipoprotein D gene (AmphiApoD) in amphioxus, Branchiostoma belcheri tsingtauense, the first such finding in a basal chordate cephalochordate. The main features of the protein predicted from AmphiApoD are characteristic of the apolipoprotein D. Phylogenetic analysis places AmphiApoD at the base of the phylogenetic tree, suggesting that AmphiApoD is the archetype of the vertebrate ApoD genes. Both whole mount in situ hybridization and Northern blotting and RT-PCR as well as in situ hybridization histochemistry reveal that AmphiApoD is expressed in tissues derived from mesoderm and endoderm including notochord and hind-gut, which contrasts with the strong expression patterns of ApoD genes in the ectodermal derivatives in mammals and birds. The expression profiles of the ApoD gene may have been changed to be expressed in the endo-mesodermal derivatives in amphioxus after the vertebrate and cephalochordate lineages diverged; alternatively, the ApoD gene may first have been expressed in the endo-mesoderm during embryogenesis in the last common ancestor of all chordates, and subsequently came to be expressed in the ectodermal derivatives of vertebrates including mammals and birds.
Genome-scale gene expression characteristics define the follicular initiation and developmental rules during folliculogenesis.

PubMed

Shi, Kerong; He, Feng; Yuan, Xuefeng; Zhao, Yaofeng; Deng, Xuemei; Hu, Xiaoxiang; Li, Ning

2013-08-01

The ovarian follicle supplies a unique dynamic system for gametes that ensures the propagation of the species. During folliculogenesis, the vast majority of the germ cells are lost or inactivated because of ovarian follicle atresia, resulting in diminished reproductive potency and potential infertility. Understanding the underlying molecular mechanism of folliculogenesis rules is essential. Primordial (P), preantral (M), and large antral (L) porcine follicles were used to reveal their genome-wide gene expression profiles. Results indicate that primordial follicles (P) process a diverse gene expression pattern compared to growing follicles (M and L). The 5,548 differentially expressed genes display a similar expression mode in M and L, with a correlation coefficient of 0.892. The number of regulated (both up and down) genes in M is more than that in L. Also, their regulation folds in M (2-364-fold) are much more acute than in L (2-75-fold). Differentially expressed gene groups with different regulation patterns in certain follicular stages are identified and presumed to be closely related following follicular developmental rules. Interestingly, functional annotation analysis revealed that these gene groups feature distinct biological processes or molecular functions. Moreover, representative candidate genes from these gene groups have had their RNA or protein expressions within follicles confirmed. Our study emphasized genome-scale gene expression characteristics, which provide novel entry points for understanding the folliculogenesis rules on the molecular level, such as follicular initiation, atresia, and dominance. Transcriptional regulatory circuitries in certain follicular stages are expected to be found among the identified differentially expressed gene groups.
Radiogenomics of hepatocellular carcinoma: multiregion analysis-based identification of prognostic imaging biomarkers by integrating gene data—a preliminary study

NASA Astrophysics Data System (ADS)

Xia, Wei; Chen, Ying; Zhang, Rui; Yan, Zhuangzhi; Zhou, Xiaobo; Zhang, Bo; Gao, Xin

2018-02-01

Our objective was to identify prognostic imaging biomarkers for hepatocellular carcinoma in contrast-enhanced computed tomography (CECT) with biological interpretations by associating imaging features and gene modules. We retrospectively analyzed 371 patients who had gene expression profiles. For the 38 patients with CECT imaging data, automatic intra-tumor partitioning was performed, resulting in three spatially distinct subregions. We extracted a total of 37 quantitative imaging features describing intensity, geometry, and texture from each subregion. Imaging features were selected after robustness and redundancy analysis. Gene modules acquired from clustering were chosen for their prognostic significance. By constructing an association map between imaging features and gene modules with Spearman rank correlations, the imaging features that significantly correlated with gene modules were obtained. These features were evaluated with Cox’s proportional hazard models and Kaplan-Meier estimates to determine their prognostic capabilities for overall survival (OS). Eight imaging features were significantly correlated with prognostic gene modules, and two of them were associated with OS. Among these, the geometry feature volume fraction of the subregion, which was significantly correlated with all prognostic gene modules representing cancer-related interpretation, was predictive of OS (Cox p = 0.022, hazard ratio = 0.24). The texture feature cluster prominence in the subregion, which was correlated with the prognostic gene module representing lipid metabolism and complement activation, also had the ability to predict OS (Cox p = 0.021, hazard ratio = 0.17). Imaging features depicting the volume fraction and textural heterogeneity in subregions have the potential to be predictors of OS with interpretable biological meaning.
Multi-level gene/MiRNA feature selection using deep belief nets and active learning.

PubMed

Ibrahim, Rania; Yousri, Noha A; Ismail, Mohamed A; El-Makky, Nagwa M

2014-01-01

Selecting the most discriminative genes/miRNAs has been raised as an important task in bioinformatics to enhance disease classifiers and to mitigate the dimensionality curse problem. Original feature selection methods choose genes/miRNAs based on their individual features regardless of how they perform together. Considering group features instead of individual ones provides a better view for selecting the most informative genes/miRNAs. Recently, deep learning has proven its ability in representing the data in multiple levels of abstraction, allowing for better discrimination between different classes. However, the idea of using deep learning for feature selection is not widely used in the bioinformatics field yet. In this paper, a novel multi-level feature selection approach named MLFS is proposed for selecting genes/miRNAs based on expression profiles. The approach is based on both deep and active learning. Moreover, an extension to use the technique for miRNAs is presented by considering the biological relation between miRNAs and genes. Experimental results show that the approach was able to outperform classical feature selection methods in hepatocellular carcinoma (HCC) by 9%, lung cancer by 6% and breast cancer by around 10% in F1-measure. Results also show the enhancement in F1-measure of our approach over recently related work in [1] and [2].
Detection of eQTL modules mediated by activity levels of transcription factors.

PubMed

Sun, Wei; Yu, Tianwei; Li, Ker-Chau

2007-09-01

Studies of gene expression quantitative trait loci (eQTL) in different organisms have shown the existence of eQTL hot spots: each being a small segment of DNA sequence that harbors the eQTL of a large number of genes. Two questions of great interest about eQTL hot spots arise: (1) which gene within the hot spot is responsible for the linkages, i.e. which gene is the quantitative trait gene (QTG)? (2) How does a QTG affect the expression levels of many genes linked to it? Answers to the first question can be offered by available biological evidence or by statistical methods. The second question is harder to address. One simple situation is that the QTG encodes a transcription factor (TF), which regulates the expression of genes linked to it. However, previous results have shown that TFs are not overrepresented in the eQTL hot spots. In this article, we consider the scenario that the propagation of genetic perturbation from a QTG to other linked genes is mediated by the TF activity. We develop a procedure to detect the eQTL modules (eQTL hot spots together with linked genes) that are compatible with this scenario. We first detect 27 eQTL modules from a yeast eQTL data, and estimate TF activity profiles using the method of Yu and Li (2005). Then likelihood ratio tests (LRTs) are conducted to find 760 relationships supporting the scenario of TF activity mediation: (DNA polymorphism --> cis-linked gene --> TF activity --> downstream linked gene). They are organized into 4 eQTL modules: an amino acid synthesis module featuring a cis-linked gene LEU2 and the mediating TF Leu3; a pheromone response module featuring a cis-linked gene GPA1 and the mediating TF Ste12; an energy-source control module featuring two cis-linked genes, GSY2 and HAP1, and the mediating TF Hap1; a mitotic exit module featuring four cis-linked genes, AMN1, CSH1, DEM1 and TOS1, and the mediating TF complex Ace2/Swi5. Gene Ontology is utilized to reveal interesting functional groups of the downstream genes in each module. Our methods are implemented in an R package: eqtl.TF, which includes source codes and relevant data. It can be freely downloaded at http://www.stat.ucla.edu/~sunwei/software.htm. http://www.stat.ucla.edu/~sunwei/yeast_eQTL_TF/supplementary.pdf.
Tombusvirus-based vector systems to permit over-expression of genes or that serve as sensors of antiviral RNA silencing in plants.

PubMed

Shamekova, Malika; Mendoza, Maria R; Hsieh, Yi-Cheng; Lindbo, John; Omarov, Rustem T; Scholthof, Herman B

2014-03-01

A next generation Tomato bushy stunt virus (TBSV) coat protein gene replacement vector system is described that can be applied by either RNA inoculation or through agroinfiltration. A vector expressing GFP rapidly yields high levels of transient gene expression in inoculated leaves of various plant species, as illustrated for Nicotiana benthamiana, cowpea, tomato, pepper, and lettuce. A start-codon mutation to down-regulate the dose of the P19 silencing suppressor reduces GFP accumulation, whereas mutations that result in undetectable levels of P19 trigger rapid silencing of GFP. Compared to existing virus vectors the TBSV system has a unique combination of a very broad host range, rapid and high levels of replication and gene expression, and the ability to regulate its suppressor. These features are attractive for quick transient assays in numerous plant species for over-expression of genes of interest, or as a sensor to monitor the efficacy of antiviral RNA silencing. Copyright © 2014. Published by Elsevier Inc.
Hox cluster disintegration with persistent anteroposterior order of expression in Oikopleura dioica.

PubMed

Seo, Hee-Chan; Edvardsen, Rolf Brudvik; Maeland, Anne Dorthea; Bjordal, Marianne; Jensen, Marit Flo; Hansen, Anette; Flaat, Mette; Weissenbach, Jean; Lehrach, Hans; Wincker, Patrick; Reinhardt, Richard; Chourrout, Daniel

2004-09-02

Tunicate embryos and larvae have small cell numbers and simple anatomical features in comparison with other chordates, including vertebrates. Although they branch near the base of chordate phylogenetic trees, their degree of divergence from the common chordate ancestor remains difficult to evaluate. Here we show that the tunicate Oikopleura dioica has a complement of nine Hox genes in which all central genes are lacking but a full vertebrate-like set of posterior genes is present. In contrast to all bilaterians studied so far, Hox genes are not clustered in the Oikopleura genome. Their expression occurs mostly in the tail, with some tissue preference, and a strong partition of expression domains in the nerve cord, in the notochord and in the muscle. In each tissue of the tail, the anteroposterior order of Hox gene expression evokes spatial collinearity, with several alterations. We propose a relationship between the Hox cluster breakdown, the separation of Hox expression domains, and a transition to a determinative mode of development.
Hypoxia-inducible tumour-specific promoters as a dual-targeting transcriptional regulation system for cancer gene therapy

PubMed Central

Javan, Bita; Shahbazi, Majid

2017-01-01

Transcriptional targeting is the best approach for specific gene therapy. Hypoxia is a common feature of the tumour microenvironment. Therefore, targeting gene expression in hypoxic cells by placing transgene under the control of a hypoxia-responsive promoter can be a good strategy for cancer-specific gene therapy. The hypoxia-inducible gene expression system has been investigated more in suicide gene therapy and it can also be of great help in knocking down cancer gene therapy with siRNAs. However, this system needs to be optimised to have maximum efficacy with minimum side effects in normal tissues. The combination of tissue-/tumour-specific promoters with HRE core sequences has been found to enhance the specificity and efficacy of this system. In this review, hypoxia-inducible gene expression system as well as gene therapy strategies targeting tumour hypoxia will be discussed. This review will also focus on hypoxia-inducible tumour-specific promoters as a dual-targeting transcriptional regulation systems developed for cancer-specific gene therapy. PMID:28798809
NCBI GEO: archive for high-throughput functional genomic data.

PubMed

Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Rudnev, Dmitry; Evangelista, Carlos; Kim, Irene F; Soboleva, Alexandra; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Edgar, Ron

2009-01-01

The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as 'Minimum Information About a Microarray Experiment' (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.
Overlapping gene expression profiles indicative of antigen processing and the interferon pathway characterize inflammatory fibrotic skin diseases.

PubMed

Limpers, Annelies; van Royen-Kerkhof, Annet; van Roon, Joel A G; Radstake, Timothy R D J; Broen, Jasper C A

2014-02-01

Inflammatory fibrotic disorders have been of high interest both for dermatologists and rheumatologists. Although the phenotypic end stage of this group of diseases is ultimately the same, namely fibrosis, patients present with different clinical features and are often treated with distinct therapeutic modalities. This review addresses whether there is evidence for different underlying molecular pathways in the various inflammatory fibrotic diseases such as localized scleroderma, pediatric lichen sclerosus, adult lichen sclerosus, eosinophilic fasciitis and systemic sclerosis. To investigate this, a large number of gene expression microarray studies performed on skin or fibroblasts from patients with these aforementioned diseases were described, (re-)analysed, and compared. As suspected by the heterogeneous phenotype, most diseases showed unique gene expression features. Intriguingly, a clear overlap was observed between adult and pediatric lichen sclerosus and localized scleroderma, in antigen processing and the interferon pathway. Delineating the cause and consequence of these pathways may generate novel tools to better characterize and more effectively treat these patients.
Competing endogenous RNA regulatory network in papillary thyroid carcinoma.

PubMed

Chen, Shouhua; Fan, Xiaobin; Gu, He; Zhang, Lili; Zhao, Wenhua

2018-05-11

The present study aimed to screen all types of RNAs involved in the development of papillary thyroid carcinoma (PTC). RNA‑sequencing data of PTC and normal samples were used for screening differentially expressed (DE) microRNAs (DE‑miRNAs), long non‑coding RNAs (DE‑lncRNAs) and genes (DEGs). Subsequently, lncRNA‑miRNA, miRNA‑gene (that is, miRNA‑mRNA) and gene‑gene interaction pairs were extracted and used to construct regulatory networks. Feature genes in the miRNA‑mRNA network were identified by topological analysis and recursive feature elimination analysis. A support vector machine (SVM) classifier was built using 15 feature genes, and its classification effect was validated using two microarray data sets that were downloaded from the Gene Expression Omnibus (GEO) database. In addition, Gene Ontology function and Kyoto Encyclopedia Genes and Genomes pathway enrichment analyses were conducted for genes identified in the ceRNA network. A total of 506 samples, including 447 tumor samples and 59 normal samples, were obtained from The Cancer Genome Atlas (TCGA); 16 DE‑lncRNAs, 917 DEGs and 30 DE‑miRNAs were screened. The miRNA‑mRNA regulatory network comprised 353 nodes and 577 interactions. From these data, 15 feature genes with high predictive precision (>95%) were extracted from the network and were used to form an SVM classifier with an accuracy of 96.05% (486/506) for PTC samples downloaded from TCGA, and accuracies of 96.81 and 98.46% for GEO downloaded data sets. The ceRNA regulatory network comprised 596 lines (or interactions) and 365 nodes. Genes in the ceRNA network were significantly enriched in 'neuron development', 'differentiation', 'neuroactive ligand‑receptor interaction', 'metabolism of xenobiotics by cytochrome P450', 'drug metabolism' and 'cytokine‑cytokine receptor interaction' pathways. Hox transcript antisense RNA, miRNA‑206 and kallikrein‑related peptidase 10 were nodes in the ceRNA regulatory network of the selected feature gene, and they may serve import roles in the development of PTC.
Hidden among the crowd: differential DNA methylation-expression correlations in cancer occur at important oncogenic pathways

PubMed Central

Mosquera Orgueira, Adrián

2015-01-01

DNA methylation is a frequent epigenetic mechanism that participates in transcriptional repression. Variations in DNA methylation with respect to gene expression are constant, and, for unknown reasons, some genes with highly methylated promoters are sometimes overexpressed. In this study we have analyzed the expression and methylation patterns of thousands of genes in five groups of cancer and normal tissue samples in order to determine local and genome-wide differences. We observed significant changes in global methylation-expression correlation in all the neoplasms, which suggests that differential correlation events are frequent in cancer. A focused analysis in the breast cancer cohort identified 1662 genes whose correlation varies significantly between normal and cancerous breast, but whose DNA methylation and gene expression patterns do not change substantially. These genes were enriched in cancer-related pathways and repressive chromatin features across various model cell lines, such as PRC2 binding and H3K27me3 marks. Substantial changes in methylation-expression correlation indicate that these genes are subject to epigenetic remodeling, where the differential activity of other factors break the expected relationship between both variables. Our findings suggest a complex regulatory landscape where a redistribution of local and large-scale chromatin repressive domains at differentially correlated genes (DCGs) creates epigenetic hotspots that modulate cancer-specific gene expression. PMID:26029238
Continuum theory of gene expression waves during vertebrate segmentation.

PubMed

Jörg, David J; Morelli, Luis G; Soroldoni, Daniele; Oates, Andrew C; Jülicher, Frank

2015-09-01

The segmentation of the vertebrate body plan during embryonic development is a rhythmic and sequential process governed by genetic oscillations. These genetic oscillations give rise to traveling waves of gene expression in the segmenting tissue. Here we present a minimal continuum theory of vertebrate segmentation that captures the key principles governing the dynamic patterns of gene expression including the effects of shortening of the oscillating tissue. We show that our theory can quantitatively account for the key features of segmentation observed in zebrafish, in particular the shape of the wave patterns, the period of segmentation and the segment length as a function of time.
Continuum theory of gene expression waves during vertebrate segmentation

PubMed Central

Jörg, David J; Morelli, Luis G; Soroldoni, Daniele; Oates, Andrew C; Jülicher, Frank

2015-01-01

Abstract The segmentation of the vertebrate body plan during embryonic development is a rhythmic and sequential process governed by genetic oscillations. These genetic oscillations give rise to traveling waves of gene expression in the segmenting tissue. Here we present a minimal continuum theory of vertebrate segmentation that captures the key principles governing the dynamic patterns of gene expression including the effects of shortening of the oscillating tissue. We show that our theory can quantitatively account for the key features of segmentation observed in zebrafish, in particular the shape of the wave patterns, the period of segmentation and the segment length as a function of time. PMID:28725158
Global Expression Profiling in Atopic Eczema Reveals Reciprocal Expression of Inflammatory and Lipid Genes

PubMed Central

Sääf, Annika M.; Tengvall-Linder, Maria; Chang, Howard Y.; Adler, Adam S.; Wahlgren, Carl-Fredrik; Scheynius, Annika; Nordenskjöld, Magnus; Bradley, Maria

2008-01-01

Background Atopic eczema (AE) is a common chronic inflammatory skin disorder. In order to dissect the genetic background several linkage and genetic association studies have been performed. Yet very little is known about specific genes involved in this complex skin disease, and the underlying molecular mechanisms are not fully understood. Methodology/Findings We used human DNA microarrays to identify a molecular picture of the programmed responses of the human genome to AE. The transcriptional program was analyzed in skin biopsy samples from lesional and patch-tested skin from AE patients sensitized to Malassezia sympodialis (M. sympodialis), and corresponding biopsies from healthy individuals. The most notable feature of the global gene-expression pattern observed in AE skin was a reciprocal expression of induced inflammatory genes and repressed lipid metabolism genes. The overall transcriptional response in M. sympodialis patch-tested AE skin was similar to the gene-expression signature identified in lesional AE skin. In the constellation of genes differentially expressed in AE skin compared to healthy control skin, we have identified several potential susceptibility genes that may play a critical role in the pathological condition of AE. Many of these genes, including genes with a role in immune responses, lipid homeostasis, and epidermal differentiation, are localized on chromosomal regions previously linked to AE. Conclusions/Significance Through genome-wide expression profiling, we were able to discover a distinct reciprocal expression pattern of induced inflammatory genes and repressed lipid metabolism genes in skin from AE patients. We found a significant enrichment of differentially expressed genes in AE with cytobands associated to the disease, and furthermore new chromosomal regions were found that could potentially guide future region-specific linkage mapping in AE. The full data set is available at http://microarray-pubs.stanford.edu/eczema. PMID:19107207
Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

PubMed

Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

2017-09-01

The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative, quantitative portrait of the relative, typical gene‑expression profile in the form of searchable database tables.
Differential Gene Expression (DEX) and Alternative Splicing Events (ASE) for Temporal Dynamic Processes Using HMMs and Hierarchical Bayesian Modeling Approaches.

PubMed

Oh, Sunghee; Song, Seongho

2017-01-01

In gene expression profile, data analysis pipeline is categorized into four levels, major downstream tasks, i.e., (1) identification of differential expression; (2) clustering co-expression patterns; (3) classification of subtypes of samples; and (4) detection of genetic regulatory networks, are performed posterior to preprocessing procedure such as normalization techniques. To be more specific, temporal dynamic gene expression data has its inherent feature, namely, two neighboring time points (previous and current state) are highly correlated with each other, compared to static expression data which samples are assumed as independent individuals. In this chapter, we demonstrate how HMMs and hierarchical Bayesian modeling methods capture the horizontal time dependency structures in time series expression profiles by focusing on the identification of differential expression. In addition, those differential expression genes and transcript variant isoforms over time detected in core prerequisite steps can be generally further applied in detection of genetic regulatory networks to comprehensively uncover dynamic repertoires in the aspects of system biology as the coupled framework.
Identification of General Patterns of Sex-Biased Expression in Daphnia, a Genus with Environmental Sex Determination

PubMed Central

Molinier, Cécile; Reisser, Céline M.O.; Fields, Peter; Ségard, Adeline; Galimov, Yan; Haag, Christoph R.

2018-01-01

Daphnia reproduce by cyclic-parthenogenesis, where phases of asexual reproduction are intermitted by sexual production of diapause stages. This life cycle, together with environmental sex determination, allow the comparison of gene expression between genetically identical males and females. We investigated gene expression differences between males and females in four genotypes of Daphnia magna and compared the results with published data on sex-biased gene expression in two other Daphnia species, each representing one of the major phylogenetic clades within the genus. We found that 42% of all annotated genes showed sex-biased expression in D. magna. This proportion is similar both to estimates from other Daphnia species as well as from species with genetic sex determination, suggesting that sex-biased expression is not reduced under environmental sex determination. Among 7453 single copy, one-to-one orthologs in the three Daphnia species, 707 consistently showed sex-biased expression and 675 were biased in the same direction in all three species. Hence these genes represent a core-set of genes with consistent sex-differential expression in the genus. A functional analysis identified that several of them are involved in known sex determination pathways. Moreover, 75% were overexpressed in females rather than males, a pattern that appears to be a general feature of sex-biased gene expression in Daphnia. PMID:29535148
Identification of General Patterns of Sex-Biased Expression in Daphnia, a Genus with Environmental Sex Determination.

PubMed

Molinier, Cécile; Reisser, Céline M O; Fields, Peter; Ségard, Adeline; Galimov, Yan; Haag, Christoph R

2018-05-04

Daphnia reproduce by cyclic-parthenogenesis, where phases of asexual reproduction are intermitted by sexual production of diapause stages. This life cycle, together with environmental sex determination, allow the comparison of gene expression between genetically identical males and females. We investigated gene expression differences between males and females in four genotypes of Daphnia magna and compared the results with published data on sex-biased gene expression in two other Daphnia species, each representing one of the major phylogenetic clades within the genus. We found that 42% of all annotated genes showed sex-biased expression in D. magna This proportion is similar both to estimates from other Daphnia species as well as from species with genetic sex determination, suggesting that sex-biased expression is not reduced under environmental sex determination. Among 7453 single copy, one-to-one orthologs in the three Daphnia species, 707 consistently showed sex-biased expression and 675 were biased in the same direction in all three species. Hence these genes represent a core-set of genes with consistent sex-differential expression in the genus. A functional analysis identified that several of them are involved in known sex determination pathways. Moreover, 75% were overexpressed in females rather than males, a pattern that appears to be a general feature of sex-biased gene expression in Daphnia . Copyright © 2018 Molinier et al.
William's syndrome: gene expression is related to parental origin and regional coordinate control

PubMed Central

Collette, Jeremy C; Chen, Xiao-Ning; Mills, Debra L; Galaburda, Albert M; Reiss, Allan L; Bellugi, Ursula; Korenberg, Julie R

2013-01-01

William's syndrome (WS) features a spectrum of neurocognitive and behavioral abnormalities due to a rare 1.5MB deletion that includes about 24–28 genes on chromosome band 7q11.23. Study of the expression of these genes from the single normal copy provides an opportunity to elucidate the genetic and epigenetic controls on these genes as well as their roles in both WS and normal brain development and function. We used quantitative RT-PCR to determine the transcriptional level of 14 WS gene markers in a cohort of 77 persons with WS and 48 normal controls. Results reported here: (1) show that the expression of the genes deleted in WS is decreased in some but not all cases, (2) demonstrate that the parental origin of the deletion contributes to the level of expression of GTF2I independently of age and gender and (3) indicate that the correlation of expression between GTF2I and some other genes in the WS region differs in WS subjects and normal controls, which in turn points toward a regulatory role for this gene. Interspecies comparisons suggest GTF2I may play a key role in normal brain development. PMID:19282872

Prediction of essential proteins based on gene expression programming.

PubMed

Zhong, Jiancheng; Wang, Jianxin; Peng, Wei; Zhang, Zhen; Pan, Yi

2013-01-01

Essential proteins are indispensable for cell survive. Identifying essential proteins is very important for improving our understanding the way of a cell working. There are various types of features related to the essentiality of proteins. Many methods have been proposed to combine some of them to predict essential proteins. However, it is still a big challenge for designing an effective method to predict them by integrating different features, and explaining how these selected features decide the essentiality of protein. Gene expression programming (GEP) is a learning algorithm and what it learns specifically is about relationships between variables in sets of data and then builds models to explain these relationships. In this work, we propose a GEP-based method to predict essential protein by combing some biological features and topological features. We carry out experiments on S. cerevisiae data. The experimental results show that the our method achieves better prediction performance than those methods using individual features. Moreover, our method outperforms some machine learning methods and performs as well as a method which is obtained by combining the outputs of eight machine learning methods. The accuracy of predicting essential proteins can been improved by using GEP method to combine some topological features and biological features.
The transcriptional landscape of age in human peripheral blood

PubMed Central

Peters, Marjolein J.; Joehanes, Roby; Pilling, Luke C.; Schurmann, Claudia; Conneely, Karen N.; Powell, Joseph; Reinmaa, Eva; Sutphin, George L.; Zhernakova, Alexandra; Schramm, Katharina; Wilson, Yana A.; Kobes, Sayuko; Tukiainen, Taru; Nalls, Michael A.; Hernandez, Dena G.; Cookson, Mark R.; Gibbs, Raphael J.; Hardy, John; Ramasamy, Adaikalavan; Zonderman, Alan B.; Dillman, Allissa; Traynor, Bryan; Smith, Colin; Longo, Dan L.; Trabzuni, Daniah; Troncoso, Juan; van der Brug, Marcel; Weale, Michael E.; O'Brien, Richard; Johnson, Robert; Walker, Robert; Zielke, Ronald H.; Arepalli, Sampath; Ryten, Mina; Singleton, Andrew B.; Ramos, Yolande F.; Göring, Harald H. H.; Fornage, Myriam; Liu, Yongmei; Gharib, Sina A.; Stranger, Barbara E.; De Jager, Philip L.; Aviv, Abraham; Levy, Daniel; Murabito, Joanne M.; Munson, Peter J.; Huan, Tianxiao; Hofman, Albert; Uitterlinden, André G.; Rivadeneira, Fernando; van Rooij, Jeroen; Stolk, Lisette; Broer, Linda; Verbiest, Michael M. P. J.; Jhamai, Mila; Arp, Pascal; Metspalu, Andres; Tserel, Liina; Milani, Lili; Samani, Nilesh J.; Peterson, Pärt; Kasela, Silva; Codd, Veryan; Peters, Annette; Ward-Caviness, Cavin K.; Herder, Christian; Waldenberger, Melanie; Roden, Michael; Singmann, Paula; Zeilinger, Sonja; Illig, Thomas; Homuth, Georg; Grabe, Hans-Jörgen; Völzke, Henry; Steil, Leif; Kocher, Thomas; Murray, Anna; Melzer, David; Yaghootkar, Hanieh; Bandinelli, Stefania; Moses, Eric K.; Kent, Jack W.; Curran, Joanne E.; Johnson, Matthew P.; Williams-Blangero, Sarah; Westra, Harm-Jan; McRae, Allan F.; Smith, Jennifer A.; Kardia, Sharon L. R.; Hovatta, Iiris; Perola, Markus; Ripatti, Samuli; Salomaa, Veikko; Henders, Anjali K.; Martin, Nicholas G.; Smith, Alicia K.; Mehta, Divya; Binder, Elisabeth B.; Nylocks, K Maria; Kennedy, Elizabeth M.; Klengel, Torsten; Ding, Jingzhong; Suchy-Dicey, Astrid M.; Enquobahrie, Daniel A.; Brody, Jennifer; Rotter, Jerome I.; Chen, Yii-Der I.; Houwing-Duistermaat, Jeanine; Kloppenburg, Margreet; Slagboom, P. Eline; Helmer, Quinta; den Hollander, Wouter; Bean, Shannon; Raj, Towfique; Bakhshi, Noman; Wang, Qiao Ping; Oyston, Lisa J.; Psaty, Bruce M.; Tracy, Russell P.; Montgomery, Grant W.; Turner, Stephen T.; Blangero, John; Meulenbelt, Ingrid; Ressler, Kerry J.; Yang, Jian; Franke, Lude; Kettunen, Johannes; Visscher, Peter M.; Neely, G. Gregory; Korstanje, Ron; Hanson, Robert L.; Prokisch, Holger; Ferrucci, Luigi; Esko, Tonu; Teumer, Alexander; van Meurs, Joyce B. J.; Johnson, Andrew D.

2015-01-01

Disease incidences increase with age, but the molecular characteristics of ageing that lead to increased disease susceptibility remain inadequately understood. Here we perform a whole-blood gene expression meta-analysis in 14,983 individuals of European ancestry (including replication) and identify 1,497 genes that are differentially expressed with chronological age. The age-associated genes do not harbor more age-associated CpG-methylation sites than other genes, but are instead enriched for the presence of potentially functional CpG-methylation sites in enhancer and insulator regions that associate with both chronological age and gene expression levels. We further used the gene expression profiles to calculate the ‘transcriptomic age' of an individual, and show that differences between transcriptomic age and chronological age are associated with biological features linked to ageing, such as blood pressure, cholesterol levels, fasting glucose, and body mass index. The transcriptomic prediction model adds biological relevance and complements existing epigenetic prediction models, and can be used by others to calculate transcriptomic age in external cohorts. PMID:26490707
Stem cell and neurogenic gene-expression profiles link prostate basal cells to aggressive prostate cancer

PubMed Central

Zhang, Dingxiao; Park, Daechan; Zhong, Yi; Lu, Yue; Rycaj, Kiera; Gong, Shuai; Chen, Xin; Liu, Xin; Chao, Hsueh-Ping; Whitney, Pamela; Calhoun-Davis, Tammy; Takata, Yoko; Shen, Jianjun; Iyer, Vishwanath R.; Tang, Dean G.

2016-01-01

The prostate gland mainly contains basal and luminal cells constructed as a pseudostratified epithelium. Annotation of prostate epithelial transcriptomes provides a foundation for discoveries that can impact disease understanding and treatment. Here we describe a genome-wide transcriptome analysis of human benign prostatic basal and luminal epithelial populations using deep RNA sequencing. Through molecular and biological characterizations, we show that the differential gene-expression profiles account for their distinct functional properties. Strikingly, basal cells preferentially express gene categories associated with stem cells, neurogenesis and ribosomal RNA (rRNA) biogenesis. Consistent with this profile, basal cells functionally exhibit intrinsic stem-like and neurogenic properties with enhanced rRNA transcription activity. Of clinical relevance, the basal cell gene-expression profile is enriched in advanced, anaplastic, castration-resistant and metastatic prostate cancers. Therefore, we link the cell-type-specific gene signatures to aggressive subtypes of prostate cancer and identify gene signatures associated with adverse clinical features. PMID:26924072
Stem cell and neurogenic gene-expression profiles link prostate basal cells to aggressive prostate cancer.

PubMed

Zhang, Dingxiao; Park, Daechan; Zhong, Yi; Lu, Yue; Rycaj, Kiera; Gong, Shuai; Chen, Xin; Liu, Xin; Chao, Hsueh-Ping; Whitney, Pamela; Calhoun-Davis, Tammy; Takata, Yoko; Shen, Jianjun; Iyer, Vishwanath R; Tang, Dean G

2016-02-29

The prostate gland mainly contains basal and luminal cells constructed as a pseudostratified epithelium. Annotation of prostate epithelial transcriptomes provides a foundation for discoveries that can impact disease understanding and treatment. Here we describe a genome-wide transcriptome analysis of human benign prostatic basal and luminal epithelial populations using deep RNA sequencing. Through molecular and biological characterizations, we show that the differential gene-expression profiles account for their distinct functional properties. Strikingly, basal cells preferentially express gene categories associated with stem cells, neurogenesis and ribosomal RNA (rRNA) biogenesis. Consistent with this profile, basal cells functionally exhibit intrinsic stem-like and neurogenic properties with enhanced rRNA transcription activity. Of clinical relevance, the basal cell gene-expression profile is enriched in advanced, anaplastic, castration-resistant and metastatic prostate cancers. Therefore, we link the cell-type-specific gene signatures to aggressive subtypes of prostate cancer and identify gene signatures associated with adverse clinical features.
Reprogramming Methods Do Not Affect Gene Expression Profile of Human Induced Pluripotent Stem Cells.

PubMed

Trevisan, Marta; Desole, Giovanna; Costanzi, Giulia; Lavezzo, Enrico; Palù, Giorgio; Barzon, Luisa

2017-01-20

Induced pluripotent stem cells (iPSCs) are pluripotent cells derived from adult somatic cells. After the pioneering work by Yamanaka, who first generated iPSCs by retroviral transduction of four reprogramming factors, several alternative methods to obtain iPSCs have been developed in order to increase the yield and safety of the process. However, the question remains open on whether the different reprogramming methods can influence the pluripotency features of the derived lines. In this study, three different strategies, based on retroviral vectors, episomal vectors, and Sendai virus vectors, were applied to derive iPSCs from human fibroblasts. The reprogramming efficiency of the methods based on episomal and Sendai virus vectors was higher than that of the retroviral vector-based approach. All human iPSC clones derived with the different methods showed the typical features of pluripotent stem cells, including the expression of alkaline phosphatase and stemness maker genes, and could give rise to the three germ layer derivatives upon embryoid bodies assay. Microarray analysis confirmed the presence of typical stem cell gene expression profiles in all iPSC clones and did not identify any significant difference among reprogramming methods. In conclusion, the use of different reprogramming methods is equivalent and does not affect gene expression profile of the derived human iPSCs.
The rcsA Promoter of Pantoea stewartii subsp. stewartii Features a Low-Level Constitutive Promoter and an EsaR Quorum-Sensing-Regulated Promoter

PubMed Central

Carlier, Aurelien L.; von Bodman, S. B.

2006-01-01

The upstream region of the Pantoea stewartii rcsA gene features two promoters, one for constitutive basal-level expression and a second autoregulated promoter for induced expression. The EsaR quorum-sensing repressor binds to a site centered between the two promoters, blocking transcription elongation from the regulated promoter under noninducing conditions. PMID:16740966
Homoeolog-specific transcriptional bias in allopolyploid wheat

PubMed Central

2010-01-01

Background Interaction between parental genomes is accompanied by global changes in gene expression which, eventually, contributes to growth vigor and the broader phenotypic diversity of allopolyploid species. In order to gain a better understanding of the effects of allopolyploidization on the regulation of diverged gene networks, we performed a genome-wide analysis of homoeolog-specific gene expression in re-synthesized allohexaploid wheat created by the hybridization of a tetraploid derivative of hexaploid wheat with the diploid ancestor of the wheat D genome Ae. tauschii. Results Affymetrix wheat genome arrays were used for both the discovery of divergent homoeolog-specific mutations and analysis of homoeolog-specific gene expression in re-synthesized allohexaploid wheat. More than 34,000 detectable parent-specific features (PSF) distributed across the wheat genome were used to assess AB genome (could not differentiate A and B genome contributions) and D genome parental expression in the allopolyploid transcriptome. In re-synthesized polyploid 81% of PSFs detected mid-parent levels of gene expression, and only 19% of PSFs showed the evidence of non-additive expression. Non-additive expression in both AB and D genomes was strongly biased toward up-regulation of parental type of gene expression with only 6% and 11% of genes, respectively, being down-regulated. Of all the non-additive gene expression, 84% can be explained by differences in the parental genotypes used to make the allopolyploid. Homoeolog-specific co-regulation of several functional gene categories was found, particularly genes involved in photosynthesis and protein biosynthesis in wheat. Conclusions Here, we have demonstrated that the establishment of interactions between the diverged regulatory networks in allopolyploids is accompanied by massive homoeolog-specific up- and down-regulation of gene expression. This study provides insights into interactions between homoeologous genomes and their role in growth vigor, development, and fertility of allopolyploid species. PMID:20849627
CRAWview: for viewing splicing variation, gene families, and polymorphism in clusters of ESTs and full-length sequences.

PubMed

Chou, A; Burke, J

1999-05-01

DNA sequence clustering has become a valuable method in support of gene discovery and gene expression analysis. Our interest lies in leveraging the sequence diversity within clusters of expressed sequence tags (ESTs) to model gene structure for the study of gene variants that arise from, among other things, alternative mRNA splicing, polymorphism, and divergence after gene duplication, fusion, and translocation events. In previous work, CRAW was developed to discover gene variants from assembled clusters of ESTs. Most importantly, novel gene features (the differing units between gene variants, for example alternative exons, polymorphisms, transposable elements, etc.) that are specialized to tissue, disease, population, or developmental states can be identified when these tools collate DNA source information with gene variant discrimination. While the goal is complete automation of novel feature and gene variant detection, current methods are far from perfect and hence the development of effective tools for visualization and exploratory data analysis are of paramount importance in the process of sifting through candidate genes and validating targets. We present CRAWview, a Java based visualization extension to CRAW. Features that vary between gene forms are displayed using an automatically generated color coded index. The reporting format of CRAWview gives a brief, high level summary report to display overlap and divergence within clusters of sequences as well as the ability to 'drill down' and see detailed information concerning regions of interest. Additionally, the alignment viewing and editing capabilities of CRAWview make it possible to interactively correct frame-shifts and otherwise edit cluster assemblies. We have implemented CRAWview as a Java application across windows NT/95 and UNIX platforms. A beta version of CRAWview will be freely available to academic users from Pangea Systems (http://www.pangeasystems.com). Contact :
Radiogenomics analysis identifies correlations of digital mammography with clinical molecular signatures in breast cancer.

PubMed

Tamez-Peña, Jose-Gerardo; Rodriguez-Rojas, Juan-Andrés; Gomez-Rueda, Hugo; Celaya-Padilla, Jose-Maria; Rivera-Prieto, Roxana-Alicia; Palacios-Corona, Rebeca; Garza-Montemayor, Margarita; Cardona-Huerta, Servando; Treviño, Victor

2018-01-01

In breast cancer, well-known gene expression subtypes have been related to a specific clinical outcome. However, their impact on the breast tissue phenotype has been poorly studied. Here, we investigate the association of imaging data of tumors to gene expression signatures from 71 patients with breast cancer that underwent pre-treatment digital mammograms and tumor biopsies. From digital mammograms, a semi-automated radiogenomics analysis generated 1,078 features describing the shape, signal distribution, and texture of tumors along their contralateral image used as control. From tumor biopsy, we estimated the OncotypeDX and PAM50 recurrence scores using gene expression microarrays. Then, we used multivariate analysis under stringent cross-validation to train models predicting recurrence scores. Few univariate features reached Spearman correlation coefficients above 0.4. Nevertheless, multivariate analysis yielded significantly correlated models for both signatures (correlation of OncotypeDX = 0.49 ± 0.07 and PAM50 = 0.32 ± 0.10 in stringent cross-validation and OncotypeDX = 0.83 and PAM50 = 0.78 for a unique model). Equivalent models trained from the unaffected contralateral breast were not correlated suggesting that the image signatures were tumor-specific and that overfitting was not a considerable issue. We also noted that models were improved by combining clinical information (triple negative status and progesterone receptor). The models used mostly wavelets and fractal features suggesting their importance to capture tumor information. Our results suggest that molecular-based recurrence risk and breast cancer subtypes have observable radiographic phenotypes. To our knowledge, this is the first study associating mammographic information to gene expression recurrence signatures.
Radiogenomics analysis identifies correlations of digital mammography with clinical molecular signatures in breast cancer

PubMed Central

Tamez-Peña, Jose-Gerardo; Rodriguez-Rojas, Juan-Andrés; Gomez-Rueda, Hugo; Celaya-Padilla, Jose-Maria; Rivera-Prieto, Roxana-Alicia; Palacios-Corona, Rebeca; Garza-Montemayor, Margarita; Cardona-Huerta, Servando

2018-01-01

In breast cancer, well-known gene expression subtypes have been related to a specific clinical outcome. However, their impact on the breast tissue phenotype has been poorly studied. Here, we investigate the association of imaging data of tumors to gene expression signatures from 71 patients with breast cancer that underwent pre-treatment digital mammograms and tumor biopsies. From digital mammograms, a semi-automated radiogenomics analysis generated 1,078 features describing the shape, signal distribution, and texture of tumors along their contralateral image used as control. From tumor biopsy, we estimated the OncotypeDX and PAM50 recurrence scores using gene expression microarrays. Then, we used multivariate analysis under stringent cross-validation to train models predicting recurrence scores. Few univariate features reached Spearman correlation coefficients above 0.4. Nevertheless, multivariate analysis yielded significantly correlated models for both signatures (correlation of OncotypeDX = 0.49 ± 0.07 and PAM50 = 0.32 ± 0.10 in stringent cross-validation and OncotypeDX = 0.83 and PAM50 = 0.78 for a unique model). Equivalent models trained from the unaffected contralateral breast were not correlated suggesting that the image signatures were tumor-specific and that overfitting was not a considerable issue. We also noted that models were improved by combining clinical information (triple negative status and progesterone receptor). The models used mostly wavelets and fractal features suggesting their importance to capture tumor information. Our results suggest that molecular-based recurrence risk and breast cancer subtypes have observable radiographic phenotypes. To our knowledge, this is the first study associating mammographic information to gene expression recurrence signatures. PMID:29596496
Pericentromeric Effects Shape the Patterns of Divergence, Retention, and Expression of Duplicated Genes in the Paleopolyploid Soybean[C][W

PubMed Central

Du, Jianchang; Tian, Zhixi; Sui, Yi; Zhao, Meixia; Song, Qijian; Cannon, Steven B.; Cregan, Perry; Ma, Jianxin

2012-01-01

The evolutionary forces that govern the divergence and retention of duplicated genes in polyploids are poorly understood. In this study, we first investigated the rates of nonsynonymous substitution (Ka) and the rates of synonymous substitution (Ks) for a nearly complete set of genes in the paleopolyploid soybean (Glycine max) by comparing the orthologs between soybean and its progenitor species Glycine soja and then compared the patterns of gene divergence and expression between pericentromeric regions and chromosomal arms in different gene categories. Our results reveal strong associations between duplication status and Ka and gene expression levels and overall low Ks and low levels of gene expression in pericentromeric regions. It is theorized that deleterious mutations can easily accumulate in recombination-suppressed regions, because of Hill-Robertson effects. Intriguingly, the genes in pericentromeric regions—the cold spots for meiotic recombination in soybean—showed significantly lower Ka and higher levels of expression than their homoeologs in chromosomal arms. This asymmetric evolution of two members of individual whole genome duplication (WGD)-derived gene pairs, echoing the biased accumulation of singletons in pericentromeric regions, suggests that distinct genomic features between the two distinct chromatin types are important determinants shaping the patterns of divergence and retention of WGD-derived genes. PMID:22227891
ADGO: analysis of differentially expressed gene sets using composite GO annotation.

PubMed

Nam, Dougu; Kim, Sang-Bae; Kim, Seon-Kyu; Yang, Sungjin; Kim, Seon-Young; Chu, In-Sun

2006-09-15

Genes are typically expressed in modular manners in biological processes. Recent studies reflect such features in analyzing gene expression patterns by directly scoring gene sets. Gene annotations have been used to define the gene sets, which have served to reveal specific biological themes from expression data. However, current annotations have limited analytical power, because they are classified by single categories providing only unary information for the gene sets. Here we propose a method for discovering composite biological themes from expression data. We intersected two annotated gene sets from different categories of Gene Ontology (GO). We then scored the expression changes of all the single and intersected sets. In this way, we were able to uncover, for example, a gene set with the molecular function F and the cellular component C that showed significant expression change, while the changes in individual gene sets were not significant. We provided an exemplary analysis for HIV-1 immune response. In addition, we tested the method on 20 public datasets where we found many 'filtered' composite terms the number of which reached approximately 34% (a strong criterion, 5% significance) of the number of significant unary terms on average. By using composite annotation, we can derive new and improved information about disease and biological processes from expression data. We provide a web application (ADGO: http://array.kobic.re.kr/ADGO) for the analysis of differentially expressed gene sets with composite GO annotations. The user can analyze Affymetrix and dual channel array (spotted cDNA and spotted oligo microarray) data for four species: human, mouse, rat and yeast. chu@kribb.re.kr http://array.kobic.re.kr/ADGO.
Embryonic stem cell-like features of testicular carcinoma in situ revealed by genome-wide gene expression profiling.

PubMed

Almstrup, Kristian; Hoei-Hansen, Christina E; Wirkner, Ute; Blake, Jonathon; Schwager, Christian; Ansorge, Wilhelm; Nielsen, John E; Skakkebaek, Niels E; Rajpert-De Meyts, Ewa; Leffers, Henrik

2004-07-15

Carcinoma in situ (CIS) is the common precursor of histologically heterogeneous testicular germ cell tumors (TGCTs), which in recent decades have markedly increased and now are the most common malignancy of young men. Using genome-wide gene expression profiling, we identified >200 genes highly expressed in testicular CIS, including many never reported in testicular neoplasms. Expression was further verified by semiquantitative reverse transcription-PCR and in situ hybridization. Among the highest expressed genes were NANOG and POU5F1, and reverse transcription-PCR revealed possible changes in their stoichiometry on progression into embryonic carcinoma. We compared the CIS expression profile with patterns reported in embryonic stem cells (ESCs), which revealed a substantial overlap that may be as high as 50%. We also demonstrated an over-representation of expressed genes in regions of 17q and 12, reported as unstable in cultured ESCs. The close similarity between CIS and ESCs explains the pluripotency of CIS. Moreover, the findings are consistent with an early prenatal origin of TGCTs and thus suggest that etiologic factors operating in utero are of primary importance for the incidence trends of TGCTs. Finally, some of the highly expressed genes identified in this study are promising candidates for new diagnostic markers for CIS and/or TGCTs.
Dietary tomato and lycopene impact androgen signaling- and carcinogenesis-related gene expression during early TRAMP prostate carcinogenesis

PubMed Central

Wan, Lei; Tan, Hsueh-Li; Thomas-Ahner, Jennifer M.; Pearl, Dennis K.; Erdman, John W.; Moran, Nancy E.; Clinton, Steven K.

2014-01-01

Consumption of tomato products containing the carotenoid lycopene is associated with a reduced risk of prostate cancer. To identify gene expression patterns associated with early testosterone-driven prostate carcinogenesis, which are impacted by dietary tomato and lycopene, wild type (WT) and transgenic adenocarcinoma of the mouse prostate (TRAMP) mice were fed control or tomato- or lycopene-containing diets from 4-10 wk-of-age. Eight-week-old mice underwent sham surgery, castration, or castration followed by testosterone-repletion (2.5 mg/kg/d initiated 1 wk after castration). Ten-wk-old intact TRAMP mice exhibit early multifocal prostatic intraepithelial neoplasia (PIN). Of the 200 prostate cancer-related genes measured by quantitative NanoString®, 189 are detectable, 164 significantly differ by genotype, 179 by testosterone status, and 30 by diet type (P<0.05). In TRAMP, expression of Birc5, Mki67, Aurkb, Ccnb2, Foxm1, and Ccne2 is greater compared to WT and is decreased by castration. In parallel, castration reduces Ki67-positive staining (P<0.0001) compared to intact and testosterone-repleted TRAMP mice. Expression of genes involved in androgen metabolism/signaling pathways are reduced by lycopene feeding (Srd5a1) and by tomato-feeding (Srd5a2, Pxn, and Srebf1). Additionally, tomato-feeding significantly reduced expression of genes associated with stem cell features, Aldh1a and Ly6a, while lycopene-feeding significantly reduced expression of neuroendocrine differentiation-related genes, Ngfr and Syp. Collectively, these studies demonstrate a profile of testosterone-regulated genes associated with early stages of prostate carcinogenesis that are potential mechanistic targets of dietary tomato components. Future studies on androgen signaling/metabolism, stem cell features, and neuroendocrine differentiation pathways may elucidate the mechanisms by which dietary tomato and lycopene impact prostate cancer risk. PMID:25315431
Sequence analysis, identification of evolutionary conserved motifs and expression analysis of murine tcof1 provide further evidence for a potential function for the gene and its human homologue, TCOF1.

PubMed

Dixon, J; Hovanes, K; Shiang, R; Dixon, M J

1997-05-01

The gene mutated in Treacher Collins syndrome, an autosomal dominant disorder of facial development, has recently been cloned. While the function of the predicted protein, Treacle, is unknown, it has been shown to share a number of features with the highly phosphorylated nucleolar phosphoproteins, which play a role in nucleolar-cytoplasmic transport. In the current study, the murine homologue of the Treacher Collins syndrome gene has been isolated and shown to encode a low complexity, serine/alanine-rich protein of 133 kDa. Interspecies comparison indicates that the proteins display 61.5% identity, with the level of conservation being greatest in the regions of acidic/basic amino acid repeats and nuclear localization signals. These features are shared with the nucleolar phosphoproteins. Confirmation that the gene isolated in the current study is orthologous with the Treacher Collins syndrome gene was provided by the demonstration that it mapped to central mouse chromosome 18 in a conserved syntenic region with human chromosome 5q21-q33. Expression analysis in the mouse indicated that the gene was expressed in a wide variety of embryonic and adult tissues. Peak levels of expression in the developing embryo were observed at the edges of the neural folds immediately prior to fusion, and also in the developing branchial arches at the times of critical morphogenetic events. These observations support a role for the gene in the development of the craniofacial complex and provide further evidence that the gene encodes a protein which may be involved in nucleolar-cytoplasmic transport.
Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma

PubMed Central

Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

2007-01-01

Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis. PMID:18305825
Iterative local Gaussian clustering for expressed genes identification linked to malignancy of human colorectal carcinoma.

PubMed

Wasito, Ito; Hashim, Siti Zaiton M; Sukmaningrum, Sri

2007-12-30

Gene expression profiling plays an important role in the identification of biological and clinical properties of human solid tumors such as colorectal carcinoma. Profiling is required to reveal underlying molecular features for diagnostic and therapeutic purposes. A non-parametric density-estimation-based approach called iterative local Gaussian clustering (ILGC), was used to identify clusters of expressed genes. We used experimental data from a previous study by Muro and others consisting of 1,536 genes in 100 colorectal cancer and 11 normal tissues. In this dataset, the ILGC finds three clusters, two large and one small gene clusters, similar to their results which used Gaussian mixture clustering. The correlation of each cluster of genes and clinical properties of malignancy of human colorectal cancer was analysed for the existence of tumor or normal, the existence of distant metastasis and the existence of lymph node metastasis.
categoryCompare, an analytical tool based on feature annotations

PubMed Central

Flight, Robert M.; Harrison, Benjamin J.; Mohammad, Fahim; Bunge, Mary B.; Moon, Lawrence D. F.; Petruska, Jeffrey C.; Rouchka, Eric C.

2014-01-01

Assessment of high-throughput—omics data initially focuses on relative or raw levels of a particular feature, such as an expression value for a transcript, protein, or metabolite. At a second level, analyses of annotations including known or predicted functions and associations of each individual feature, attempt to distill biological context. Most currently available comparative- and meta-analyses methods are dependent on the availability of identical features across data sets, and concentrate on determining features that are differentially expressed across experiments, some of which may be considered “biomarkers.” The heterogeneity of measurement platforms and inherent variability of biological systems confounds the search for robust biomarkers indicative of a particular condition. In many instances, however, multiple data sets show involvement of common biological processes or signaling pathways, even though individual features are not commonly measured or differentially expressed between them. We developed a methodology, categoryCompare, for cross-platform and cross-sample comparison of high-throughput data at the annotation level. We assessed the utility of the approach using hypothetical data, as well as determining similarities and differences in the set of processes in two instances: (1) denervated skin vs. denervated muscle, and (2) colon from Crohn's disease vs. colon from ulcerative colitis (UC). The hypothetical data showed that in many cases comparing annotations gave superior results to comparing only at the gene level. Improved analytical results depended as well on the number of genes included in the annotation term, the amount of noise in relation to the number of genes expressing in unenriched annotation categories, and the specific method in which samples are combined. In the skin vs. muscle denervation comparison, the tissues demonstrated markedly different responses. The Crohn's vs. UC comparison showed gross similarities in inflammatory response in the two diseases, with particular processes specific to each disease. PMID:24808906
Candidate genes for panhypopituitarism identified by gene expression profiling

PubMed Central

Mortensen, Amanda H.; MacDonald, James W.; Ghosh, Debashis

2011-01-01

Mutations in the transcription factors PROP1 and PIT1 (POU1F1) lead to pituitary hormone deficiency and hypopituitarism in mice and humans. The dysmorphology of developing Prop1 mutant pituitaries readily distinguishes them from those of Pit1 mutants and normal mice. This and other features suggest that Prop1 controls the expression of genes besides Pit1 that are important for pituitary cell migration, survival, and differentiation. To identify genes involved in these processes we used microarray analysis of gene expression to compare pituitary RNA from newborn Prop1 and Pit1 mutants and wild-type littermates. Significant differences in gene expression were noted between each mutant and their normal littermates, as well as between Prop1 and Pit1 mutants. Otx2, a gene critical for normal eye and pituitary development in humans and mice, exhibited elevated expression specifically in Prop1 mutant pituitaries. We report the spatial and temporal regulation of Otx2 in normal mice and Prop1 mutants, and the results suggest Otx2 could influence pituitary development by affecting signaling from the ventral diencephalon and regulation of gene expression in Rathke's pouch. The discovery that Otx2 expression is affected by Prop1 deficiency provides support for our hypothesis that identifying molecular differences in mutants will contribute to understanding the molecular mechanisms that control pituitary organogenesis and lead to human pituitary disease. PMID:21828248
Tumor SHB gene expression affects disease characteristics in human acute myeloid leukemia.

PubMed

Jamalpour, Maria; Li, Xiujuan; Cavelier, Lucia; Gustafsson, Karin; Mostoslavsky, Gustavo; Höglund, Martin; Welsh, Michael

2017-10-01

The mouse Shb gene coding for the Src Homology 2-domain containing adapter protein B has recently been placed in context of BCRABL1-induced myeloid leukemia in mice and the current study was performed in order to relate SHB to human acute myeloid leukemia (AML). Publicly available AML databases were mined for SHB gene expression and patient survival. SHB gene expression was determined in the Uppsala cohort of AML patients by qPCR. Cell proliferation was determined after SHB gene knockdown in leukemic cell lines. Despite a low frequency of SHB gene mutations, many tumors overexpressed SHB mRNA compared with normal myeloid blood cells. AML patients with tumors expressing low SHB mRNA displayed longer survival times. A subgroup of AML exhibiting a favorable prognosis, acute promyelocytic leukemia (APL) with a PMLRARA translocation, expressed less SHB mRNA than AML tumors in general. When examining genes co-expressed with SHB in AML tumors, four other genes ( PAX5, HDAC7, BCORL1, TET1) related to leukemia were identified. A network consisting of these genes plus SHB was identified that relates to certain phenotypic characteristics, such as immune cell, vascular and apoptotic features. SHB knockdown in the APL PMLRARA cell line NB4 and the monocyte/macrophage cell line MM6 adversely affected proliferation, linking SHB gene expression to tumor cell expansion and consequently to patient survival. It is concluded that tumor SHB gene expression relates to AML survival and its subgroup APL. Moreover, this gene is included in a network of genes that plays a role for an AML phenotype exhibiting certain immune cell, vascular and apoptotic characteristics.

RAS oncogene-mediated deregulation of the transcriptome: from molecular signature to function.

PubMed

Schäfer, Reinhold; Sers, Christine

2011-01-01

Transcriptome analysis of cancer cells has developed into a standard procedure to elucidate multiple features of the malignant process and to link gene expression to clinical properties. Gene expression profiling based on microarrays provides essentially correlative information and needs to be transferred to the functional level in order to understand the activity and contribution of individual genes or sets of genes as elements of the gene signature. To date, there exist significant gaps in the functional understanding of gene expression profiles. Moreover, the processes that drive the profound transcriptional alterations that characterize cancer cells remain mainly elusive. We have used pathway-restricted gene expression profiles derived from RAS oncogene-transformed cells and from RAS-expressing cancer cells to identify regulators downstream of the MAPK pathway.We describe the role of epigenetic regulation exemplified by the control of several immune genes in generic cell lines and colorectal cancer cells, particularly the functional interaction between signaling and DNA methylation. Moreover, we assess the role of the architectural transcription factor high mobility AT-hook 2 (HMGA2) as a regulator of the RAS-responsive transcriptome in ovarian epithelial cells. Finally, we describe an integrated approach combining pathway interference in colorectal cancer cells, gene expression profiling and computational analysis of regulatory elements of deregulated target genes. This strategy resulted in the identification of Y-box binding protein 1 (YBX1) as a regulator of MAPK-dependent proliferation and gene expression. The implications for a therapeutic application of HMGA2 gene silencing and the role of YBX1 as a prognostic factor are discussed.
Expression pattern of circadian genes and steroidogenesis-related genes after testosterone stimulation in the human ovary.

PubMed

Chen, Minghui; Xu, Yanwen; Miao, Benyu; Zhao, Hui; Luo, Lu; Shi, Huijuan; Zhou, Canquan

2016-09-10

Previous studies have shown that circadian genes might be involved in the development of polycystic ovarian syndrome (PCOS). Hyperandrogenism is a hallmark feature of PCOS. However, the effect of hyperandrogenism on circadian gene expression in human granulosa cells is unknown, and the general expression pattern of circadian genes in the human ovary is unclear. Expression of the circadian proteins CLOCK and PER2 in human ovaries was observed by immunohistochemistry. The mRNA expression patterns of the circadian genes CLOCK, PER2, and BMAL1, and the steroidogenesis-related genes STAR, CYP11A1, HSD3B2, and CYP19A1 in cultured human luteinized granulosa cells were analyzed over the course of 48 h after testosterone treatment by quantitative polymerase chain reaction. Immunostaining of CLOCK and PER2 protein was detected in the granulosa cells of dominant antral follicles but was absent in the primordial, primary, or preantral follicles of human ovaries. After testosterone stimulation, expression of PER2 showed an oscillating pattern, with two peaks occurring at the 24th and 44th hours; expression of CLOCK increased significantly to the peak at the 24th hour, whereas expression of BMAL1 did not change significantly over time in human luteinized granulosa cells. Among the four steroidogenesis-related genes evaluated, only STAR displayed an oscillating expression pattern with two peaks occurring at the 24th and 40th hours after testosterone stimulation. Circadian genes are expressed in the dominant antral follicles of the human ovary. Oscillating expression of the circadian gene PER2 can be induced by testosterone in human granulosa cells in vitro. Expression of STAR also displayed an oscillating pattern after testosterone stimulation. Our results indicate a potential relationship between the circadian clock and steroidogenesis in the human ovary, and demonstrate the effect of testosterone on circadian gene expression in granulosa cells.
Differential transcriptomic responses of Biomphalaria glabrata (Gastropoda, Mollusca) to bacteria and metazoan parasites, Schistosoma mansoni and Echinostoma paraensei (Digenea, Platyhelminthes)

PubMed Central

Adema, Coen M; Hanington, Patrick C.; Lun, Cheng-Man; Rosenberg, George H.; Aragon, Anthony D; Stout, Barbara A; Richard, Mara L. Lennard; Gross, Paul S.; Loker, Eric S

2009-01-01

A 70-mer oligonucleotide-based microarray (1152 features) that emphasizes stress and immune responses factors was constructed to study transcriptomic responses of the snail Biomphalaria glabrata to different immune challenges. In addition to sequences with relevant putative ID and Gene Ontology (GO) annotation, the array features non-immune factors and unknown B. glabrata ESTs for functional gene discovery. The transcription profiles of B. glabrata (3 biological replicates, each a pool of 5 snails) were recorded at 12 hours post wounding, exposure to Gram negative or Gram positive bacteria (Escherichia coli and Micrococcus luteus, respectively), or infection with compatible trematode parasites (S. mansoni or E. paraensei, 20 miracidia/snail), relative to controls, using universal reference RNA. The data were subjected to Significance Analysis for Microarrays (SAM), with a false positive rate (FPR) ≤10%. Wounding yielded a modest differential expression profile (27 up/21 down) with affected features mostly dissimilar from other treatments. Partially overlapping, yet distinct expression profiles were recorded from snails challenged with E. coli (83 up/20 down) or M. luteus (120 up/42 down), mostly showing up-regulation of defense and stress-related features. Significantly altered expression of selected immune features indicates that B. glabrata detects and responds differently to compatible trematodes. Echinostoma paraensei infection was associated mostly with down regulation of many (immune-) transcripts (42 up/68 down), whereas S. mansoni exposure yielded a preponderance of up-regulated features (140 up/23 down), with only few known immune genes affected. These observations may reflect the divergent strategies developed by trematodes during their evolution as specialized pathogens of snails to negate host defense responses. Clearly, the immune defenses of B. glabrata distinguish and respond differently to various immune challenges. PMID:19962194
Mutual information estimation reveals global associations between stimuli and biological processes

PubMed Central

Suzuki, Taiji; Sugiyama, Masashi; Kanamori, Takafumi; Sese, Jun

2009-01-01

Background Although microarray gene expression analysis has become popular, it remains difficult to interpret the biological changes caused by stimuli or variation of conditions. Clustering of genes and associating each group with biological functions are often used methods. However, such methods only detect partial changes within cell processes. Herein, we propose a method for discovering global changes within a cell by associating observed conditions of gene expression with gene functions. Results To elucidate the association, we introduce a novel feature selection method called Least-Squares Mutual Information (LSMI), which computes mutual information without density estimaion, and therefore LSMI can detect nonlinear associations within a cell. We demonstrate the effectiveness of LSMI through comparison with existing methods. The results of the application to yeast microarray datasets reveal that non-natural stimuli affect various biological processes, whereas others are no significant relation to specific cell processes. Furthermore, we discover that biological processes can be categorized into four types according to the responses of various stimuli: DNA/RNA metabolism, gene expression, protein metabolism, and protein localization. Conclusion We proposed a novel feature selection method called LSMI, and applied LSMI to mining the association between conditions of yeast and biological processes through microarray datasets. In fact, LSMI allows us to elucidate the global organization of cellular process control. PMID:19208155
A method to identify differential expression profiles of time-course gene data with Fourier transformation.

PubMed

Kim, Jaehee; Ogden, Robert Todd; Kim, Haseong

2013-10-18

Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization.The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be potentially used to identify genes which have the same patterns or biological processes, and help facing the present and forthcoming challenges of data analysis in functional genomics.
Homo sapiens exhibit a distinct pattern of CNV genes regulation: an important role of miRNAs and SNPs in expression plasticity.

PubMed

Dweep, Harsh; Kubikova, Nada; Gretz, Norbert; Voskarides, Konstantinos; Felekkis, Kyriacos

2015-07-16

Gene expression regulation is a complex and highly organized process involving a variety of genomic factors. It is widely accepted that differences in gene expression can contribute to the phenotypic variability between species, and that their interpretation can aid in the understanding of the physiologic variability. CNVs and miRNAs are two major players in the regulation of expression plasticity and may be responsible for the unique phenotypic characteristics observed in different lineages. We have previously demonstrated that a close interaction between these two genomic elements may have contributed to the regulation of gene expression during evolution. This work presents the molecular interactions between CNV and non CNV genes with miRNAs and other genomic elements in eight different species. A comprehensive analysis of these interactions indicates a unique nature of human CNV genes regulation as compared to other species. By using genes with short 3' UTR that abolish the "canonical" miRNA-dependent regulation, as a model, we demonstrate a distinct and tight regulation of human genes that might explain some of the unique features of human physiology. In addition, comparison of gene expression regulation between species indicated that there is a significant difference between humans and mice possibly questioning the effectiveness of the latest as experimental models of human diseases.
Homo sapiens exhibit a distinct pattern of CNV genes regulation: an important role of miRNAs and SNPs in expression plasticity

PubMed Central

Dweep, Harsh; Kubikova, Nada; Gretz, Norbert; Voskarides, Konstantinos; Felekkis, Kyriacos

2015-01-01

Gene expression regulation is a complex and highly organized process involving a variety of genomic factors. It is widely accepted that differences in gene expression can contribute to the phenotypic variability between species, and that their interpretation can aid in the understanding of the physiologic variability. CNVs and miRNAs are two major players in the regulation of expression plasticity and may be responsible for the unique phenotypic characteristics observed in different lineages. We have previously demonstrated that a close interaction between these two genomic elements may have contributed to the regulation of gene expression during evolution. This work presents the molecular interactions between CNV and non CNV genes with miRNAs and other genomic elements in eight different species. A comprehensive analysis of these interactions indicates a unique nature of human CNV genes regulation as compared to other species. By using genes with short 3′ UTR that abolish the “canonical” miRNA-dependent regulation, as a model, we demonstrate a distinct and tight regulation of human genes that might explain some of the unique features of human physiology. In addition, comparison of gene expression regulation between species indicated that there is a significant difference between humans and mice possibly questioning the effectiveness of the latest as experimental models of human diseases. PMID:26178010
Gene-assisted selection: applications of association genetics for forest tree breeding

Treesearch

Philip L. Wilcox; Craig E. Echt; Rowland D. Burdon

2007-01-01

This chapter describes application of association genetics in forest tree species for the purposes of selection. We use the term gene-assisted selection (GAS) to denote application of marker-trait associations determined via association genetics, which we anticipate will be based on poly morph isms associated with expressed genes. The salient features of forest trees...
Prognostic Value of FBXO39 and ETS-1 but not BMI-1 in Iranian Colorectal Cancer Patients

PubMed

Motalebzadeh, Jamshid; Shabani, Samira; Rezayati, Saeedeh; Shakournia, Narges; Mirzaei, Rezvan; Mahjoubi, Bahar; Hoseini, Kamal; Mahjoubi, Frouzandeh

2018-05-26

Background: Colorectal cancer (CRC) is one of the most prevalent cancers worldwide. Despite recent progress in diagnosis and treatment, it remains a major health problem and further studies are needed. We here investigated expression profiles of the FBXO39, ETS-1 and BMI-1 genes in CRCs to validate any possible diagnostic/prognostic significance. Material and Methods: Thirty six patients with locally advanced CRC admitted to Hazrate-Rasoul Hospital-Tehran were enrolled. Initially the expression pattern of FBXO39, ETS-1 and BMI-1 genes were determined using RT-PCR in CRC tumor and adjacent normal tissues then real-time RT-PCR was employed to quantify BMI-1 gene expression. Results: FBXO39 expression was restricted to tumor tissues. Interestingly, expression of this gene was detected in all stage-0 tumor samples. There was a significant relation between FBXO39 gene expression and lymph node involvement. The ETS-1 gene was expressed in 66% of all tumor tissues with p-value=0.03 for increase as compared to the adjacent normal samples. In addition, there was a significant relation between ETS-1 gene expression and tumor size and lymph node involvement. RT-PCR demonstrated BMI-1 gene expression in both tumor and normal tissues and quantification by real-time RT-PCR showed no association between BMI-1 levels and CRC clinicopathological features. Conclusion: Expression of FBXO39 and ETS-1 with lymph node involvement may be considered as an alarm for the occurrence of CRC metastasis, and therfore have prognostic value while BMI-1 appears without importance. Creative Commons Attribution License
Characterizing the molecular features of ERG-positive tumors in primary and castration resistant prostate cancer

PubMed Central

Roudier, Martine P; Winters, Brian R; Coleman, Ilsa; Lam, Hung-Ming; Zhang, Xiaotun; Coleman, Roger; Chéry, Lisly; True, Lawrence D.; Higano, Celestia S.; Montgomery, Bruce; Lange, Paul H.; Snyder, Linda A.; Srivistava, Shiv; Corey, Eva; Vessella, Robert L.; Nelson, Peter S.; Üren, Aykut; Morrissey, Colm

2017-01-01

Background The TMPRSS2-ERG gene fusion is detected in approximately half of primary prostate cancers (PCa) yet the prognostic significance remains unclear. We hypothesized that ERG promotes the expression of common genes in primary PCa and metastatic castration-resistant PCa (CRPC), with the objective of identifying ERG-associated pathways, which may promote the transition from primary PCa to CRPC. Methods We constructed tissue microarrays (TMA) from 127 radical prostatectomy specimens, 20 LuCaP patient-derived xenografts (PDX), and 152 CRPC metastases obtained immediately at time of death. Nuclear ERG was assessed by immunohistochemistry (IHC). To characterize the molecular features of ERG-expressing PCa, a subset of IHC confirmed ERG+ or ERG-specimens including 11 radical prostatectomies, 20 LuCaP PDXs, and 45 CRPC metastases underwent gene expression analysis. Genes were ranked based on expression in primary PCa and CRPC. Common genes of interest were targeted for IHC analysis and expression compared with biochemical recurrence (BCR) status. Results IHC revealed that 43% of primary PCa, 35% of the LuCaP PDXs, and 18% of the CRPC metastases were ERG+ (12 of 48 patients [25%] had at least 1 ERG+ metastasis). Based on gene expression data and previous literature, two proteins involved in calcium signaling (NCALD, CACNA1D), a protein involved in inflammation (HLA-DMB), CD3 positive immune cells, and a novel ERG-associated protein, DCLK1 were evaluated in primary PCa and CRPC metastases. In ERG+ primary PCa, a weak association was seen with NCALD and CACNA1D protein expression. HLA-DMB expression and the presence of CD3 positive immune cells were decreased in CRPC metastases compared to primary PCa. DCLK1 was upregulated at the protein level in unpaired ERG+ primary PCa and CRPC metastases (p=0.0013 and p<0.0001, respectively). In primary PCa, ERG status or expression of targeted proteins was not associated with BCR-free survival. However for primary PCa, ERG+DCLK1+ patients exhibited shorter time to BCR (p=0.06) compared with ERG+DCLK1- patients. Conclusions This study examined ERG expression in primary PCa and CRPC. We have identified altered levels of inflammatory mediators associated with ERG expression. We determined expression of DCLK1 correlates with ERG expression and may play a role in primary PCa progression to metastatic CPRC. PMID:26990456
Gene expression profiles of breast biopsies from healthy women identify a group with claudin-low features

PubMed Central

2011-01-01

Background Increased understanding of the variability in normal breast biology will enable us to identify mechanisms of breast cancer initiation and the origin of different subtypes, and to better predict breast cancer risk. Methods Gene expression patterns in breast biopsies from 79 healthy women referred to breast diagnostic centers in Norway were explored by unsupervised hierarchical clustering and supervised analyses, such as gene set enrichment analysis and gene ontology analysis and comparison with previously published genelists and independent datasets. Results Unsupervised hierarchical clustering identified two separate clusters of normal breast tissue based on gene-expression profiling, regardless of clustering algorithm and gene filtering used. Comparison of the expression profile of the two clusters with several published gene lists describing breast cells revealed that the samples in cluster 1 share characteristics with stromal cells and stem cells, and to a certain degree with mesenchymal cells and myoepithelial cells. The samples in cluster 1 also share many features with the newly identified claudin-low breast cancer intrinsic subtype, which also shows characteristics of stromal and stem cells. More women belonging to cluster 1 have a family history of breast cancer and there is a slight overrepresentation of nulliparous women in cluster 1. Similar findings were seen in a separate dataset consisting of histologically normal tissue from both breasts harboring breast cancer and from mammoplasty reductions. Conclusion This is the first study to explore the variability of gene expression patterns in whole biopsies from normal breasts and identified distinct subtypes of normal breast tissue. Further studies are needed to determine the specific cell contribution to the variation in the biology of normal breasts, how the clusters identified relate to breast cancer risk and their possible link to the origin of the different molecular subtypes of breast cancer. PMID:22044755
Analysis of microarray leukemia data using an efficient MapReduce-based K-nearest-neighbor classifier.

PubMed

Kumar, Mukesh; Rath, Nitish Kumar; Rath, Santanu Kumar

2016-04-01

Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as it keeps changing with time. Therefore, the analysis of microarray datasets in a small amount of time is essential. They often contain a large amount of expression, but only a fraction of it comprises genes that are significantly expressed. The precise identification of genes of interest that are responsible for causing cancer are imperative in microarray data analysis. Most existing schemes employ a two-phase process such as feature selection/extraction followed by classification. In this paper, various statistical methods (tests) based on MapReduce are proposed for selecting relevant features. After feature selection, a MapReduce-based K-nearest neighbor (mrKNN) classifier is also employed to classify microarray data. These algorithms are successfully implemented in a Hadoop framework. A comparative analysis is done on these MapReduce-based models using microarray datasets of various dimensions. From the obtained results, it is observed that these models consume much less execution time than conventional models in processing big data. Copyright © 2016 Elsevier Inc. All rights reserved.
Epigenetics of the myotonic dystrophy-associated DMPK gene neighborhood

PubMed Central

Buckley, Lauren; Lacey, Michelle; Ehrlich, Melanie

2016-01-01

Aim: Identify epigenetic marks in the vicinity of DMPK (linked to myotonic dystrophy, DM1) that help explain tissue-specific differences in its expression. Materials & methods: At DMPK and its flanking genes (DMWD, SIX5, BHMG1 and RSPH6A), we analyzed many epigenetic and transcription profiles from myoblasts, myotubes, skeletal muscle, heart and 30 nonmuscle samples. Results: In the DMPK gene neighborhood, muscle-associated DNA hypermethylation and hypomethylation, enhancer chromatin, and CTCF binding were seen. Myogenic DMPK hypermethylation correlated with high expression and decreased alternative promoter usage. Testis/sperm hypomethylation of BHMG1 and RSPH6A was associated with testis-specific expression. G-quadruplex (G4) motifs and sperm-specific hypomethylation were found near the DM1-linked CTG repeats within DMPK. Conclusion: Tissue-specific epigenetic features in DMPK and neighboring genes help regulate its expression. G4 motifs in DMPK DNA and RNA might contribute to DM1 pathology. PMID:26756355
Gene expression analysis in RA: towards personalized medicine

PubMed Central

Burska, A N; Roget, K; Blits, M; Soto Gomez, L; van de Loo, F; Hazelwood, L D; Verweij, C L; Rowe, A; Goulielmos, G N; van Baarsen, L G M; Ponchel, F

2014-01-01

Gene expression has recently been at the forefront of advance in personalized medicine, notably in the field of cancer and transplantation, providing a rational for a similar approach in rheumatoid arthritis (RA). RA is a prototypic inflammatory autoimmune disease with a poorly understood etiopathogenesis. Inflammation is the main feature of RA; however, many biological processes are involved at different stages of the disease. Gene expression signatures offer management tools to meet the current needs for personalization of RA patient's care. This review analyses currently available information with respect to RA diagnostic, prognostic and prediction of response to therapy with a view to highlight the abundance of data, whose comparison is often inconclusive due to the mixed use of material source, experimental methodologies and analysis tools, reinforcing the need for harmonization if gene expression signatures are to become a useful clinical tool in personalized medicine for RA patients. PMID:24589910
A highly sensitive and accurate gene expression analysis by sequencing ("bead-seq") for a single cell.

PubMed

Matsunaga, Hiroko; Goto, Mari; Arikawa, Koji; Shirai, Masataka; Tsunoda, Hiroyuki; Huang, Huan; Kambara, Hideki

2015-02-15

Analyses of gene expressions in single cells are important for understanding detailed biological phenomena. Here, a highly sensitive and accurate method by sequencing (called "bead-seq") to obtain a whole gene expression profile for a single cell is proposed. A key feature of the method is to use a complementary DNA (cDNA) library on magnetic beads, which enables adding washing steps to remove residual reagents in a sample preparation process. By adding the washing steps, the next steps can be carried out under the optimal conditions without losing cDNAs. Error sources were carefully evaluated to conclude that the first several steps were the key steps. It is demonstrated that bead-seq is superior to the conventional methods for single-cell gene expression analyses in terms of reproducibility, quantitative accuracy, and biases caused during sample preparation and sequencing processes. Copyright © 2014 Elsevier Inc. All rights reserved.
The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome.

PubMed

Hurst, Laurence D; Ghanbarian, Avazeh T; Forrest, Alistair R R; Huminiecki, Lukasz

2015-12-01

X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression profiles of X-linked genes. Tissues whose tissue-specific genes are very highly expressed (e.g., secretory tissues, tissues abundant in structural proteins) are also tissues in which gene expression is relatively rare on the X chromosome. These trends cannot be fully accounted for in terms of alternative models of biased expression. In conclusion, the notion that it is hard for genes on the Therian X to be highly expressed, owing to transcriptional traffic jams, provides a simple yet robustly supported rationale of many peculiar features of X's gene content, gene expression, and evolution.
The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome

PubMed Central

Hurst, Laurence D.; Ghanbarian, Avazeh T.; Forrest, Alistair R. R.; Huminiecki, Lukasz

2015-01-01

X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression profiles of X-linked genes. Tissues whose tissue-specific genes are very highly expressed (e.g., secretory tissues, tissues abundant in structural proteins) are also tissues in which gene expression is relatively rare on the X chromosome. These trends cannot be fully accounted for in terms of alternative models of biased expression. In conclusion, the notion that it is hard for genes on the Therian X to be highly expressed, owing to transcriptional traffic jams, provides a simple yet robustly supported rationale of many peculiar features of X’s gene content, gene expression, and evolution. PMID:26685068
Genome-wide characterization of pectin methyl esterase genes reveals members differentially expressed in tolerant and susceptible wheats in response to Fusarium graminearum.

PubMed

Zega, Alessandra; D'Ovidio, Renato

2016-11-01

Pectin methyl esterase (PME) genes code for enzymes that are involved in structural modifications of the plant cell wall during plant growth and development. They are also involved in plant-pathogen interaction. PME genes belong to a multigene family and in this study we report the first comprehensive analysis of the PME gene family in bread wheat (Triticum aestivum L.). Like in other species, the members of the TaPME family are dispersed throughout the genome and their encoded products retain the typical structural features of PMEs. qRT-PCR analysis showed variation in the expression pattern of TaPME genes in different tissues and revealed that these genes are mainly expressed in flowering spikes. In our attempt to identify putative TaPME genes involved in wheat defense, we revealed a strong variation in the expression of the TaPME following Fusarium graminearum infection, the causal agent of Fusarium head blight (FHB). Particularly interesting was the finding that the expression profile of some PME genes was markedly different between the FHB-resistant wheat cultivar Sumai3 and the FHB-susceptible cultivar Bobwhite, suggesting a possible involvement of these PME genes in FHB resistance. Moreover, the expression analysis of the TaPME genes during F. graminearum progression within the spike revealed those genes that responded more promptly to pathogen invasion. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
oPOSSUM: integrated tools for analysis of regulatory motif over-representation

PubMed Central

Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.

2007-01-01

The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675
Development of a versatile enrichment analysis tool reveals associations between the maternal brain and mental health disorders, including autism

PubMed Central

2013-01-01

Background A recent study of lateral septum (LS) suggested a large number of autism-related genes with altered expression in the postpartum state. However, formally testing the findings for enrichment of autism-associated genes proved to be problematic with existing software. Many gene-disease association databases have been curated which are not currently incorporated in popular, full-featured enrichment tools, and the use of custom gene lists in these programs can be difficult to perform and interpret. As a simple alternative, we have developed the Modular Single-set Enrichment Test (MSET), a minimal tool that enables one to easily evaluate expression data for enrichment of any conceivable gene list of interest. Results The MSET approach was validated by testing several publicly available expression data sets for expected enrichment in areas of autism, attention deficit hyperactivity disorder (ADHD), and arthritis. Using nine independent, unique autism gene lists extracted from association databases and two recent publications, a striking consensus of enrichment was detected within gene expression changes in LS of postpartum mice. A network of 160 autism-related genes was identified, representing developmental processes such as synaptic plasticity, neuronal morphogenesis, and differentiation. Additionally, maternal LS displayed enrichment for genes associated with bipolar disorder, schizophrenia, ADHD, and depression. Conclusions The transition to motherhood includes the most fundamental social bonding event in mammals and features naturally occurring changes in sociability. Some individuals with autism, schizophrenia, or other mental health disorders exhibit impaired social traits. Genes involved in these deficits may also contribute to elevated sociability in the maternal brain. To date, this is the first study to show a significant, quantitative link between the maternal brain and mental health disorders using large scale gene expression data. Thus, the postpartum brain may provide a novel and promising platform for understanding the complex genetics of improved sociability that may have direct relevance for multiple psychiatric illnesses. This study also provides an important new tool that fills a critical analysis gap and makes evaluation of enrichment using any database of interest possible with an emphasis on ease of use and methodological transparency. PMID:24245670

Development of a versatile enrichment analysis tool reveals associations between the maternal brain and mental health disorders, including autism.

PubMed

Eisinger, Brian E; Saul, Michael C; Driessen, Terri M; Gammie, Stephen C

2013-11-19

A recent study of lateral septum (LS) suggested a large number of autism-related genes with altered expression in the postpartum state. However, formally testing the findings for enrichment of autism-associated genes proved to be problematic with existing software. Many gene-disease association databases have been curated which are not currently incorporated in popular, full-featured enrichment tools, and the use of custom gene lists in these programs can be difficult to perform and interpret. As a simple alternative, we have developed the Modular Single-set Enrichment Test (MSET), a minimal tool that enables one to easily evaluate expression data for enrichment of any conceivable gene list of interest. The MSET approach was validated by testing several publicly available expression data sets for expected enrichment in areas of autism, attention deficit hyperactivity disorder (ADHD), and arthritis. Using nine independent, unique autism gene lists extracted from association databases and two recent publications, a striking consensus of enrichment was detected within gene expression changes in LS of postpartum mice. A network of 160 autism-related genes was identified, representing developmental processes such as synaptic plasticity, neuronal morphogenesis, and differentiation. Additionally, maternal LS displayed enrichment for genes associated with bipolar disorder, schizophrenia, ADHD, and depression. The transition to motherhood includes the most fundamental social bonding event in mammals and features naturally occurring changes in sociability. Some individuals with autism, schizophrenia, or other mental health disorders exhibit impaired social traits. Genes involved in these deficits may also contribute to elevated sociability in the maternal brain. To date, this is the first study to show a significant, quantitative link between the maternal brain and mental health disorders using large scale gene expression data. Thus, the postpartum brain may provide a novel and promising platform for understanding the complex genetics of improved sociability that may have direct relevance for multiple psychiatric illnesses. This study also provides an important new tool that fills a critical analysis gap and makes evaluation of enrichment using any database of interest possible with an emphasis on ease of use and methodological transparency.
Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks

PubMed Central

Marbach, Daniel; Roy, Sushmita; Ay, Ferhat; Meyer, Patrick E.; Candeias, Rogerio; Kahveci, Tamer; Bristow, Christopher A.; Kellis, Manolis

2012-01-01

Gaining insights on gene regulation from large-scale functional data sets is a grand challenge in systems biology. In this article, we develop and apply methods for transcriptional regulatory network inference from diverse functional genomics data sets and demonstrate their value for gene function and gene expression prediction. We formulate the network inference problem in a machine-learning framework and use both supervised and unsupervised methods to predict regulatory edges by integrating transcription factor (TF) binding, evolutionarily conserved sequence motifs, gene expression, and chromatin modification data sets as input features. Applying these methods to Drosophila melanogaster, we predict ∼300,000 regulatory edges in a network of ∼600 TFs and 12,000 target genes. We validate our predictions using known regulatory interactions, gene functional annotations, tissue-specific expression, protein–protein interactions, and three-dimensional maps of chromosome conformation. We use the inferred network to identify putative functions for hundreds of previously uncharacterized genes, including many in nervous system development, which are independently confirmed based on their tissue-specific expression patterns. Last, we use the regulatory network to predict target gene expression levels as a function of TF expression, and find significantly higher predictive power for integrative networks than for motif or ChIP-based networks. Our work reveals the complementarity between physical evidence of regulatory interactions (TF binding, motif conservation) and functional evidence (coordinated expression or chromatin patterns) and demonstrates the power of data integration for network inference and studies of gene regulation at the systems level. PMID:22456606
Gene structure and expression characteristic of a novel odorant receptor gene cluster in the parasitoid wasp Microplitis mediator (Hymenoptera: Braconidae).

PubMed

Wang, S-N; Shan, S; Zheng, Y; Peng, Y; Lu, Z-Y; Yang, Y-Q; Li, R-J; Zhang, Y-J; Guo, Y-Y

2017-08-01

Odorant receptors (ORs) expressed in the antennae of parasitoid wasps are responsible for detection of various lipophilic airborne molecules. In the present study, 107 novel OR genes were identified from Microplitis mediator antennal transcriptome data. Phylogenetic analysis of the set of OR genes from M. mediator and Microplitis demolitor revealed that M. mediator OR (MmedOR) genes can be classified into different subfamilies, and the majority of MmedORs in each subfamily shared high sequence identities and clear orthologous relationships to M. demolitor ORs. Within a subfamily, six MmedOR genes, MmedOR98, 124, 125, 126, 131 and 155, shared a similar gene structure and were tightly linked in the genome. To evaluate whether the clustered MmedOR genes share common regulatory features, the transcription profile and expression characteristics of the six closely related OR genes were investigated in M. mediator. Rapid amplification of cDNA ends-PCR experiments revealed that the OR genes within the cluster were transcribed as single mRNAs, and a bicistronic mRNA for two adjacent genes (MmedOR124 and MmedOR98) was also detected in female antennae by reverse transcription PCR. In situ hybridization experiments indicated that each OR gene within the cluster was expressed in a different number of cells. Moreover, there was no co-expression of the two highly related OR genes, MmedOR124 and MmedOR98, which appeared to be individually expressed in a distinct population of neurons. Overall, there were distinct expression profiles of closely related MmedOR genes from the same cluster in M. mediator. These data provide a basic understanding of the olfactory coding in parasitoid wasps. © 2017 The Royal Entomological Society.
Partitioning of functional gene expression data using principal points.

PubMed

Kim, Jaehee; Kim, Haseong

2017-10-12

DNA microarrays offer motivation and hope for the simultaneous study of variations in multiple genes. Gene expression is a temporal process that allows variations in expression levels with a characterized gene function over a period of time. Temporal gene expression curves can be treated as functional data since they are considered as independent realizations of a stochastic process. This process requires appropriate models to identify patterns of gene functions. The partitioning of the functional data can find homogeneous subgroups of entities for the massive genes within the inherent biological networks. Therefor it can be a useful technique for the analysis of time-course gene expression data. We propose a new self-consistent partitioning method of functional coefficients for individual expression profiles based on the orthonormal basis system. A principal points based functional partitioning method is proposed for time-course gene expression data. The method explores the relationship between genes using Legendre coefficients as principal points to extract the features of gene functions. Our proposed method provides high connectivity in connectedness after clustering for simulated data and finds a significant subsets of genes with the increased connectivity. Our approach has comparative advantages that fewer coefficients are used from the functional data and self-consistency of principal points for partitioning. As real data applications, we are able to find partitioned genes through the gene expressions found in budding yeast data and Escherichia coli data. The proposed method benefitted from the use of principal points, dimension reduction, and choice of orthogonal basis system as well as provides appropriately connected genes in the resulting subsets. We illustrate our method by applying with each set of cell-cycle-regulated time-course yeast genes and E. coli genes. The proposed method is able to identify highly connected genes and to explore the complex dynamics of biological systems in functional genomics.
CARSVM: a class association rule-based classification framework and its application to gene expression data.

PubMed

Kianmehr, Keivan; Alhajj, Reda

2008-09-01

In this study, we aim at building a classification framework, namely the CARSVM model, which integrates association rule mining and support vector machine (SVM). The goal is to benefit from advantages of both, the discriminative knowledge represented by class association rules and the classification power of the SVM algorithm, to construct an efficient and accurate classifier model that improves the interpretability problem of SVM as a traditional machine learning technique and overcomes the efficiency issues of associative classification algorithms. In our proposed framework: instead of using the original training set, a set of rule-based feature vectors, which are generated based on the discriminative ability of class association rules over the training samples, are presented to the learning component of the SVM algorithm. We show that rule-based feature vectors present a high-qualified source of discrimination knowledge that can impact substantially the prediction power of SVM and associative classification techniques. They provide users with more conveniences in terms of understandability and interpretability as well. We have used four datasets from UCI ML repository to evaluate the performance of the developed system in comparison with five well-known existing classification methods. Because of the importance and popularity of gene expression analysis as real world application of the classification model, we present an extension of CARSVM combined with feature selection to be applied to gene expression data. Then, we describe how this combination will provide biologists with an efficient and understandable classifier model. The reported test results and their biological interpretation demonstrate the applicability, efficiency and effectiveness of the proposed model. From the results, it can be concluded that a considerable increase in classification accuracy can be obtained when the rule-based feature vectors are integrated in the learning process of the SVM algorithm. In the context of applicability, according to the results obtained from gene expression analysis, we can conclude that the CARSVM system can be utilized in a variety of real world applications with some adjustments.
Comprehensive investigation of clinicopathologic features, oncogenic driver mutations and immunohistochemical markers in peripheral lung squamous cell carcinoma.

PubMed

Zhang, Yang; Zheng, Difan; Li, Yuan; Pan, Yunjian; Sun, Yihua; Chen, Haiquan

2017-11-01

Although the majority of lung squamous cell carcinomas (SQCC) arise in central airways, the prevalence of peripheral (p) SQCC is increasing. This study aimed to have a comprehensive investigation of clinicopathologic features, status of common driver mutations and immunophenotypes of p-SQCC compared to central (c) SQCC. A total of 261 p-SQCC were compared to 444 c-SQCC for clinicopathologic characteristics. Comprehensive mutational analysis of EGFR, KRAS, HER2, BRAF, PIK3CA, DDR2, AKT1, ALK, ROS1, RET and FGFRs were performed. TTF1, CK7, Napsin A and PE10 protein expression were analyzed through immunohistochemistry (IHC). TTF1, CK7, CK8, SPA and TP63 gene expression levels were measured by quantitative real-time PCR. Compared to c-SQCC, p-SQCC were associated with female (14.2% vs . 4.5%, P<0.001), never-smokers (22.6% vs . 13.3%, P=0.001), older age at diagnosis (64.9 vs . 59.5 years, P<0.001) and lower pathologic stage (P<0.001). The frequency of EGFR mutations was significantly higher in p-SQCC than c-SQCC (6.2% vs . 2.2%, P=0.040). Positive protein expression of TTF1 (P=0.010) and CK7 (P=0.001) was significantly more prevalent in p-SQCC. p-SQCC had significantly higher gene expression of SPA (P=0.003), whereas c-SQCC showed higher gene expression of TP63 (P=0.028). Lung p-SQCC had distinctive clinicopathologic characteristics and molecular features compared to c-SQCC, but showed some similarity with adenocarcinoma (ADC).
Three urocortins in medaka: identification and spatial expression in the central nervous system.

PubMed

Hosono, K; Yamashita, J; Kikuchi, Y; Hiraki-Kajiyama, T; Okubo, K

2017-05-01

The urocortin (UCN) group of neuropeptides includes urocortin 1/sauvagine/urotensin 1 (UTS1), urocortin 2 (UCN2) and urocortin 3 (UCN3). In recent years, evidence has accumulated showing that UCNs play pivotal roles in mediating stress response and anxiety in mammals. Evidence has also emerged regarding the evolutionary conservation of UCNs in vertebrates, but very little information is available about UCNs in non-mammalian vertebrates. Indeed, at present, there are no reports of the empirical identification of ucn2 in non-mammalian vertebrates or of the distribution of ucn2 and ucn3 expression in the adult central nervous system (CNS) of these animals. To gain insight into the evolutionary nature of UCNs in vertebrates, we cloned uts1, ucn2 and ucn3 in a teleost fish, medaka and examined the spatial expression of these genes in the adult brain and spinal cord. Although all known UCN2 genes except those in rodents have been reported to likely lack the necessary structural features to produce a functional pre-pro-protein, all three UCN genes in medaka, including ucn2, displayed all of these features, suggesting their functionality. The three UCN genes exhibited distinct spatial expression patterns in the medaka brain: uts1 was primarily expressed in broad regions of the dorsal telencephalon, ucn2 was expressed in restricted regions of the thalamus and brainstem and ucn3 was expressed in discrete nuclei throughout many regions of the brain. We also found that these genes were all expressed throughout the medaka spinal cord, each with a distinct spatial pattern. Given that many of these regions have been implicated in stress responses and anxiety, the three UCNs may serve distinct physiological roles in the medaka CNS, including those involved in stress and anxiety, as shown in the mammalian CNS. © 2017 British Society for Neuroendocrinology.
Genetic Rearrangements Can Modify Chromatin Features at Epialleles

PubMed Central

Foerster, Andrea M.; Dinh, Huy Q.; Sedman, Laura; Wohlrab, Bonnie; Mittelsten Scheid, Ortrun

2011-01-01

Analogous to genetically distinct alleles, epialleles represent heritable states of different gene expression from sequence-identical genes. Alleles and epialleles both contribute to phenotypic heterogeneity. While alleles originate from mutation and recombination, the source of epialleles is less well understood. We analyze active and inactive epialleles that were found at a transgenic insert with a selectable marker gene in Arabidopsis. Both converse expression states are stably transmitted to progeny. The silent epiallele was previously shown to change its state upon loss-of-function of trans-acting regulators and drug treatments. We analyzed the composition of the epialleles, their chromatin features, their nuclear localization, transcripts, and homologous small RNA. After mutagenesis by T-DNA transformation of plants carrying the silent epiallele, we found new active alleles. These switches were associated with different, larger or smaller, and non-overlapping deletions or rearrangements in the 3′ regions of the epiallele. These cis-mutations caused different degrees of gene expression stability depending on the nature of the sequence alteration, the consequences for transcription and transcripts, and the resulting chromatin organization upstream. This illustrates a tight dependence of epigenetic regulation on local structures and indicates that sequence alterations can cause epigenetic changes at some distance in regions not directly affected by the mutation. Similar effects may also be involved in gene expression and chromatin changes in the vicinity of transposon insertions or excisions, recombination events, or DNA repair processes and could contribute to the origin of new epialleles. PMID:22028669
Genetic and epigenetic mechanisms in the pathogenesis of neurofibromatosis type I

DOE Office of Scientific and Technical Information (OSTI.GOV)

Metheny, L.J.; Amedeo, M.S.; Cappione, J.

Neurofibromatosis type I (NF1) is a common genetic disease which leads to a variety of clinical features affecting cells of neural crest origin. In the period since the NF1 gene was isolated 1991, our understanding of the genetics of NF1 has increased remarkably. One of the most striking aspects of NF1 genetics is its complexity, both in terms of gene organization and expression. The gene is large and, when mutated, gives rise to diverse manifestations. A growing body of data suggests that mutations in the NF1 gene alone may not be responsible for all of the features of this disease.more » Epigenetic mechanisms, those which affect the NF1 transcript, play a role in the normal expression of the NF1 gene. Therefore, aberrations in those epigenetic processes are most likely pathogenic. Herein we summarize salient aspects of the vast body of NF1 literature and provide some insights into the myriad of regulatory mechanisms that may go awry in the genesis of this common but complex disease. 58 refs., 3 figs.« less
A View of the Therapy for Bell's Palsy Based on Molecular Biological Analyses of Facial Muscles.

PubMed

Moriyama, Hiroshi; Mitsukawa, Nobuyuki; Itoh, Masahiro; Otsuka, Naruhito

2017-12-01

Details regarding the molecular biological features of Bell's palsy have not been widely reported in textbooks. We genetically analyzed facial muscles and clarified these points. We performed genetic analysis of facial muscle specimens from Japanese patients with severe (House-Brackmann facial nerve grading system V) and moderate (House-Brackmann facial nerve grading system III) dysfunction due to Bell's palsy. Microarray analysis of gene expression was performed using specimens from the healthy and affected sides, and gene expression was compared. Changes in gene expression were defined as an affected side/healthy side ratio of >1.5 or <0.5. We observed that the gene expression in Bell's palsy changes with the degree of facial nerve palsy. Especially, muscle, neuron, and energy category genes tended to fluctuate with the degree of facial nerve palsy. It is expected that this study will aid in the development of new treatments and diagnostic/prognostic markers based on the severity of facial nerve palsy.
Control of developmentally primed erythroid genes by combinatorial co-repressor actions

PubMed Central

Stadhouders, Ralph; Cico, Alba; Stephen, Tharshana; Thongjuea, Supat; Kolovos, Petros; Baymaz, H. Irem; Yu, Xiao; Demmers, Jeroen; Bezstarosti, Karel; Maas, Alex; Barroca, Vilma; Kockx, Christel; Ozgur, Zeliha; van Ijcken, Wilfred; Arcangeli, Marie-Laure; Andrieu-Soler, Charlotte; Lenhard, Boris; Grosveld, Frank; Soler, Eric

2015-01-01

How transcription factors (TFs) cooperate within large protein complexes to allow rapid modulation of gene expression during development is still largely unknown. Here we show that the key haematopoietic LIM-domain-binding protein-1 (LDB1) TF complex contains several activator and repressor components that together maintain an erythroid-specific gene expression programme primed for rapid activation until differentiation is induced. A combination of proteomics, functional genomics and in vivo studies presented here identifies known and novel co-repressors, most notably the ETO2 and IRF2BP2 proteins, involved in maintaining this primed state. The ETO2–IRF2BP2 axis, interacting with the NCOR1/SMRT co-repressor complex, suppresses the expression of the vast majority of archetypical erythroid genes and pathways until its decommissioning at the onset of terminal erythroid differentiation. Our experiments demonstrate that multimeric regulatory complexes feature a dynamic interplay between activating and repressing components that determines lineage-specific gene expression and cellular differentiation. PMID:26593974
Logic Learning Machine and standard supervised methods for Hodgkin's lymphoma prognosis using gene expression data and clinical variables.

PubMed

Parodi, Stefano; Manneschi, Chiara; Verda, Damiano; Ferrari, Enrico; Muselli, Marco

2018-03-01

This study evaluates the performance of a set of machine learning techniques in predicting the prognosis of Hodgkin's lymphoma using clinical factors and gene expression data. Analysed samples from 130 Hodgkin's lymphoma patients included a small set of clinical variables and more than 54,000 gene features. Machine learning classifiers included three black-box algorithms ( k-nearest neighbour, Artificial Neural Network, and Support Vector Machine) and two methods based on intelligible rules (Decision Tree and the innovative Logic Learning Machine method). Support Vector Machine clearly outperformed any of the other methods. Among the two rule-based algorithms, Logic Learning Machine performed better and identified a set of simple intelligible rules based on a combination of clinical variables and gene expressions. Decision Tree identified a non-coding gene ( XIST) involved in the early phases of X chromosome inactivation that was overexpressed in females and in non-relapsed patients. XIST expression might be responsible for the better prognosis of female Hodgkin's lymphoma patients.
Integrated multi-cohort transcriptional meta-analysis of neurodegenerative diseases.

PubMed

Li, Matthew D; Burns, Terry C; Morgan, Alexander A; Khatri, Purvesh

2014-09-04

Neurodegenerative diseases share common pathologic features including neuroinflammation, mitochondrial dysfunction and protein aggregation, suggesting common underlying mechanisms of neurodegeneration. We undertook a meta-analysis of public gene expression data for neurodegenerative diseases to identify a common transcriptional signature of neurodegeneration. Using 1,270 post-mortem central nervous system tissue samples from 13 patient cohorts covering four neurodegenerative diseases, we identified 243 differentially expressed genes, which were similarly dysregulated in 15 additional patient cohorts of 205 samples including seven neurodegenerative diseases. This gene signature correlated with histologic disease severity. Metallothioneins featured prominently among differentially expressed genes, and functional pathway analysis identified specific convergent themes of dysregulation. MetaCore network analyses revealed various novel candidate hub genes (e.g. STAU2). Genes associated with M1-polarized macrophages and reactive astrocytes were strongly enriched in the meta-analysis data. Evaluation of genes enriched in neurons revealed 70 down-regulated genes, over half not previously associated with neurodegeneration. Comparison with aging brain data (3 patient cohorts, 221 samples) revealed 53 of these to be unique to neurodegenerative disease, many of which are strong candidates to be important in neuropathogenesis (e.g. NDN, NAP1L2). ENCODE ChIP-seq analysis predicted common upstream transcriptional regulators not associated with normal aging (REST, RBBP5, SIN3A, SP2, YY1, ZNF143, IKZF1). Finally, we removed genes common to neurodegeneration from disease-specific gene signatures, revealing uniquely robust immune response and JAK-STAT signaling in amyotrophic lateral sclerosis. Our results implicate pervasive bioenergetic deficits, M1-type microglial activation and gliosis as unifying themes of neurodegeneration, and identify numerous novel genes associated with neurodegenerative processes.
Configurable pattern-based evolutionary biclustering of gene expression data

PubMed Central

2013-01-01

Background Biclustering algorithms for microarray data aim at discovering functionally related gene sets under different subsets of experimental conditions. Due to the problem complexity and the characteristics of microarray datasets, heuristic searches are usually used instead of exhaustive algorithms. Also, the comparison among different techniques is still a challenge. The obtained results vary in relevant features such as the number of genes or conditions, which makes it difficult to carry out a fair comparison. Moreover, existing approaches do not allow the user to specify any preferences on these properties. Results Here, we present the first biclustering algorithm in which it is possible to particularize several biclusters features in terms of different objectives. This can be done by tuning the specified features in the algorithm or also by incorporating new objectives into the search. Furthermore, our approach bases the bicluster evaluation in the use of expression patterns, being able to recognize both shifting and scaling patterns either simultaneously or not. Evolutionary computation has been chosen as the search strategy, naming thus our proposal Evo-Bexpa (Evolutionary Biclustering based in Expression Patterns). Conclusions We have conducted experiments on both synthetic and real datasets demonstrating Evo-Bexpa abilities to obtain meaningful biclusters. Synthetic experiments have been designed in order to compare Evo-Bexpa performance with other approaches when looking for perfect patterns. Experiments with four different real datasets also confirm the proper performing of our algorithm, whose results have been biologically validated through Gene Ontology. PMID:23433178
Anticancer drug sensitivity prediction in cell lines from baseline gene expression through recursive feature selection.

PubMed

Dong, Zuoli; Zhang, Naiqian; Li, Chun; Wang, Haiyun; Fang, Yun; Wang, Jun; Zheng, Xiaoqi

2015-06-30

An enduring challenge in personalized medicine is to select right drug for individual patients. Testing drugs on patients in large clinical trials is one way to assess their efficacy and toxicity, but it is impractical to test hundreds of drugs currently under development. Therefore the preclinical prediction model is highly expected as it enables prediction of drug response to hundreds of cell lines in parallel. Recently, two large-scale pharmacogenomic studies screened multiple anticancer drugs on over 1000 cell lines in an effort to elucidate the response mechanism of anticancer drugs. To this aim, we here used gene expression features and drug sensitivity data in Cancer Cell Line Encyclopedia (CCLE) to build a predictor based on Support Vector Machine (SVM) and a recursive feature selection tool. Robustness of our model was validated by cross-validation and an independent dataset, the Cancer Genome Project (CGP). Our model achieved good cross validation performance for most drugs in the Cancer Cell Line Encyclopedia (≥80% accuracy for 10 drugs, ≥75% accuracy for 19 drugs). Independent tests on eleven common drugs between CCLE and CGP achieved satisfactory performance for three of them, i.e., AZD6244, Erlotinib and PD-0325901, using expression levels of only twelve, six and seven genes, respectively. These results suggest that drug response could be effectively predicted from genomic features. Our model could be applied to predict drug response for some certain drugs and potentially play a complementary role in personalized medicine.
Hepatic Leukemia Factor Promotes Resistance To Cell Death: Implications For Therapeutics and Chronotherapy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Waters, Katrina M.; Sontag, Ryan L.; Weber, Thomas J.

Physiological variation related to circadian rhythms and aberrant gene expression patterns are believed to modulate therapeutic efficacy, but the precise molecular determinants remain unclear. Here we examine the regulation of cell death by hepatic leukemia factor (HLF), which is an output regulator of circadian rhythms and is aberrantly expressed in human cancers, using an ectopic expression strategy in JB6 mouse epidermal cells and human keratinocytes. Ectopic HLF expression inhibited cell death in both JB6 cells and human keratinocytes, as induced by serum-starvation, tumor necrosis factor alpha and ionizing radiation. Microarray analysis indicates that HLF regulates a complex multi-gene transcriptional programmore » encompassing upregulation of anti-apoptotic genes, downregulation of pro-apoptotic genes, and many additional changes that are consistent with an anti-death program. Collectively, our results demonstrate that ectopic expression of HLF, an established transcription factor that cycles with circadian rhythms, can recapitulate many features associated with circadian-dependent physiological variation.« less
Bartter syndrome in two sisters with a novel mutation of the CLCNKB gene, one with deafness.

PubMed

Robitaille, Pierre; Merouani, Aicha; He, Ning; Pei, York

2011-09-01

This article describes two sisters with type III Bartter syndrome (BS) due to a novel missense variant of the CLCNKB gene. The phenotypic expression of the disease was very different in these two siblings. In one sister, the disease followed a very severe course, especially in the neonatal period and as a toddler. Both the classic symptoms and the biochemical features of the syndrome were striking. In addition, she presented with sensorineural deafness, a complication yet unreported in this subtype of BS In contrast, the least affected sister was symptom free and the biochemical features of the disease although present remained discrete throughout the prolonged follow-up. It is suggested that such a difference in the phenotypic expression of the disease is possibly secondary to the modifier effect of a gene and/or results from environmental factor(s).
ATP7A is a novel target of retinoic acid receptor β2 in neuroblastoma cells

PubMed Central

Bohlken, A; Cheung, B B; Bell, J L; Koach, J; Smith, S; Sekyere, E; Thomas, W; Norris, M; Haber, M; Lovejoy, D B; Richardson, D R; Marshall, G M

2009-01-01

Increased retinoic acid receptor β (RARβ2) gene expression is a hallmark of cancer cell responsiveness to retinoid anticancer effects. Moreover, low basal or induced RARβ2 expression is a common feature of many human cancers, suggesting that RARβ2 may act as a tumour suppressor gene in the absence of supplemented retinoid. We have previously shown that low RARβ2 expression is a feature of advanced neuroblastoma. Here, we demonstrate that the ABC domain of the RARβ2 protein alone was sufficient for the growth inhibitory effects of RARβ2 on neuroblastoma cells. ATP7A, the copper efflux pump, is a retinoid-responsive gene, was upregulated by ectopic overexpression of RARβ2. The ectopic overexpression of the RARβ2 ABC domain was sufficient to induce ATP7A expression, whereas, RARβ2 siRNA blocked the induction of ATP7A expression in retinoid-treated neuroblastoma cells. Forced downregulation of ATP7A reduced copper efflux and increased viability of retinoid-treated neuroblastoma cells. Copper supplementation enhanced cell growth and reduced retinoid-responsiveness, whereas copper chelation reduced the viability and proliferative capacity. Taken together, our data demonstrates ATP7A expression is regulated by retinoic acid receptor β and it has effects on intracellular copper levels, revealing a link between the anticancer action of retinoids and copper metabolism. PMID:19127267
Selective modes determine evolutionary rates, gene compactness and expression patterns in Brassica.

PubMed

Guo, Yue; Liu, Jing; Zhang, Jiefu; Liu, Shengyi; Du, Jianchang

2017-07-01

It has been well documented that most nuclear protein-coding genes in organisms can be classified into two categories: positively selected genes (PSGs) and negatively selected genes (NSGs). The characteristics and evolutionary fates of different types of genes, however, have been poorly understood. In this study, the rates of nonsynonymous substitution (K a ) and the rates of synonymous substitution (K s ) were investigated by comparing the orthologs between the two sequenced Brassica species, Brassica rapa and Brassica oleracea, and the evolutionary rates, gene structures, expression patterns, and codon bias were compared between PSGs and NSGs. The resulting data show that PSGs have higher protein evolutionary rates, lower synonymous substitution rates, shorter gene length, fewer exons, higher functional specificity, lower expression level, higher tissue-specific expression and stronger codon bias than NSGs. Although the quantities and values are different, the relative features of PSGs and NSGs have been largely verified in the model species Arabidopsis. These data suggest that PSGs and NSGs differ not only under selective pressure (K a /K s ), but also in their evolutionary, structural and functional properties, indicating that selective modes may serve as a determinant factor for measuring evolutionary rates, gene compactness and expression patterns in Brassica. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
PAGER 2.0: an update to the pathway, annotated-list and gene-signature electronic repository for Human Network Biology

PubMed Central

Yue, Zongliang; Zheng, Qi; Neylon, Michael T; Yoo, Minjae; Shin, Jimin; Zhao, Zhiying; Tan, Aik Choon

2018-01-01

Abstract Integrative Gene-set, Network and Pathway Analysis (GNPA) is a powerful data analysis approach developed to help interpret high-throughput omics data. In PAGER 1.0, we demonstrated that researchers can gain unbiased and reproducible biological insights with the introduction of PAGs (Pathways, Annotated-lists and Gene-signatures) as the basic data representation elements. In PAGER 2.0, we improve the utility of integrative GNPA by significantly expanding the coverage of PAGs and PAG-to-PAG relationships in the database, defining a new metric to quantify PAG data qualities, and developing new software features to simplify online integrative GNPA. Specifically, we included 84 282 PAGs spanning 24 different data sources that cover human diseases, published gene-expression signatures, drug–gene, miRNA–gene interactions, pathways and tissue-specific gene expressions. We introduced a new normalized Cohesion Coefficient (nCoCo) score to assess the biological relevance of genes inside a PAG, and RP-score to rank genes and assign gene-specific weights inside a PAG. The companion web interface contains numerous features to help users query and navigate the database content. The database content can be freely downloaded and is compatible with third-party Gene Set Enrichment Analysis tools. We expect PAGER 2.0 to become a major resource in integrative GNPA. PAGER 2.0 is available at http://discovery.informatics.uab.edu/PAGER/. PMID:29126216

Contrasting expression of membrane metalloproteinases, MT1-MMP and MT3-MMP, suggests distinct functions in skeletal development.

PubMed

Yang, Maozhou; Zhang, Bingbing; Zhang, Liang; Gibson, Gary

2008-07-01

Membrane-type 1 matrix metalloproteinase (MT1-MMP) is the most ubiquitous and widely studied of the membrane-type metalloproteinases (MT-MMPs). It was thus surprising to find no published data on chicken MT1-MMP. We report here the characterization of the chicken gene. Its low sequence identity with the MT1-MMP genes of other species, high GC content, and divergent catalytic domain explains the absence of data and our difficulties in characterizing the gene. The absence of structural features in the chicken gene that have been suggested to be critical for the activation of MMP-2 by MT1-MMP; for the effect of MT1-MMP on cell migration and for the recycling of MT1-MMP suggest these features are either not essential or that MT1-MMP does not perform these functions in chickens. Comparison of the expression of chicken MT1-MMP with MT3-MMP and with MMP-2 and MMP-13 has confirmed the previously recognized co-expression of MT1-MMP with MMP-2 and MMP-13 in fibrous and vascular tissues, particularly those surrounding the developing long bones in other species. By contrast, MT3-MMP expression differs markedly from that of MT1-MMP and of both MMP-2 and MMP-13. MT3-MMP is expressed by chondrocytes of the developing articular surface. Similar expression patterns of this group of MT-MMPs and MMPs have been observed in mouse embryos and suggest distinct and specific functions for MT1-MMP and MT3-MMP in skeletal development.
Turning the gene tap off; implications of regulating gene expression for cancer therapeutics

PubMed Central

Curtin, James F.; Candolfi, Marianela; Xiong, Weidong; Lowenstein, Pedro R.; Castro, Maria G.

2008-01-01

Cancer poses a tremendous therapeutic challenge worldwide, highlighting the critical need for developing novel therapeutics. A promising cancer treatment modality is gene therapy, which is a form of molecular medicine designed to introduce into target cells genetic material with therapeutic intent. Anticancer gene therapy strategies currently used in preclinical models, and in some cases in the clinic, include proapoptotic genes, oncolytic/replicative vectors, conditional cytotoxic approaches, inhibition of angiogenesis, inhibition of growth factor signaling, inactivation of oncogenes, inhibition of tumor invasion and stimulation of the immune system. The translation of these novel therapeutic modalities from the preclinical setting to the clinic has been driven by encouraging preclinical efficacy data and advances in gene delivery technologies. One area of intense research involves the ability to accurately regulate the levels of therapeutic gene expression to achieve enhanced efficacy and provide the capability to switch gene expression off completely if adverse side effects should arise. This feature could also be implemented to switch gene expression off when a successful therapeutic outcome ensues. Here, we will review recent developments related to the engineering of transcriptional switches within gene delivery systems, which could be implemented in clinical gene therapy applications directed at the treatment of cancer. PMID:18347132
Comprehensive gene expression profiling following DNA vaccination of rainbow trout against infectious hematopoietic necrosis virus

USGS Publications Warehouse

Purcell, Maureen K.; Nichols, Krista M.; Winton, James R.; Kurath, Gael; Thorgaard, Gary H.; Wheeler, Paul; Hansen, John D.; Herwig, Russell P.; Park, Linda K.

2006-01-01

The DNA vaccine based on the glycoprotein gene of Infectious hematopoietic necrosis virus induces a non-specific anti-viral immune response and long-term specific immunity against IHNV. This study characterized gene expression responses associated with the early anti-viral response. Homozygous rainbow trout were injected intra-muscularly (I.M.) with vector DNA or the IHNV DNA vaccine. Gene expression in muscle tissue (I.M. site) was evaluated using a 16,008 feature salmon cDNA microarray. Eighty different genes were significantly modulated in the vector DNA group while 910 genes were modulated in the IHNV DNA vaccinate group relative to control group. Quantitative reverse-transcriptase PCR was used to examine expression of selected immune genes at the I.M. site and in other secondary tissues. In the localized response (I.M. site), the magnitudes of gene expression changes were much greater in the vaccinate group relative to the vector DNA group for the majority of genes analyzed. At secondary systemic sites (e.g. gill, kidney and spleen), type I IFN-related genes were up-regulated in only the IHNV DNA vaccinated group. The results presented here suggest that the IHNV DNA vaccine induces up-regulation of the type I IFN system across multiple tissues, which is the functional basis of early anti-viral immunity.
Identification and characterization of the grape WRKY family.

PubMed

Zhang, Ying; Feng, Jian Can

2014-01-01

WRKY transcription factors have functions in plant growth and development and in response to biotic and abiotic stresses. Many studies have focused on functional identification of WRKY transcription factors, but little is known about the molecular phylogeny or global expression patterns of the complete WRKY family. In this study, we identified 80 WRKY proteins encoded in the grape genome. Based on the structural features of these proteins, the grape WRKY genes were classified into three groups (groups 1-3). Analysis of WRKY genes expression profiles indicated that 28 WRKY genes were differentially expressed in response to biotic stress caused by grape whiterot and/or salicylic acid (SA). In that 16 WRKY genes upregulated both by whiterot pathogenic bacteria and SA. The results indicated that 16 WRKY proteins participated in SA-dependent defense signal pathway. This study provides a basis for cloning genes with specific functions from grape.
The impact of preserved Klotho gene expression on anti-oxidative stress activity in healthy kidney.

PubMed

Kimura, Takaaki; Shiizaki, Kazuhiro; Kurosu, Hiroshi; Akimoto, Tetsu; Shinzato, Takahiro; Shimizu, Toshihiro; Kurosawa, Akira; Kubo, Taro; Nanmoku, Koji; Kuro-O, Makoto; Yagisawa, Takashi

2018-04-25

Klotho, which was originally identified as an anti-aging gene, forms a complex with fibroblast growth factor 23 (FGF23) receptor in kidney, with subsequent signaling that regulates mineral metabolism. Other biological activities of Klotho including anti-aging effects such as protection from various cellular stress have been shown, however, the precise mechanism of these effects of Klotho gene in the healthy human kidney is not well understood. In this study, we examined the relationships of Klotho and anti-oxidative stress gene expression levels in zero-hour biopsy specimens from 44 donors in kidney transplantation and verified them in animal models whose Klotho gene expression levels were varied. The nitrotyrosine expression level in kidney was evaluated in these animal models. Expression levels of Klotho gene were positively correlated with p53 gene, and antioxidant enzyme genes such as Catalase, superoxide dismutase 1 (SOD1), SOD2, peroxiredoxin 3 (PRDX3), and glutathione peroxidase 1 (GPX1) but not clinical parameters such as age and renal function, and pathological features such as glomerulosclerosis and interstitial fibrosis tubular atrophy. The expression levels of all genes were significantly higher in mice with Klotho overexpression than in wild-type mice, and those except for PRDX3 and GPX1 were significantly lower in Klotho-deficient mice than in wild-type littermate mice. Nitrotyrosine-positive bands of various sizes were observed in kidney from Klotho-deficient mice only. The preservation of Klotho gene expression might induce the anti-oxidative stress mechanism for homeostasis of healthy human kidney independently of its general condition including age, renal function, and histological findings.
Id expression in amphioxus and lamprey highlights the role of gene cooption during neural crest evolution

NASA Technical Reports Server (NTRS)

Meulemans, Daniel; McCauley, David; Bronner-Fraser, Marianne

2003-01-01

Neural crest cells are unique to vertebrates and generate many of the adult structures that differentiate them from their closest invertebrate relatives, the cephalochordates. Id genes are robust markers of neural crest cells at all stages of development. We compared Id gene expression in amphioxus and lamprey to ask if cephalochordates deploy Id genes at the neural plate border and dorsal neural tube in a manner similar to vertebrates. Furthermore, we examined whether Id expression in these cells is a basal vertebrate trait or a derived feature of gnathostomes. We found that while expression of Id genes in the mesoderm and endoderm is conserved between amphioxus and vertebrates, expression in the lateral neural plate border and dorsal neural tube is a vertebrate novelty. Furthermore, expression of lamprey Id implies that recruitment of Id genes to these cells occurred very early in the vertebrate lineage. Based on expression in amphioxus we postulate that Id cooption conferred sensory cell progenitor-like properties upon the lateral neurectoderm, and pharyngeal mesoderm-like properties upon cranial neural crest. Amphioxus Id expression is also consistent with homology between the anterior neurectoderm of amphioxus and the presumptive placodal ectoderm of vertebrates. These observations support the idea that neural crest evolution was driven in large part by cooption of multipurpose transcriptional regulators from other tissues and cell types.
Eye development in the four-eyed fish Anableps anableps: cranial and retinal adaptations to simultaneous aerial and aquatic vision

PubMed Central

Perez, Louise N.; Lorena, Jamily; Costa, Carinne M.; Araujo, Maysa S.; Frota-Lima, Gabriela N.; Matos-Rodrigues, Gabriel E.; Martins, Rodrigo A. P.; Mattox, George M. T.

2017-01-01

The unique eyes of the four-eyed fish Anableps anableps have long intrigued biologists. Key features associated with the bulging eye of Anableps include the expanded frontal bone and the duplicated pupils and cornea. Furthermore, the Anableps retina expresses different photoreceptor genes in dorsal and ventral regions, potentially associated with distinct aerial and aquatic stimuli. To gain insight into the developmental basis of the Anableps unique eye, we examined neurocranium and eye ontogeny, as well as photoreceptor gene expression during larval stages. First, we described six larval stages during which duplication of eye structures occurs. Our osteological analysis of neurocranium ontogeny revealed another distinctive Anablepid feature: an ossified interorbital septum partially separating the orbital cavities. Furthermore, we identified the onset of differences in cell proliferation and cell layer density between dorsal and ventral regions of the retina. Finally, we show that differential photoreceptor gene expression in the retina initiates during development, suggesting that it is inherited and not environmentally determined. In sum, our results shed light on the ontogenetic steps leading to the highly derived Anableps eye. PMID:28381624
Follicular lymphomas with and without translocation t(14;18) differ in gene expression profiles and genetic alterations

PubMed Central

Leich, Ellen; Salaverria, Itziar; Bea, Silvia; Zettl, Andreas; Wright, George; Moreno, Victor; Gascoyne, Randy D.; Chan, Wing-Chung; Braziel, Rita M.; Rimsza, Lisa M.; Weisenburger, Dennis D.; Delabie, Jan; Jaffe, Elaine S.; Lister, Andrew; Fitzgibbon, Jude; Staudt, Louis M.; Hartmann, Elena M.; Mueller-Hermelink, Hans-Konrad; Campo, Elias; Ott, German

2009-01-01

Follicular lymphoma (FL) is genetically characterized by the presence of the t(14;18)(q32;q21) chromosomal translocation in approximately 90% of cases. In contrast to FL carrying the t(14;18), their t(14;18)-negative counterparts are less well studied about their immunohistochemical, genetic, molecular, and clinical features. Within a previously published series of 184 FLs grades 1 to 3A with available gene expression data, we identified 17 FLs lacking the t(14;18). Comparative genomic hybridization and high-resolution single nucleotide polymorphism (SNP) array profiling showed that gains/amplifications of the BCL2 gene locus in 18q were restricted to the t(14;18)-positive FL subgroup. A comparison of gene expression profiles showed an enrichment of germinal center B cell–associated signatures in t(14;18)-positive FL, whereas activated B cell–like, NFκB, proliferation, and bystander cell signatures were enriched in t(14;18)-negative FL. These findings were confirmed by immunohistochemistry in an independent validation series of 84 FLs, in which 32% of t(14;18)-negative FLs showed weak or absent CD10 expression and 91% an increased Ki67 proliferation rate. Although overall survival did not differ between FL with and without t(14;18), our findings suggest distinct molecular features of t(14;18)-negative FL. PMID:19471018
DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, X; Zhou, Z; Thomas, K

Purpose: The goal of this work is to investigate the use of contrast enhanced computed tomographic (CT) features for the prediction of mutations of BAP1, PBRM1, and VHL genes in renal cell carcinoma (RCC). Methods: For this study, we used two patient databases with renal cell carcinoma (RCC). The first one consisted of 33 patients from our institution (UT Southwestern Medical Center, UTSW). The second one consisted of 24 patients from the Cancer Imaging Archive (TCIA), where each patient is connected by a unique identi?er to the tissue samples from the Cancer Genome Atlas (TCGA). From the contrast enhanced CTmore » image of each patient, tumor contour was first delineated by a physician. Geometry, intensity, and texture features were extracted from the delineated tumor. Based on UTSW dataset, we completed feature selection and trained a support vector machine (SVM) classifier to predict mutations of BAP1, PBRM1 and VHL genes. We then used TCIA-TCGA dataset to validate the predictive model build upon UTSW dataset. Results: The prediction accuracy of gene expression of TCIA-TCGA patients was 0.83 (20 of 24), 0.83 (20 of 24), and 0.75 (18 of 24) for BAP1, PBRM1, and VHL respectively. For BAP1 gene, texture feature was the most prominent feature type. For PBRM1 gene, intensity feature was the most prominent. For VHL gene, geometry, intensity, and texture features were all important. Conclusion: Using our feature selection strategy and models, we achieved predictive accuracy over 0.75 for all three genes under the condition of using patient data from one institution for training and data from other institutions for testing. These results suggest that radiogenomics can be used to aid in prognosis and used as convenient surrogates for expensive and time consuming gene assay procedures.« less
Isolation and genome-wide expression and methylation characterization of CD31+ cells from normal and malignant human prostate tissue

PubMed Central

Luo, Wei; Hu, Qiang; Wang, Dan; Deeb, Kristin K.; Ma, Yingyu; Morrison, Carl D.; Liu, Song; Johnson, Candace S.; Trump, Donald L.

2013-01-01

Endothelial cells (ECs) are an important component involved in the angiogenesis. Little is known about the global gene expression and epigenetic regulation in tumor endothelial cells. The identification of gene expression and epigenetic difference between human prostate tumor-derived endothelial cells (TdECs) and those in normal tissues may uncover unique biological features of TdEC and facilitate the discovery of new anti-angiogenic targets. We established a method for isolation of CD31+ endothelial cells from malignant and normal prostate tissues obtained at prostatectomy. TdECs and normal-derived ECs (NdECs) showed >90% enrichment in primary culture and demonstrated microvascular endothelial cell characteristics such as cobblestone morphology in monolayer culture, diI-acetyl-LDL uptake and capillary-tube like formation in Matrigel®. In vitro primary cultures of ECs maintained expression of endothelial markers such as CD31, von Willebrand factor, intercellular adhesion molecule, vascular endothelial growth factor receptor 1, and vascular endothelial growth factor receptor 2. We then conducted a pilot study of transcriptome and methylome analysis of TdECs and matched NdECs from patients with prostate cancer. We observed a wide spectrum of differences in gene expression and methylation patterns in endothelial cells, between malignant and normal prostate tissues. Array-based expression and methylation data were validated by qRT-PCR and bisulfite DNA pyrosequencing. Further analysis of transcriptome and methylome data revealed a number of differentially expressed genes with loci whose methylation change is accompanied by an inverse change in gene expression. Our study demonstrates the feasibility of isolation of ECs from histologically normal prostate and prostate cancer via CD31+ selection. The data, although preliminary, indicates that there exist widespread differences in methylation and transcription between TdECs and NdECs. Interestingly, only a small proportion of perturbed genes were overlapped between American (AA) and Caucasian American (CA) patients with prostate cancer. Our study indicates that identifying gene expression and/or epigenetic differences between TdECs and NdECs may provide us with new anti-angiogenic targets. Future studies will be required to further characterize the isolated ECs and determine the biological features that can be exploited in the prognosis and therapy of prostate cancer. PMID:23978847
Transcriptional profiles of Arabidopsis stomataless mutants reveal developmental and physiological features of life in the absence of stomata

PubMed Central

de Marcos, Alberto; Triviño, Magdalena; Pérez-Bueno, María Luisa; Ballesteros, Isabel; Barón, Matilde; Mena, Montaña; Fenoll, Carmen

2015-01-01

Loss of function of the positive stomata development regulators SPCH or MUTE in Arabidopsis thaliana renders stomataless plants; spch-3 and mute-3 mutants are extreme dwarfs, but produce cotyledons and tiny leaves, providing a system to interrogate plant life in the absence of stomata. To this end, we compared their cotyledon transcriptomes with that of wild-type plants. K-means clustering of differentially expressed genes generated four clusters: clusters 1 and 2 grouped genes commonly regulated in the mutants, while clusters 3 and 4 contained genes distinctively regulated in mute-3. Classification in functional categories and metabolic pathways of genes in clusters 1 and 2 suggested that both mutants had depressed secondary, nitrogen and sulfur metabolisms, while only a few photosynthesis-related genes were down-regulated. In situ quenching analysis of chlorophyll fluorescence revealed limited inhibition of photosynthesis. This and other fluorescence measurements matched the mutant transcriptomic features. Differential transcriptomes of both mutants were enriched in growth-related genes, including known stomata development regulators, which paralleled their epidermal phenotypes. Analysis of cluster 3 was not informative for developmental aspects of mute-3. Cluster 4 comprised genes differentially up−regulated in mute−3, 35% of which were direct targets for SPCH and may relate to the unique cell types of mute−3. A screen of T-DNA insertion lines in genes differentially expressed in the mutants identified a gene putatively involved in stomata development. A collection of lines for conditional overexpression of transcription factors differentially expressed in the mutants rendered distinct epidermal phenotypes, suggesting that these proteins may be novel stomatal development regulators. Thus, our transcriptome analysis represents a useful source of new genes for the study of stomata development and for characterizing physiology and growth in the absence of stomata. PMID:26157447
Computational exploration of cis-regulatory modules in rhythmic expression data using the "Exploration of Distinctive CREs and CRMs" (EDCC) and "CRM Network Generator" (CNG) programs.

PubMed

Bekiaris, Pavlos Stephanos; Tekath, Tobias; Staiger, Dorothee; Danisman, Selahattin

2018-01-01

Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, "Exploration of Distinctive CREs and CRMs" (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, "CRM Network Generator" (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression.
Computational exploration of cis-regulatory modules in rhythmic expression data using the “Exploration of Distinctive CREs and CRMs” (EDCC) and “CRM Network Generator” (CNG) programs

PubMed Central

Staiger, Dorothee

2018-01-01

Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, “Exploration of Distinctive CREs and CRMs” (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, “CRM Network Generator” (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression. PMID:29298348
Constitutional trisomy 8 mosaicism as a model for epigenetic studies of aneuploidy

PubMed Central

2013-01-01

Background To investigate epigenetic patterns associated with aneuploidy we used constitutional trisomy 8 mosaicism (CT8M) as a model, enabling analyses of single cell clones, harboring either trisomy or disomy 8, from the same patient; this circumvents any bias introduced by using cells from unrelated, healthy individuals as controls. We profiled gene and miRNA expression as well as genome-wide and promoter specific DNA methylation and hydroxymethylation patterns in trisomic and disomic fibroblasts, using microarrays and methylated DNA immunoprecipitation. Results Trisomy 8-positive fibroblasts displayed a characteristic expression and methylation phenotype distinct from disomic fibroblasts, with the majority (65%) of chromosome 8 genes in the trisomic cells being overexpressed. However, 69% of all deregulated genes and non-coding RNAs were not located on this chromosome. Pathway analysis of the deregulated genes revealed that cancer, genetic disorder, and hematopoiesis were top ranked. The trisomy 8-positive cells displayed depletion of 5-hydroxymethylcytosine and global hypomethylation of gene-poor regions on chromosome 8, thus partly mimicking the inactivated X chromosome in females. Conclusions Trisomy 8 affects genes situated also on other chromosomes which, in cooperation with the observed chromosome 8 gene dosage effect, has an impact on the clinical features of CT8M, as demonstrated by the pathway analysis revealing key features that might explain the increased incidence of hematologic malignancies in CT8M patients. Furthermore, we hypothesize that the general depletion of hydroxymethylation and global hypomethylation of chromosome 8 may be unrelated to gene expression regulation, instead being associated with a general mechanism of chromatin processing and compartmentalization of additional chromosomes. PMID:23816241
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica

PubMed Central

Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu

2017-01-01

GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple (Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS (MdGRAS6, 26, 28, 44, 53, 64, 107, and 122) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA3, 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees. PMID:28503152
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica.

PubMed

Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu

2017-01-01

GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple ( Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS ( MdGRAS6, 26, 28, 44, 53, 64, 107 , and 122 ) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA 3 , 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees.
An Integrated Bioinformatics Approach Identifies Elevated Cyclin E2 Expression and E2F Activity as Distinct Features of Tamoxifen Resistant Breast Tumors

PubMed Central

Huang, Lei; Zhao, Shuangping; Frasor, Jonna M.; Dai, Yang

2011-01-01

Approximately half of estrogen receptor (ER) positive breast tumors will fail to respond to endocrine therapy. Here we used an integrative bioinformatics approach to analyze three gene expression profiling data sets from breast tumors in an attempt to uncover underlying mechanisms contributing to the development of resistance and potential therapeutic strategies to counteract these mechanisms. Genes that are differentially expressed in tamoxifen resistant vs. sensitive breast tumors were identified from three different publically available microarray datasets. These differentially expressed (DE) genes were analyzed using gene function and gene set enrichment and examined in intrinsic subtypes of breast tumors. The Connectivity Map analysis was utilized to link gene expression profiles of tamoxifen resistant tumors to small molecules and validation studies were carried out in a tamoxifen resistant cell line. Despite little overlap in genes that are differentially expressed in tamoxifen resistant vs. sensitive tumors, a high degree of functional similarity was observed among the three datasets. Tamoxifen resistant tumors displayed enriched expression of genes related to cell cycle and proliferation, as well as elevated activity of E2F transcription factors, and were highly correlated with a Luminal intrinsic subtype. A number of small molecules, including phenothiazines, were found that induced a gene signature in breast cancer cell lines opposite to that found in tamoxifen resistant vs. sensitive tumors and the ability of phenothiazines to down-regulate cyclin E2 and inhibit proliferation of tamoxifen resistant breast cancer cells was validated. Our findings demonstrate that an integrated bioinformatics approach to analyze gene expression profiles from multiple breast tumor datasets can identify important biological pathways and potentially novel therapeutic options for tamoxifen-resistant breast cancers. PMID:21789246
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.

PubMed

Johnson, Benjamin K; Scholz, Matthew B; Teal, Tracy K; Abramovitch, Robert B

2016-02-04

Many tools exist in the analysis of bacterial RNA sequencing (RNA-seq) transcriptional profiling experiments to identify differentially expressed genes between experimental conditions. Generally, the workflow includes quality control of reads, mapping to a reference, counting transcript abundance, and statistical tests for differentially expressed genes. In spite of the numerous tools developed for each component of an RNA-seq analysis workflow, easy-to-use bacterially oriented workflow applications to combine multiple tools and automate the process are lacking. With many tools to choose from for each step, the task of identifying a specific tool, adapting the input/output options to the specific use-case, and integrating the tools into a coherent analysis pipeline is not a trivial endeavor, particularly for microbiologists with limited bioinformatics experience. To make bacterial RNA-seq data analysis more accessible, we developed a Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis (SPARTA). SPARTA is a reference-based bacterial RNA-seq analysis workflow application for single-end Illumina reads. SPARTA is turnkey software that simplifies the process of analyzing RNA-seq data sets, making bacterial RNA-seq analysis a routine process that can be undertaken on a personal computer or in the classroom. The easy-to-install, complete workflow processes whole transcriptome shotgun sequencing data files by trimming reads and removing adapters, mapping reads to a reference, counting gene features, calculating differential gene expression, and, importantly, checking for potential batch effects within the data set. SPARTA outputs quality analysis reports, gene feature counts and differential gene expression tables and scatterplots. SPARTA provides an easy-to-use bacterial RNA-seq transcriptional profiling workflow to identify differentially expressed genes between experimental conditions. This software will enable microbiologists with limited bioinformatics experience to analyze their data and integrate next generation sequencing (NGS) technologies into the classroom. The SPARTA software and tutorial are available at sparta.readthedocs.org.
Roles for Msx and Dlx homeoproteins in vertebrate development.

PubMed

Bendall, A J; Abate-Shen, C

2000-04-18

This review provides a comparative analysis of the expression patterns, functions, and biochemical properties of Msx and Dlx homeobox genes. These comprise multi-gene families that are closely related with respect to sequence features as well as expression patterns during vertebrate development. Thus, members of the Msx and Dlx families are expressed in overlapping, but distinct, patterns and display complementary or antagonistic functions, depending upon the context. A common theme shared among Msx and Dlx genes is that they are required during early, middle, and late phases of development where their differential expression mediates patterning, morphogenesis, and histogenesis of tissues in which they are expressed. With respect to their biochemical properties, Msx proteins function as transcriptional repressors, while Dlx proteins are transcriptional activators. Moreover, their ability to oppose each other's transcriptional actions implies a mechanism underlying their complementary or antagonistic functions during development.
Identifying marker genes in transcription profiling data using a mixture of feature relevance experts.

PubMed

Chow, M L; Moler, E J; Mian, I S

2001-03-08

Transcription profiling experiments permit the expression levels of many genes to be measured simultaneously. Given profiling data from two types of samples, genes that most distinguish the samples (marker genes) are good candidates for subsequent in-depth experimental studies and developing decision support systems for diagnosis, prognosis, and monitoring. This work proposes a mixture of feature relevance experts as a method for identifying marker genes and illustrates the idea using published data from samples labeled as acute lymphoblastic and myeloid leukemia (ALL, AML). A feature relevance expert implements an algorithm that calculates how well a gene distinguishes samples, reorders genes according to this relevance measure, and uses a supervised learning method [here, support vector machines (SVMs)] to determine the generalization performances of different nested gene subsets. The mixture of three feature relevance experts examined implement two existing and one novel feature relevance measures. For each expert, a gene subset consisting of the top 50 genes distinguished ALL from AML samples as completely as all 7,070 genes. The 125 genes at the union of the top 50s are plausible markers for a prototype decision support system. Chromosomal aberration and other data support the prediction that the three genes at the intersection of the top 50s, cystatin C, azurocidin, and adipsin, are good targets for investigating the basic biology of ALL/AML. The same data were employed to identify markers that distinguish samples based on their labels of T cell/B cell, peripheral blood/bone marrow, and male/female. Selenoprotein W may discriminate T cells from B cells. Results from analysis of transcription profiling data from tumor/nontumor colon adenocarcinoma samples support the general utility of the aforementioned approach. Theoretical issues such as choosing SVM kernels and their parameters, training and evaluating feature relevance experts, and the impact of potentially mislabeled samples on marker identification (feature selection) are discussed.

Querying Co-regulated Genes on Diverse Gene Expression Datasets Via Biclustering.

PubMed

Deveci, Mehmet; Küçüktunç, Onur; Eren, Kemal; Bozdağ, Doruk; Kaya, Kamer; Çatalyürek, Ümit V

2016-01-01

Rapid development and increasing popularity of gene expression microarrays have resulted in a number of studies on the discovery of co-regulated genes. One important way of discovering such co-regulations is the query-based search since gene co-expressions may indicate a shared role in a biological process. Although there exist promising query-driven search methods adapting clustering, they fail to capture many genes that function in the same biological pathway because microarray datasets are fraught with spurious samples or samples of diverse origin, or the pathways might be regulated under only a subset of samples. On the other hand, a class of clustering algorithms known as biclustering algorithms which simultaneously cluster both the items and their features are useful while analyzing gene expression data, or any data in which items are related in only a subset of their samples. This means that genes need not be related in all samples to be clustered together. Because many genes only interact under specific circumstances, biclustering may recover the relationships that traditional clustering algorithms can easily miss. In this chapter, we briefly summarize the literature using biclustering for querying co-regulated genes. Then we present a novel biclustering approach and evaluate its performance by a thorough experimental analysis.
Integration of Steady-State and Temporal Gene Expression Data for the Inference of Gene Regulatory Networks

PubMed Central

Wang, Yi Kan; Hurley, Daniel G.; Schnell, Santiago; Print, Cristin G.; Crampin, Edmund J.

2013-01-01

We develop a new regression algorithm, cMIKANA, for inference of gene regulatory networks from combinations of steady-state and time-series gene expression data. Using simulated gene expression datasets to assess the accuracy of reconstructing gene regulatory networks, we show that steady-state and time-series data sets can successfully be combined to identify gene regulatory interactions using the new algorithm. Inferring gene networks from combined data sets was found to be advantageous when using noisy measurements collected with either lower sampling rates or a limited number of experimental replicates. We illustrate our method by applying it to a microarray gene expression dataset from human umbilical vein endothelial cells (HUVECs) which combines time series data from treatment with growth factor TNF and steady state data from siRNA knockdown treatments. Our results suggest that the combination of steady-state and time-series datasets may provide better prediction of RNA-to-RNA interactions, and may also reveal biological features that cannot be identified from dynamic or steady state information alone. Finally, we consider the experimental design of genomics experiments for gene regulatory network inference and show that network inference can be improved by incorporating steady-state measurements with time-series data. PMID:23967277
Sexual selection, genetic conflict, selfish genes, and the atypical patterns of gene expression in spermatogenic cells.

PubMed

Kleene, Kenneth C

2005-01-01

This review proposes that the peculiar patterns of gene expression in spermatogenic cells are the consequence of powerful evolutionary forces known as sexual selection. Sexual selection is generally characterized by intense competition of males for females, an enormous variety of the strategies to maximize male reproductive success, exaggerated male traits at all levels of biological organization, co-evolution of sexual traits in males and females, and conflict between the sexual advantage of the male trait and the reproductive fitness of females and the individual fitness of both sexes. In addition, spermatogenesis is afflicted by selfish genes that promote their transmission to progeny while causing deleterious effects. Sexual selection, selfish genes, and genetic conflict provide compelling explanations for many atypical features of gene expression in spermatogenic cells including the gross overexpression of certain mRNAs, transcripts encoding truncated proteins that cannot carry out basic functions of the proteins encoded by the same genes in somatic cells, the large number of gene families containing paralogous genes encoding spermatogenic cell-specific isoforms, the large number of testis-cancer-associated genes that are expressed only in spermatogenic cells and malignant cells, and the overbearing role of Sertoli cells in regulating the number and quality of spermatozoa.
Global transgenerational gene expression dynamics in two newly synthesized allohexaploid wheat (Triticum aestivum) lines

PubMed Central

2012-01-01

Background Alteration in gene expression resulting from allopolyploidization is a prominent feature in plants, but its spectrum and extent are not fully known. Common wheat (Triticum aestivum) was formed via allohexaploidization about 10,000 years ago, and became the most important crop plant. To gain further insights into the genome-wide transcriptional dynamics associated with the onset of common wheat formation, we conducted microarray-based genome-wide gene expression analysis on two newly synthesized allohexaploid wheat lines with chromosomal stability and a genome constitution analogous to that of the present-day common wheat. Results Multi-color GISH (genomic in situ hybridization) was used to identify individual plants from two nascent allohexaploid wheat lines between Triticum turgidum (2n = 4x = 28; genome BBAA) and Aegilops tauschii (2n = 2x = 14; genome DD), which had a stable chromosomal constitution analogous to that of common wheat (2n = 6x = 42; genome BBAADD). Genome-wide analysis of gene expression was performed for these allohexaploid lines along with their parental plants from T. turgidum and Ae. tauschii, using the Affymetrix Gene Chip Wheat Genome-Array. Comparison with the parental plants coupled with inclusion of empirical mid-parent values (MPVs) revealed that whereas the great majority of genes showed the expected parental additivity, two major patterns of alteration in gene expression in the allohexaploid lines were identified: parental dominance expression and non-additive expression. Genes involved in each of the two altered expression patterns could be classified into three distinct groups, stochastic, heritable and persistent, based on their transgenerational heritability and inter-line conservation. Strikingly, whereas both altered patterns of gene expression showed a propensity of inheritance, identity of the involved genes was highly stochastic, consistent with the involvement of diverse Gene Ontology (GO) terms. Nonetheless, those genes showing non-additive expression exhibited a significant enrichment for vesicle-function. Conclusions Our results show that two patterns of global alteration in gene expression are conditioned by allohexaploidization in wheat, that is, parental dominance expression and non-additive expression. Both altered patterns of gene expression but not the identity of the genes involved are likely to play functional roles in stabilization and establishment of the newly formed allohexaploid plants, and hence, relevant to speciation and evolution of T. aestivum. PMID:22277161
Intrinsic gene expression subsets of diffuse cutaneous systemic sclerosis are stable in serial skin biopsies

PubMed Central

Pendergrass, Sarah A.; Lemaire, Raphael; Francis, Ian; Mahoney, J. Matthew; Lafyatis, Robert; Whitfield, Michael L.

2012-01-01

Skin biopsy gene expression was analyzed by DNA microarray from 13 dSSc patients enrolled in an open label study of rituximab, 9 dSSc patients not treated with rituximab, and 9 healthy controls. These data recapitulate the patient ‘intrinsic’ gene expression subsets described previously including proliferation, inflammatory, and normal-like groups. Serial skin biopsies showed consistent and non-progressing gene expression over time, and importantly, the patients in the inflammatory subset do not move to the fibroproliferative subset, and vice versa. We were unable to detect significant differences in gene expression before and after rituximab treatment, consistent with an apparent lack of clinical response. Serial biopsies from each patient stayed within the same gene expression subset regardless of treatment regimen or the time point at which they were taken. Collectively, these data emphasize the heterogeneous nature of SSc and demonstrate that the intrinsic subsets are an inherent, reproducible and stable feature of the disease that is independent of disease duration. Moreover, these data have fundamental importance for the future development of personalized therapy for SSc; drugs targeting inflammation are likely to benefit those patients with an inflammatory signature, whereas drugs targeting fibrosis are likely to benefit those with a fibroproliferative signature. PMID:22318389
HUMAN GLYCERALDEHYDE 3-PHOSPHATE DEHYDROGENASE-2 (GAPD2) GENE IS EXPRESSED SPECIFICALLY IN SPERMATOGENIC CELLS

EPA Science Inventory

Although the process of glycolysis is highly conserved in eukaryotes, several glycolytic enzymes have unique structural or functional features in spermatogenic cells. We previously identified and characterized the mouse complementary DNA (cDNA) and a gene for 1 of these enzymes, ...
Investigation of MACC1 Gene Expression in Head and Neck Cancer and Cancer Stem Cells.

PubMed

Evran, Ebru; Şahin, Hilal; Akbaş, Kübra; Çiğdem, Sadik; Gündüz, Esra

2016-12-01

By investigating the MACC1 gene (metastasis-associated in colon cancer 1) in cancer stem cells (CSC) resistant to chemotherapy and in cancer stem cells (CSC) resistant to chemotherapy and in cancer cells (CS) sensitive to chemotherapy we determineda steady expression in both types of cells in head and neck cancer. In conformity with the result we examined if this gene could be a competitor gene for chemotherapy. According to literature, the MACC1 gene shows a clear expression in head and neck cancer cells [1]. Here we examined MACC1 expression in CSC and investigated it as a possible biomarker. Our experiments were performed in the UT -SCC -74 in primary head and neck cancer cell line. We examined the MACC -1 gene expression by Real Time PCR from both isolated CSC and CS. Expression of MACC -1 gene of cancer stem cells showed an two-fold increase compared with cancer cells. Based on the positive expression of MACC1 in both CS and CSC, this gene may serve as a potential biomarker in head and neck cancer. By comparing the results of this study with the novel features of MACC1, two important hypotheses could be examined. The first hypothesis is that MACC1 is a possible transcripton factor in colon cancer, which influences a high expression of CSC in head and neck and affects the expression of three biomarkers of the CSC control group biomarkers. The second hypothesisis is that the positive expression of MACC1 in patients with a malignant prognosis of tongue cancer, which belongs to head and neck cancer types, operates a faster development of CSC to cancer cells.
Enhancer modularity and the evolution of new traits.

PubMed

Koshikawa, Shigeyuki

2015-01-01

Animals have modular cis-regulatory regions in their genomes, and expression of a single gene is often regulated by multiple enhancers residing in such a region. In the laboratory, and also in natural populations, loss of an enhancer can result in a loss of gene expression. Although only a few examples have been well characterized to date, some studies have suggested that an evolutionary gain of a new enhancer function can establish a new gene expression domain. Our recent study showed that Drosophila guttifera has more enhancers and additional expression domains of the wingless gene during the pupal stage, compared to D. melanogaster, and that these new features appear to have evolved in the ancestral lineage leading to D. guttifera. (1) Gain of a new expression domain of a developmental regulatory gene (toolkit gene), such as wingless, can cause co-option of the expression of its downstream genes to the new domain, resulting in duplication of a preexisting structure at this new body position. Recently, with the advancement of evo-devo studies, we have learned that the developmental regulatory systems are strikingly similar across various animal taxa, in spite of the great diversity of the animals' morphology. Even behind "new" traits, co-options of essential developmental genes from known systems are very common. We previously provided concrete evidence of gains of enhancer activities of a developmental regulatory gene underlying gains of new traits. (1) Broad occurrence of this scenario is testable and should be validated in the future.
NIS expression in thyroid tumors, relation with prognosis clinicopathological and molecular features

PubMed Central

Tavares, Catarina; Coelho, Maria João; Eloy, Catarina; Melo, Miguel; da Rocha, Adriana Gaspar; Pestana, Ana; Batista, Rui; Ferreira, Luciana Bueno; Rios, Elisabete; Selmi-Ruby, Samia; Cavadas, Bruno; Pereira, Luísa; Sobrinho Simões, Manuel

2018-01-01

Thyroid cancer therapy is based on surgery followed by radioiodine treatment. The incorporation of radioiodine by cancer cells is mediated by sodium iodide symporter (NIS) (codified by the SLC5A5 gene), that is functional only when targeted to the cell membrane. We aimed to evaluate if NIS expression in thyroid primary tumors would be helpful in predicting tumor behavior, response to therapy and prognosis. NIS expression was addressed by qPCR and immunohistochemistry. In order to validate our data, we also studied SLC5A5 expression on 378 primary papillary thyroid carcinomas from The Cancer Genome Atlas (TCGA) database. In our series, SLC5A5 expression was lower in carcinomas with vascular invasion and with extrathyroidal extension and in those harboring BRAFV600E mutation. Analysis of SLC5A5 expression from TCGA database confirmed our results. Furthermore, it showed that larger tumors, with locoregional recurrences and/or distant metastases or harboring RAS, BRAF and/or TERT promoter (TERTp) mutations presented significantly less SLC5A5 expression. Regarding immunohistochemistry, 12/211 of the cases demonstrated NIS in the membrane of tumor cells, those cases showed variable outcomes concerning therapy success, prognosis and all but one were wild type for BRAF, NRAS and TERTp mutations. SLC5A5 mRNA lower expression is associated with features of aggressiveness and with key genetic alterations involving BRAF, RAS and TERTp. Mutations in these genes seem to decrease protein expression and its targeting to the cell membrane. SLC5A5 mRNA expression is more informative than NIS immunohistochemical expression regarding tumor aggressiveness and prognostic features. PMID:29298843
Disease susceptibility of the human macula: differential gene transcription in the retinal pigmented epithelium/choroid.

PubMed

Radeke, Monte J; Peterson, Katie E; Johnson, Lincoln V; Anderson, Don H

2007-09-01

The discoveries of gene variants associated with macular diseases have provided valuable insight into their molecular mechanisms, but they have not clarified why the macula is particularly vulnerable to degenerative disease. Its predisposition may be attributable to specialized structural features and/or functional properties of the underlying macular RPE/choroid. To examine the molecular basis for the macula's disease susceptibility, we compared the gene expression profile of the human RPE/choroid in the macula with the profile in the extramacular region using DNA microarrays. Seventy-five candidate genes with differences in macular:extramacular expression levels were identified by microarray analysis, of which 29 were selected for further analysis. Quantitative PCR confirmed that 21 showed statistically significant differences in expression. Five genes were expressed at higher levels in the macula. Two showed significant changes in the macular:extramacular expression ratio; another two exhibited changes in absolute expression level, as a function of age or AMD. Several of the differentially expressed genes have potential relevance to AMD pathobiology. One is an RPE cell growth factor (TFPI2), five are extracellular matrix components (DCN, MYOC, OGN, SMOC2, TFPI2), and six are related to inflammation (CCL19, CCL26, CXCL14, SLIT2) and/or angiogenesis (CXCL14, SLIT2, TFPI2, WFDC1). The identification of regional differences in gene expression in the RPE/choroid is a first step in clarifying the macula's propensity for degeneration. These findings lay the groundwork for further studies into the roles of the corresponding gene products in the normal, aged, and diseased macula.
Differential prioritization between relevance and redundancy in correlation-based feature selection techniques for multiclass gene expression data.

PubMed

Ooi, Chia Huey; Chetty, Madhu; Teng, Shyh Wei

2006-06-23

Due to the large number of genes in a typical microarray dataset, feature selection looks set to play an important role in reducing noise and computational cost in gene expression-based tissue classification while improving accuracy at the same time. Surprisingly, this does not appear to be the case for all multiclass microarray datasets. The reason is that many feature selection techniques applied on microarray datasets are either rank-based and hence do not take into account correlations between genes, or are wrapper-based, which require high computational cost, and often yield difficult-to-reproduce results. In studies where correlations between genes are considered, attempts to establish the merit of the proposed techniques are hampered by evaluation procedures which are less than meticulous, resulting in overly optimistic estimates of accuracy. We present two realistically evaluated correlation-based feature selection techniques which incorporate, in addition to the two existing criteria involved in forming a predictor set (relevance and redundancy), a third criterion called the degree of differential prioritization (DDP). DDP functions as a parameter to strike the balance between relevance and redundancy, providing our techniques with the novel ability to differentially prioritize the optimization of relevance against redundancy (and vice versa). This ability proves useful in producing optimal classification accuracy while using reasonably small predictor set sizes for nine well-known multiclass microarray datasets. For multiclass microarray datasets, especially the GCM and NCI60 datasets, DDP enables our filter-based techniques to produce accuracies better than those reported in previous studies which employed similarly realistic evaluation procedures.
Genome-wide analysis of WRKY gene family in Cucumis sativus

PubMed Central

2011-01-01

Background WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. Results We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Conclusions Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes. PMID:21955985
Genome-wide analysis of WRKY gene family in Cucumis sativus.

PubMed

Ling, Jian; Jiang, Weijie; Zhang, Ying; Yu, Hongjun; Mao, Zhenchuan; Gu, Xingfang; Huang, Sanwen; Xie, Bingyan

2011-09-28

WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes.
Defining the molecular profile of planarian pluripotent stem cells using a combinatorial RNAseq, RNA interference and irradiation approach.

PubMed

Solana, Jordi; Kao, Damian; Mihaylova, Yuliana; Jaber-Hijazi, Farah; Malla, Sunir; Wilson, Ray; Aboobaker, Aziz

2012-01-01

Planarian stem cells, or neoblasts, drive the almost unlimited regeneration capacities of freshwater planarians. Neoblasts are traditionally described by their morphological features and by the fact that they are the only proliferative cell type in asexual planarians. Therefore, they can be specifically eliminated by irradiation. Irradiation, however, is likely to induce transcriptome-wide changes in gene expression that are not associated with neoblast ablation. This has affected the accurate description of their specific transcriptomic profile. We introduce the use of Smed-histone-2B RNA interference (RNAi) for genetic ablation of neoblast cells in Schmidtea mediterranea as an alternative to irradiation. We characterize the rapid, neoblast-specific phenotype induced by Smed-histone-2B RNAi, resulting in neoblast ablation. We compare and triangulate RNA-seq data after using both irradiation and Smed-histone-2B RNAi over a time course as means of neoblast ablation. Our analyses show that Smed-histone-2B RNAi eliminates neoblast gene expression with high specificity and discrimination from gene expression in other cellular compartments. We compile a high confidence list of genes downregulated by both irradiation and Smed-histone-2B RNAi and validate their expression in neoblast cells. Lastly, we analyze the overall expression profile of neoblast cells. Our list of neoblast genes parallels their morphological features and is highly enriched for nuclear components, chromatin remodeling factors, RNA splicing factors, RNA granule components and the machinery of cell division. Our data reveal that the regulation of planarian stem cells relies on posttranscriptional regulatory mechanisms and suggest that planarians are an ideal model for this understudied aspect of stem cell biology.
Defining the molecular profile of planarian pluripotent stem cells using a combinatorial RNA-seq, RNA interference and irradiation approach

PubMed Central

2012-01-01

Background Planarian stem cells, or neoblasts, drive the almost unlimited regeneration capacities of freshwater planarians. Neoblasts are traditionally described by their morphological features and by the fact that they are the only proliferative cell type in asexual planarians. Therefore, they can be specifically eliminated by irradiation. Irradiation, however, is likely to induce transcriptome-wide changes in gene expression that are not associated with neoblast ablation. This has affected the accurate description of their specific transcriptomic profile. Results We introduce the use of Smed-histone-2B RNA interference (RNAi) for genetic ablation of neoblast cells in Schmidtea mediterranea as an alternative to irradiation. We characterize the rapid, neoblast-specific phenotype induced by Smed-histone-2B RNAi, resulting in neoblast ablation. We compare and triangulate RNA-seq data after using both irradiation and Smed-histone-2B RNAi over a time course as means of neoblast ablation. Our analyses show that Smed-histone-2B RNAi eliminates neoblast gene expression with high specificity and discrimination from gene expression in other cellular compartments. We compile a high confidence list of genes downregulated by both irradiation and Smed-histone-2B RNAi and validate their expression in neoblast cells. Lastly, we analyze the overall expression profile of neoblast cells. Conclusions Our list of neoblast genes parallels their morphological features and is highly enriched for nuclear components, chromatin remodeling factors, RNA splicing factors, RNA granule components and the machinery of cell division. Our data reveal that the regulation of planarian stem cells relies on posttranscriptional regulatory mechanisms and suggest that planarians are an ideal model for this understudied aspect of stem cell biology. PMID:22439894
Changes in global gene expression during in vitro decidualization of rat endometrial stromal cells

PubMed Central

Vallejo, Griselda; Maschi, Darío; Citrinovitz, Ana Cecilia Mestre; Aiba, Kazuhiro; Maronna, Ricardo; Yohai, Victor; Ko, Minoru S. H.; Beato, Miguel; Saragüeta, Patricia

2009-01-01

During the preimplantation phase of pregnancy the endometrial stroma differentiates into decidua, a process that implies numerous morphological changes and is an example of physiological transdifferentiation. Here we show that UIII rat endometrial stromal cells cultured in the presence of calf serum acquired morphological features of decidual cells and expressed decidual markers. To identify genes involved in decidualization we compared gene expression patterns of control and decidualized UIII cells using cDNA microarray. We found 322 annotated genes exhibiting significant differences in expression (>3 fold, FDR > 0.005), of which 312 have not been previously related to decidualization. Analysis of overrepresented functions revealed that protein synthesis, gene expression and chromatin architecture and remodeling are the most relevant modified functions during decidualization. Relevant genes are also found in the functional terms differentiation, cell proliferation, signal transduction, and matrix/structural proteins. Several of these new genes involved in decidualization (Csdc2, Trim27, Eef1a1, Bmp1, Wt1, Aes, Gna12, and Men1) are shown to be also regulated in uterine decidua during normal pregnancy. Thus, the UIII cell culture model will allow future mechanistic studies to define the transcriptional network regulating reprogramming of stromal cells into decidual cells. PMID:19780023
Gene expression profiles of metabolic aggressiveness and tumor recurrence in benign meningioma.

PubMed

Serna, Eva; Morales, José Manuel; Mata, Manuel; Gonzalez-Darder, José; San Miguel, Teresa; Gil-Benso, Rosario; Lopez-Gines, Concha; Cerda-Nicolas, Miguel; Monleon, Daniel

2013-01-01

Around 20% of meningiomas histologically benign may be clinically aggressive and recur. This strongly affects management of meningioma patients. There is a need to evaluate the potential aggressiveness of an individual meningioma. Additional criteria for better classification of meningiomas will improve clinical decisions as well as patient follow up strategy after surgery. The aim of this study was to determine the relationship between gene expression profiles and new metabolic subgroups of benign meningioma with potential clinical relevance. Forty benign and fourteen atypical meningioma tissue samples were included in the study. We obtained metabolic profiles by NMR and recurrence after surgery information for all of them. We measured gene expression by oligonucleotide microarray measurements on 19 of them. To our knowledge, this is the first time that distinct gene expression profiles are reported for benign meningioma molecular subgroups with clinical correlation. Our results show that metabolic aggressiveness in otherwise histological benign meningioma proceeds mostly through alterations in the expression of genes involved in the regulation of transcription, mainly the LMO3 gene. Genes involved in tumor metabolism, like IGF1R, are also differentially expressed in those meningioma subgroups with higher rates of membrane turnover, higher energy demand and increased resistance to apoptosis. These new subgroups of benign meningiomas exhibit different rates of recurrence. This work shows that benign meningioma with metabolic aggressiveness constitute a subgroup of potentially recurrent tumors in which alterations in genes regulating critical features of aggressiveness, like increased angiogenesis or cell invasion, are still no predominant. The determination of these gene expression biosignatures may allow the early detection of clinically aggressive tumors.
In silico gene expression profiling in Cannabis sativa.

PubMed

Massimino, Luca

2017-01-01

The cannabis plant and its active ingredients (i.e., cannabinoids and terpenoids) have been socially stigmatized for half a century. Luckily, with more than 430,000 published scientific papers and about 600 ongoing and completed clinical trials, nowadays cannabis is employed for the treatment of many different medical conditions. Nevertheless, even if a large amount of high-throughput functional genomic data exists, most researchers feature a strong background in molecular biology but lack advanced bioinformatics skills. In this work, publicly available gene expression datasets have been analyzed giving rise to a total of 40,224 gene expression profiles taken from cannabis plant tissue at different developmental stages. The resource presented here will provide researchers with a starting point for future investigations with Cannabis sativa .
Circular RNAs: analysis, expression and potential functions

PubMed Central

Salzman, Julia

2016-01-01

Just a few years ago, it had been assumed that the dominant RNA isoforms produced from eukaryotic genes were variants of messenger RNA, functioning as intermediates in gene expression. In early 2012, however, a surprising discovery was made: circular RNA (circRNA) was shown to be a transcriptional product in thousands of human and mouse genes and in hundreds of cases constituted the dominant RNA isoform. Subsequent studies revealed that the expression of circRNAs is developmentally regulated, tissue and cell-type specific, and shared across the eukaryotic tree of life. These features suggest important functions for these molecules. Here, we describe major advances in the field of circRNA biology, focusing on the regulation of and functional roles played by these molecules. PMID:27246710
Developmental and Environmental Regulation of Aquaporin Gene Expression across Populus Species: Divergence or Redundancy?

PubMed Central

Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène

2013-01-01

Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected. PMID:23393587

Developmental and environmental regulation of Aquaporin gene expression across Populus species: divergence or redundancy?

PubMed

Cohen, David; Bogeat-Triboulot, Marie-Béatrice; Vialet-Chabrand, Silvère; Merret, Rémy; Courty, Pierre-Emmanuel; Moretti, Sébastien; Bizet, François; Guilliot, Agnès; Hummel, Irène

2013-01-01

Aquaporins (AQPs) are membrane channels belonging to the major intrinsic proteins family and are known for their ability to facilitate water movement. While in Populus trichocarpa, AQP proteins form a large family encompassing fifty-five genes, most of the experimental work focused on a few genes or subfamilies. The current work was undertaken to develop a comprehensive picture of the whole AQP gene family in Populus species by delineating gene expression domain and distinguishing responsiveness to developmental and environmental cues. Since duplication events amplified the poplar AQP family, we addressed the question of expression redundancy between gene duplicates. On these purposes, we carried a meta-analysis of all publicly available Affymetrix experiments. Our in-silico strategy controlled for previously identified biases in cross-species transcriptomics, a necessary step for any comparative transcriptomics based on multispecies design chips. Three poplar AQPs were not supported by any expression data, even in a large collection of situations (abiotic and biotic constraints, temporal oscillations and mutants). The expression of 11 AQPs was never or poorly regulated whatever the wideness of their expression domain and their expression level. Our work highlighted that PtTIP1;4 was the most responsive gene of the AQP family. A high functional divergence between gene duplicates was detected across species and in response to tested cues, except for the root-expressed PtTIP2;3/PtTIP2;4 pair exhibiting 80% convergent responses. Our meta-analysis assessed key features of aquaporin expression which had remained hidden in single experiments, such as expression wideness, response specificity and genotype and environment interactions. By consolidating expression profiles using independent experimental series, we showed that the large expansion of AQP family in poplar was accompanied with a strong divergence of gene expression, even if some cases of functional redundancy could be suspected.
Wt-p53 action in human leukaemia cell lines corresponding to different stages of differentiation.

PubMed

Rizzo, M G; Zepparoni, A; Cristofanelli, B; Scardigli, R; Crescenzi, M; Blandino, G; Giuliacci, S; Ferrari, S; Soddu, S; Sacchi, A

1998-05-01

Recent studies support the potential application of the wt-p53 gene in cancer therapy. Expression of exogenous wt-p53 suppresses a variety of leukaemia phenotypes by acting on cell survival, proliferation and/or differentiation. As for tumour gene therapy, the final fate of the neoplastic cells is one of the most relevant points. We examined the effects of exogenous wt-p53 gene expression in several leukaemia cell lines to identify p53-responsive leukaemia. The temperature-sensitive p53Val135 mutant or the human wt-p53 cDNA was transduced in leukaemia cell lines representative of different acute leukaemia FAB subtypes, including M1 (KG1), M2 (HL-60), M3 (NB4), M5 (U937) and M6 (HEL 92.1.7), as well as blast crisis of chronic myelogenous leukaemia (BC-CML: K562, BV173) showing diverse differentiation features. By morphological, molecular and biochemical analyses, we have shown that exogenous wt-p53 gene expression induces apoptosis only in cells corresponding to M1, M2 and M3 of the FAB classification and in BC-CML showing morphological and cytochemical features of undifferentiated blast cells. In contrast, it promotes differentiation in the others. Interestingly, cell responsiveness was independent of the vector used and the status of the endogenous p53 gene.
HOXB2, an adverse prognostic indicator for stage I lung adenocarcinomas, promotes invasion by transcriptional regulation of metastasis-related genes in HOP-62 non-small cell lung cancer cells.

PubMed

Inamura, Kentaro; Togashi, Yuki; Ninomiya, Hironori; Shimoji, Takashi; Noda, Tetsuo; Ishikawa, Yuichi

2008-01-01

Previously, using microarray and real-time RT-PCR analysis, we established that HOXB2 is an adverse prognostic indicator for Stage I lung adenocarcinomas. HOXB2 is one of the homeobox master development-controlling genes regulating morphogenesis and cell differentiation. The molecular functions of HOXB2 were analyzed with a small interfering RNA (siRNA) approach in HOP-62 human non-small cell lung cancer (NSCLC) cells featuring high HOXB2 expression. Matrigel invasion assays and microarray gene expression analysis were compared between the HOXB2-siRNA cells and the control cells. The Matrigel invasion assays showed attenuation of HOXB2 expression by siRNA to result in a significant decrease of invasiveness compared to the control cells (p = 0.0013, paired t-test). On microarray gene expression analysis, up-regulation of many metastasis-related genes and others correlating with HOXB2 expression was observed in the control case. With attenuation of HOXB2 expression, downregulation was noted for laminins alpha 4 and 5, involved in enriched signaling, and for Mac-2BP (Mac-2 binding protein) and integrin beta 4 amongst the genes having an enriched glycoprotein ontology. HOXB2 promotes invasion of lung cancer cells through the regulation of metastasis-related genes.
Establishing glucose- and ABA-regulated transcription networks in Arabidopsis by microarray analysis and promoter classification using a Relevance Vector Machine.

PubMed

Li, Yunhai; Lee, Kee Khoon; Walsh, Sean; Smith, Caroline; Hadingham, Sophie; Sorefan, Karim; Cawley, Gavin; Bevan, Michael W

2006-03-01

Establishing transcriptional regulatory networks by analysis of gene expression data and promoter sequences shows great promise. We developed a novel promoter classification method using a Relevance Vector Machine (RVM) and Bayesian statistical principles to identify discriminatory features in the promoter sequences of genes that can correctly classify transcriptional responses. The method was applied to microarray data obtained from Arabidopsis seedlings treated with glucose or abscisic acid (ABA). Of those genes showing >2.5-fold changes in expression level, approximately 70% were correctly predicted as being up- or down-regulated (under 10-fold cross-validation), based on the presence or absence of a small set of discriminative promoter motifs. Many of these motifs have known regulatory functions in sugar- and ABA-mediated gene expression. One promoter motif that was not known to be involved in glucose-responsive gene expression was identified as the strongest classifier of glucose-up-regulated gene expression. We show it confers glucose-responsive gene expression in conjunction with another promoter motif, thus validating the classification method. We were able to establish a detailed model of glucose and ABA transcriptional regulatory networks and their interactions, which will help us to understand the mechanisms linking metabolism with growth in Arabidopsis. This study shows that machine learning strategies coupled to Bayesian statistical methods hold significant promise for identifying functionally significant promoter sequences.
Downregulation of ATM Gene and Protein Expression in Canine Mammary Tumors.

PubMed

Raposo-Ferreira, T M M; Bueno, R C; Terra, E M; Avante, M L; Tinucci-Costa, M; Carvalho, M; Cassali, G D; Linde, S D; Rogatto, S R; Laufer-Amorim, R

2016-11-01

The ataxia telangiectasia mutated (ATM) gene encodes a protein associated with DNA damage repair and maintenance of genomic integrity. In women, ATM transcript and protein downregulation have been reported in sporadic breast carcinomas, and the absence of ATM protein expression has been associated with poor prognosis. The aim of this study was to evaluate ATM gene and protein expression in canine mammary tumors and their association with clinical outcome. ATM gene and protein expression was evaluated by reverse transcription-quantitative polymerase chain reaction and immunohistochemistry, respectively, in normal mammary gland samples (n = 10), benign mammary tumors (n = 11), nonmetastatic mammary carcinomas (n = 19), and metastatic mammary carcinomas (n = 11). Lower ATM transcript levels were detected in benign mammary tumors and carcinomas compared with normal mammary glands (P = .011). Similarly, lower ATM protein expression was observed in benign tumors (P = .0003), nonmetastatic mammary carcinomas (P < .0001), and the primary sites of metastatic carcinomas (P < .0001) compared with normal mammary glands. No significant differences in ATM gene or protein levels were detected among benign tumors and nonmetastatic and metastatic mammary carcinomas (P > .05). The levels of ATM gene or protein expression were not significantly associated with clinical and pathological features or with survival. Similar to human breast cancer, the data in this study suggest that ATM gene and protein downregulation is involved in canine mammary gland tumorigenesis. © The Author(s) 2016.
Mixture models for detecting differentially expressed genes in microarrays.

PubMed

Jones, Liat Ben-Tovim; Bean, Richard; McLachlan, Geoffrey J; Zhu, Justin Xi

2006-10-01

An important and common problem in microarray experiments is the detection of genes that are differentially expressed in a given number of classes. As this problem concerns the selection of significant genes from a large pool of candidate genes, it needs to be carried out within the framework of multiple hypothesis testing. In this paper, we focus on the use of mixture models to handle the multiplicity issue. With this approach, a measure of the local FDR (false discovery rate) is provided for each gene. An attractive feature of the mixture model approach is that it provides a framework for the estimation of the prior probability that a gene is not differentially expressed, and this probability can subsequently be used in forming a decision rule. The rule can also be formed to take the false negative rate into account. We apply this approach to a well-known publicly available data set on breast cancer, and discuss our findings with reference to other approaches.
The Three Clades of the Telomere-Associated TLO Gene Family of Candida albicans Have Different Splicing, Localization, and Expression Features

PubMed Central

Anderson, Matthew Z.; Baller, Joshua A.; Dulmage, Keely; Wigen, Lauren

2012-01-01

Candida albicans grows within a wide range of host niches, and this adaptability enhances its success as a commensal and as a pathogen. The telomere-associated TLO gene family underwent a recent expansion from one or two copies in other CUG clade members to 14 expressed copies in C. albicans. This correlates with increased virulence and clinical prevalence relative to those of other Candida clade species. The 14 expressed TLO gene family members have a conserved Med2 domain at the N terminus, suggesting a role in general transcription. The C-terminal half is more divergent, distinguishing three clades: clade α and clade β have no introns and encode proteins that localize primarily to the nucleus; clade γ sometimes undergoes splicing, and the gene products localize within the mitochondria as well as the nuclei. Additionally, TLOα genes are generally expressed at much higher levels than are TLOγ genes. We propose that expansion of the TLO gene family and the predicted role of Tlo proteins in transcription regulation provide C. albicans with the ability to adapt rapidly to the broad range of different environmental niches within the human host. PMID:22923044
TA-GC cloning: A new simple and versatile technique for the directional cloning of PCR products for recombinant protein expression.

PubMed

Niarchos, Athanasios; Siora, Anastasia; Konstantinou, Evangelia; Kalampoki, Vasiliki; Lagoumintzis, George; Poulas, Konstantinos

2017-01-01

During the last few decades, the recombinant protein expression finds more and more applications. The cloning of protein-coding genes into expression vectors is required to be directional for proper expression, and versatile in order to facilitate gene insertion in multiple different vectors for expression tests. In this study, the TA-GC cloning method is proposed, as a new, simple and efficient method for the directional cloning of protein-coding genes in expression vectors. The presented method features several advantages over existing methods, which tend to be relatively more labour intensive, inflexible or expensive. The proposed method relies on the complementarity between single A- and G-overhangs of the protein-coding gene, obtained after a short incubation with T4 DNA polymerase, and T and C overhangs of the novel vector pET-BccI, created after digestion with the restriction endonuclease BccI. The novel protein-expression vector pET-BccI also facilitates the screening of transformed colonies for recombinant transformants. Evaluation experiments of the proposed TA-GC cloning method showed that 81% of the transformed colonies contained recombinant pET-BccI plasmids, and 98% of the recombinant colonies expressed the desired protein. This demonstrates that TA-GC cloning could be a valuable method for cloning protein-coding genes in expression vectors.
TA-GC cloning: A new simple and versatile technique for the directional cloning of PCR products for recombinant protein expression

PubMed Central

Niarchos, Athanasios; Siora, Anastasia; Konstantinou, Evangelia; Kalampoki, Vasiliki; Poulas, Konstantinos

2017-01-01

During the last few decades, the recombinant protein expression finds more and more applications. The cloning of protein-coding genes into expression vectors is required to be directional for proper expression, and versatile in order to facilitate gene insertion in multiple different vectors for expression tests. In this study, the TA-GC cloning method is proposed, as a new, simple and efficient method for the directional cloning of protein-coding genes in expression vectors. The presented method features several advantages over existing methods, which tend to be relatively more labour intensive, inflexible or expensive. The proposed method relies on the complementarity between single A- and G-overhangs of the protein-coding gene, obtained after a short incubation with T4 DNA polymerase, and T and C overhangs of the novel vector pET-BccI, created after digestion with the restriction endonuclease BccI. The novel protein-expression vector pET-BccI also facilitates the screening of transformed colonies for recombinant transformants. Evaluation experiments of the proposed TA-GC cloning method showed that 81% of the transformed colonies contained recombinant pET-BccI plasmids, and 98% of the recombinant colonies expressed the desired protein. This demonstrates that TA-GC cloning could be a valuable method for cloning protein-coding genes in expression vectors. PMID:29091919
GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.

PubMed

Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin

2016-12-05

Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three subnetwork identification algorithms. The two searching algorithms resulted in better classification and gene/gene-set agreement compared to the original greedy search of the GNFS method. The identified lung cancer subnetwork using the proposed searching algorithm resulted in an improvement of the cross-dataset validation and an increase in the consistency of findings between two independent datasets. The homogeneity measurement of the datasets was conducted to assess dataset compatibility in cross-dataset validation. The lung cancer dataset with higher homogeneity showed a better result when using the GS search while the dataset with low homogeneity showed a better result when using the PN search. The 10-fold cross-dataset validation on the independent lung cancer datasets showed higher classification performance of the proposed algorithms when compared with the greedy search in the original GNFS method. The proposed searching algorithms provide a higher number of genes in the subnetwork expansion step than the greedy algorithm. As a result, the performance of the subnetworks identified from the GSNFS method was improved in terms of classification performance and gene/gene-set level agreement depending on the homogeneity of the datasets used in the analysis. Some common genes obtained from the four datasets using different searching algorithms are genes known to play a role in lung cancer. The improvement of classification performance and the gene/gene-set level agreement, and the biological relevance indicated the effectiveness of the GSNFS method for gene subnetwork identification using expression data.
Gene coexpression measures in large heterogeneous samples using count statistics.

PubMed

Wang, Y X Rachel; Waterman, Michael S; Huang, Haiyan

2014-11-18

With the advent of high-throughput technologies making large-scale gene expression data readily available, developing appropriate computational tools to process these data and distill insights into systems biology has been an important part of the "big data" challenge. Gene coexpression is one of the earliest techniques developed that is still widely in use for functional annotation, pathway analysis, and, most importantly, the reconstruction of gene regulatory networks, based on gene expression data. However, most coexpression measures do not specifically account for local features in expression profiles. For example, it is very likely that the patterns of gene association may change or only exist in a subset of the samples, especially when the samples are pooled from a range of experiments. We propose two new gene coexpression statistics based on counting local patterns of gene expression ranks to take into account the potentially diverse nature of gene interactions. In particular, one of our statistics is designed for time-course data with local dependence structures, such as time series coupled over a subregion of the time domain. We provide asymptotic analysis of their distributions and power, and evaluate their performance against a wide range of existing coexpression measures on simulated and real data. Our new statistics are fast to compute, robust against outliers, and show comparable and often better general performance.
Antisense sequences of the nbl gene induce apoptosis in the human promyelocytic leukemia cell line HL-60.

PubMed

Naora, H; Nishida, T; Shindo, Y; Adachi, M; Naora, H

1998-04-01

Apoptosis is induced by the transcriptional inhibitor actinomycin D (Act D) in various cell types, particularly many leukemic cell lines such as HL-60. A common feature of these cell lines is their high constitutive expression level of the nbl gene, which was originally isolated by virtue of its abundance in a Namalwa Burkitt lymphoma cDNA library. In contrast, cell lines which constitutively express nbl at low levels appear not to undergo typical apoptotic death in response to Act D. Apoptotic induction by Act D in cells which normally express nbl at high levels was found in this study to be closely associated with a decline in nbl mRNA levels, raising the possibility that apoptosis could be induced by lowering nbl expression levels in such cells. Transient expression of nbl antisense sequences in HL-60 cells decreased cell viability, and induced typical apoptotic morphology such as cell shrinkage, chromatin condensation and nuclear fragmentation. Incubation with nbl antisense oligomers also induced similar features in HL-60 cells and in another high nb-expressing cell line, Jurkat, but had little effect in HepG2 cells which constitutively express nbl at low levels. These findings suggest that lowering constitutively high levels of nbl expression can induce apoptosis.
Functional characterization of the late embryogenesis abundant (LEA) protein gene family from Pinus tabuliformis (Pinaceae) in Escherichia coli.

PubMed

Gao, Jie; Lan, Ting

2016-01-19

Late embryogenesis abundant (LEA) proteins are a large and highly diverse gene family present in a wide range of plant species. LEAs are proposed to play a role in various stress tolerance responses. Our study represents the first-ever survey of LEA proteins and their encoding genes in a widely distributed pine (Pinus tabuliformis) in China. Twenty-three LEA genes were identified from the P. tabuliformis belonging to seven groups. Proteins with repeated motifs are an important feature specific to LEA groups. Ten of 23 pine LEA genes were selectively expressed in specific tissues, and showed expression divergence within each group. In addition, we selected 13 genes representing each group and introduced theses genes into Escherichia coli to assess the protective function of PtaLEA under heat and salt stresses. Compared with control cells, the E. coli cells expressing PtaLEA fusion protein exhibited enhanced salt and heat resistance and viability, indicating the protein may play a protective role in cells under stress conditions. Furthermore, among these enhanced tolerance genes, a certain extent of function divergence appeared within a gene group as well as between gene groups, suggesting potential functional diversity of this gene family in conifers.
Molecular, Biochemical, and Dietary Regulation Features of α-Amylase in a Carnivorous Crustacean, the Spiny Lobster Panulirus argus.

PubMed

Rodríguez-Viera, Leandro; Perera, Erick; Martos-Sitcha, Juan Antonio; Perdomo-Morales, Rolando; Casuso, Antonio; Montero-Alejo, Vivian; García-Galano, Tsai; Martínez-Rodríguez, Gonzalo; Mancera, Juan Miguel

2016-01-01

Alpha-amylases are ubiquitously distributed throughout microbials, plants and animals. It is widely accepted that omnivorous crustaceans have higher α-amylase activity and number of isoforms than carnivorous, but contradictory results have been obtained in some species, and carnivorous crustaceans have been less studied. In addition, the physiological meaning of α-amylase polymorphism in crustaceans is not well understood. In this work we studied α-amylase in a carnivorous lobster at the gene, transcript, and protein levels. It was showed that α-amylase isoenzyme composition (i.e., phenotype) in lobster determines carbohydrate digestion efficiency. Most frequent α-amylase phenotype has the lowest digestion efficiency, suggesting this is a favoured trait. We revealed that gene and intron loss have occurred in lobster α-amylase, thus lobsters express a single 1830 bp cDNA encoding a highly conserved protein with 513 amino acids. This protein gives rise to two isoenzymes in some individuals by glycosylation but not by limited proteolysis. Only the glycosylated isoenzyme could be purified by chromatography, with biochemical features similar to other animal amylases. High carbohydrate content in diet down-regulates α-amylase gene expression in lobster. However, high α-amylase activity occurs in lobster gastric juice irrespective of diet and was proposed to function as an early sensor of the carbohydrate content of diet to regulate further gene expression. We concluded that gene/isoenzyme simplicity, post-translational modifications and low Km, coupled with a tight regulation of gene expression, have arose during evolution of α-amylase in the carnivorous lobster to control excessive carbohydrate digestion in the presence of an active α-amylase.
Molecular, Biochemical, and Dietary Regulation Features of α-Amylase in a Carnivorous Crustacean, the Spiny Lobster Panulirus argus

PubMed Central

Martos-Sitcha, Juan Antonio; Perdomo-Morales, Rolando; Casuso, Antonio; Montero-Alejo, Vivian; García-Galano, Tsai; Martínez-Rodríguez, Gonzalo; Mancera, Juan Miguel

2016-01-01

Alpha-amylases are ubiquitously distributed throughout microbials, plants and animals. It is widely accepted that omnivorous crustaceans have higher α-amylase activity and number of isoforms than carnivorous, but contradictory results have been obtained in some species, and carnivorous crustaceans have been less studied. In addition, the physiological meaning of α-amylase polymorphism in crustaceans is not well understood. In this work we studied α-amylase in a carnivorous lobster at the gene, transcript, and protein levels. It was showed that α-amylase isoenzyme composition (i.e., phenotype) in lobster determines carbohydrate digestion efficiency. Most frequent α-amylase phenotype has the lowest digestion efficiency, suggesting this is a favoured trait. We revealed that gene and intron loss have occurred in lobster α-amylase, thus lobsters express a single 1830 bp cDNA encoding a highly conserved protein with 513 amino acids. This protein gives rise to two isoenzymes in some individuals by glycosylation but not by limited proteolysis. Only the glycosylated isoenzyme could be purified by chromatography, with biochemical features similar to other animal amylases. High carbohydrate content in diet down-regulates α-amylase gene expression in lobster. However, high α-amylase activity occurs in lobster gastric juice irrespective of diet and was proposed to function as an early sensor of the carbohydrate content of diet to regulate further gene expression. We concluded that gene/isoenzyme simplicity, post-translational modifications and low Km, coupled with a tight regulation of gene expression, have arose during evolution of α-amylase in the carnivorous lobster to control excessive carbohydrate digestion in the presence of an active α-amylase. PMID:27391425
A method to identify differential expression profiles of time-course gene data with Fourier transformation

PubMed Central

2013-01-01

Background Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis. Results This work aims to identify genes that show different gene expression profiles across time. We propose a statistical procedure to discover gene groups with similar profiles using a nonparametric representation that accounts for the autocorrelation in the data. In particular, we first represent each profile in terms of a Fourier basis, and then we screen out genes that are not differentially expressed based on the Fourier coefficients. Finally, we cluster the remaining gene profiles using a model-based approach in the Fourier domain. We evaluate the screening results in terms of sensitivity, specificity, FDR and FNR, compare with the Gaussian process regression screening in a simulation study and illustrate the results by application to yeast cell-cycle microarray expression data with alpha-factor synchronization. The key elements of the proposed methodology: (i) representation of gene profiles in the Fourier domain; (ii) automatic screening of genes based on the Fourier coefficients and taking into account autocorrelation in the data, while controlling the false discovery rate (FDR); (iii) model-based clustering of the remaining gene profiles. Conclusions Using this method, we identified a set of cell-cycle-regulated time-course yeast genes. The proposed method is general and can be potentially used to identify genes which have the same patterns or biological processes, and help facing the present and forthcoming challenges of data analysis in functional genomics. PMID:24134721
Alterations of Clock Gene RNA Expression in Brain Regions of a Triple Transgenic Model of Alzheimer’s Disease

PubMed Central

Bellanti, Francesco; Iannelli, Giuseppina; Blonda, Maria; Tamborra, Rosanna; Villani, Rosanna; Romano, Adele; Calcagnini, Silvio; Mazzoccoli, Gianluigi; Vinciguerra, Manlio; Gaetani, Silvana; Giudetti, Anna Maria; Vendemiale, Gianluigi; Cassano, Tommaso; Serviddio, Gaetano

2017-01-01

A disruption to circadian rhythmicity and the sleep/wake cycle constitutes a major feature of Alzheimer’s disease (AD). The maintenance of circadian rhythmicity is regulated by endogenous clock genes and a number of external Zeitgebers, including light. This study investigated the light induced changes in the expression of clock genes in a triple transgenic model of AD (3×Tg-AD) and their wild type littermates (Non-Tg). Changes in gene expression were evaluated in four brain areas¾suprachiasmatic nucleus (SCN), hippocampus, frontal cortex and brainstem¾of 6- and 18-month-old Non-Tg and 3×Tg-AD mice after 12 h exposure to light or darkness. Light exposure exerted significant effects on clock gene expression in the SCN, the site of the major circadian pacemaker. These patterns of expression were disrupted in 3×Tg-AD and in 18-month-old compared with 6-month-old Non-Tg mice. In other brain areas, age rather than genotype affected gene expression; the effect of genotype was observed on hippocampal Sirt1 expression, while it modified the expression of genes regulating the negative feedback loop as well as Rorα, Csnk1ɛ and Sirt1 in the brainstem. In conclusion, during the early development of AD, there is a disruption to the normal expression of genes regulating circadian function after exposure to light, particularly in the SCN but also in extra-hypothalamic brain areas supporting circadian regulation, suggesting a severe impairment of functioning of the clock gene pathway. Even though this study did not demonstrate a direct association between these alterations in clock gene expression among brain areas with the cognitive impairments and chrono-disruption that characterize the early onset of AD, our novel results encourage further investigation aimed at testing this hypothesis. PMID:28671110
Cellular Interactions and Immune Response of Spherical Nucleic Acid (SNA) Nanoconjugates

NASA Astrophysics Data System (ADS)

Massich, Matthew David

Spherical nucleic acid (SNA) nanoconjugates consist of a densely packed monolayer shell of highly-oriented oligonucleotides covalently bound to a gold nanoparticle core. The nanoconjugates exhibit several important qualities, which make them useful for various biological applications, such as antisense gene regulation strategies and the intracellular detection of biomolecules. The focus of this thesis was to characterize the nanoconjugates interaction with cultured cells and specifically the immune response to their intracellular presence. The immune response of macrophage cells to internalized nanoconjugates was studied, and due to the dense functionalization of oligonucleotides on the surface of the nanoparticle and the resulting high localized salt concentration the innate immune response to the nanoconjugates is ˜25-fold less when compared to a lipoplex carrying the same sequence. Additionally, genome-wide expression profiling was used to study the biological response of cultured cells to the nanoconjugates. The biological response of HeLa cells to gold nanoparticles stabilized by weakly bound ligands was significant, yet when these same nanoparticles were stably functionalized with covalently attached oligonucleotides the cells showed no measurable response. In human keratinocytes, the oligonucleotide sequences caused 427 genes to be differentially expressed when complexed with Dharmafect, but when the oligonucleotides were conjugated to nanoparticles only 7 genes were differentially expressed. Beyond characterizing the cellular interactions and immune response of the nanoconjugates, the optimal length of siRNA (from 19--34 base pairs) that induces the most gene knockdown while maintaining limited immune activation was determined to be 24 base pairs. Further, the SNAs were shown to be useful as a potential antiviral gene therapy by demonstrating approximately 50% knockdown of the Ebola VP35 gene. Lastly, a scanning probe-enabled method was used to rapidly create nanoscale fibronectin patterns over large areas with a range of feature sizes, thereby opening the field of nanocombinatorics. This allowed the investigation of the relationship between fibronectin feature size and stem cell fate. MSCs cultured on nanoscale fibronectin features directed differentiation toward osteogenesis to a greater extent than cells grown on both microscale features and cells grown on non-patterned fibronectin substrates with osteogenic inducing media, demonstrating a new method for controlling stem cell fate.
Mapping MRI/MRS Parameters with Genetic Over-expression Profiles In Human Prostate Cancer: Demonstrating the Potential

PubMed Central

Lenkinski, Robert E.; Bloch, B. Nicholas; Liu, Fangbing; Frangioni, John V.; Perner, Sven; Rubin, Mark A.; Genega, Elizabeth; Rofsky, Neil M.; Gaston, Sandra M.

2009-01-01

Magnetic resonance imaging (MRI) and MR spectroscopy can probe a variety of physiological (e.g. blood vessel permeability) and metabolic characteristics of prostate cancer. However, little is known about the changes in gene expression that underlie the spectral and imaging features observed in prostate cancer. Tumor induced changes in vascular permeability and angiogenesis are thought to contribute to patterns of dynamic contrast enhanced (DCE) MRI images of prostate cancer even though the genetic basis of tumor vasculogenesis is complex and the specific mechanisms underlying these DCEMRI features have not yet been determined. In order to identify the changes in gene expression that correspond to MRS and DCEMRI patterns in human prostate cancers, we have utilized tissue print micropeel techniques to generate “whole mount” molecular maps of radical prostatectomy specimens that correspond to pre-surgical MRI/MRS studies. These molecular maps include RNA expression profiles from both Affymetrix GeneChip microarrays and quantitative reverse transcriptase PCR (qrt-PCR) analysis, as well as immunohistochemical studies. Using these methods on patients with prostate cancer, we found robust over-expression of choline kinase a in the majority of primary tumors. We also observed overexpression of neuropeptide Y (NPY), a newly identified angiogenic factor, in a subset of DCEMRI positive prostate cancers. These studies set the stage for establishing MRI/MRS parameters as validated biomarkers for human prostate cancer. PMID:18752015
Genomic identification, characterization and differential expression analysis of SBP-box gene family in Brassica napus.

PubMed

Cheng, Hongtao; Hao, Mengyu; Wang, Wenxiang; Mei, Desheng; Tong, Chaobo; Wang, Hui; Liu, Jia; Fu, Li; Hu, Qiong

2016-09-08

SBP-box genes belong to one of the largest families of transcription factors. Though members of this family have been characterized to be important regulators of diverse biological processes, information of SBP-box genes in the third most important oilseed crop Brassica napus is largely undefined. In the present study, by whole genome bioinformatics analysis and transcriptional profiling, 58 putative members of SBP-box gene family in oilseed rape (Brassica napus L.) were identified and their expression pattern in different tissues as well as possible interaction with miRNAs were analyzed. In addition, B. napus lines with contrasting branch angle were used for investigating the involvement of SBP-box genes in plant architecture regulation. Detailed gene information, including genomic organization, structural feature, conserved domain and phylogenetic relationship of the genes were systematically characterized. By phylogenetic analysis, BnaSBP proteins were classified into eight distinct groups representing the clear orthologous relationships to their family members in Arabidopsis and rice. Expression analysis in twelve tissues including vegetative and reproductive organs showed different expression patterns among the SBP-box genes and a number of the genes exhibit tissue specific expression, indicating their diverse functions involved in the developmental process. Forty-four SBP-box genes were ascertained to contain the putative miR156 binding site, with 30 and 14 of the genes targeted by miR156 at the coding and 3'UTR region, respectively. Relative expression level of miR156 is varied across tissues. Different expression pattern of some BnaSBP genes and the negative correlation of transcription levels between miR156 and its target BnaSBP gene were observed in lines with different branch angle. Taken together, this study represents the first systematic analysis of the SBP-box gene family in Brassica napus. The data presented here provides base foundation for understanding the crucial roles of BnaSBP genes in plant development and other biological processes.

GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.

PubMed

Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E

2016-03-11

Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.
Lack of NF1 gene expression in a sporadic schwannoma from a patient without neurofibromatosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Norton, K.K.; Dowton, B.; Silow-Santiago, I.

The neurofibromatosis type 1 (NF1) gene encodes a tumor suppressor protein, neurofibromin, which is expressed at high levels in Schwann cells and other adult tissues. Loss of NF1 gene expression has been reported in Schwann cell tumors (neurofibrosarcomas) from patients with NF1 and its loss is associated with increased proliferation of these cells. We examined one spinal schwannoma from a patient without clinical features of neurofibromatosis type 1 or 2. The tumor was a typical schwannoma confirmed by standard neuropathologic criteria and expressed S100 by immunocytochemistry. NF1 gene expression in this tumor was examined by in situ hybridization using anmore » NF1-specific riboprobe, Northern blot analysis and reverse-transcribed (RT) PCR. Little or no expression of NF1 RNA could be detected using these methods whereas abundant expression of S100, cyclophilin and beta-action RNA was found in the tumor. Fibroblast and Schwann cells were then individually cultured from this schwannoma and the RNA extracted for Northern blot and RT-PCR analysis. In these cultured Schwann cells both from early and late passages, abundant expression of NF1 RNA could be detected. It is unlikely that our culture technique preferentially expanded {open_quotes}normal{close_quotes} Schwann cells, since NF1 acts as a tumor suppressor gene and its presence would not confer any growth advantage over the tumor-derived, neurofibromin-negative Schwann cells which presumably have an increased proliferation rate. Similarly, the conditions used to expand these Schwann cells do not result in increased NF1 gene expression as shown in previous studies. These results suggest that, in some tumors, expression of the NF1 gene can be downregulated by factors produced within the tumor and that this type of tumor suppressor gene downregulation may represent another mechanism other than mutation for turning off the expression of these growth-suppressing genes and allowing for cell proliferation in tumors.« less
Transcription and translation of the sigG gene is tuned for proper execution of the switch from early to late gene expression in the developing Bacillus subtilis spore

PubMed Central

Mearls, Elizabeth B.; Jackter, Jacquelin; Colquhoun, Jennifer M.; Matthews, Allison J.; Fenton, Colleen

2018-01-01

A cascade of alternative sigma factors directs developmental gene expression during spore formation by the bacterium Bacillus subtilis. As the spore develops, a tightly regulated switch occurs in which the early-acting sigma factor σF is replaced by the late-acting sigma factor σG. The gene encoding σG (sigG) is transcribed by σF and by σG itself in an autoregulatory loop; yet σG activity is not detected until σF-dependent gene expression is complete. This separation in σF and σG activities has been suggested to be due at least in part to a poorly understood intercellular checkpoint pathway that delays sigG expression by σF. Here we report the results of a careful examination of sigG expression during sporulation. Unexpectedly, our findings argue against the existence of a regulatory mechanism to delay sigG transcription by σF and instead support a model in which sigG is transcribed by σF with normal timing, but at levels that are very low. This low-level expression of sigG is the consequence of several intrinsic features of the sigG regulatory and coding sequence—promoter spacing, secondary structure potential of the mRNA, and start codon identity—that dampen its transcription and translation. Especially notable is the presence of a conserved hairpin in the 5’ leader sequence of the sigG mRNA that occludes the ribosome-binding site, reducing translation by up to 4-fold. Finally, we demonstrate that misexpression of sigG from regulatory and coding sequences lacking these features triggers premature σG activity in the forespore during sporulation, as well as inappropriate σG activity during vegetative growth. Altogether, these data indicate that transcription and translation of the sigG gene is tuned to prevent vegetative expression of σG and to ensure the precise timing of the switch from σF to σG in the developing spore. PMID:29702640
Maintenance and Loss of Duplicated Genes by Dosage Subfunctionalization.

PubMed

Gout, Jean-Francois; Lynch, Michael

2015-08-01

Whole-genome duplications (WGDs) have contributed to gene-repertoire enrichment in many eukaryotic lineages. However, most duplicated genes are eventually lost and it is still unclear why some duplicated genes are evolutionary successful whereas others quickly turn to pseudogenes. Here, we show that dosage constraints are major factors opposing post-WGD gene loss in several Paramecium species that share a common ancestral WGD. We propose a model where a majority of WGD-derived duplicates preserve their ancestral function and are retained to produce enough of the proteins performing this same ancestral function. Under this model, the expression level of individual duplicated genes can evolve neutrally as long as they maintain a roughly constant summed expression, and this allows random genetic drift toward uneven contributions of the two copies to total expression. Our analysis suggests that once a high level of imbalance is reached, which can require substantial lengths of time, the copy with the lowest expression level contributes a small enough fraction of the total expression that selection no longer opposes its loss. Extension of our analysis to yeast species sharing a common ancestral WGD yields similar results, suggesting that duplicated-gene retention for dosage constraints followed by divergence in expression level and eventual deterministic gene loss might be a universal feature of post-WGD evolution. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Transcriptome analysis identifies genes involved in sex determination and development of Xenopus laevis gonads.

PubMed

Piprek, Rafal P; Damulewicz, Milena; Kloc, Malgorzata; Kubiak, Jacek Z

Development of the gonads is a complex process, which starts with a period of undifferentiated, bipotential gonads. During this period the expression of sex-determining genes is initiated. Sex determination is a process triggering differentiation of the gonads into the testis or ovary. Sex determination period is followed by sexual differentiation, i.e. appearance of the first testis- and ovary-specific features. In Xenopus laevis W-linked DM-domain gene (DM-W) had been described as a master determinant of the gonadal female sex. However, the data on the expression and function of other genes participating in gonad development in X. laevis, and in anurans, in general, are very limited. We applied microarray technique to analyze the expression pattern of a subset of X. laevis genes previously identified to be involved in gonad development in several vertebrate species. We also analyzed the localization and the expression level of proteins encoded by these genes in developing X. laevis gonads. These analyses pointed to the set of genes differentially expressed in developing testes and ovaries. Gata4, Sox9, Dmrt1, Amh, Fgf9, Ptgds, Pdgf, Fshr, and Cyp17a1 expression was upregulated in developing testes, while DM-W, Fst, Foxl2, and Cyp19a1 were upregulated in developing ovaries. We discuss the possible roles of these genes in development of X. laevis gonads. Copyright © 2018 International Society of Differentiation. Published by Elsevier B.V. All rights reserved.
A Pathway Based Classification Method for Analyzing Gene Expression for Alzheimer's Disease Diagnosis.

PubMed

Voyle, Nicola; Keohane, Aoife; Newhouse, Stephen; Lunnon, Katie; Johnston, Caroline; Soininen, Hilkka; Kloszewska, Iwona; Mecocci, Patrizia; Tsolaki, Magda; Vellas, Bruno; Lovestone, Simon; Hodges, Angela; Kiddle, Steven; Dobson, Richard Jb

2016-01-01

Recent studies indicate that gene expression levels in blood may be able to differentiate subjects with Alzheimer's disease (AD) from normal elderly controls and mild cognitively impaired (MCI) subjects. However, there is limited replicability at the single marker level. A pathway-based interpretation of gene expression may prove more robust. This study aimed to investigate whether a case/control classification model built on pathway level data was more robust than a gene level model and may consequently perform better in test data. The study used two batches of gene expression data from the AddNeuroMed (ANM) and Dementia Case Registry (DCR) cohorts. Our study used Illumina Human HT-12 Expression BeadChips to collect gene expression from blood samples. Random forest modeling with recursive feature elimination was used to predict case/control status. Age and APOE ɛ4 status were used as covariates for all analysis. Gene and pathway level models performed similarly to each other and to a model based on demographic information only. Any potential increase in concordance from the novel pathway level approach used here has not lead to a greater predictive ability in these datasets. However, we have only tested one method for creating pathway level scores. Further, we have been able to benchmark pathways against genes in datasets that had been extensively harmonized. Further work should focus on the use of alternative methods for creating pathway level scores, in particular those that incorporate pathway topology, and the use of an endophenotype based approach.
Featured Article: Transcriptional landscape analysis identifies differently expressed genes involved in follicle-stimulating hormone induced postmenopausal osteoporosis.

PubMed

Maasalu, Katre; Laius, Ott; Zhytnik, Lidiia; Kõks, Sulev; Prans, Ele; Reimann, Ene; Märtson, Aare

2017-01-01

Osteoporosis is a disorder associated with bone tissue reorganization, bone mass, and mineral density. Osteoporosis can severely affect postmenopausal women, causing bone fragility and osteoporotic fractures. The aim of the current study was to compare blood mRNA profiles of postmenopausal women with and without osteoporosis, with the aim of finding different gene expressions and thus targets for future osteoporosis biomarker studies. Our study consisted of transcriptome analysis of whole blood serum from 12 elderly female osteoporotic patients and 12 non-osteoporotic elderly female controls. The transcriptome analysis was performed with RNA sequencing technology. For data analysis, the edgeR package of R Bioconductor was used. Two hundred and fourteen genes were expressed differently in osteoporotic compared with non-osteoporotic patients. Statistical analysis revealed 20 differently expressed genes with a false discovery rate of less than 1.47 × 10 -4 among osteoporotic patients. The expression of 10 genes were up-regulated and 10 down-regulated. Further statistical analysis identified a potential osteoporosis mRNA biomarker pattern consisting of six genes: CACNA1G, ALG13, SBK1, GGT7, MBNL3, and RIOK3. Functional ingenuity pathway analysis identified the strongest candidate genes with regard to potential involvement in a follicle-stimulating hormone activated network of increased osteoclast activity and hypogonadal bone loss. The differentially expressed genes identified in this study may contribute to future research of postmenopausal osteoporosis blood biomarkers.
Amphioxus and lamprey AP-2 genes: implications for neural crest evolution and migration patterns

NASA Technical Reports Server (NTRS)

Meulemans, Daniel; Bronner-Fraser, Marianne

2002-01-01

The neural crest is a uniquely vertebrate cell type present in the most basal vertebrates, but not in cephalochordates. We have studied differences in regulation of the neural crest marker AP-2 across two evolutionary transitions: invertebrate to vertebrate, and agnathan to gnathostome. Isolation and comparison of amphioxus, lamprey and axolotl AP-2 reveals its extensive expansion in the vertebrate dorsal neural tube and pharyngeal arches, implying co-option of AP-2 genes by neural crest cells early in vertebrate evolution. Expression in non-neural ectoderm is a conserved feature in amphioxus and vertebrates, suggesting an ancient role for AP-2 genes in this tissue. There is also common expression in subsets of ventrolateral neurons in the anterior neural tube, consistent with a primitive role in brain development. Comparison of AP-2 expression in axolotl and lamprey suggests an elaboration of cranial neural crest patterning in gnathostomes. However, migration of AP-2-expressing neural crest cells medial to the pharyngeal arch mesoderm appears to be a primitive feature retained in all vertebrates. Because AP-2 has essential roles in cranial neural crest differentiation and proliferation, the co-option of AP-2 by neural crest cells in the vertebrate lineage was a potentially crucial event in vertebrate evolution.
Transgenic mice expressing mutant Pinin exhibit muscular dystrophy, nebulin deficiency and elevated expression of slow-type muscle fiber genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Hsu-Pin; Hsu, Shu-Yuan; Wu, Wen-Ai

Highlights: •Pnn CCD domain functions as a dominant negative mutant regulating Pnn expression and function. •Pnn CCD mutant Tg mice have a muscle wasting phenotype during development and show dystrophic histological features. •Pnn mutant muscles are susceptible to slow fiber type gene transition and NEB reduction. •The Tg mouse generated by overexpression of the Pnn CCD domain displays many characteristics resembling NEB{sup +/−} mice. -- Abstract: Pinin (Pnn) is a nuclear speckle-associated SR-like protein. The N-terminal region of the Pnn protein sequence is highly conserved from mammals to insects, but the C-terminal RS domain-containing region is absent in lower species.more » The N-terminal coiled-coil domain (CCD) is, therefore, of interest not only from a functional point of view, but also from an evolutionarily standpoint. To explore the biological role of the Pnn CCD in a physiological context, we generated transgenic mice overexpressing Pnn mutant in skeletal muscle. We found that overexpression of the CCD reduces endogenous Pnn expression in cultured cell lines as well as in transgenic skeletal muscle fibers. Pnn mutant mice exhibited reduced body mass and impaired muscle function during development. Mutant skeletal muscles show dystrophic histological features with muscle fibers heavily loaded with centrally located myonuclei. Expression profiling and pathway analysis identified over-representation of genes in gene categories associated with muscle contraction, specifically those related to slow type fiber. In addition nebulin (NEB) expression level is repressed in Pnn mutant skeletal muscle. We conclude that Pnn downregulation in skeletal muscle causes a muscular dystrophic phenotype associated with NEB deficiency and the CCD domain is incapable of replacing full length Pnn in terms of functional capacity.« less
Generation of mammalian cells stably expressing multiple genes at predetermined levels.

PubMed

Liu, X; Constantinescu, S N; Sun, Y; Bogan, J S; Hirsch, D; Weinberg, R A; Lodish, H F

2000-04-10

Expression of cloned genes at desired levels in cultured mammalian cells is essential for studying protein function. Controlled levels of expression have been difficult to achieve, especially for cell lines with low transfection efficiency or when expression of multiple genes is required. An internal ribosomal entry site (IRES) has been incorporated into many types of expression vectors to allow simultaneous expression of two genes. However, there has been no systematic quantitative analysis of expression levels in individual cells of genes linked by an IRES, and thus the broad use of these vectors in functional analysis has been limited. We constructed a set of retroviral expression vectors containing an IRES followed by a quantitative selectable marker such as green fluorescent protein (GFP) or truncated cell surface proteins CD2 or CD4. The gene of interest is placed in a multiple cloning site 5' of the IRES sequence under the control of the retroviral long terminal repeat (LTR) promoter. These vectors exploit the approximately 100-fold differences in levels of expression of a retrovirus vector depending on its site of insertion in the host chromosome. We show that the level of expression of the gene downstream of the IRES and the expression level and functional activity of the gene cloned upstream of the IRES are highly correlated in stably infected target cells. This feature makes our vectors extremely useful for the rapid generation of stably transfected cell populations or clonal cell lines expressing specific amounts of a desired protein simply by fluorescent activated cell sorting (FACS) based on the level of expression of the gene downstream of the IRES. We show how these vectors can be used to generate cells expressing high levels of the erythropoietin receptor (EpoR) or a dominant negative Smad3 protein and to generate cells expressing two different cloned proteins, Ski and Smad4. Correlation of a biologic effect with the level of expression of the protein downstream of the IRES provides strong evidence for the function of the protein placed upstream of the IRES.
Integrative topological analysis of mass spectrometry data reveals molecular features with clinical relevance in esophageal squamous cell carcinoma

PubMed Central

Gao, She-Gan; Liu, Rui-Min; Zhao, Yun-Gang; Wang, Pei; Ward, Douglas G.; Wang, Guang-Chao; Guo, Xiang-Qian; Gu, Juan; Niu, Wan-Bin; Zhang, Tian; Martin, Ashley; Guo, Zhi-Peng; Feng, Xiao-Shan; Qi, Yi-Jun; Ma, Yuan-Fang

2016-01-01

Combining MS-based proteomic data with network and topological features of such network would identify more clinically relevant molecules and meaningfully expand the repertoire of proteins derived from MS analysis. The integrative topological indexes representing 95.96% information of seven individual topological measures of node proteins were calculated within a protein-protein interaction (PPI) network, built using 244 differentially expressed proteins (DEPs) identified by iTRAQ 2D-LC-MS/MS. Compared with DEPs, differentially expressed genes (DEGs) and comprehensive features (CFs), structurally dominant nodes (SDNs) based on integrative topological index distribution produced comparable classification performance in three different clinical settings using five independent gene expression data sets. The signature molecules of SDN-based classifier for distinction of early from late clinical TNM stages were enriched in biological traits of protein synthesis, intracellular localization and ribosome biogenesis, which suggests that ribosome biogenesis represents a promising therapeutic target for treating ESCC. In addition, ITGB1 expression selected exclusively by integrative topological measures correlated with clinical stages and prognosis, which was further validated with two independent cohorts of ESCC samples. Thus the integrative topological analysis of PPI networks proposed in this study provides an alternative approach to identify potential biomarkers and therapeutic targets from MS/MS data with functional insights in ESCC. PMID:26898710
Screening key candidate genes and pathways involved in insulinoma by microarray analysis.

PubMed

Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin

2018-06-01

Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.
Gene expression changes in female zebrafish (Danio rerio) brain in response to acute exposure to methylmercury

USGS Publications Warehouse

Richter, Catherine A.; Garcia-Reyero, Natàlia; Martyniuk, Chris; Knoebl, Iris; Pope, Marie; Wright-Osment, Maureen K.; Denslow, Nancy D.; Tillitt, Donald E.

2011-01-01

Methylmercury (MeHg) is a potent neurotoxicant and endocrine disruptor that accumulates in aquatic systems. Previous studies have shown suppression of hormone levels in both male and female fish, suggesting effects on gonadotropin regulation in the brain. The gene expression profile in adult female zebrafish whole brain induced by acute (96 h) MeHg exposure was investigated. Fish were exposed by injection to 0 or 0.5(mu or u)g MeHg/g. Gene expression changes in the brain were examined using a 22,000-feature zebrafish microarray. At a significance level of p
Systematic genomic identification of colorectal cancer genes delineating advanced from early clinical stage and metastasis

PubMed Central

2013-01-01

Background Colorectal cancer is the third leading cause of cancer deaths in the United States. The initial assessment of colorectal cancer involves clinical staging that takes into account the extent of primary tumor invasion, determining the number of lymph nodes with metastatic cancer and the identification of metastatic sites in other organs. Advanced clinical stage indicates metastatic cancer, either in regional lymph nodes or in distant organs. While the genomic and genetic basis of colorectal cancer has been elucidated to some degree, less is known about the identity of specific cancer genes that are associated with advanced clinical stage and metastasis. Methods We compiled multiple genomic data types (mutations, copy number alterations, gene expression and methylation status) as well as clinical meta-data from The Cancer Genome Atlas (TCGA). We used an elastic-net regularized regression method on the combined genomic data to identify genetic aberrations and their associated cancer genes that are indicators of clinical stage. We ranked candidate genes by their regression coefficient and level of support from multiple assay modalities. Results A fit of the elastic-net regularized regression to 197 samples and integrated analysis of four genomic platforms identified the set of top gene predictors of advanced clinical stage, including: WRN, SYK, DDX5 and ADRA2C. These genetic features were identified robustly in bootstrap resampling analysis. Conclusions We conducted an analysis integrating multiple genomic features including mutations, copy number alterations, gene expression and methylation. This integrated approach in which one considers all of these genomic features performs better than any individual genomic assay. We identified multiple genes that robustly delineate advanced clinical stage, suggesting their possible role in colorectal cancer metastatic progression. PMID:24308539
Photomodulating Gene Expression by Using Caged siRNAs with Single-Aptamer Modification.

PubMed

Zhang, Liangliang; Chen, Changmai; Fan, Xinli; Tang, Xinjing

2018-06-18

Caged siRNAs incorporating terminal modification were rationally designed for photochemical regulation of gene silencing induced by RNA interference (RNAi). Through the conjugation of a single oligonucleotide aptamer at the 5' terminus of the antisense RNA strand, enhancement of the blocking effect for RNA-induced silencing complex (RISC) formation/processing was expected, due both/either to the aptamers themselves and/or to their interaction with large binding proteins. Two oligonucleotide aptamers (AS1411 and MUC-1) were chosen for aptamer-siRNA conjugation through a photolabile linker. This caging strategy was successfully used to photoregulate gene expression both of firefly luciferase and of green fluorescent protein (GFP) in cells. Further patterning experiments revealed that spatial regulation of GFP expression was successfully achieved by using the aptamer-modified caged siRNA and light activation. We expect that further optimized caged siRNAs featuring aptamer conjugation will be promising for practical applications to spatiotemporal photoregulation of gene expression in the future. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Gene expression profiling--Opening the black box of plant ecosystem responses to global change

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leakey, A.D.B.; Ainsworth, E.A.; Bernard, S.M.

The use of genomic techniques to address ecological questions is emerging as the field of genomic ecology. Experimentation under environmentally realistic conditions to investigate the molecular response of plants to meaningful changes in growth conditions and ecological interactions is the defining feature of genomic ecology. Since the impact of global change factors on plant performance are mediated by direct effects at the molecular, biochemical and physiological scales, gene expression analysis promises important advances in understanding factors that have previously been consigned to the 'black box' of unknown mechanism. Various tools and approaches are available for assessing gene expression in modelmore » and non-model species as part of global change biology studies. Each approach has its own unique advantages and constraints. A first generation of genomic ecology studies in managed ecosystems and mesocosms have provided a testbed for the approach and have begun to reveal how the experimental design and data analysis of gene expression studies can be tailored for use in an ecological context.« less
NCBI GEO: mining millions of expression profiles--database and tools.

PubMed

Barrett, Tanya; Suzek, Tugba O; Troup, Dennis B; Wilhite, Stephen E; Ngau, Wing-Chi; Ledoux, Pierre; Rudnev, Dmitry; Lash, Alex E; Fujibuchi, Wataru; Edgar, Ron

2005-01-01

The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest fully public repository for high-throughput molecular abundance data, primarily gene expression data. The database has a flexible and open design that allows the submission, storage and retrieval of many data types. These data include microarray-based experiments measuring the abundance of mRNA, genomic DNA and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. GEO currently holds over 30,000 submissions representing approximately half a billion individual molecular abundance measurements, for over 100 organisms. Here, we describe recent database developments that facilitate effective mining and visualization of these data. Features are provided to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
Transcriptomic and macroevolutionary evidence for phenotypic uncoupling between frog life history phases

PubMed Central

Wollenberg Valero, Katharina C.; Garcia-Porta, Joan; Rodríguez, Ariel; Arias, Mónica; Shah, Abhijeet; Randrianiaina, Roger Daniel; Brown, Jason L.; Glaw, Frank; Amat, Felix; Künzel, Sven; Metzler, Dirk; Isokpehi, Raphael D.; Vences, Miguel

2017-01-01

Anuran amphibians undergo major morphological transitions during development, but the contribution of their markedly different life-history phases to macroevolution has rarely been analysed. Here we generate testable predictions for coupling versus uncoupling of phenotypic evolution of tadpole and adult life-history phases, and for the underlying expression of genes related to morphological feature formation. We test these predictions by combining evidence from gene expression in two distantly related frogs, Xenopus laevis and Mantidactylus betsileanus, with patterns of morphological evolution in the entire radiation of Madagascan mantellid frogs. Genes linked to morphological structure formation are expressed in a highly phase-specific pattern, suggesting uncoupling of phenotypic evolution across life-history phases. This gene expression pattern agrees with uncoupled rates of trait evolution among life-history phases in the mantellids, which we show to have undergone an adaptive radiation. Our results validate a prevalence of uncoupling in the evolution of tadpole and adult phenotypes of frogs. PMID:28504275
An intronless form of the tobacco extensin gene terminator strongly enhances transient gene expression in plant leaves.

PubMed

Rosenthal, Sun Hee; Diamos, Andrew G; Mason, Hugh S

2018-03-01

We have found interesting features of a plant gene (extensin) 3' flanking region, including extremely efficient polyadenylation which greatly improves transient expression of transgenes when an intron is removed. Its use will greatly benefit studies of gene expression in plants, research in molecular biology, and applications for recombinant proteins. Plants are a promising platform for the production of recombinant proteins. To express high-value proteins in plants efficiently, the optimization of expression cassettes using appropriate regulatory sequences is critical. Here, we characterize the activity of the tobacco extensin (Ext) gene terminator by transient expression in Nicotiana benthamiana, tobacco, and lettuce. Ext is a member of the hydroxyproline-rich glycoprotein (HRGP) superfamily and constitutes the major protein component of cell walls. The present study demonstrates that the Ext terminator with its native intron removed increased transient gene expression up to 13.5-fold compared to previously established terminators. The enhanced transgene expression was correlated with increased mRNA accumulation and reduced levels of read-through transcripts, which could impair gene expression. Analysis of transcript 3'-ends found that the majority of polyadenylated transcripts were cleaved at a YA dinucleotide downstream from a canonical AAUAAA motif and a UG-rich region, both of which were found to be highly conserved among related extensin terminators. Deletion of either of these regions eliminated most of the activity of the terminator. Additionally, a 45 nt polypurine sequence ~ 175 nt upstream from the polyadenylation sites was found to also be necessary for the enhanced expression. We conclude that the use of Ext terminator has great potential to benefit the production of recombinant proteins in plants.
Overlapping but distinct topology for zebrafish V2R-like olfactory receptors reminiscent of odorant receptor spatial expression zones.

PubMed

Ahuja, Gaurav; Reichel, Vera; Kowatschew, Daniel; Syed, Adnan S; Kotagiri, Aswani Kumar; Oka, Yuichiro; Weth, Franco; Korsching, Sigrun I

2018-05-23

The sense of smell is unrivaled in terms of molecular complexity of its input channels. Even zebrafish, a model vertebrate system in many research fields including olfaction, possesses several hundred different olfactory receptor genes, organized in four different gene families. For one of these families, the initially discovered odorant receptors proper, segregation of expression into distinct spatial subdomains within a common sensory surface has been observed both in teleost fish and in mammals. However, for the remaining three families, little to nothing was known about their spatial coding logic. Here we wished to investigate, whether the principle of spatial segregation observed for odorant receptors extends to another olfactory receptor family, the V2R-related OlfC genes. Furthermore we thought to examine, how expression of OlfC genes is integrated into expression zones of odorant receptor genes, which in fish share a single sensory surface with OlfC genes. To select representative genes, we performed a comprehensive phylogenetic study of the zebrafish OlfC family, which identified a novel OlfC gene, reduced the number of pseudogenes to 1, and brought the total family size to 60 intact OlfC receptors. We analyzed the spatial pattern of OlfC-expressing cells for seven representative receptors in three dimensions (height within the epithelial layer, horizontal distance from the center of the olfactory organ, and height within the olfactory organ). We report non-random distributions of labeled neurons for all OlfC genes analysed. Distributions for sparsely expressed OlfC genes are significantly different from each other in nearly all cases, broad overlap notwithstanding. For two of the three coordinates analyzed, OlfC expression zones are intercalated with those of odorant receptor zones, whereas in the third dimension some segregation is observed. Our results show that V2R-related OlfC genes follow the same spatial logic of expression as odorant receptors and their expression zones intermingle with those of odorant receptor genes. Thus, distinctly different expression zones for individual receptor genes constitute a general feature shared by teleost and tetrapod V2R/OlfC and odorant receptor families alike.

Landscape of the spliced leader trans-splicing mechanism in Schistosoma mansoni.

PubMed

Boroni, Mariana; Sammeth, Michael; Gava, Sandra Grossi; Jorge, Natasha Andressa Nogueira; Macedo, Andréa Mara; Machado, Carlos Renato; Mourão, Marina Moraes; Franco, Glória Regina

2018-03-01

Spliced leader dependent trans-splicing (SLTS) has been described as an important RNA regulatory process that occurs in different organisms, including the trematode Schistosoma mansoni. We identified more than seven thousand putative SLTS sites in the parasite, comprising genes with a wide spectrum of functional classes, which underlines the SLTS as a ubiquitous mechanism in the parasite. Also, SLTS gene expression levels span several orders of magnitude, showing that SLTS frequency is not determined by the expression level of the target gene, but by the presence of particular gene features facilitating or hindering the trans-splicing mechanism. Our in-depth investigation of SLTS events demonstrates widespread alternative trans-splicing (ATS) acceptor sites occurring in different regions along the entire gene body, highlighting another important role of SLTS generating alternative RNA isoforms in the parasite, besides the polycistron resolution. Particularly for introns where SLTS directly competes for the same acceptor substrate with cis-splicing, we identified for the first time additional and important features that might determine the type of splicing. Our study substantially extends the current knowledge of RNA processing by SLTS in S. mansoni, and provide basis for future studies on the trans-splicing mechanism in other eukaryotes.
Neutrophil elastase increases MUC5AC mRNA and protein expression in respiratory epithelial cells.

PubMed

Voynow, J A; Young, L R; Wang, Y; Horger, T; Rose, M C; Fischer, B M

1999-05-01

Chronic neutrophil-predominant inflammation and hypersecretion of mucus are common pathophysiological features of cystic fibrosis, chronic bronchitis, and viral- or pollution-triggered asthma. Neutrophils release elastase, a serine protease, that causes increased mucin production and secretion. The molecular mechanisms of elastase-induced mucin production are unknown. We hypothesized that as part of this mechanism, elastase upregulates expression of a major respiratory mucin gene, MUC5AC. A549, a human lung carcinoma cell line that expresses MUC5AC mRNA and protein, and normal human bronchial epithelial cells in an air-liquid interface culture were stimulated with neutrophil elastase. Neutrophil elastase increased MUC5AC mRNA levels in a time-dependent manner in both cell culture systems. Neutrophil elastase treatment also increased MUC5AC protein levels in A549 cells. The mechanism of MUC5AC gene regulation by elastase was determined in A549 cells. The induction of MUC5AC gene expression required serine protease activity; other classes of proteases had no effect on MUC5AC gene expression. Neutrophil elastase increased MUC5AC mRNA levels by enhancing mRNA stability. This is the first report of mucin gene regulation by this mechanism.
Advanced Design of Dumbbell-shaped Genetic Minimal Vectors Improves Non-coding and Coding RNA Expression.

PubMed

Jiang, Xiaoou; Yu, Han; Teo, Cui Rong; Tan, Genim Siu Xian; Goh, Sok Chin; Patel, Parasvi; Chua, Yiqiang Kevin; Hameed, Nasirah Banu Sahul; Bertoletti, Antonio; Patzel, Volker

2016-09-01

Dumbbell-shaped DNA minimal vectors lacking nontherapeutic genes and bacterial sequences are considered a stable, safe alternative to viral, nonviral, and naked plasmid-based gene-transfer systems. We investigated novel molecular features of dumbbell vectors aiming to reduce vector size and to improve the expression of noncoding or coding RNA. We minimized small hairpin RNA (shRNA) or microRNA (miRNA) expressing dumbbell vectors in size down to 130 bp generating the smallest genetic expression vectors reported. This was achieved by using a minimal H1 promoter with integrated transcriptional terminator transcribing the RNA hairpin structure around the dumbbell loop. Such vectors were generated with high conversion yields using a novel protocol. Minimized shRNA-expressing dumbbells showed accelerated kinetics of delivery and transcription leading to enhanced gene silencing in human tissue culture cells. In primary human T cells, minimized miRNA-expressing dumbbells revealed higher stability and triggered stronger target gene suppression as compared with plasmids and miRNA mimics. Dumbbell-driven gene expression was enhanced up to 56- or 160-fold by implementation of an intron and the SV40 enhancer compared with control dumbbells or plasmids. Advanced dumbbell vectors may represent one option to close the gap between durable expression that is achievable with integrating viral vectors and short-term effects triggered by naked RNA.
Gene expression changes in response to aging compared to heat stress, oxidative stress and ionizing radiation in Drosophila melanogaster.

PubMed

Landis, Gary; Shen, Jie; Tower, John

2012-11-01

Gene expression changes in response to aging, heat stress, hyperoxia, hydrogen peroxide, and ionizing radiation were compared using microarrays. A set of 18 genes were up-regulated across all conditions, indicating a general stress response shared with aging, including the heat shock protein (Hsp) genes Hsp70, Hsp83 and l(2)efl, the glutathione-S-transferase gene GstD2, and the mitochondrial unfolded protein response (mUPR) gene ref(2)P. Selected gene expression changes were confirmed using quantitative PCR, Northern analysis and GstD-GFP reporter constructs. Certain genes were altered in only a subset of the conditions, for example, up-regulation of numerous developmental pathway and signaling genes in response to hydrogen peroxide. While aging shared features with each stress, aging was more similar to the stresses most associated with oxidative stress (hyperoxia, hydrogen peroxide, ionizing radiation) than to heat stress. Aging is associated with down-regulation of numerous mitochondrial genes, including electron-transport-chain (ETC) genes and mitochondrial metabolism genes, and a sub-set of these changes was also observed upon hydrogen peroxide stress and ionizing radiation stress. Aging shared the largest number of gene expression changes with hyperoxia. The extensive down-regulation of mitochondrial and ETC genes during aging is consistent with an aging-associated failure in mitochondrial maintenance, which may underlie the oxidative stress-like and proteotoxic stress-like responses observed during aging.
Gene expression changes in response to aging compared to heat stress, oxidative stress and ionizing radiation in Drosophila melanogaster

PubMed Central

Landis, Gary; Shen, Jie; Tower, John

2012-01-01

Gene expression changes in response to aging, heat stress, hyperoxia, hydrogen peroxide, and ionizing radiation were compared using microarrays. A set of 18 genes were up-regulated across all conditions, indicating a general stress response shared with aging, including the heat shock protein (Hsp) genes Hsp70, Hsp83 and l(2)efl, the glutathione-S-transferase gene GstD2, and the mitochondrial unfolded protein response (mUPR) gene ref(2)P. Selected gene expression changes were confirmed using quantitative PCR, Northern analysis and GstD-GFP reporter constructs. Certain genes were altered in only a subset of the conditions, for example, up-regulation of numerous developmental pathway and signaling genes in response to hydrogen peroxide. While aging shared features with each stress, aging was more similar to the stresses most associated with oxidative stress (hyperoxia, hydrogen peroxide, ionizing radiation) than to heat stress. Aging is associated with down-regulation of numerous mitochondrial genes, including electron-transport-chain (ETC) genes and mitochondrial metabolism genes, and a sub-set of these changes was also observed upon hydrogen peroxide stress and ionizing radiation stress. Aging shared the largest number of gene expression changes with hyperoxia. The extensive down-regulation of mitochondrial and ETC genes during aging is consistent with an aging-associated failure in mitochondrial maintenance, which may underlie the oxidative stress-like and proteotoxic stress-like responses observed during aging. PMID:23211361
Primetime for Learning Genes.

PubMed

Keifer, Joyce

2017-02-11

Learning genes in mature neurons are uniquely suited to respond rapidly to specific environmental stimuli. Expression of individual learning genes, therefore, requires regulatory mechanisms that have the flexibility to respond with transcriptional activation or repression to select appropriate physiological and behavioral responses. Among the mechanisms that equip genes to respond adaptively are bivalent domains. These are specific histone modifications localized to gene promoters that are characteristic of both gene activation and repression, and have been studied primarily for developmental genes in embryonic stem cells. In this review, studies of the epigenetic regulation of learning genes in neurons, particularly the brain-derived neurotrophic factor gene ( BDNF ), by methylation/demethylation and chromatin modifications in the context of learning and memory will be highlighted. Because of the unique function of learning genes in the mature brain, it is proposed that bivalent domains are a characteristic feature of the chromatin landscape surrounding their promoters. This allows them to be "poised" for rapid response to activate or repress gene expression depending on environmental stimuli.
Spermatogenesis Drives Rapid Gene Creation and Masculinization of the X Chromosome in Stalk-Eyed Flies (Diopsidae).

PubMed

Baker, Richard H; Narechania, Apurva; DeSalle, Rob; Johns, Philip M; Reinhardt, Josephine A; Wilkinson, Gerald S

2016-03-26

Throughout their evolutionary history, genomes acquire new genetic material that facilitates phenotypic innovation and diversification. Developmental processes associated with reproduction are particularly likely to involve novel genes. Abundant gene creation impacts the evolution of chromosomal gene content and general regulatory mechanisms such as dosage compensation. Numerous studies in model organisms have found complex and, at times contradictory, relationships among these genomic attributes highlighting the need to examine these patterns in other systems characterized by abundant sexual selection. Therefore, we examined the association among novel gene creation, tissue-specific gene expression, and chromosomal gene content within stalk-eyed flies. Flies in this family are characterized by strong sexual selection and the presence of a newly evolved X chromosome. We generated RNA-seq transcriptome data from the testes for three species within the family and from seven additional tissues in the highly dimorphic species,Teleopsis dalmanni Analysis of dipteran gene orthology reveals dramatic testes-specific gene creation in stalk-eyed flies, involving numerous gene families that are highly conserved in other insect groups. Identification of X-linked genes for the three species indicates that the X chromosome arose prior to the diversification of the family. The most striking feature of this X chromosome is that it is highly masculinized, containing nearly twice as many testes-specific genes as expected based on its size. All the major processes that may drive differential sex chromosome gene content-creation of genes with male-specific expression, development of male-specific expression from pre-existing genes, and movement of genes with male-specific expression-are elevated on the X chromosome ofT. dalmanni This masculinization occurs despite evidence that testes expressed genes do not achieve the same levels of gene expression on the X chromosome as they do on the autosomes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Strategies for comparing gene expression profiles from different microarray platforms: application to a case-control experiment.

PubMed

Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina

2006-06-01

Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
A novel role for the Bombyx Slbo homologue, BmC/EBP, in insect choriogenesis.

PubMed

Sourmeli, S; Papantonis, A; Lecanidou, R

2005-11-18

One previously unidentified cDNA clone coding for a C/EBP factor, BmC/EBP, was isolated from Bombyx mori follicular cells. This is the first time that a C/EBP factor has been isolated and characterized in Lepidoptera. We provide information concerning structural features and developmental specificity, as well as in vitro interaction properties with chorion gene promoter modules. BmC/EBP was capable of effectively recognizing homologous binding sites from chorion gene promoters derived from flies and other moths, despite significant diversity of chorion structure, gene organization, and gene expression profiles. We propose that the relative concentration of BmC/EBP, in relation to its differential binding affinity for promoter cis-elements, results in activation or repression of silkmoth chorion gene expression.
Non-syndromic odontogenic keratocysts: A rare case report

PubMed Central

Kurdekar, Raghavendra S.; Prakash, Jeevan; Rana, A. S.; Kalra, Puneet

2013-01-01

Odontogenic keratocysts are very well documented in the literature. Multiple odontogenic keratocysts (OKCs) are one of the most frequent features of nevoid basal cell carcinoma syndrome (NBCCS). It is linked with mutation in the PTCH gene (human homolog of the drosophila segment polarity gene, “patched”,). Partial expression of the gene may result in occurrence of only multiple recurring OKC without any associated systemic findings. A rare case of multiple odontogenic keratocysts unassociated with any syndrome is reported, so as to add to the growing number of such cases in the literature. The possibility of this case being a partial expression of the Gorlin-Goltz syndrome is discussed. PMID:24163561
An Adaptive Genetic Association Test Using Double Kernel Machines.

PubMed

Zhan, Xiang; Epstein, Michael P; Ghosh, Debashis

2015-10-01

Recently, gene set-based approaches have become very popular in gene expression profiling studies for assessing how genetic variants are related to disease outcomes. Since most genes are not differentially expressed, existing pathway tests considering all genes within a pathway suffer from considerable noise and power loss. Moreover, for a differentially expressed pathway, it is of interest to select important genes that drive the effect of the pathway. In this article, we propose an adaptive association test using double kernel machines (DKM), which can both select important genes within the pathway as well as test for the overall genetic pathway effect. This DKM procedure first uses the garrote kernel machines (GKM) test for the purposes of subset selection and then the least squares kernel machine (LSKM) test for testing the effect of the subset of genes. An appealing feature of the kernel machine framework is that it can provide a flexible and unified method for multi-dimensional modeling of the genetic pathway effect allowing for both parametric and nonparametric components. This DKM approach is illustrated with application to simulated data as well as to data from a neuroimaging genetics study.
The control of lambda DNA terminase synthesis.

PubMed Central

Murialdo, H; Davidson, A; Chow, S; Gold, M

1987-01-01

Nu1 and A, the genes coding for bacteriophage lambda DNA terminase, rank among the most poorly translated genes expressed in E. coli. To understand the reason for this low level of translation the genes were cloned into plasmids and their expression measured. In addition, the wild type DNA sequences immediately preceding the genes were reduced and modified. It was found that the elements that control translation are contained in the 100 base pairs upstream from the initiation codon. Interchanging these upstream sequences with those of an efficiently translated gene dramatically increased the translation of terminase subunits. It seems unlikely that the rare codons present in the genes, and any feature of their mRNA secondary structure play a role in the control of their translation. The elimination of cos from plasmids containing Nu1 and A also resulted in an increase in terminase production. This result suggests a role for cos in the control of late gene expression. The terminase subunit overproducer strains are potentially very useful for the design of improved DNA packaging and cosmid mapping techniques. Images PMID:3029667
Shutoff of Host Gene Expression in Influenza A Virus and Herpesviruses: Similar Mechanisms and Common Themes

PubMed Central

Rivas, Hembly G.; Schmaling, Summer K.; Gaglia, Marta M.

2016-01-01

The ability to shut off host gene expression is a shared feature of many viral infections, and it is thought to promote viral replication by freeing host cell machinery and blocking immune responses. Despite the molecular differences between viruses, an emerging theme in the study of host shutoff is that divergent viruses use similar mechanisms to enact host shutoff. Moreover, even viruses that encode few proteins often have multiple mechanisms to affect host gene expression, and we are only starting to understand how these mechanisms are integrated. In this review we discuss the multiplicity of host shutoff mechanisms used by the orthomyxovirus influenza A virus and members of the alpha- and gamma-herpesvirus subfamilies. We highlight the surprising similarities in their mechanisms of host shutoff and discuss how the different mechanisms they use may play a coordinated role in gene regulation. PMID:27092522
Characterizing the molecular features of ERG-positive tumors in primary and castration resistant prostate cancer.

PubMed

Roudier, Martine P; Winters, Brian R; Coleman, Ilsa; Lam, Hung-Ming; Zhang, Xiaotun; Coleman, Roger; Chéry, Lisly; True, Lawrence D; Higano, Celestia S; Montgomery, Bruce; Lange, Paul H; Snyder, Linda A; Srivastava, Shiv; Corey, Eva; Vessella, Robert L; Nelson, Peter S; Üren, Aykut; Morrissey, Colm

2016-06-01

The TMPRSS2-ERG gene fusion is detected in approximately half of primary prostate cancers (PCa) yet the prognostic significance remains unclear. We hypothesized that ERG promotes the expression of common genes in primary PCa and metastatic castration-resistant PCa (CRPC), with the objective of identifying ERG-associated pathways, which may promote the transition from primary PCa to CRPC. We constructed tissue microarrays (TMA) from 127 radical prostatectomy specimens, 20 LuCaP patient-derived xenografts (PDX), and 152 CRPC metastases obtained immediately at time of death. Nuclear ERG was assessed by immunohistochemistry (IHC). To characterize the molecular features of ERG-expressing PCa, a subset of IHC confirmed ERG+ or ERG- specimens including 11 radical prostatectomies, 20 LuCaP PDXs, and 45 CRPC metastases underwent gene expression analysis. Genes were ranked based on expression in primary PCa and CRPC. Common genes of interest were targeted for IHC analysis and expression compared with biochemical recurrence (BCR) status. IHC revealed that 43% of primary PCa, 35% of the LuCaP PDXs, and 18% of the CRPC metastases were ERG+ (12 of 48 patients [25%] had at least one ERG+ metastasis). Based on gene expression data and previous literature, two proteins involved in calcium signaling (NCALD, CACNA1D), a protein involved in inflammation (HLA-DMB), CD3 positive immune cells, and a novel ERG-associated protein, DCLK1 were evaluated in primary PCa and CRPC metastases. In ERG+ primary PCa, a weak association was seen with NCALD and CACNA1D protein expression. HLA-DMB association with ERG was decreased and CD3 cell number association with ERG was changed from positive to negative in CRPC metastases compared to primary PCa. DCLK1 was upregulated at the protein level in unpaired ERG+ primary PCa and CRPC metastases (P = 0.0013 and P < 0.0001, respectively). In primary PCa, ERG status or expression of targeted proteins was not associated with BCR-free survival. However, for primary PCa, ERG+DCLK1+ patients exhibited shorter time to BCR (P = 0.06) compared with ERG+DCLK1- patients. This study examined ERG expression in primary PCa and CRPC. We have identified altered levels of inflammatory mediators associated with ERG expression. We determined expression of DCLK1 correlates with ERG expression and may play a role in primary PCa progression to metastatic CPRC. Prostate 76:810-822, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
DiRE: identifying distant regulatory elements of co-expressed genes

PubMed Central

Gotea, Valer; Ovcharenko, Ivan

2008-01-01

Regulation of gene expression in eukaryotic genomes is established through a complex cooperative activity of proximal promoters and distant regulatory elements (REs) such as enhancers, repressors and silencers. We have developed a web server named DiRE, based on the Enhancer Identification (EI) method, for predicting distant regulatory elements in higher eukaryotic genomes, namely for determining their chromosomal location and functional characteristics. The server uses gene co-expression data, comparative genomics and profiles of transcription factor binding sites (TFBSs) to determine TFBS-association signatures that can be used for discriminating specific regulatory functions. DiRE's unique feature is its ability to detect REs outside of proximal promoter regions, as it takes advantage of the full gene locus to conduct the search. DiRE can predict common REs for any set of input genes for which the user has prior knowledge of co-expression, co-function or other biologically meaningful grouping. The server predicts function-specific REs consisting of clusters of specifically-associated TFBSs and it also scores the association of individual transcription factors (TFs) with the biological function shared by the group of input genes. Its integration with the Array2BIO server allows users to start their analysis with raw microarray expression data. The DiRE web server is freely available at http://dire.dcode.org. PMID:18487623
Distribution and regulation of stochasticity and plasticity in Saccharomyces cerevisiae

DOE PAGES

Dar, R. D.; Karig, D. K.; Cooke, J. F.; ...

2010-09-01

Stochasticity is an inherent feature of complex systems with nanoscale structure. In such systems information is represented by small collections of elements (e.g. a few electrons on a quantum dot), and small variations in the populations of these elements may lead to big uncertainties in the information. Unfortunately, little is known about how to work within this inherently noisy environment to design robust functionality into complex nanoscale systems. Here, we look to the biological cell as an intriguing model system where evolution has mediated the trade-offs between fluctuations and function, and in particular we look at the relationships and trade-offsmore » between stochastic and deterministic responses in the gene expression of budding yeast (Saccharomyces cerevisiae). We find gene regulatory arrangements that control the stochastic and deterministic components of expression, and show that genes that have evolved to respond to stimuli (stress) in the most strongly deterministic way exhibit the most noise in the absence of the stimuli. We show that this relationship is consistent with a bursty 2-state model of gene expression, and demonstrate that this regulatory motif generates the most uncertainty in gene expression when there is the greatest uncertainty in the optimal level of gene expression.« less
Mouse Hair Cycle Expression Dynamics Modeled as Coupled Mesenchymal and Epithelial Oscillators

PubMed Central

Tasseff, Ryan; Bheda-Malge, Anjali; DiColandrea, Teresa; Bascom, Charles C.; Isfort, Robert J.; Gelinas, Richard

2014-01-01

The hair cycle is a dynamic process where follicles repeatedly move through phases of growth, retraction, and relative quiescence. This process is an example of temporal and spatial biological complexity. Understanding of the hair cycle and its regulation would shed light on many other complex systems relevant to biological and medical research. Currently, a systematic characterization of gene expression and summarization within the context of a mathematical model is not yet available. Given the cyclic nature of the hair cycle, we felt it was important to consider a subset of genes with periodic expression. To this end, we combined several mathematical approaches with high-throughput, whole mouse skin, mRNA expression data to characterize aspects of the dynamics and the possible cell populations corresponding to potentially periodic patterns. In particular two gene clusters, demonstrating properties of out-of-phase synchronized expression, were identified. A mean field, phase coupled oscillator model was shown to quantitatively recapitulate the synchronization observed in the data. Furthermore, we found only one configuration of positive-negative coupling to be dynamically stable, which provided insight on general features of the regulation. Subsequent bifurcation analysis was able to identify and describe alternate states based on perturbation of system parameters. A 2-population mixture model and cell type enrichment was used to associate the two gene clusters to features of background mesenchymal populations and rapidly expanding follicular epithelial cells. Distinct timing and localization of expression was also shown by RNA and protein imaging for representative genes. Taken together, the evidence suggests that synchronization between expanding epithelial and background mesenchymal cells may be maintained, in part, by inhibitory regulation, and potential mediators of this regulation were identified. Furthermore, the model suggests that impairing this negative regulation will drive a bifurcation which may represent transition into a pathological state such as hair miniaturization. PMID:25375120
Differential Gene Expression in Benign Prostate Epithelium of Men with and without Prostate Cancer: Evidence for a Prostate Cancer Field Effect

PubMed Central

Risk, Michael C; Knudsen, Beatrice S; Coleman, Ilsa; Dumpit, Ruth F; Kristal, Alan R; LeMeur, Nolwenn; Gentleman, Robert C; True, Lawrence D; Nelson, Peter S; Lin, Daniel W

2010-01-01

Background Several malignancies are known to exhibit a “field-effect” whereby regions beyond tumor boundaries harbor histological or molecular changes that are associated with cancer. We sought to determine if histologically benign prostate epithelium collected from men with prostate cancer exhibits features indicative of pre-malignancy or field effect. Methods Prostate needle biopsies from 15 men with high grade(Gleason 8–10) prostate cancer and 15 age- and BMI-matched controls were identified from a biospecimen repository. Benign epithelia from each patient were isolated by laser capture microdissection. RNA was isolated, amplified, and used for microarray hybridization. Quantitative PCR(qPCR) was used to determine the expression of specific genes of interest. Alterations in protein expression were analyzed through immunohistochemistry. Results Overall patterns of gene expression in microdissected benign-associated benign epithelium (BABE) and cancer-associated benign epithelium (CABE) were similar. Two genes previously associated with prostate cancer, PSMA and SSTR1, were significantly upregulated in the CABE group(FDR <1%). Expression of other prostate cancer-associated genes, including ERG, HOXC4, HOXC5 and MME, were also increased in CABE by qRT-PCR, although other genes commonly altered in prostate cancer were not different between the BABE and CABE samples. The expression of MME and PSMA proteins on IHC coincided with their mRNA alterations. Conclusion Gene expression profiles between benign epithelia of patients with and without prostate cancer are very similar. However, these tissues exhibit differences in the expression levels of several genes previously associated with prostate cancer development or progression. These differences may comprise a field effect and represent early events in carcinogenesis. PMID:20935156
The Mhc class II of the Black grouse (Tetrao tetrix) consists of low numbers of B and Y genes with variable diversity and expression.

PubMed

Strand, Tanja; Westerdahl, Helena; Höglund, Jacob; V Alatalo, Rauno; Siitari, Heli

2007-09-01

We found that the Black grouse (Tetrao tetrix) possess low numbers of Mhc class II B (BLB) and Y (YLB) genes with variable diversity and expression. We have therefore shown, for the first time, that another bird species (in this case, a wild lek-breeding galliform) shares several features of the simple Mhc of the domestic chicken (Gallus gallus). The Black grouse BLB genes showed the same level of polymorphism that has been reported in chicken, and we also found indications of balancing selection in the peptide-binding regions. The YLB genes were less variable than the BLB genes, also in accordance with earlier studies in chicken, although their functional significance still remains obscure. We hypothesize that the YLB genes could have been under purifying selection, just as the mammal Mhc-E gene cluster.
Transcriptome analysis of rainbow trout infected with high and low virulence strains of Infectious hematopoietic necrosis virus

USGS Publications Warehouse

Purcell, Maureen K.; Marjara, Inderjit Singh; Batts, William; Kurath, Gael; Hansen, John D.

2010-01-01

There are three main genetic lineages or genogroups of Infectious hematopoietic necrosis virus (IHNV) in N. America. Strains representing the M genogroup are more virulent in rainbow trout relative to the U genogroup. In this study, we used microarray analysis to evaluate potential mechanisms responsible for host-specific virulence in rainbow trout that were given intraperitoneal injections of buffer or a representative M or U type virus strain. Reverse transcriptase quantitative PCR (RT-qPCR) was used to assess viral load and gene expression of select immune genes. Viral load was significantly higher in trout infected with the M virus starting at 24 h post-infection (p.i.) and continuing until 72 h p.i. Microarray analysis of the 48 h time point revealed 153 up-regulated and 248 down-regulated features in response to M virus infection but only 62 up-regulated and 49 down-regulated features following U virus infection. Translation and transcription features were among the most frequent down-regulated features in response to M virus infection and may be associated with the host cell shutoff phenomenon. A greater host cell shutoff response by the M virus may facilitate subversion of the host cell transcriptional machinery and enhance viral replication, suggesting the M virus may be better optimized to manipulate the rainbow trout transcriptional and translational machinery. Anti-viral associated features were the most commonly up-regulated features. A common set of features were up-regulated in both the M and U infection groups, but were induced to a higher magnitude in the M infection group. Gene expression of the anti-viral genes Mx-1 and Vig-1 was correlated but not entirely dependent on viral load in the anterior kidney. Slower replication of the U virus may allow the host more time to induce protective anti-viral immune mechanisms.

GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

PubMed Central

Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H.

2011-01-01

GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. PMID:21998647
Molluscan engrailed expression, serial organization, and shell evolution

NASA Technical Reports Server (NTRS)

Jacobs, D. K.; Wray, C. G.; Wedeen, C. J.; Kostriken, R.; DeSalle, R.; Staton, J. L.; Gates, R. D.; Lindberg, D. R.

2000-01-01

Whether the serial features found in some molluscs are ancestral or derived is considered controversial. Here, in situ hybridization and antibody studies show iterated engrailed-gene expression in transverse rows of ectodermal cells bounding plate field development and spicule formation in the chiton, Lepidochitona cavema, as well as in cells surrounding the valves and in the early development of the shell hinge in the clam, Transennella tantilla. Ectodermal expression of engrailed is associated with skeletogenesis across a range of bilaterian phyla, suggesting a single evolutionary origin of invertebrate skeletons. The shared ancestry of bilaterian-invertebrate skeletons may help explain the sudden appearance of shelly fossils in the Cambrian. Our interpretation departs from the consideration of canonical metameres or segments as units of evolutionary analysis. In this interpretation, the shared ancestry of engrailed-gene function in the terminal/posterior addition of serially repeated elements during development explains the iterative expression of engrailed genes in a range of metazoan body plans.
Pathophysiology of viral-induced exacerbations of COPD

PubMed Central

Alfredo, Potena; Gaetano, Caramori; Paolo, Casolari; Marco, Contoli; Johnston, Sebastian L; Alberto, Papi

2007-01-01

Inflammation of the lower airways is a central feature of chronic obstructive pulmonary disease (COPD). Inflammatory responses are associated with an increased expression of a cascade of proteins including cytokines, chemokines, growth factors, enzymes, adhesion molecules and receptors. In most cases the increased expression of these proteins is the result of enhanced gene transcription: many of these genes are not expressed in normal cells under resting conditions but they are induced in the inflammatory process in a cell-specific manner. Transcription factors regulate the expression of many pro-inflammatory genes and play a key role in the pathogenesis of airway inflammation. Many studies have suggested a role for viral infections as a causative agent of COPD exacerbations. In this review we will focus our attention on the relationship between common respiratory viral infections and the molecular and inflammatory mechanisms that lead to COPD exacerbation. PMID:18268922
Enhancer trap expression patterns provide a novel teaching resource.

PubMed

Geisler, Matt; Jablonska, Barbara; Springer, Patricia S

2002-12-01

A collection of Arabidopsis enhancer trap transposants has been identified for use as a teaching tool. This collection serves to assist students in understanding the patterning and organization of plant tissues and cells, and will be useful in plant anatomy, morphology, and developmental biology courses. Each transposant exhibits reporter gene expression in a specific tissue, cell type, or domain, and these lines collectively offer a glimpse of compartments of gene expression. Some compartments correspond to classical definitions of botanical anatomy and can assist in anatomical identification. Other patterns of reporter gene expression are more complex and do not necessarily correspond to known anatomical features. The sensitivity of the beta-glucuronidase histochemical stain provides the student with a colorful and direct way to visualize difficult aspects of plant development and anatomy, and provides the teacher with an invaluable tool for a practical laboratory session.
Biomarkers identified for prostate cancer patients through genome-scale screening.

PubMed

Wang, Lei-Yun; Cui, Jia-Jia; Zhu, Tao; Shao, Wei-Hua; Zhao, Yi; Wang, Sai; Zhang, Yu-Peng; Wu, Ji-Chu; Zhang, Le

2017-11-03

Prostate cancer is a threat to men and usually occurs in aged males. Though prostate specific antigen level and Gleason score are utilized for evaluation of the prostate cancer in clinic, the biomarkers for this malignancy have not been widely recognized. Furthermore, the outcome varies across individuals receiving comparable treatment regimens and the underlying mechanism is still unclear. We supposed that genetic feature may be responsible for, at least in part, this process and conducted a two-cohort study to compare the genetic difference in tumorous and normal tissues of prostate cancer patients. The Gene Expression Omnibus dataset were used and a total of 41 genes were found significantly differently expressed in tumor tissues as compared with normal prostate tissues. Four genes (SPOCK3, SPON1, PTN and TGFB3) were selected for further evaluation after Gene Ontology analysis, Kyoto Encyclopedia of Genes and Genomes pathway analysis and clinical association analysis. MIR1908 was also found decreased expression level in prostate cancer whose target genes were found expressing in both prostate tumor and normal tissues. These results indicated that these potential biomarkers deserve attention in prostate cancer patients and the underlying mechanism should be further investigated.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Price, Morgan N.; Arkin, Adam P.; Alm, Eric J.

Operons are a major feature of all prokaryotic genomes, but how and why operon structures vary is not well understood. To elucidate the life-cycle of operons, we compared gene order between Escherichia coli K12 and its relatives and identified the recently formed and destroyed operons in E. coli. This allowed us to determine how operons form, how they become closely spaced, and how they die. Our findings suggest that operon evolution is driven by selection on gene expression patterns. First, both operon creation and operon destruction lead to large changes in gene expression patterns. For example, the removal of lysAmore » and ruvA from ancestral operons that contained essential genes allowed their expression to respond to lysine levels and DNA damage, respectively. Second, some operons have undergone accelerated evolution, with multiple new genes being added during a brief period. Third, although most operons are closely spaced because of a neutral bias towards deletion and because of selection against large overlaps, highly expressed operons tend to be widely spaced because of regulatory fine-tuning by intervening sequences. Although operon evolution seems to be adaptive, it need not be optimal: new operons often comprise functionally unrelated genes that were already in proximity before the operon formed.« less
Gene expression profile in cardiovascular disease and preeclampsia: a meta-analysis of the transcriptome based on raw data from human studies deposited in Gene Expression Omnibus.

PubMed

Sitras, V; Fenton, C; Acharya, G

2015-02-01

Cardiovascular disease (CVD) and preeclampsia (PE) share common clinical features. We aimed to identify common transcriptomic signatures involved in CVD and PE in humans. Meta-analysis of individual raw microarray data deposited in GEO, obtained from blood samples of patients with CVD versus controls and placental samples from women with PE versus healthy women with uncomplicated pregnancies. Annotation of cases versus control samples was taken directly from the microarray documentation. Genes that showed a significant differential expression in the majority of experiments were selected for subsequent analysis. Hypergeometric gene list analysis was performed using Bioconductor GOstats package. Bioinformatic analysis was performed in PANTHER. Seven studies in CVD and 5 studies in PE were eligible for meta-analysis. A total of 181 genes were found to be differentially expressed in microarray studies investigating gene expression in blood samples obtained from patients with CVD compared to controls and 925 genes were differentially expressed between preeclamptic and healthy placentas. Among these differentially expressed genes, 22 were common between CVD and PE. Bioinformatic analysis of these genes revealed oxidative stress, p-53 pathway feedback, inflammation mediated by chemokines and cytokines, interleukin signaling, B-cell activation, PDGF signaling, Wnt signaling, integrin signaling and Alzheimer disease pathways to be involved in the pathophysiology of both CVD and PE. Metabolism, development, response to stimulus, immune response and cell communication were the associated biologic processes in both conditions. Gene set enrichment analysis showed the following overlapping pathways between CVD and PE: TGF-β-signaling, apoptosis, graft-versus-host disease, allograft rejection, chemokine signaling, steroid hormone synthesis, type I and II diabetes mellitus, VEGF signaling, pathways in cancer, GNRH signaling, Huntingtons disease and Notch signaling. CVD and PE share same common traits in their gene expression profile indicating common pathways in their pathophysiology. Copyright © 2014 Elsevier Ltd. All rights reserved.
Molecular biological features of male germ cell differentiation

PubMed Central

HIROSE, MIKA; TOKUHIRO, KEIZO; TAINAKA, HITOSHI; MIYAGAWA, YASUSHI; TSUJIMURA, AKIRA; OKUYAMA, AKIHIKO; NISHIMUNE, YOSHITAKE

2007-01-01

Somatic cell differentiation is required throughout the life of a multicellular organism to maintain homeostasis. In contrast, germ cells have only one specific function; to preserve the species by conveying the parental genes to the next generation. Recent studies of the development and molecular biology of the male germ cell have identified many genes, or isoforms, that are specifically expressed in the male germ cell. In the present review, we consider the unique features of male germ cell differentiation. (Reprod Med Biol 2007; 6: 1–9) PMID:29699260
Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies.

PubMed

Martini, Paolo; Risso, Davide; Sales, Gabriele; Romualdi, Chiara; Lanfranchi, Gerolamo; Cagnin, Stefano

2011-04-11

In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focused on the identification of significant differentially expressed genes. Later, researchers moved toward the systematic integration of gene expression profiles with additional biological information, such as chromosomal location, ontological annotations or sequence features. The analysis of gene expression linked to physical location of genes on chromosomes allows the identification of transcriptionally imbalanced regions, while, Gene Set Analysis focuses on the detection of coordinated changes in transcriptional levels among sets of biologically related genes. In this field, meta-analysis offers the possibility to compare different studies, addressing the same biological question to fully exploit public gene expression datasets. We describe STEPath, a method that starts from gene expression profiles and integrates the analysis of imbalanced region as an a priori step before performing gene set analysis. The application of STEPath in individual studies produced gene set scores weighted by chromosomal activation. As a final step, we propose a way to compare these scores across different studies (meta-analysis) on related biological issues. One complication with meta-analysis is batch effects, which occur because molecular measurements are affected by laboratory conditions, reagent lots and personnel differences. Major problems occur when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. We evaluated the power of combining chromosome mapping and gene set enrichment analysis, performing the analysis on a dataset of leukaemia (example of individual study) and on a dataset of skeletal muscle diseases (meta-analysis approach). In leukaemia, we identified the Hox gene set, a gene set closely related to the pathology that other algorithms of gene set analysis do not identify, while the meta-analysis approach on muscular disease discriminates between related pathologies and correlates similar ones from different studies. STEPath is a new method that integrates gene expression profiles, genomic co-expressed regions and the information about the biological function of genes. The usage of the STEPath-computed gene set scores overcomes batch effects in the meta-analysis approaches allowing the direct comparison of different pathologies and different studies on a gene set activation level.
An Overview of Hox Genes in Lophotrochozoa: Evolution and Functionality

PubMed Central

Barucca, Marco; Canapa, Adriana; Biscotti, Maria Assunta

2016-01-01

Hox genes are regulators of animal embryonic development. Changes in the number and sequence of Hox genes as well as in their expression patterns have been related to the evolution of the body plan. Lophotrochozoa is a clade of Protostomia characterized by several phyla which show a wide morphological diversity. Despite that the works summarized in this review emphasize the fragmentary nature of the data available regarding the presence and expression of Hox genes, they also offer interesting insight into the evolution of the Hox cluster and the role played by Hox genes in several phyla. However, the number of genes involved in the cluster of the lophotrochozoan ancestor is still a question of debate. The data presented here suggest that at least nine genes were present while two other genes, Lox4 and Post-2, may either have been present in the ancestor or may have arisen as a result of duplication in the Brachiopoda-Mollusca-Annelida lineage. Spatial and temporal collinearity is a feature of Hox gene expression which was probably present in the ancestor of deuterostomes and protostomes. However, in Lophotrochozoa, it has been detected in only a few species belonging to Annelida and Mollusca. PMID:29615580
Relationships among msx gene structure and function in zebrafish and other vertebrates.

PubMed

Ekker, M; Akimenko, M A; Allende, M L; Smith, R; Drouin, G; Langille, R M; Weinberg, E S; Westerfield, M

1997-10-01

The zebrafish genome contains at least five msx homeobox genes, msxA, msxB, msxC, msxD, and the newly isolated msxE. Although these genes share structural features common to all Msx genes, phylogenetic analyses of protein sequences indicate that the msx genes from zebrafish are not orthologous to the Msx1 and Msx2 genes of mammals, birds, and amphibians. The zebrafish msxB and msxC are more closely related to each other and to the mouse Msx3. Similarly, although the combinatorial expression of the zebrafish msx genes in the embryonic dorsal neuroectoderm, visceral arches, fins, and sensory organs suggests functional similarities with the Msx genes of other vertebrates, differences in the expression patterns preclude precise assignment of orthological relationships. Distinct duplication events may have given rise to the msx genes of modern fish and other vertebrate lineages whereas many aspects of msx gene functions during embryonic development have been preserved.
3D chromosome rendering from Hi-C data using virtual reality

NASA Astrophysics Data System (ADS)

Zhu, Yixin; Selvaraj, Siddarth; Weber, Philip; Fang, Jennifer; Schulze, Jürgen P.; Ren, Bing

2015-01-01

Most genome browsers display DNA linearly, using single-dimensional depictions that are useful to examine certain epigenetic mechanisms such as DNA methylation. However, these representations are insufficient to visualize intrachromosomal interactions and relationships between distal genome features. Relationships between DNA regions may be difficult to decipher or missed entirely if those regions are distant in one dimension but could be spatially proximal when mapped to three-dimensional space. For example, the visualization of enhancers folding over genes is only fully expressed in three-dimensional space. Thus, to accurately understand DNA behavior during gene expression, a means to model chromosomes is essential. Using coordinates generated from Hi-C interaction frequency data, we have created interactive 3D models of whole chromosome structures and its respective domains. We have also rendered information on genomic features such as genes, CTCF binding sites, and enhancers. The goal of this article is to present the procedure, findings, and conclusions of our models and renderings.
An intersectional gene regulatory strategy defines subclass diversity of C. elegans motor neurons.

PubMed

Kratsios, Paschalis; Kerk, Sze Yen; Catela, Catarina; Liang, Joseph; Vidal, Berta; Bayer, Emily A; Feng, Weidong; De La Cruz, Estanisla Daniel; Croci, Laura; Consalez, G Giacomo; Mizumoto, Kota; Hobert, Oliver

2017-07-05

A core principle of nervous system organization is the diversification of neuron classes into subclasses that share large sets of features but differ in select traits. We describe here a molecular mechanism necessary for motor neurons to acquire subclass-specific traits in the nematode Caenorhabditis elegans . Cholinergic motor neuron classes of the ventral nerve cord can be subdivided into subclasses along the anterior-posterior (A-P) axis based on synaptic connectivity patterns and molecular features. The conserved COE-type terminal selector UNC-3 not only controls the expression of traits shared by all members of a neuron class, but is also required for subclass-specific traits expressed along the A-P axis. UNC-3, which is not regionally restricted, requires region-specific cofactors in the form of Hox proteins to co-activate subclass-specific effector genes in post-mitotic motor neurons. This intersectional gene regulatory principle for neuronal subclass diversification may be conserved from nematodes to mice.
Eye development in the four-eyed fish Anableps anableps: cranial and retinal adaptations to simultaneous aerial and aquatic vision.

PubMed

Perez, Louise N; Lorena, Jamily; Costa, Carinne M; Araujo, Maysa S; Frota-Lima, Gabriela N; Matos-Rodrigues, Gabriel E; Martins, Rodrigo A P; Mattox, George M T; Schneider, Patricia N

2017-04-12

The unique eyes of the four-eyed fish Anableps anableps have long intrigued biologists. Key features associated with the bulging eye of Anableps include the expanded frontal bone and the duplicated pupils and cornea. Furthermore, the Anableps retina expresses different photoreceptor genes in dorsal and ventral regions, potentially associated with distinct aerial and aquatic stimuli. To gain insight into the developmental basis of the Anableps unique eye, we examined neurocranium and eye ontogeny, as well as photoreceptor gene expression during larval stages. First, we described six larval stages during which duplication of eye structures occurs. Our osteological analysis of neurocranium ontogeny revealed another distinctive Anablepid feature: an ossified interorbital septum partially separating the orbital cavities. Furthermore, we identified the onset of differences in cell proliferation and cell layer density between dorsal and ventral regions of the retina. Finally, we show that differential photoreceptor gene expression in the retina initiates during development, suggesting that it is inherited and not environmentally determined. In sum, our results shed light on the ontogenetic steps leading to the highly derived Anableps eye. © 2017 The Author(s).
Repression of Virus-Induced Interferon A Promoters by Homeodomain Transcription Factor Ptx1

PubMed Central

Lopez, Sébastien; Island, Marie-Laure; Drouin, Jacques; Bandu, Marie-Thérese; Christeff, Nicolas; Darracq, Nicole; Barbey, Régine; Doly, Janine; Thomas, Dominique; Navarro, Sébastien

2000-01-01

Interferon A (IFN-A) genes are differentially expressed after virus induction. The differential expression of individual IFN-A genes is modulated by substitutions in the proximal positive virus responsive element A (VRE-A) of their promoters and by the presence or absence of a distal negative regulatory element (DNRE). The functional feature of the DNRE is to specifically act by repression of VRE-A activity. With the use of the yeast one-hybrid system, we describe here the identification of a specific DNRE-binding protein, the pituitary homeobox 1 (Ptx1 or Pitx1). Ptx1 is detectable in different cell types that differentially express IFN-A genes, and the endogenous Ptx1 protein binds specifically to the DNRE. Upon virus induction, Ptx1 negatively regulates the transcription of DNRE-containing IFN-A promoters, and the C-terminal region, as well as the homeodomain of the Ptx1 protein, is required for this repression. After virus induction, the expression of the Ptx1 antisense RNA leads to a significant increase of endogenous IFN-A gene transcription and is able to modify the pattern of differential expression of individual IFN-A genes. These studies suggest that Ptx1 contributes to the differential transcriptional strength of the promoters of different IFN-A genes and that these genes may provide new targets for transcriptional regulation by a homeodomain transcription factor. PMID:11003649
Tabu search and binary particle swarm optimization for feature selection using microarray data.

PubMed

Chuang, Li-Yeh; Yang, Cheng-Huei; Yang, Cheng-Hong

2009-12-01

Gene expression profiles have great potential as a medical diagnosis tool because they represent the state of a cell at the molecular level. In the classification of cancer type research, available training datasets generally have a fairly small sample size compared to the number of genes involved. This fact poses an unprecedented challenge to some classification methodologies due to training data limitations. Therefore, a good selection method for genes relevant for sample classification is needed to improve the predictive accuracy, and to avoid incomprehensibility due to the large number of genes investigated. In this article, we propose to combine tabu search (TS) and binary particle swarm optimization (BPSO) for feature selection. BPSO acts as a local optimizer each time the TS has been run for a single generation. The K-nearest neighbor method with leave-one-out cross-validation and support vector machine with one-versus-rest serve as evaluators of the TS and BPSO. The proposed method is applied and compared to the 11 classification problems taken from the literature. Experimental results show that our method simplifies features effectively and either obtains higher classification accuracy or uses fewer features compared to other feature selection methods.
Mapping the Shh long-range regulatory domain

PubMed Central

Anderson, Eve; Devenney, Paul S.; Hill, Robert E.; Lettice, Laura A.

2014-01-01

Coordinated gene expression controlled by long-distance enhancers is orchestrated by DNA regulatory sequences involving transcription factors and layers of control mechanisms. The Shh gene and well-established regulators are an example of genomic composition in which enhancers reside in a large desert extending into neighbouring genes to control the spatiotemporal pattern of expression. Exploiting the local hopping activity of the Sleeping Beauty transposon, the lacZ reporter gene was dispersed throughout the Shh region to systematically map the genomic features responsible for expression activity. We found that enhancer activities are retained inside a genomic region that corresponds to the topological associated domain (TAD) defined by Hi-C. This domain of approximately 900 kb is in an open conformation over its length and is generally susceptible to all Shh enhancers. Similar to the distal enhancers, an enhancer residing within the Shh second intron activates the reporter gene located at distances of hundreds of kilobases away, suggesting that both proximal and distal enhancers have the capacity to survey the Shh topological domain to recognise potential promoters. The widely expressed Rnf32 gene lying within the Shh domain evades enhancer activities by a process that may be common among other housekeeping genes that reside in large regulatory domains. Finally, the boundaries of the Shh TAD do not represent the absolute expression limits of enhancer activity, as expression activity is lost stepwise at a number of genomic positions at the verges of these domains. PMID:25252942
Radial chromatin positioning is shaped by local gene density, not by gene expression

PubMed Central

2009-01-01

G- and R-bands of metaphase chromosomes are characterized by profound differences in gene density, CG content, replication timing, and chromatin compaction. The preferential localization of gene-dense, transcriptionally active, and early replicating chromatin in the nuclear interior and of gene-poor, later replicating chromatin at the nuclear envelope has been demonstrated to be evolutionary-conserved in various cell types. Yet, the impact of different local chromatin features on the radial nuclear arrangement of chromatin is still not well understood. In particular, it is not known whether radial chromatin positioning is preferentially shaped by local gene density per se or by other related parameters such as replication timing or transcriptional activity. The interdependence of these distinct chromatin features on the linear deoxyribonucleic acid (DNA) sequence precludes a simple dissection of these parameters with respect to their importance for the reorganization of the linear DNA organization into the distinct radial chromatin arrangements observed in the nuclear space. To analyze this problem, we generated probe sets of pooled bacterial artificial chromosome (BAC) clones from HSA 11, 12, 18, and 19 representing R/G-band-assigned chromatin, segments with different gene density and gene loci with different expression levels. Using multicolor 3D flourescent in situ hybridization (FISH) and 3D image analysis, we determined their localization in the nucleus and their positions within or outside the corresponding chromosome territory (CT). For each BAC data on local gene density within 2- and 10-Mb windows, as well as GC (guanine and cytosine) content, replication timing and expression levels were determined. A correlation analysis of these parameters with nuclear positioning revealed regional gene density as the decisive parameter determining the radial positioning of chromatin in the nucleus in contrast to band assignment, replication timing, and transcriptional activity. We demonstrate a polarized distribution of gene-dense vs gene-poor chromatin within CTs with respect to the nuclear border. Whereas we confirm previous reports that a particular gene-dense and transcriptionally highly active region of about 2 Mb on 11p15.5 often loops out from the territory surface, gene-dense and highly expressed sequences were not generally found preferentially at the CT surface as previously suggested. PMID:17333233
Insulated hsp70B' promoter: stringent heat-inducible activity in replication-deficient, but not replication-competent adenoviruses.

PubMed

Rohmer, Stanimira; Mainka, Astrid; Knippertz, Ilka; Hesse, Andrea; Nettelbeck, Dirk M

2008-04-01

Key to the realization of gene therapy is the development of efficient and targeted gene transfer vectors. Therapeutic gene transfer by replication-deficient or more recently by conditionally replication-competent/oncolytic adenoviruses has shown much promise. For specific applications, however, it will be advantageous to provide vectors that allow for external control of gene expression. The efficient cellular heat shock system in combination with available technology for focused and controlled hyperthermia suggests heat-regulated transcription control as a promising tool for this purpose. We investigated the feasibility of a short fragment of the human hsp70B' promoter, with and without upstream insulator elements, for the regulation of transgene expression by replication-deficient or oncolytic adenoviruses. Two novel adenoviral vectors with an insulated hsp70B' promoter were developed and showed stringent heat-inducible gene expression with induction ratios up to 8000-fold. In contrast, regulation of gene expression from the hsp70B' promoter without insulation was suboptimal. In replication-competent/oncolytic adenoviruses regulation of the hsp70B' promoter was lost specifically during late replication in permissive cells and could not be restored by the insulators. We developed novel adenovirus gene transfer vectors that feature improved and stringent regulation of transgene expression from the hsp70B' promoter using promoter insulation. These vectors have potential for gene therapy applications that benefit from external modulation of therapeutic gene expression or for combination therapy with hyperthermia. Furthermore, our study reveals that vector replication can deregulate inserted cellular promoters, an observation which is of relevance for the development of replication-competent/oncolytic gene transfer vectors. (c) 2008 John Wiley & Sons, Ltd.
Expression profiling of chickpea genes differentially regulated during a resistance response to Ascochyta rabiei.

PubMed

Coram, Tristan E; Pang, Edwin C K

2006-11-01

Using microarray technology and a set of chickpea (Cicer arietinum L.) unigenes, grasspea (Lathyrus sativus L.) expressed sequence tags (ESTs) and lentil (Lens culinaris Med.) resistance gene analogues, the ascochyta blight (Ascochyta rabiei (Pass.) L.) resistance response was studied in four chickpea genotypes, including resistant, moderately resistant, susceptible and wild relative (Cicer echinospermum L.) genotypes. The experimental system minimized environmental effects and was conducted in reference design, in which samples from mock-inoculated controls acted as reference against post-inoculation samples. Robust data quality was achieved through the use of three biological replicates (including a dye swap), the inclusion of negative controls and strict selection criteria for differentially expressed genes, including a fold change cut-off determined by self-self hybridizations, Student's t-test and multiple testing correction (P < 0.05). Microarray observations were also validated by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). The time course expression patterns of 756 microarray features resulted in the differential expression of 97 genes in at least one genotype at one time point. k-means clustering grouped the genes into clusters of similar observations for each genotype, and comparisons between A. rabiei-resistant and A. rabiei-susceptible genotypes revealed potential gene 'signatures' predictive of effective A. rabiei resistance. These genes included several pathogenesis-related proteins, SNAKIN2 antimicrobial peptide, proline-rich protein, disease resistance response protein DRRG49-C, environmental stress-inducible protein, leucine-zipper protein, polymorphic antigen membrane protein, Ca-binding protein and several unknown proteins. The potential involvement of these genes and their pathways of induction are discussed. This study represents the first large-scale gene expression profiling in chickpea, and future work will focus on the functional validation of the genes of interest.

Mouse and human BAC transgenes recapitulate tissue-specific expression of the vitamin D receptor in mice and rescue the VDR-null phenotype.

PubMed

Lee, Seong Min; Bishop, Kathleen A; Goellner, Joseph J; O'Brien, Charles A; Pike, J Wesley

2014-06-01

The biological actions of 1,25-dihydroxyvitamin D3 (1,25(OH)2D3) are mediated by the vitamin D receptor (VDR), which is expressed in numerous target tissues in a cell type-selective manner. Recent studies using genomic analyses and recombineered bacterial artificial chromosomes (BACs) have defined the specific features of mouse and human VDR gene loci in vitro. In the current study, we introduced recombineered mouse and human VDR BACs as transgenes into mice and explored their expression capabilities in vivo. Individual transgenic mouse strains selectively expressed BAC-derived mouse or human VDR proteins in appropriate vitamin D target tissues, thereby recapitulating the tissue-specific expression of endogenous mouse VDR. The mouse VDR transgene was also regulated by 1,25(OH)2D3 and dibutyryl-cAMP. When crossed into a VDR-null mouse background, both transgenes restored wild-type basal as well as 1,25(OH)2D3-inducible gene expression patterns in the appropriate tissues. This maneuver resulted in the complete rescue of the aberrant phenotype noted in the VDR-null mouse, including systemic features associated with altered calcium and phosphorus homeostasis and disrupted production of parathyroid hormone and fibroblast growth factor 23, and abnormalities associated with the skeleton, kidney, parathyroid gland, and the skin. This study suggests that both mouse and human VDR transgenes are capable of recapitulating basal and regulated expression of the VDR in the appropriate mouse tissues and restore 1,25(OH)2D3 function. These results provide a baseline for further dissection of mechanisms integral to mouse and human VDR gene expression and offer the potential to explore the consequence of selective mutations in VDR proteins in vivo.
Relationship of the Interaction Between Two Quantitative Trait Loci with γ-Globin Expression in β-Thalassemia Intermedia Patients.

PubMed

NickAria, Shiva; Haghpanah, Sezaneh; Ramzi, Mani; Karimi, Mehran

2018-05-10

Globin switching is a significant factor on blood hemoglobin (Hb) level but its molecular mechanisms have not yet been identified, however, several quantitative trait loci (QTL) and polymorphisms involved regions on chromosomes 2p, 6q, 8q and X account for variation in the γ-globin expression level. We studied the effect of interaction between a region on intron six of the TOX gene, chromosome 8q (chr8q) and XmnI locus on the γ-globin promoter, chr11p on γ-globin expression in 150 β-thalassemia intermedia (β-TI) patients, evaluated by statistical interaction analysis. Our results showed a significant interaction between one QTL on intron six of the TOX gene (rs9693712) and XmnI locus that effect γ-globin expression. Interchromosomal interaction mediates through transcriptional machanisms to preserve true genome architectural features, chromosomes localization and DNA bending. This interaction can be a part of the unknown molecular mechanism of globin switching and regulation of gene expression.
Viral Vectors for Gene Delivery to the Central Nervous System

PubMed Central

Lentz, Thomas B.; Gray, Steven J.; Samulski, R. Jude

2011-01-01

The potential benefits of gene therapy for neurological diseases such as Parkinson’s, Amyotrophic Lateral Sclerosis (ALS), Epilepsy, and Alzheimer’s are enormous. Even a delay in the onset of severe symptoms would be invaluable to patients suffering from these and other diseases. Significant effort has been placed in developing vectors capable of delivering therapeutic genes to the CNS in order to treat neurological disorders. At the forefront of potential vectors, viral systems have evolved to efficiently deliver their genetic material to a cell. The biology of different viruses offers unique solutions to the challenges of gene therapy, such as cell targeting, transgene expression and vector production. It is important to consider the natural biology of a vector when deciding whether it will be the most effective for a specific therapeutic function. In this review, we outline desired features of the ideal vector for gene delivery to the CNS and discuss how well available viral vectors compare to this model. Adeno-associated virus, retrovirus, adenovirus and herpesvirus vectors are covered. Focus is placed on features of the natural biology that have made these viruses effective tools for gene delivery with emphasis on their application in the CNS. Our goal is to provide insight into features of the optimal vector and which viral vectors can provide these features. PMID:22001604
Structure and expression of dna methyltransferase genes from apomictic and sexual Boechera species.

PubMed

Taşkin, Kemal Melik; Özbilen, Aslıhan; Sezer, Fatih; Hürkan, Kaan; Güneş, Şebnem

2017-04-01

In this study, we determined the structure of DNA methyltransferase (DNMT) genes in apomict and sexual Boechera species and investigated the expression levels during seed development. Protein and DNA sequences of diploid sexual Boechera stricta DNMT genes obtained from Phytozome 10.3 were used to identify the homologues in apomicts, Boechera holboellii and Boechera divaricarpa. Geneious R8 software was used to map the short-paired reads library of B. holboellii whole genome or B. divaricarpa transcriptome reads to the reference gene sequences. We determined three DNMT genes; for Boechera spp. METHYLTRANSFERASE1 (MET1), CHROMOMETHYLASE 3 (CMT3) and DOMAINS REARRANGED METHYLTRANSFERASE 1/2 (DRM2). We examined the structure of these genes with bioinformatic tools and compared with other DNMT genes in plants. We also examined the levels of expression in silique tissues after fertilization by semi-quantitative PCR. The structure of DNMT proteins in apomict and sexual Boechera species share common features. However, the expression levels of DNMT genes were different in apomict and sexual Boechera species. We found that DRM2 was upregulated in apomictic Boechera species after fertilization. Phylogenetic trees showed that three genes are conserved among green algae, monocotyledons and dicotyledons. Our results indicated a deregulation of DNA methylation machinery during seed development in apomicts. Copyright © 2016 Elsevier Ltd. All rights reserved.
Expression analysis of two gene subfamilies encoding the plasma membrane H+-ATPase in Nicotiana plumbaginifolia reveals the major transport functions of this enzyme.

PubMed

Moriau, L; Michelet, B; Bogaerts, P; Lambert, L; Michel, A; Oufattole, M; Boutry, M

1999-07-01

The plasma membrane H+-ATPase couples ATP hydrolysis to proton transport, thereby establishing the driving force for solute transport across the plasma membrane. In Nicotiana plumbaginifolia, this enzyme is encoded by at least nine pma (plasma membrane H+-ATPase) genes. Four of these are classified into two gene subfamilies, pma1-2-3 and pma4, which are the most highly expressed in plant species. We have isolated genomic clones for pma2 and pma4. Mapping of their transcript 5' end revealed the presence of a long leader that contained small open reading frames, regulatory features typical of other pma genes. The gusA reporter gene was then used to determine the expression of pma2, pma3 and pma4 in N. tabacum. These data, together with those obtained previously for pma1, led to the following conclusions. (i) The four pma-gusA genes were all expressed in root, stem, leaf and flower organs, but each in a cell-type specific manner. Expression in these organs was confirmed at the protein level, using subfamily-specific antibodies. (ii) pma4-gusA was expressed in many cell types and notably in root hair and epidermis, in companion cells, and in guard cells, indicating that in N. plumbaginifolia the same H+-ATPase isoform might be involved in mineral nutrition, phloem loading and control of stomata aperture. (iii) The second gene subfamily is composed, in N. plumbaginifolia, of a single gene (pma4) with a wide expression pattern and, in Arabidopsis thaliana, of three genes (aha1, aha2, aha3), at least two of them having a more restrictive expression pattern. (iv) Some cell types expressed pma2 and pma4 at the same time, which encode H+-ATPases with different enzymatic properties.
Molecular differences between mature and immature dental pulp cells: Bioinformatics and preliminary results.

PubMed

Chen, Long; Jiang, Yifeng; Du, Zhen

2018-04-01

Although previous studies have demonstrated that dental pulp stem cells (DPSCs) from mature and immature teeth exhibit potential for multi-directional differentiation, the molecular and biological difference between the DPSCs from mature and immature permanent teeth has not been fully investigated. In the present study, 500 differentially expressed genes from dental pulp cells (DPCs) in mature and immature permanent teeth were obtained from the Gene Expression Omnibus online database. Based on bioinformatics analysis using the Database for Annotation, Visualization and Integrated Discovery, these genes were divided into a number of subgroups associated with immunity, inflammation and cell signaling. The results of the present study suggest that immune features, response to infection and cell signaling may be different in DPCs from mature and immature permanent teeth; furthermore, DPCs from immature permanent teeth may be more suitable for use in tissue engineering or stem cell therapy. The Online Mendelian Inheritance in Man database stated that Sonic Hedgehog (SHH), a differentially expressed gene in DPCs from mature and immature permanent teeth, serves a crucial role in the development of craniofacial tissues, including teeth, which further confirmed that SHH may cause DPCs from mature and immature permanent teeth to exhibit different biological characteristics. The Search Tool for the Retrieval of Interacting Genes/Proteins database revealed that SHH has functional protein associations with a number of other proteins, including Glioma-associated oncogene (GLI)1, GLI2, growth arrest-specific protein 1, bone morphogenetic protein (BMP)2 and BMP4, in mice and humans. It was also demonstrated that SHH may interact with other genes to regulate the biological characteristics of DPCs. The results of the present study may provide a useful reference basis for selecting suitable DPSCs and molecules for the treatment of these cells to optimize features for tissue engineering or stem cell therapy. Quantitative polymerase chain reaction should be performed to confirm the differential expression of these genes prior to the beginning of a functional study.
Genome-wide identification and expression profiling of serine proteases and homologs in the diamondback moth, Plutella xylostella (L.).

PubMed

Lin, Hailan; Xia, Xiaofeng; Yu, Liying; Vasseur, Liette; Gurr, Geoff M; Yao, Fengluan; Yang, Guang; You, Minsheng

2015-12-10

Serine proteases (SPs) are crucial proteolytic enzymes responsible for digestion and other processes including signal transduction and immune responses in insects. Serine protease homologs (SPHs) lack catalytic activity but are involved in innate immunity. This study presents a genome-wide investigation of SPs and SPHs in the diamondback moth, Plutella xylostella (L.), a globally-distributed destructive pest of cruciferous crops. A total of 120 putative SPs and 101 putative SPHs were identified in the P. xylostella genome by bioinformatics analysis. Based on the features of trypsin, 38 SPs were putatively designated as trypsin genes. The distribution, transcription orientation, exon-intron structure and sequence alignments suggested that the majority of trypsin genes evolved from tandem duplications. Among the 221 SP/SPH genes, ten SP and three SPH genes with one or more clip domains were predicted and designated as PxCLIPs. Phylogenetic analysis of CLIPs in P. xylostella, two other Lepidoptera species (Bombyx mori and Manduca sexta), and two more distantly related insects (Drosophila melanogaster and Apis mellifera) showed that seven of the 13 PxCLIPs were clustered with homologs of the Lepidoptera rather than other species. Expression profiling of the P. xylostella SP and SPH genes in different developmental stages and tissues showed diverse expression patterns, suggesting high functional diversity with roles in digestion and development. This is the first genome-wide investigation on the SP and SPH genes in P. xylostella. The characterized features and profiled expression patterns of the P. xylostella SPs and SPHs suggest their involvement in digestion, development and immunity of this species. Our findings provide a foundation for further research on the functions of this gene family in P. xylostella, and a better understanding of its capacity to rapidly adapt to a wide range of environmental variables including host plants and insecticides.
Adult-onset Alexander disease, associated with a mutation in an alternative GFAP transcript, may be phenotypically modulated by a non-neutral HDAC6 variant.

PubMed

Melchionda, Laura; Fang, Mingyan; Wang, Hairong; Fugnanesi, Valeria; Morbin, Michela; Liu, Xuanzhu; Li, Wenyan; Ceccherini, Isabella; Farina, Laura; Savoiardo, Mario; D'Adamo, Pio; Zhang, Jianguo; Costa, Alfredo; Ravaglia, Sabrina; Ghezzi, Daniele; Zeviani, Massimo

2013-05-01

We studied a family including two half-siblings, sharing the same mother, affected by slowly progressive, adult-onset neurological syndromes. In spite of the diversity of the clinical features, characterized by a mild movement disorder with cognitive impairment in the elder patient, and severe motor-neuron disease (MND) in her half-brother, the brain Magnetic Resonance Imaging (MRI) features were compatible with adult-onset Alexander's disease (AOAD), suggesting different expression of the same, genetically determined, condition. Since mutations in the alpha isoform of glial fibrillary acidic protein, GFAP-α, the only cause so far known of AOAD, were excluded, we applied exome Next Generation Sequencing (NGS) to identify gene variants, which were then functionally validated by molecular characterization of recombinant and patient-derived cells. Exome-NGS revealed a mutation in a previously neglected GFAP isoform, GFAP-ϵ, which disrupts the GFAP-associated filamentous cytoskeletal meshwork of astrocytoma cells. To shed light on the different clinical features in the two patients, we sought for variants in other genes. The male patient had a mutation, absent in his half-sister, in X-linked histone deacetylase 6, a candidate MND susceptibility gene. Exome-NGS is an unbiased approach that not only helps identify new disease genes, but may also contribute to elucidate phenotypic expression.
Comparison of Leaf Sheath Transcriptome Profiles with Physiological Traits of Bread Wheat Cultivars under Salinity Stress

PubMed Central

Trittermann, Christine; Berger, Bettina; Roy, Stuart J.; Seki, Motoaki; Shinozaki, Kazuo; Tester, Mark

2015-01-01

Salinity stress has significant negative effects on plant biomass production and crop yield. Salinity tolerance is controlled by complex systems of gene expression and ion transport. The relationship between specific features of mild salinity stress adaptation and gene expression was analyzed using four commercial varieties of bread wheat (Triticum aestivum) that have different levels of salinity tolerance. The high-throughput phenotyping system in The Plant Accelerator at the Australian Plant Phenomics Facility revealed variation in shoot relative growth rate and salinity tolerance among the four cultivars. Comparative analysis of gene expression in the leaf sheaths identified genes whose functions are potentially linked to shoot biomass development and salinity tolerance. Early responses to mild salinity stress through changes in gene expression have an influence on the acquisition of stress tolerance and improvement in biomass accumulation during the early “osmotic” phase of salinity stress. In addition, results revealed transcript profiles for the wheat cultivars that were different from those of usual stress-inducible genes, but were related to those of plant growth. These findings suggest that, in the process of breeding, selection of specific traits with various salinity stress-inducible genes in commercial bread wheat has led to adaptation to mild salinity conditions. PMID:26244554
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries

PubMed Central

Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

2008-01-01

Background Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. Results We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. Conclusion EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects. PMID:18402700
EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.

PubMed

Smith, Robin P; Buchser, William J; Lemmon, Marcus B; Pardinas, Jose R; Bixby, John L; Lemmon, Vance P

2008-04-10

Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. We have developed "EST Express", a suite of analytical tools that identify and annotate ESTs originating from specific mRNA populations. The software consists of a user-friendly GUI powered by PHP and MySQL that allows for online collaboration between researchers and continuity with UniGene, Entrez Gene and RefSeq. Two key features of the software include a novel, simplified Entrez Gene parser and tools to manage cDNA library sequencing projects. We have tested the software on a large data set (2,016 samples) produced by subtractive hybridization. EST Express is an open-source, cross-platform web server application that imports sequences from cDNA libraries, such as those generated through subtractive hybridization or yeast two-hybrid screens. It then provides several layers of annotation based on Entrez Gene and RefSeq to allow the user to highlight useful genes and manage cDNA library projects.
Identification of differentially expressed genes in flax (Linum usitatissimum L.) under saline-alkaline stress by digital gene expression.

PubMed

Yu, Ying; Huang, Wengong; Chen, Hongyu; Wu, Guangwen; Yuan, Hongmei; Song, Xixia; Kang, Qinghua; Zhao, Dongsheng; Jiang, Weidong; Liu, Yan; Wu, Jianzhong; Cheng, Lili; Yao, Yubo; Guan, Fengzhi

2014-10-01

The salinization and alkalization of soil are widespread environmental problems, and alkaline salt stress is more destructive than neutral salt stress. Therefore, understanding the mechanism of plant tolerance to saline-alkaline stress has become a major challenge. However, little attention has been paid to the mechanism of plant alkaline salt tolerance. In this study, gene expression profiling of flax was analyzed under alkaline-salt stress (AS2), neutral salt stress (NSS) and alkaline stress (AS) by digital gene expression. Three-week-old flax seedlings were placed in 25 mM Na2CO3 (pH11.6) (AS2), 50mM NaCl (NSS) and NaOH (pH11.6) (AS) for 18 h. There were 7736, 1566 and 454 differentially expressed genes in AS2, NSS and AS compared to CK, respectively. The GO category gene enrichment analysis revealed that photosynthesis was particularly affected in AS2, carbohydrate metabolism was particularly affected in NSS, and the response to biotic stimulus was particularly affected in AS. We also analyzed the expression pattern of five categories of genes including transcription factors, signaling transduction proteins, phytohormones, reactive oxygen species proteins and transporters under these three stresses. Some key regulatory gene families involved in abiotic stress, such as WRKY, MAPKKK, ABA, PrxR and ion channels, were differentially expressed. Compared with NSS and AS, AS2 triggered more differentially expressed genes and special pathways, indicating that the mechanism of AS2 was more complex than NSS and AS. To the best of our knowledge, this was the first transcriptome analysis of flax in response to saline-alkaline stress. These data indicate that common and diverse features of saline-alkaline stress provide novel insights into the molecular mechanisms of plant saline-alkaline tolerance and offer a number of candidate genes as potential markers of tolerance to saline-alkaline stress. Copyright © 2014 Elsevier B.V. All rights reserved.
Systems biology definition of the core proteome of metabolism and expression is consistent with high-throughput data.

PubMed

Yang, Laurence; Tan, Justin; O'Brien, Edward J; Monk, Jonathan M; Kim, Donghyuk; Li, Howard J; Charusanti, Pep; Ebrahim, Ali; Lloyd, Colton J; Yurkovich, James T; Du, Bin; Dräger, Andreas; Thomas, Alex; Sun, Yuekai; Saunders, Michael A; Palsson, Bernhard O

2015-08-25

Finding the minimal set of gene functions needed to sustain life is of both fundamental and practical importance. Minimal gene lists have been proposed by using comparative genomics-based core proteome definitions. A definition of a core proteome that is supported by empirical data, is understood at the systems-level, and provides a basis for computing essential cell functions is lacking. Here, we use a systems biology-based genome-scale model of metabolism and expression to define a functional core proteome consisting of 356 gene products, accounting for 44% of the Escherichia coli proteome by mass based on proteomics data. This systems biology core proteome includes 212 genes not found in previous comparative genomics-based core proteome definitions, accounts for 65% of known essential genes in E. coli, and has 78% gene function overlap with minimal genomes (Buchnera aphidicola and Mycoplasma genitalium). Based on transcriptomics data across environmental and genetic backgrounds, the systems biology core proteome is significantly enriched in nondifferentially expressed genes and depleted in differentially expressed genes. Compared with the noncore, core gene expression levels are also similar across genetic backgrounds (two times higher Spearman rank correlation) and exhibit significantly more complex transcriptional and posttranscriptional regulatory features (40% more transcription start sites per gene, 22% longer 5'UTR). Thus, genome-scale systems biology approaches rigorously identify a functional core proteome needed to support growth. This framework, validated by using high-throughput datasets, facilitates a mechanistic understanding of systems-level core proteome function through in silico models; it de facto defines a paleome.
VitisExpDB: a database resource for grape functional genomics.

PubMed

Doddapaneni, Harshavardhan; Lin, Hong; Walker, M Andrew; Yao, Jiqiang; Civerolo, Edwin L

2008-02-28

The family Vitaceae consists of many different grape species that grow in a range of climatic conditions. In the past few years, several studies have generated functional genomic information on different Vitis species and cultivars, including the European grape vine, Vitis vinifera. Our goal is to develop a comprehensive web data source for Vitaceae. VitisExpDB is an online MySQL-PHP driven relational database that houses annotated EST and gene expression data for V. vinifera and non-vinifera grape species and varieties. Currently, the database stores approximately 320,000 EST sequences derived from 8 species/hybrids, their annotation (BLAST top match) details and Gene Ontology based structured vocabulary. Putative homologs for each EST in other species and varieties along with information on their percent nucleotide identities, phylogenetic relationship and common primers can be retrieved. The database also includes information on probe sequence and annotation features of the high density 60-mer gene expression chip consisting of approximately 20,000 non-redundant set of ESTs. Finally, the database includes 14 processed global microarray expression profile sets. Data from 12 of these expression profile sets have been mapped onto metabolic pathways. A user-friendly web interface with multiple search indices and extensively hyperlinked result features that permit efficient data retrieval has been developed. Several online bioinformatics tools that interact with the database along with other sequence analysis tools have been added. In addition, users can submit their ESTs to the database. The developed database provides genomic resource to grape community for functional analysis of genes in the collection and for the grape genome annotation and gene function identification. The VitisExpDB database is available through our website http://cropdisease.ars.usda.gov/vitis_at/main-page.htm.
VitisExpDB: A database resource for grape functional genomics

PubMed Central

Doddapaneni, Harshavardhan; Lin, Hong; Walker, M Andrew; Yao, Jiqiang; Civerolo, Edwin L

2008-01-01

Background The family Vitaceae consists of many different grape species that grow in a range of climatic conditions. In the past few years, several studies have generated functional genomic information on different Vitis species and cultivars, including the European grape vine, Vitis vinifera. Our goal is to develop a comprehensive web data source for Vitaceae. Description VitisExpDB is an online MySQL-PHP driven relational database that houses annotated EST and gene expression data for V. vinifera and non-vinifera grape species and varieties. Currently, the database stores ~320,000 EST sequences derived from 8 species/hybrids, their annotation (BLAST top match) details and Gene Ontology based structured vocabulary. Putative homologs for each EST in other species and varieties along with information on their percent nucleotide identities, phylogenetic relationship and common primers can be retrieved. The database also includes information on probe sequence and annotation features of the high density 60-mer gene expression chip consisting of ~20,000 non-redundant set of ESTs. Finally, the database includes 14 processed global microarray expression profile sets. Data from 12 of these expression profile sets have been mapped onto metabolic pathways. A user-friendly web interface with multiple search indices and extensively hyperlinked result features that permit efficient data retrieval has been developed. Several online bioinformatics tools that interact with the database along with other sequence analysis tools have been added. In addition, users can submit their ESTs to the database. Conclusion The developed database provides genomic resource to grape community for functional analysis of genes in the collection and for the grape genome annotation and gene function identification. The VitisExpDB database is available through our website . PMID:18307813
Comparison of transcriptomic responses to pancreas disease (PD) and heart and skeletal muscle inflammation (HSMI) in heart of Atlantic salmon (Salmo salar L).

PubMed

Johansen, Lill-Heidi; Thim, Hanna L; Jørgensen, Sven Martin; Afanasyev, Sergey; Strandskog, Guro; Taksdal, Torunn; Fremmerlid, Kjersti; McLoughlin, Marion; Jørgensen, Jorunn B; Krasnov, Aleksei

2015-10-01

Pancreas disease (PD) and heart and skeletal muscle inflammation (HSMI) are viral diseases associated with SAV (salmonid alphavirus) and PRV (piscine reovirus), which induce systemic infections and pathologies in cardiac and skeletal muscle tissue of farmed Atlantic salmon (Salmo salar L), resulting in severe morbidity and mortality. While general features of the clinical symptoms and pathogenesis of salmonid viral diseases are relatively well studied, much less is known about molecular mechanisms associated with immunity and disease-specific changes. In this study, transcriptomic analyses of heart tissue from PD and HSMI challenged Atlantic salmon were done, focusing on the mature phases of both diseases at respectively 28-35 and 42-77 days post infection. A large number of immune genes was activated in both trials with prevalence of genes associated with early innate antiviral responses, their expression levels being slightly higher in PD challenged fish. Activation of the IFN axis was in parallel with inflammatory changes that involved diverse humoral and cellular factors. Adaptive immune response genes were more pronounced in fish with HSMI, as suggested by increased expression of a large number of genes associated with differentiation and maturation of B lymphocytes and cytotoxic T cells. A similar down-regulation of non-immune genes such as myofiber and mitochondrial proteins between diseases was most likely reflecting myocardial pathology. A suite of genes important for cardiac function including B-type natriuretic peptide and four neuropeptides displayed differential expression between PD and HSMI. Comparison of results revealed common and distinct features and added to the understanding of both diseases at their mature phases with typical clinical pictures. A number of genes that showed disease-specific changes can be of interest for diagnostics. Copyright © 2015 Elsevier Ltd. All rights reserved.
Msn2 Coordinates a Stoichiometric Gene Expression Program

PubMed Central

Stewart-Ornstein, Jacob; Nelson, Christopher; DeRisi, Joe; Weissman, Jonathan S.; El-Samad, Hana

2014-01-01

Summary Background Many cellular processes operate in an “analog” regime in which the magnitude of the response is precisely tailored to the intensity of the stimulus. In order to maintain the coherence of such responses, the cell must provide for proportional expression of multiple target genes across a wide dynamic range of induction states. Our understanding of the strategies used to achieve graded gene regulation is limited. Results In this work, we document a relationship between stress responsive gene expression and the transcription factor Msn2 that is graded over a large range of Msn2 cocnentrations. We use computational modeling, in vivo, and in vitro analysis to dissect the roots of this relationship. Our studies reveal a simple and general strategy based on non-cooperative low-affinity interactions between Msn2 and its cognate binding sites, as well as competition over a large number of Msn2 binding sites in the genome relative to the number of Msn2 molecules. Conclusions In addition to enabling precise tuning of gene expression to the state of the environment, this strategy ensures co-linear activation of target genes, allowing for stoichiometric expression of large groups of genes without extensive promoter tuning. Furthermore, such a strategy enables precise modulation of the activity of any given promoter by addition of binding sites without altering the qualitative relationship between different genes in a regulon. This feature renders a given regulon highly ‘evolvable’. PMID:24210615
Identification of an intact ParaHox cluster with temporal colinearity but altered spatial colinearity in the hemichordate Ptychodera flava

PubMed Central

2013-01-01

Background ParaHox and Hox genes are thought to have evolved from a common ancestral ProtoHox cluster or from tandem duplication prior to the divergence of cnidarians and bilaterians. Similar to Hox clusters, chordate ParaHox genes including Gsx, Xlox, and Cdx, are clustered and their expression exhibits temporal and spatial colinearity. In non-chordate animals, however, studies on the genomic organization of ParaHox genes are limited to only a few animal taxa. Hemichordates, such as the Enteropneust acorn worms, have been used to gain insights into the origins of chordate characters. In this study, we investigated the genomic organization and expression of ParaHox genes in the indirect developing hemichordate acorn worm Ptychodera flava. Results We found that P. flava contains an intact ParaHox cluster with a similar arrangement to that of chordates. The temporal expression order of the P. flava ParaHox genes is the same as that of the chordate ParaHox genes. During embryogenesis, the spatial expression pattern of PfCdx in the posterior endoderm represents a conserved feature similar to the expression of its orthologs in other animals. On the other hand, PfXlox and PfGsx show a novel expression pattern in the blastopore. Nevertheless, during metamorphosis, PfXlox and PfCdx are expressed in the endoderm in a spatially staggered pattern similar to the situation in chordates. Conclusions Our study shows that P. flava ParaHox genes, despite forming an intact cluster, exhibit temporal colinearity but lose spatial colinearity during embryogenesis. During metamorphosis, partial spatial colinearity is retained in the transforming larva. These results strongly suggest that intact ParaHox gene clustering was retained in the deuterostome ancestor and is correlated with temporal colinearity. PMID:23802544
Divergent evolution of arrested development in the dauer stage of Caenorhabditis elegans and the infective stage of Heterodera glycines

PubMed Central

Elling, Axel A; Mitreva, Makedonka; Recknor, Justin; Gai, Xiaowu; Martin, John; Maier, Thomas R; McDermott, Jeffrey P; Hewezi, Tarek; McK Bird, David; Davis, Eric L; Hussey, Richard S; Nettleton, Dan; McCarter, James P; Baum, Thomas J

2007-01-01

Background The soybean cyst nematode Heterodera glycines is the most important parasite in soybean production worldwide. A comprehensive analysis of large-scale gene expression changes throughout the development of plant-parasitic nematodes has been lacking to date. Results We report an extensive genomic analysis of H. glycines, beginning with the generation of 20,100 expressed sequence tags (ESTs). In-depth analysis of these ESTs plus approximately 1,900 previously published sequences predicted 6,860 unique H. glycines genes and allowed a classification by function using InterProScan. Expression profiling of all 6,860 genes throughout the H. glycines life cycle was undertaken using the Affymetrix Soybean Genome Array GeneChip. Our data sets and results represent a comprehensive resource for molecular studies of H. glycines. Demonstrating the power of this resource, we were able to address whether arrested development in the Caenorhabditis elegans dauer larva and the H. glycines infective second-stage juvenile (J2) exhibits shared gene expression profiles. We determined that the gene expression profiles associated with the C. elegans dauer pathway are not uniformly conserved in H. glycines and that the expression profiles of genes for metabolic enzymes of C. elegans dauer larvae and H. glycines infective J2 are dissimilar. Conclusion Our results indicate that hallmark gene expression patterns and metabolism features are not shared in the developmentally arrested life stages of C. elegans and H. glycines, suggesting that developmental arrest in these two nematode species has undergone more divergent evolution than previously thought and pointing to the need for detailed genomic analyses of individual parasite species. PMID:17919324
DDC and COBL, flanking the imprinted GRB10 gene on 7p12, are biallelically expressed.

PubMed

Hitchins, Megan P; Bentley, Louise; Monk, David; Beechey, Colin; Peters, Jo; Kelsey, Gavin; Ishino, Fumitoshi; Preece, Michael A; Stanier, Philip; Moore, Gudrun E

2002-12-01

Maternal duplication of human 7p11.2-p13 has been associated with Silver-Russell syndrome (SRS) in two familial cases. GRB10 is the only imprinted gene identified within this region to date. GRB10 demonstrates an intricate tissue- and isoform-specific imprinting profile in humans, with paternal expression in fetal brain and maternal expression of one isoform in skeletal muscle. The mouse homolog is maternally transcribed. The GRB10 protein is a potent growth inhibitor and represents a candidate for SRS, which is characterized by pre- and postnatal growth retardation and a spectrum of additional dysmorphic features. Since imprinted genes tend to be grouped in clusters, we investigated the imprinting status of the dopa-decarboxylase gene (DDC) and the Cordon-bleu gene (COBL) which flank GRB10 within the 7p11.2-p13 SRS duplicated region. Although both genes were found to replicate asynchronously, suggestive of imprinting, SNP expression analyses showed that neither gene was imprinted in multiple human fetal tissues. The mouse homologues, Ddc and Cobl, which map to the homologous imprinted region on proximal Chr 11, were also biallelically expressed in mice with uniparental maternal or paternal inheritance of this region. With the intent of using mouse Grb10 as an imprinted control, biallelic expression was consistently observed in fetal, postnatal, and adult brain of these mice, in contrast to the maternal-specific transcription previously demonstrated in brain in inter-specific F1 progeny. This may be a further example of over-expression of maternally derived transcripts in inter-specific mouse crosses. GRB10 remains the only imprinted gene identified within 7p11.2-p13.

Low-grade and high-grade mammary carcinomas in WAP-T transgenic mice are independent entities distinguished by Met expression.

PubMed

Otto, Benjamin; Gruner, Katharina; Heinlein, Christina; Wegwitz, Florian; Nollau, Peter; Ylstra, Bauke; Pantel, Klaus; Schumacher, Udo; Baumbusch, Lars O; Martin-Subero, José Ignacio; Siebert, Reiner; Wagener, Christoph; Streichert, Thomas; Deppert, Wolfgang; Tolstonog, Genrich V

2013-03-15

Mammary carcinomas developing in SV40 transgenic WAP-T mice arise in two distinct histological phenotypes: as differentiated low-grade and undifferentiated high-grade tumors. We integrated different types of information such as histological grading, analysis of aCGH-based gene copy number and gene expression profiling to provide a comprehensive molecular description of mammary tumors in WAP-T mice. Applying a novel procedure for the correlation of gene copy number with gene expression on a global scale, we observed in tumor samples a global coherence between genotype and transcription. This coherence can be interpreted as a matched transcriptional regulation inherited from the cells of tumor origin and determined by the activity of cancer driver genes. Despite common recurrent genomic aberrations, e.g. gain of chr. 15 in most WAP-T tumors, loss of chr. 19 frequently occurs only in low-grade tumors. These tumors show features of "basal-like" epithelial differentiation, particularly expression of keratin 14. The high-grade tumors are clearly separated from the low-grade tumors by strong expression of the Met gene and by coexpression of epithelial (e.g. keratin 18) and mesenchymal (e.g. vimentin) markers. In high-grade tumors, the expression of the nonmutated Met protein is associated with Met-locus amplification and Met activity. The role of Met as a cancer driver gene is supported by the contribution of active Met signaling to motility and growth of mammary tumor-derived cells. Finally, we discuss the independent origin of low- and high-grade tumors from distinct cells of tumor origin, possibly luminal progenitors, distinguished by Met gene expression and Met signaling. Copyright © 2012 UICC.
Identification and Characterization of a Cis Antisense RNA of the rpoH Gene of Salmonella enterica Serovar Typhi.

PubMed

Xiong, Changyan; Li, Xuejiao; Liu, Juanli; Zhao, Xin; Xu, Shungao; Huang, Xinxiang

2018-01-01

Antisense RNAs from complementary strands of protein coding genes regulate the expression of genes involved in many cellular processes. Using deep sequencing analysis of the Salmonella enterica serovar Typhi ( S. Typhi) transcriptome, a novel antisense RNA encoded on the strand complementary to the rpoH gene was revealed. In this study, the molecular features of this antisense RNA were assessed using northern blotting and rapid amplification of cDNA ends. The 3,508 nt sequence of RNA was identified as the antisense RNA of the rpoH gene and was named ArpH. ArpH was found to attenuate the invasion of HeLa cells by S. Typhi by regulating the expression of SPI-1 genes. In an rpoH mutant strain, the invasive capacity of S. Typhi was increased, whereas overexpression of ArpH positively regulates rpoH mRNA levels. Results of this study suggest that the cis -encoded antisense RNA ArpH is likely to affect the invasive capacity of S. Typhi by regulating the expression of rpoH .
F-MAP: A Bayesian approach to infer the gene regulatory network using external hints

PubMed Central

Shahdoust, Maryam; Mahjub, Hossein; Sadeghi, Mehdi

2017-01-01

The Common topological features of related species gene regulatory networks suggest reconstruction of the network of one species by using the further information from gene expressions profile of related species. We present an algorithm to reconstruct the gene regulatory network named; F-MAP, which applies the knowledge about gene interactions from related species. Our algorithm sets a Bayesian framework to estimate the precision matrix of one species microarray gene expressions dataset to infer the Gaussian Graphical model of the network. The conjugate Wishart prior is used and the information from related species is applied to estimate the hyperparameters of the prior distribution by using the factor analysis. Applying the proposed algorithm on six related species of drosophila shows that the precision of reconstructed networks is improved considerably compared to the precision of networks constructed by other Bayesian approaches. PMID:28938012
Estimation of Dynamic Systems for Gene Regulatory Networks from Dependent Time-Course Data.

PubMed

Kim, Yoonji; Kim, Jaejik

2018-06-15

Dynamic system consisting of ordinary differential equations (ODEs) is a well-known tool for describing dynamic nature of gene regulatory networks (GRNs), and the dynamic features of GRNs are usually captured through time-course gene expression data. Owing to high-throughput technologies, time-course gene expression data have complex structures such as heteroscedasticity, correlations between genes, and time dependence. Since gene experiments typically yield highly noisy data with small sample size, for a more accurate prediction of the dynamics, the complex structures should be taken into account in ODE models. Hence, this study proposes an ODE model considering such data structures and a fast and stable estimation method for the ODE parameters based on the generalized profiling approach with data smoothing techniques. The proposed method also provides statistical inference for the ODE estimator and it is applied to a zebrafish retina cell network.
Genetics Home Reference: mevalonate kinase deficiency

MedlinePlus

... cell maturation (differentiation), formation of the cell's structural framework (the cytoskeleton), gene activity (expression), and protein production ... Group. Long-term follow-up, clinical features, and quality of life in a series of 103 patients ...
Identification of Methylated Genes Associated with Aggressive Clinicopathological Features in Mantle Cell Lymphoma

PubMed Central

Hernández, Luis; Navarro, Alba; Beà, Sílvia; Pinyol, Magda; López-Guillermo, Armando; Rosenwald, Andreas; Ott, German; Campo, Elías; Jares, Pedro

2011-01-01

Background Mantle cell lymphoma (MCL) is genetically characterized by the t(11;14)(q13;q32) translocation and a high number of secondary chromosomal alterations. The contribution of DNA methylation to MCL lymphomagenesis is not well known. We sought to identify epigenetically silenced genes in these tumours that might have clinical relevance. Methodology/Principal Findings To identify potential methylated genes in MCL we initially investigated seven MCL cell lines treated with epigenetic drugs and gene expression microarray profiling. The methylation status of selected candidate genes was validated by a quantitative assay and subsequently analyzed in a series of primary MCL (n = 38). After pharmacological reversion we identified 252 potentially methylated genes. The methylation analysis of a subset of these genes (n = 25) in the MCL cell lines and normal B lymphocytes confirmed that 80% of them were methylated in the cell lines but not in normal lymphocytes. The subsequent analysis in primary MCL identified five genes (SOX9, HOXA9, AHR, NR2F2, and ROBO1) frequently methylated in these tumours. The gene methylation events tended to occur in the same primary neoplasms and correlated with higher proliferation, increased number of chromosomal abnormalities, and shorter survival of the patients. Conclusions We have identified a set of genes whose methylation degree and gene expression levels correlate with aggressive clinicopathological features of MCL. Our findings also suggest that a subset of MCL might show a CpG island methylator phenotype (CIMP) that may influence the behaviour of the tumours. PMID:21603610
Ars insulator identified in sea urchin possesses an activity to ensure the transgene expression in mouse cells.

PubMed

Tajima, Shoji; Shinohara, Keiko; Fukumoto, Maiko; Zaitsu, Reiko; Miyagawa, Junichi; Hino, Shinjiro; Fan, Jun; Akasaka, Koji; Matsuoka, Masao

2006-04-01

Sea urchin arylsulfatase (Ars) gene locus has features of an insulator, i.e., blocking of enhancer and promoter interaction, and protection of a transgene against positional effects [Akasaka et al. (1999) Cell. Mol. Biol. 45, 555-565]. To examine the effect of Ars insulator on long-term expression of a transgene, the insulator was inserted into LTR of retrovirus vector harboring hrGFP gene as a reporter, and then introduced into mouse myoblast cells. The isolated clones transduced with the reporter gene with or without Ars insulator were cultured for more than 20 wk in the absence of a selection reagent, and the expression of hrGFP was periodically determined. Expression of hrGFP in four clones transduced with the reporter gene without Ars insulator was completely silenced after 20 wk of culture. On the other hand, hrGFP was expressed in all clones with Ars insulator inserted in one of the two different orientations. Histone H3 deacetylation and DNA methylation of the 5'LTR promoter region, signs for heterochromatin and silencing, were suppressed in the clones that were expressing hrGFP. Ars insulator is effective in maintaining a transgene in mouse cells in an orientation-dependent manner, and will be a useful tool to ensure stable expression of a transgene.
Comprehensive Identification of Sexual Dimorphism-Associated Differentially Expressed Genes in Two-Way Factorial Designed RNA-Seq Data on Japanese Quail (Coturnix coturnix japonica)

PubMed Central

Rodriguez-Zas, Sandra; Oh, Jae-Don; Han, Jae Yong; Lee, Kichoon; Park, Tae Sub; Shin, Sangsu; Jiao Jiao, Zhang; Ghosh, Mrinmoy; Jeong, Dong Kee; Cho, Seoae; Kim, Heebal; Song, Ki-Duk; Lee, Hak-Kyo

2015-01-01

Japanese quail (Coturnix coturnix japonica) reach sexual maturity earlier, breed rapidly and successfully, and cost less and require less space than other birds raised commercially. Given the value of this species for food production and experimental use, more studies are necessary to determine chromosomal regions and genes associated with gender and breed-differentiation. This study employed Trinity and edgeR for transcriptome analysis of next-generation RNA-seq data, which included 4 tissues obtained from 3 different breeding lines of Japanese quail (random bred control, heavy weight, low weight). Differentially expressed genes shared between female and male tissue contrast groups were analyzed to identify genes related to sexual dimorphism as well as potential novel candidate genes for molecular sexing. Several of the genes identified in the present study as significant sex-related genes have been previously found in avian gene expression analyses (NIPBL, UBAP2), and other genes found differentially expressed in this study and not previously associated with sex-related differences may be considered potential candidates for molecular sexing (TERA, MYP0, PPR17, CASQ2). Additionally, other genes likely associated with neuronal and brain development (CHKA, NYAP), as well as body development and size differentiation (ANKRD26, GRP87) in quail were identified. Expression of homeobox protein regulating genes (HXC4, ISL1) shared between our two sex-related contrast groups (Female Brain vs. Male Brain and Ovary vs. Testis) indicates that these genes may regulate sex-specific anatomical development. Results reveal genetic features of the quail breed and could allow for more effective molecular sexing as well as selective breeding for traits important in commercial production. PMID:26418419
A peculiar lamin in a peculiar mammal: Expression of lamin LIII in platypus (Ornithorhynchus anatinus).

PubMed

Peter, Annette; Khandekar, Shaunak; Deakin, Janine E; Stick, Reimer

2015-11-01

Platypus (Ornithorhynchus anatinus) holds a unique phylogenetic position at the base of the mammalian lineage due to an amalgamation of mammalian and sauropsid-like features. Here we describe the set of four lamin genes for platypus. Lamins are major components of the nuclear lamina, which constitutes a main component of the nucleoskeleton and is involved in a wide range of nuclear functions. Vertebrate evolution was accompanied by an increase in the number of lamin genes from a single gene in their closest relatives, the tunicates and cephalochordates, to four genes in the vertebrate lineage. Of the four genes the LIII gene is characterized by the presence of two alternatively spliced CaaX-encoding exons. In amphibians and fish LIII is the major lamin protein in oocytes and early embryos. The LIII gene is conserved throughout the vertebrate lineage, with the notable exception of marsupials and placental mammals, which have lost the LIII gene. Here we show that platypus has retained an LIII gene, albeit with a significantly altered structure and with a radically different expression pattern. The platypus LIII gene contains only a single CaaX-encoding exon and the head domain together with coil 1a and part of coil1b of the platypus LIII protein is replaced by a novel short non-helical N-terminus. It is expressed exclusively in the testis. These features resemble those of male germ cell-specific lamins in placental mammals, in particular those of lamin C2. Our data suggest (i) that the specific functions of LIII, which it fulfills in all other vertebrates, is no longer required in mammals and (ii) once it had been freed from these functions has undergone structural alterations and has adopted a new functionality in monotremes. Copyright © 2015 Elsevier GmbH. All rights reserved.
Impact of epidermal growth factor receptor protein and gene alteration on Taiwanese hepatocellular carcinomas.

PubMed

Su, Yu-Hung; Ng, Kwai-Fong; Yu, Ming-Chin; Wu, Ting-Jung; Yeh, Ta-Sen; Lee, Wei-Chen; Lin, Yong-Shiang; Hsieh, Tsung-Han; Lin, Chun-Yen; Yeh, Chau-Ting; Chen, Tse-Ching

2015-09-01

Epidermal growth factor receptor (EGFR) overexpression is associated with disease progression and poor survival in a variety of solid tumors. The role of EGFR in hepatocellular carcinoma (HCC) remains controversial. One hundred thirty-eight HCCs were analyzed for total EGFR (t-EGFR) and phospho-EGFR (p-EGFR) expression and gene amplification using immunohistochemistry and fluorescence in situ hybridization. The role of EGFR was analyzed in relation to the clinicopathological features. Weak to strong p-EGFR immunostaining was noted in 42 of the 138 HCCs. p-EGFR expression correlated with alcoholism (P = 0.03) and chronic hepatitis B infection (P = 0.041). There was no correlation between t-EGFR expression and any of the clinicopathological features. Amplification of the EGFR gene was not identified in the 138 HCCs, but 39.1% of the HCCs showed balanced polysomy of both the EGFR gene and centromere 7. Moreover, 65 tumors showed > 2.2 copies per tumor cell. EGFR copy number gain (CNG) was significantly correlated with gender (P = 0.0491), tumor grade (P = 0.006), and vascular invasion (P = 0.005). HCCs with EGFR CNG also had a poor recurrence-free survival (RFS), as compared with HCCs without EGFR CNG (P = 0.031). When exploring the impact of gender, a significant association of EGFR CNG was found with tumor grade (P = 0.044) and cirrhosis (P = 0.015) exclusively in the male group only; however, the OS and RFS analysis show no significant difference between male and female groups. EGFR CNG was related to crucial clinicopathological features and early recurrence, indicating that EGFR CNG might be a poor prognosis factor for Taiwanese HCC. © 2015 Journal of Gastroenterology and Hepatology Foundation and Wiley Publishing Asia Pty Ltd.
Severe Gardner syndrome in families with mutations restricted to a specific region of the APC gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Davies, D.R.; Armstrong, J.G.; Thakker, N.

Familial adenomatous polyposis (FAP) is associated with a number of extraintestinal manifestations, which include osteomas, epidermoid cysts, and desmoid tumors, often referred to as {open_quotes}Gardner syndrome.{close_quotes} Recent studies have suggested that some of the phenotypic features of FAP are dependent on the position of the mutation within the APC gene. In particular, the correlation between congenital hypertrophy of the retinal pigment epithelium (CHRPE) and APC genotype indicates that affected families may be divided into distinct groups. We have investigated the association between the dento-osseous features of GS on dental panoramic radiographs (DPRs) and APC genotype in a regional cohort ofmore » FAP families. DPRs were performed on 84 affected individuals from 36 families, and the dento-osseous features of FAP were quantified by a weighted scoring system. Significant DPR abnormalities were present in 69% of affected individuals. The APC gene mutation was identified in 27 of these families, and for statistical analysis these were subdivided into three groups. Group 1 comprised 18 affected individuals from seven families with mutations 5{prime} of exon 9; these families (except one) did not express CHRPE. Groups 2 comprised 38 individuals from 16 families with mutations between exon 9 and codon 1444, all of whom expressed CHRPE. Group 3 comprised 11 individuals from four families with mutations 3{prime} of codon 1444, none of whom expressed CHRPE. Families with mutations 3{prime} of codon 1444 had significantly more lesions on DPRs (P < .001) and appeared to have a higher incidence of desmoid tumors. These results suggest that severity of some of the features of Gardner syndrome may correlate with genotype in FAP. 32 refs., 2 figs., 2 tabs.« less
3' Untranslated regions mediate transcriptional interference between convergent genes both locally and ectopically in Saccharomyces cerevisiae.

PubMed

Wang, Luwen; Jiang, Ning; Wang, Lin; Fang, Ou; Leach, Lindsey J; Hu, Xiaohua; Luo, Zewei

2014-01-01

Paired sense and antisense (S/AS) genes located in cis represent a structural feature common to the genomes of both prokaryotes and eukaryotes, and produce partially complementary transcripts. We used published genome and transcriptome sequence data and found that over 20% of genes (645 pairs) in the budding yeast Saccharomyces cerevisiae genome are arranged in convergent pairs with overlapping 3'-UTRs. Using published microarray transcriptome data from the standard laboratory strain of S. cerevisiae, our analysis revealed that expression levels of convergent pairs are significantly negatively correlated across a broad range of environments. This implies an important role for convergent genes in the regulation of gene expression, which may compensate for the absence of RNA-dependent mechanisms such as micro RNAs in budding yeast. We selected four representative convergent gene pairs and used expression assays in wild type yeast and its genetically modified strains to explore the underlying patterns of gene expression. Results showed that convergent genes are reciprocally regulated in yeast populations and in single cells, whereby an increase in expression of one gene produces a decrease in the expression of the other, and vice-versa. Time course analysis of the cell cycle illustrated the functional significance of this relationship for the three pairs with relevant functional roles. Furthermore, a series of genetic modifications revealed that the 3'-UTR sequence plays an essential causal role in mediating transcriptional interference, which requires neither the sequence of the open reading frame nor the translation of fully functional proteins. More importantly, transcriptional interference persisted even when one of the convergent genes was expressed ectopically (in trans) and therefore does not depend on the cis arrangement of convergent genes; we conclude that the mechanism of transcriptional interference cannot be explained by the transcriptional collision model, which postulates a clash between simultaneous transcriptional processes occurring on opposite DNA strands.
3′ Untranslated Regions Mediate Transcriptional Interference between Convergent Genes Both Locally and Ectopically in Saccharomyces cerevisiae

PubMed Central

Wang, Luwen; Jiang, Ning; Wang, Lin; Fang, Ou; Leach, Lindsey J.; Hu, Xiaohua; Luo, Zewei

2014-01-01

Paired sense and antisense (S/AS) genes located in cis represent a structural feature common to the genomes of both prokaryotes and eukaryotes, and produce partially complementary transcripts. We used published genome and transcriptome sequence data and found that over 20% of genes (645 pairs) in the budding yeast Saccharomyces cerevisiae genome are arranged in convergent pairs with overlapping 3′-UTRs. Using published microarray transcriptome data from the standard laboratory strain of S. cerevisiae, our analysis revealed that expression levels of convergent pairs are significantly negatively correlated across a broad range of environments. This implies an important role for convergent genes in the regulation of gene expression, which may compensate for the absence of RNA-dependent mechanisms such as micro RNAs in budding yeast. We selected four representative convergent gene pairs and used expression assays in wild type yeast and its genetically modified strains to explore the underlying patterns of gene expression. Results showed that convergent genes are reciprocally regulated in yeast populations and in single cells, whereby an increase in expression of one gene produces a decrease in the expression of the other, and vice-versa. Time course analysis of the cell cycle illustrated the functional significance of this relationship for the three pairs with relevant functional roles. Furthermore, a series of genetic modifications revealed that the 3′-UTR sequence plays an essential causal role in mediating transcriptional interference, which requires neither the sequence of the open reading frame nor the translation of fully functional proteins. More importantly, transcriptional interference persisted even when one of the convergent genes was expressed ectopically (in trans) and therefore does not depend on the cis arrangement of convergent genes; we conclude that the mechanism of transcriptional interference cannot be explained by the transcriptional collision model, which postulates a clash between simultaneous transcriptional processes occurring on opposite DNA strands. PMID:24465217
Unstable genomes elevate transcriptome dynamics

PubMed Central

Stevens, Joshua B.; Liu, Guo; Abdallah, Batoul Y.; Horne, Steven D.; Ye, Karen J.; Bremer, Steven W.; Ye, Christine J.; Krawetz, Stephen A.; Heng, Henry H.

2015-01-01

The challenge of identifying common expression signatures in cancer is well known, however the reason behind this is largely unclear. Traditionally variation in expression signatures has been attributed to technological problems, however recent evidence suggests that chromosome instability (CIN) and resultant karyotypic heterogeneity may be a large contributing factor. Using a well-defined model of immortalization, we systematically compared the pattern of genome alteration and expression dynamics during somatic evolution. Co-measurement of global gene expression and karyotypic alteration throughout the immortalization process reveals that karyotype changes influence gene expression as major structural and numerical karyotypic alterations result in large gene expression deviation. Replicate samples from stages with stable genomes are more similar to each other than are replicate samples with karyotypic heterogeneity. Karyotypic and gene expression change during immortalization is dynamic as each stage of progression has a unique expression pattern. This was further verified by comparing global expression in two replicates grown in one flask with known karyotypes. Replicates with higher karyotypic instability were found to be less similar than replicates with stable karyotypes. This data illustrates the karyotype, transcriptome, and transcriptome determined pathways are in constant flux during somatic cellular evolution (particularly during the macroevolutionary phase) and this flux is an inextricable feature of CIN and essential for cancer formation. The findings presented here underscore the importance of understanding the evolutionary process of cancer in order to design improved treatment modalities. PMID:24122714
Increased Transcript Complexity in Genes Associated with Chronic Obstructive Pulmonary Disease

PubMed Central

Lackey, Lela; McArthur, Evonne; Laederach, Alain

2015-01-01

Genome-wide association studies aim to correlate genotype with phenotype. Many common diseases including Type II diabetes, Alzheimer’s, Parkinson’s and Chronic Obstructive Pulmonary Disease (COPD) are complex genetic traits with hundreds of different loci that are associated with varied disease risk. Identifying common features in the genes associated with each disease remains a challenge. Furthermore, the role of post-transcriptional regulation, and in particular alternative splicing, is still poorly understood in most multigenic diseases. We therefore compiled comprehensive lists of genes associated with Type II diabetes, Alzheimer’s, Parkinson’s and COPD in an attempt to identify common features of their corresponding mRNA transcripts within each gene set. The SERPINA1 gene is a well-recognized genetic risk factor of COPD and it produces 11 transcript variants, which is exceptional for a human gene. This led us to hypothesize that other genes associated with COPD, and complex disorders in general, are highly transcriptionally diverse. We found that COPD-associated genes have a statistically significant enrichment in transcript complexity stemming from a disproportionately high level of alternative splicing, however, Type II Diabetes, Alzheimer’s and Parkinson’s disease genes were not significantly enriched. We also identified a subset of transcriptionally complex COPD-associated genes (~40%) that are differentially expressed between mild, moderate and severe COPD. Although the genes associated with other lung diseases are not extensively documented, we found preliminary data that idiopathic pulmonary disease genes, but not cystic fibrosis modulators, are also more transcriptionally complex. Interestingly, complex COPD transcripts are more often the product of alternative acceptor site usage. To verify the biological importance of these alternative transcripts, we used RNA-sequencing analyses to determine that COPD-associated genes are frequently expressed in lung and liver tissues and are regulated in a tissue-specific manner. Additionally, many complex COPD-associated genes are spliced differently between COPD and non-COPD patients. Our analysis therefore suggests that post-transcriptional regulation, particularly alternative splicing, is an important feature specific to COPD disease etiology that warrants further investigation. PMID:26480348
Early diagnostic role of PSA combined miR-155 detection in prostate cancer.

PubMed

Guo, T; Wang, X-X; Fu, H; Tang, Y-C; Meng, B-Q; Chen, C-H

2018-03-01

As a kind of malignant tumor in the male genitourinary system, prostate cancer exhibits significantly increased occurrence. Prostate-specific antigen (PSA) expression can be seen in the prostate cancer, prostatitis, and other diseases, therefore, lack of diagnostic specificity. The miR-155 expression is abnormally increased in the tumors. Therefore, this study aims to explore the clinical significance of PSA combined miR-155 detection in the early diagnosis of prostate cancer. A total of 86 patients diagnosed with prostate cancer were enrolled in this study. PSA and miR-155 gene expression in tumor tissue were detected by using Real-time PCR. The serum levels of PSA were measured by using enzyme-linked immunosorbent assay (ELISA). The correlation of PSA and miR-155 expression with age, body mass index (BMI), tumor volume, tumor-node-metastasis (TNM) stage, lymph node metastasis (LNM), and other clinicopathological features were analyzed, respectively. Serum PSA expression and PSA gene in tumor tissue were significantly higher compared to that in adjacent tissues (p<0.05). PSA gene and protein increased significantly with the clinical stage of TNM and decreased following the increase of grade (p<0.05). The miR-155 level was significantly elevated in the tumor tissue compared with para-carcinoma tissue (p<0.05). PSA and miR-155 expressions were positively correlated with TNM stage, tumor volume, and LNM, and negatively correlated with grade (p<0.05). PSA and miR-155 were closely related to the clinicopathological features of prostate cancer. Combined detection is helpful for the early diagnosis of prostate cancer.
Genome-wide analysis of the basic leucine zipper (bZIP) transcription factor gene family in six legume genomes.

PubMed

Wang, Zhihui; Cheng, Ke; Wan, Liyun; Yan, Liying; Jiang, Huifang; Liu, Shengyi; Lei, Yong; Liao, Boshou

2015-12-10

Plant bZIP proteins characteristically harbor a highly conserved bZIP domain with two structural features: a DNA-binding basic region and a leucine (Leu) zipper dimerization region. They have been shown to be diverse transcriptional regulators, playing crucial roles in plant development, physiological processes, and biotic/abiotic stress responses. Despite the availability of six completely sequenced legume genomes, a comprehensive investigation of bZIP family members in legumes has yet to be presented. In this study, we identified 428 bZIP genes encoding 585 distinct proteins in six legumes, Glycine max, Medicago truncatula, Phaseolus vulgaris, Cicer arietinum, Cajanus cajan, and Lotus japonicus. The legume bZIP genes were categorized into 11 groups according to their phylogenetic relationships with genes from Arabidopsis. Four kinds of intron patterns (a-d) within the basic and hinge regions were defined and additional conserved motifs were identified, both presenting high group specificity and supporting the group classification. We predicted the DNA-binding patterns and the dimerization properties, based on the characteristic features in the basic and hinge regions and the Leu zipper, respectively, which indicated that some highly conserved amino acid residues existed across each major group. The chromosome distribution and analysis for WGD-derived duplicated blocks revealed that the legume bZIP genes have expanded mainly by segmental duplication rather than tandem duplication. Expression data further revealed that the legume bZIP genes were expressed constitutively or in an organ-specific, development-dependent manner playing roles in multiple seed developmental stages and tissues. We also detected several key legume bZIP genes involved in drought- and salt-responses by comparing fold changes of expression values in drought-stressed or salt-stressed roots and leaves. In summary, this genome-wide identification, characterization and expression analysis of legume bZIP genes provides valuable information for understanding the molecular functions and evolution of the legume bZIP transcription factor family, and highlights potential legume bZIP genes involved in regulating tissue development and abiotic stress responses.
Predictive model for inflammation grades of chronic hepatitis B: Large-scale analysis of clinical parameters and gene expressions.

PubMed

Zhou, Weichen; Ma, Yanyun; Zhang, Jun; Hu, Jingyi; Zhang, Menghan; Wang, Yi; Li, Yi; Wu, Lijun; Pan, Yida; Zhang, Yitong; Zhang, Xiaonan; Zhang, Xinxin; Zhang, Zhanqing; Zhang, Jiming; Li, Hai; Lu, Lungen; Jin, Li; Wang, Jiucun; Yuan, Zhenghong; Liu, Jie

2017-11-01

Liver biopsy is the gold standard to assess pathological features (eg inflammation grades) for hepatitis B virus-infected patients although it is invasive and traumatic; meanwhile, several gene profiles of chronic hepatitis B (CHB) have been separately described in relatively small hepatitis B virus (HBV)-infected samples. We aimed to analyse correlations among inflammation grades, gene expressions and clinical parameters (serum alanine amino transaminase, aspartate amino transaminase and HBV-DNA) in large-scale CHB samples and to predict inflammation grades by using clinical parameters and/or gene expressions. We analysed gene expressions with three clinical parameters in 122 CHB samples by an improved regression model. Principal component analysis and machine-learning methods including Random Forest, K-nearest neighbour and support vector machine were used for analysis and further diagnosis models. Six normal samples were conducted to validate the predictive model. Significant genes related to clinical parameters were found enriching in the immune system, interferon-stimulated, regulation of cytokine production, anti-apoptosis, and etc. A panel of these genes with clinical parameters can effectively predict binary classifications of inflammation grade (area under the ROC curve [AUC]: 0.88, 95% confidence interval [CI]: 0.77-0.93), validated by normal samples. A panel with only clinical parameters was also valuable (AUC: 0.78, 95% CI: 0.65-0.86), indicating that liquid biopsy method for detecting the pathology of CHB is possible. This is the first study to systematically elucidate the relationships among gene expressions, clinical parameters and pathological inflammation grades in CHB, and to build models predicting inflammation grades by gene expressions and/or clinical parameters as well. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Segmentation gene expression patterns in Bactrocera dorsalis and related insects: regulation and shape of blastoderm and larval cuticle.

PubMed

Suksuwan, Worramin; Cai, Xiaoli; Ngernsiri, Lertluk; Baumgartner, Stefan

2017-01-01

The oriental fruit fly, Bactrocera dorsalis, is regarded as a severe pest of fruit production in Asia. Despite its economic importance, only limited information regarding the molecular and developmental biology of this insect is known to date. We provide a detailed analysis of B. dorsalis embryology, as well as the expression patterns of a number of segmentation genes known to act during patterning of Drosophila and compare these to the patterns of other insect families. An anterior shift of the expression of gap genes was detected when compared to Drosophila. This shift was largely restored during the step where the gap genes control expression of the pair-rule genes. We analyzed and compared the shapes of the embryos of insects of different families, B. dorsalis and the blow fly Lucilia sericata with that of the well-characterized Drosophila melanogaster. We found distinct shapes as well as differences in the ratios of the length of the anterior-posterior axis and the dorsal-ventral axis. These features were integrated into a profile of how the expression patterns of the gap gene Krüppel and the pair-rule gene even-skipped were observed along the A-P axis in three insects families. Since significant differences were observed, we discuss how Krüppel controls the even-skipped stripes. Furthermore, we discuss how the position and angles of the segmentation gene stripes differed from other insects. Finally, we analyzed the outcome of the expression patterns of the late acting segment polarity genes in relation to the anlagen of the naked-cuticle and denticle belt area of the B. dorsalis larva.
Genetic neuropathology of obsessive psychiatric syndromes

PubMed Central

Jaffe, A E; Deep-Soboslay, A; Tao, R; Hauptman, D T; Kaye, W H; Arango, V; Weinberger, D R; Hyde, T M; Kleinman, J E

2014-01-01

Anorexia nervosa (AN), bulimia nervosa (BN) and obsessive-compulsive disorder (OCD) are complex psychiatric disorders with shared obsessive features, thought to arise from the interaction of multiple genes of small effect with environmental factors. Potential candidate genes for AN, BN and OCD have been identified through clinical association and neuroimaging studies; however, recent genome-wide association studies of eating disorders (ED) so far have failed to report significant findings. In addition, few, if any, studies have interrogated postmortem brain tissue for evidence of expression quantitative trait loci (eQTLs) associated with candidate genes, which has particular promise as an approach to elucidating molecular mechanisms of association. We therefore selected single-nucleotide polymorphisms (SNPs) based on candidate gene studies for AN, BN and OCD from the literature, and examined the association of these SNPs with gene expression across the lifespan in prefrontal cortex of a nonpsychiatric control cohort (N=268). Several risk-predisposing SNPs were significantly associated with gene expression among control subjects. We then measured gene expression in the prefrontal cortex of cases previously diagnosed with obsessive psychiatric disorders, for example, ED (N=15) and OCD/obsessive-compulsive personality disorder or tics (OCD/OCPD/Tic; N=16), and nonpsychiatric controls (N=102) and identified 6 and 286 genes that were differentially expressed between ED compared with controls and OCD cases compared with controls, respectively (false discovery rate (FDR) <5%). However, none of the clinical risk SNPs were among the eQTLs and none were significantly associated with gene expression within the broad obsessive cohort, suggesting larger sample sizes or other brain regions may be required to identify candidate molecular mechanisms of clinical association in postmortem brain data sets. PMID:25180571

Genetic neuropathology of obsessive psychiatric syndromes.

PubMed

Jaffe, A E; Deep-Soboslay, A; Tao, R; Hauptman, D T; Kaye, W H; Arango, V; Weinberger, D R; Hyde, T M; Kleinman, J E

2014-09-02

Anorexia nervosa (AN), bulimia nervosa (BN) and obsessive-compulsive disorder (OCD) are complex psychiatric disorders with shared obsessive features, thought to arise from the interaction of multiple genes of small effect with environmental factors. Potential candidate genes for AN, BN and OCD have been identified through clinical association and neuroimaging studies; however, recent genome-wide association studies of eating disorders (ED) so far have failed to report significant findings. In addition, few, if any, studies have interrogated postmortem brain tissue for evidence of expression quantitative trait loci (eQTLs) associated with candidate genes, which has particular promise as an approach to elucidating molecular mechanisms of association. We therefore selected single-nucleotide polymorphisms (SNPs) based on candidate gene studies for AN, BN and OCD from the literature, and examined the association of these SNPs with gene expression across the lifespan in prefrontal cortex of a nonpsychiatric control cohort (N=268). Several risk-predisposing SNPs were significantly associated with gene expression among control subjects. We then measured gene expression in the prefrontal cortex of cases previously diagnosed with obsessive psychiatric disorders, for example, ED (N=15) and OCD/obsessive-compulsive personality disorder or tics (OCD/OCPD/Tic; N=16), and nonpsychiatric controls (N=102) and identified 6 and 286 genes that were differentially expressed between ED compared with controls and OCD cases compared with controls, respectively (false discovery rate (FDR) <5%). However, none of the clinical risk SNPs were among the eQTLs and none were significantly associated with gene expression within the broad obsessive cohort, suggesting larger sample sizes or other brain regions may be required to identify candidate molecular mechanisms of clinical association in postmortem brain data sets.
Schizophrenia, vitamin D, and brain development.

PubMed

Mackay-Sim, Alan; Féron, François; Eyles, Darryl; Burne, Thomas; McGrath, John

2004-01-01

Schizophrenia research is invigorated at present by the recent discovery of several plausible candidate susceptibility genes identified from genetic linkage and gene expression studies of brains from persons with schizophrenia. It is a current challenge to reconcile this gathering evidence for specific candidate susceptibility genes with the "neurodevelopmental hypothesis," which posits that schizophrenia arises from gene-environment interactions that disrupt brain development. We make the case here that schizophrenia may result not from numerous genes of small effect, but a few genes of transcriptional regulation acting during brain development. In particular we propose that low vitamin D during brain development interacts with susceptibility genes to alter the trajectory of brain development, probably by epigenetic regulation that alters gene expression throughout adult life. Vitamin D is an attractive "environmental" candidate because it appears to explain several key epidemiological features of schizophrenia. Vitamin D is an attractive "genetic" candidate because its nuclear hormone receptor regulates gene expression and nervous system development. The polygenic quality of schizophrenia, with linkage to many genes of small effect, maybe brought together via this "vitamin D hypothesis." We also discuss the possibility of a broader set of environmental and genetic factors interacting via the nuclear hormone receptors to affect the development of the brain leading to schizophrenia.
Hierarchical Dirichlet process model for gene expression clustering

PubMed Central

2013-01-01

Clustering is an important data processing tool for interpreting microarray data and genomic network inference. In this article, we propose a clustering algorithm based on the hierarchical Dirichlet processes (HDP). The HDP clustering introduces a hierarchical structure in the statistical model which captures the hierarchical features prevalent in biological data such as the gene express data. We develop a Gibbs sampling algorithm based on the Chinese restaurant metaphor for the HDP clustering. We apply the proposed HDP algorithm to both regulatory network segmentation and gene expression clustering. The HDP algorithm is shown to outperform several popular clustering algorithms by revealing the underlying hierarchical structure of the data. For the yeast cell cycle data, we compare the HDP result to the standard result and show that the HDP algorithm provides more information and reduces the unnecessary clustering fragments. PMID:23587447
Bioinformatic investigation of the role of ubiquitins in cucumber flower morphogenesis

NASA Astrophysics Data System (ADS)

Pawełkowicz, Magdalena; Osipowski, Paweł; Wojcieszek, Michał; Kowalczuk, Cezary; PlÄ der, Wojciech; Przybecki, Zbigniew

2016-09-01

Three cDNA clones were used to screen cucumber genome in order to find genes and proteins. Functional annotation reveals that they are correlated with ubiquitination pathways. Various bioinformatics tools were used to screen and check protein sequences features such as: the presence of specific domains, transmembrane regions, cleavage site and cellular placement. The computational analysis for promotor region shows many binding sites for transcription factors, which could regulate the expression of genes. In order to check gene expression levels in developing flower buds of monoecious (B10) and gynoecious (2gg) cucumber lines, the real - time PCR technique was applied. The expression was checked for the whole buds and only for the 3rd and 4th whorls of bud when generative organ are form which were obtained by Laser Capture Microdissection (LCM) technique.
Gene expression profiling of peripheral blood mononuclear cells (PBMC) from Mycobacterium bovis infected cattle after in vitro antigenic stimulation with purified protein derivative of tuberculin (PPD).

PubMed

Meade, Kieran G; Gormley, Eamonn; Park, Stephen D E; Fitzsimons, Tara; Rosa, Guilherme J M; Costello, Eamon; Keane, Joseph; Coussens, Paul M; MacHugh, David E

2006-09-15

Microarray analysis of messenger RNA (mRNA) abundance was used to investigate the gene expression program of peripheral blood mononuclear cells (PBMC) from cattle infected with Mycobacterium bovis, the causative agent of bovine tuberculosis. An immunospecific bovine microarray platform (BOTL-4) with spot features representing 1336 genes was used for transcriptional profiling of PBMC from six M. bovis-infected cattle stimulated in vitro with bovine purified protein derivative of tuberculin (PPD-bovine). Cells were harvested at four time points (3 h, 6 h, 12 h and 24 h post-stimulation) and a split-plot design with pooled samples was used for the microarray experiment to compare gene expression between PPD-bovine stimulated PBMC and unstimulated controls for each time point. Statistical analyses of these data revealed 224 genes (approximately 17% of transcripts on the array) differentially expressed between stimulated and unstimulated PBMC across the 24 h time course (P<0.05). Of the 224 genes, 87 genes were significantly upregulated and 137 genes were significantly downregulated in M. bovis-infected PBMC stimulated with PPD-bovine across the 24 h time course. However, perturbation of the PBMC transcriptome was most apparent at time points 3 h and 12 h post-stimulation, with 81 and 84 genes differentially expressed, respectively. In addition, a more stringent statistical threshold (P<0.01) revealed 35 genes (approximately 3%) that were differentially expressed across the time course. Real-time quantitative reverse transcription PCR (qRT-PCR) of selected genes validated the microarray results and demonstrated a wide range of differentially expressed genes in PPD-bovine-, PPD-avian- and Concanavalin A (ConA) stimulated PBMC, including the interferon-gamma gene (IFNG), which was upregulated in PBMC stimulated with PPD-bovine (40-fold), PPD-avian (10-fold) and ConA (8-fold) after in vitro culture for 12 h. The pattern of expression of these genes in PPD-bovine stimulated PBMC provides the first description of an M. bovis-specific signature of infection that may provide insights into the molecular basis of the host response to infection. Although the present study was carried out with mixed PBMC cell populations, it will guide future studies to dissect immune cell-specific gene expression patterns in response to M. bovis infection.
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

PubMed

Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

2017-04-12

Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.
Function Clustering Self-Organization Maps (FCSOMs) for mining differentially expressed genes in Drosophila and its correlation with the growth medium.

PubMed

Liu, L L; Liu, M J; Ma, M

2015-09-28

The central task of this study was to mine the gene-to-medium relationship. Adequate knowledge of this relationship could potentially improve the accuracy of differentially expressed gene mining. One of the approaches to differentially expressed gene mining uses conventional clustering algorithms to identify the gene-to-medium relationship. Compared to conventional clustering algorithms, self-organization maps (SOMs) identify the nonlinear aspects of the gene-to-medium relationships by mapping the input space into another higher dimensional feature space. However, SOMs are not suitable for huge datasets consisting of millions of samples. Therefore, a new computational model, the Function Clustering Self-Organization Maps (FCSOMs), was developed. FCSOMs take advantage of the theory of granular computing as well as advanced statistical learning methodologies, and are built specifically for each information granule (a function cluster of genes), which are intelligently partitioned by the clustering algorithm provided by the DAVID_6.7 software platform. However, only the gene functions, and not their expression values, are considered in the fuzzy clustering algorithm of DAVID. Compared to the clustering algorithm of DAVID, these experimental results show a marked improvement in the accuracy of classification with the application of FCSOMs. FCSOMs can handle huge datasets and their complex classification problems, as each FCSOM (modeled for each function cluster) can be easily parallelized.
Complete TCRα gene locus control region activity in T cells derived in vitro from embryonic stem cells

PubMed Central

Lahiji, Armin; Kučerová-Levisohn, Martina; Lovett, Jordana; Holmes, Roxanne; Zúñiga-Pflücker, Juan Carlos; Ortiz, Benjamin D.

2013-01-01

Locus Control Regions (LCR) are cis-acting gene regulatory elements with the unique, integration site-independent ability to transfer the characteristics of their locus-of-origin’s gene expression pattern to a linked transgene in mice. LCR activities have been discovered in numerous T cell lineage expressed gene loci. These elements can be adapted to the design of stem cell gene therapy vectors that direct robust therapeutic gene expression to the T cell progeny of engineered stem cells. Currently, transgenic mice provide the only experimental approach that wholly supports all the critical aspects of LCR activity. Herein we report manifestation of all key features of mouse T cell receptor (TCR)-α gene LCR function in T cells derived in vitro from mouse embryonic stem cells (ESC). High level, copy number-related TCRα LCR-linked reporter gene expression levels are cell type-restricted in this system, and upregulated during the expected stage transition of T cell development. We further report that de novo introduction of TCRα LCR linked transgenes into existing T cell lines yields incomplete LCR activity. Together, these data indicate that establishing full TCRα LCR activity requires critical molecular events occurring prior to final T-lineage determination. This study additionally validates a novel, tractable and more rapid approach for the study of LCR activity in T cells, and its translation to therapeutic genetic engineering. PMID:23720809
Arabidopsis Gene Family Profiler (aGFP)--user-oriented transcriptomic database with easy-to-use graphic interface.

PubMed

Dupl'áková, Nikoleta; Renák, David; Hovanec, Patrik; Honysová, Barbora; Twell, David; Honys, David

2007-07-23

Microarray technologies now belong to the standard functional genomics toolbox and have undergone massive development leading to increased genome coverage, accuracy and reliability. The number of experiments exploiting microarray technology has markedly increased in recent years. In parallel with the rapid accumulation of transcriptomic data, on-line analysis tools are being introduced to simplify their use. Global statistical data analysis methods contribute to the development of overall concepts about gene expression patterns and to query and compose working hypotheses. More recently, these applications are being supplemented with more specialized products offering visualization and specific data mining tools. We present a curated gene family-oriented gene expression database, Arabidopsis Gene Family Profiler (aGFP; http://agfp.ueb.cas.cz), which gives the user access to a large collection of normalised Affymetrix ATH1 microarray datasets. The database currently contains NASC Array and AtGenExpress transcriptomic datasets for various tissues at different developmental stages of wild type plants gathered from nearly 350 gene chips. The Arabidopsis GFP database has been designed as an easy-to-use tool for users needing an easily accessible resource for expression data of single genes, pre-defined gene families or custom gene sets, with the further possibility of keyword search. Arabidopsis Gene Family Profiler presents a user-friendly web interface using both graphic and text output. Data are stored at the MySQL server and individual queries are created in PHP script. The most distinguishable features of Arabidopsis Gene Family Profiler database are: 1) the presentation of normalized datasets (Affymetrix MAS algorithm and calculation of model-based gene-expression values based on the Perfect Match-only model); 2) the choice between two different normalization algorithms (Affymetrix MAS4 or MAS5 algorithms); 3) an intuitive interface; 4) an interactive "virtual plant" visualizing the spatial and developmental expression profiles of both gene families and individual genes. Arabidopsis GFP gives users the possibility to analyze current Arabidopsis developmental transcriptomic data starting with simple global queries that can be expanded and further refined to visualize comparative and highly selective gene expression profiles.
Circadian Rhythms Regulate Amelogenesis

PubMed Central

Zheng, Li; Seon, Yoon Ji; Mourão, Marcio A.; Schnell, Santiago; Kim, Doohak; Harada, Hidemitsu; Papagerakis, Silvana; Papagerakis, Petros

2013-01-01

Ameloblasts, the cells responsible for making enamel, modify their morphological features in response to specialized functions necessary for synchronized ameloblast differentiation and enamel formation. Secretory and maturation ameloblasts are characterized by the expression of stage-specific genes which follows strictly controlled repetitive patterns. Circadian rhythms are recognized as key regulators of development and diseases of many tissues including bone. Our aim was to gain novel insights on the role of clock genes in enamel formation and to explore the potential links between circadian rhythms and amelogenesis. Our data shows definitive evidence that the main clock genes (Bmal1, Clock, Per1 and Per2) oscillate in ameloblasts at regular circadian (24h) intervals both at RNA and protein levels. This study also reveals that two markers of ameloblast differentiation i.e. amelogenin (Amelx; a marker of secretory ameloblasts) and kallikrein-related peptidase 4 (Klk4, a marker of maturation ameloblasts) are downstream targets of clock genes. Both, Amelx and Klk4 show 24h oscillatory expression patterns and their expression levels are up-regulated after Bmal1 over-expression in HAT-7 ameloblast cells. Taken together, these data suggest that both the secretory and the maturation stage of amelogenesis might be under circadian control. Changes in clock genes expression patterns might result in significant alterations of enamel apposition and mineralization. PMID:23486183
Parasitization by Scleroderma guani influences expression of superoxide dismutase genes in Tenebrio molitor.

PubMed

Zhu, Jia-Ying; Ze, Sang-Zi; Stanley, David W; Yang, Bin

2014-09-01

Superoxide dismutase (SOD) is an antioxidant enzyme involved in detoxifying reactive oxygen species. In this study, we identified genes encoding the extracellular and intracellular copper-zinc SODs (ecCuZnSOD and icCuZnSOD) and a manganese SOD (MnSOD) in the yellow mealworm beetle, Tenebrio molitor. The cDNAs for ecCuZnSOD, icCuZnSOD, and MnSOD, respectively, encode 24.55, 15.81, and 23.14 kDa polypeptides, which possess structural features typical of other insect SODs. They showed 20-94% identity to other known SOD sequences from Bombyx mori, Musca domestica, Nasonia vitripennis, Pediculus humanus corporis, and Tribolium castaneum. Expression of these genes was analyzed in selected tissues and developmental stages, and following exposure to Escherichia coli and parasitization by Scleroderma guani. We recorded expression of all three SODs in cuticle, fat body, and hemocytes and in the major developmental stages. Relatively higher expressions were detected in late-instar larvae and pupae, compared to other developmental stages. Transcriptional levels were upregulated following bacterial infection. Analysis of pupae parasitized by S. guani revealed that expression of T. molitor SOD genes was significantly induced following parasitization. We infer that these genes act in immune response and in host-parasitoid interactions. © 2014 Wiley Periodicals, Inc.
Deep transcriptome sequencing provides new insights into the structural and functional organization of the wheat genome.

PubMed

Pingault, Lise; Choulet, Frédéric; Alberti, Adriana; Glover, Natasha; Wincker, Patrick; Feuillet, Catherine; Paux, Etienne

2015-02-10

Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before. By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level. Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
Heterologous gene expression driven by carbonic anhydrase gene promoter in Dunaliella salina

NASA Astrophysics Data System (ADS)

Chai, Yurong; Lu, Yumin; Wang, Tianyun; Hou, Weihong; Xue, Lexun

2006-12-01

Dunaliella salina, a halotolerant unicellular green alga without a rigid cell wall, can live in salinities ranging from 0.05 to 5 mol/L NaCl. These features of D. salina make it an ideal host for the production of antibodies, oral vaccine, and commercially valuable polypeptides. To produce high level of heterologous proteins from D. salina, highly efficient promoters are required to drive expression of target genes under controlled condition. In the present study, we cloned a 5' franking region of 1.4 kb from the carbonic anhydrase ( CAH) gene of D. salina by genomic walking and PCR. The fragment was ligated to the pMD18-T vector and characterized. Sequence analysis indicated that this region contained conserved motifs, including a TATA- like box and CAAT-box. Tandem (GT)n repeats that had a potential role of transcriptional control, were also found in this region. The transcription start site (TSS) of the CAH gene was determined by 5' RACE and nested PCR method. Transformation assays showed that the 1.4 kb fragment was able to drive expression of the selectable bar (bialaphos resistance) gene when the fusion was transformed into D. salina by biolistics. Northern blotting hybridizations showed that the bar transcript was most abundant in cells grown in 2 mol/L NaCl, and less abundant in 0.5 mol/L NaCl, indicating that expression of the bar gene was induced at high salinity. These results suggest the potential use of the CAH gene promoter to induce the expression of heterologous genes in D. salina under varied salt condition.
Involvement of Retinoblastoma Protein and HBP1 in Histone H10 Gene Expression

PubMed Central

Lemercier, Claudie; Duncliffe, Kym; Boibessot, Isabelle; Zhang, Hui; Verdel, André; Angelov, Dimitar; Khochbin, Saadi

2000-01-01

The histone H10-encoding gene is expressed in vertebrates in differentiating cells during the arrest of proliferation. In the H10 promoter, a specific regulatory element, which we named the H4 box, exhibits features which implicate a role in mediating H10 gene expression in response to both differentiation and cell cycle control signals. For instance, within the linker histone gene family, the H4 box is found only in the promoters of differentiation-associated subtypes, suggesting that it is specifically involved in differentiation-dependent expression of these genes. In addition, an element nearly identical to the H4 box is conserved in the promoters of histone H4-encoding genes and is known to be involved in their cell cycle-dependent expression. The transcription factors interacting with the H10 H4 box were therefore expected to link differentiation-dependent expression of H10 to the cell cycle control machinery. The aim of this work was to identify such transcription factors and to obtain information concerning the regulatory pathway involved. Interestingly, our cloning strategy led to the isolation of a retinoblastoma protein (RB) partner known as HBP1. HBP1, a high-mobility group box transcription factor, interacted specifically with the H10 H4 box and moreover was expressed in a differentiation-dependent manner. We also showed that the HBP1-encoding gene is able to produce different forms of HBP1. Finally, we demonstrated that both HBP1 and RB were involved in the activation of H10 gene expression. We therefore propose that HBP1 mediates a link between the cell cycle control machinery and cell differentiation signals. Through modulating the expression of specific chromatin-associated proteins such as histone H10, HBP1 plays a vital role in chromatin remodeling events during the arrest of cell proliferation in differentiating cells. PMID:10958660
Expression of FAP, ADAM12, WISP1, and SOX11 is heterogeneous in aggressive fibromatosis and spatially relates to the histologic features of tumor activity.

PubMed

Misemer, Benjamin S; Skubitz, Amy P N; Carlos Manivel, J; Schmechel, Stephen C; Cheng, Edward Y; Henriksen, Jonathan C; Koopmeiners, Joseph S; Corless, Christopher L; Skubitz, Keith M

2014-02-01

Aggressive fibromatosis (AF) represents a group of tumors with a variable and unpredictable clinical course, characterized by a monoclonal proliferation of myofibroblastic cells. The optimal treatment for AF remains unclear. Identification and validation of genes whose expression patterns are associated with AF may elucidate biological mechanisms in AF, and aid treatment selection. This study was designed to examine the protein expression by immunohistochemistry (IHC) of four genes, ADAM12, FAP, SOX11, and WISP1, that were found in an earlier study to be uniquely overexpressed in AF compared with normal tissues. Digital image analysis was performed to evaluate inter- and intratumor heterogeneity, and correlate protein expression with histologic features, including a histopathologic assessment of tumor activity, defined by nuclear chromatin density ratio (CDR). AF tumors exhibited marked inter- and intratumor histologic heterogeneity. Pathologic assessment of tumor activity and digital assessment of average nuclear size and CDR were all significantly correlated. IHC revealed protein expression of all four genes. IHC staining for ADAM12, FAP, and WISP1 correlated with CDR and was higher, whereas SOX11 staining was lower in tumors with earlier recurrence following excision. All four proteins were expressed, and the regional variation in tumor activity within and among AF cases was demonstrated. A spatial correlation between protein expression and nuclear morphology was observed. IHC also correlated with the probability of recurrence following excision. These proteins may be involved in AF pathogenesis and the corresponding pathways could serve as potential targets of therapy. © 2013 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Rhesus monkey model of liver disease reflecting clinical disease progression and hepatic gene expression analysis

PubMed Central

Wang, Hong; Tan, Tao; Wang, Junfeng; Niu, Yuyu; Yan, Yaping; Guo, Xiangyu; Kang, Yu; Duan, Yanchao; Chang, Shaohui; Liao, Jianpeng; Si, Chenyang; Ji, Weizhi; Si, Wei

2015-01-01

Alcoholic liver disease (ALD) is a significant public health issue with heavy medical and economic burdens. The aetiology of ALD is not yet completely understood. The development of drugs and therapies for ALD is hampered by a lack of suitable animal models that replicate both the histological and metabolic features of human ALD. Here, we characterize a rhesus monkey model of alcohol-induced liver steatosis and hepatic fibrosis that is compatible with the clinical progression of the biochemistry and pathology in humans with ALD. Microarray analysis of hepatic gene expression was conducted to identify potential molecular signatures of ALD progression. The up-regulation of expression of hepatic genes related to liver steatosis (CPT1A, FASN, LEPR, RXRA, IGFBP1, PPARGC1A and SLC2A4) was detected in our rhesus model, as was the down-regulation of such genes (CYP7A1, HMGCR, GCK and PNPLA3) and the up-regulation of expression of hepatic genes related to liver cancer (E2F1, OPCML, FZD7, IGFBP1 and LEF1). Our results demonstrate that this ALD model reflects the clinical disease progression and hepatic gene expression observed in humans. These findings will be useful for increasing the understanding of ALD pathogenesis and will benefit the development of new therapeutic procedures and pharmacological reagents for treating ALD. PMID:26442469
Mismatch repair mRNA and protein expression in intestinal adenocarcinoma in sika deer (Cervus nippon) resembling heritable non-polyposis colorectal cancer in man.

PubMed

Jahns, H; Browne, J A

2015-01-01

Intestinal adenocarcinomas seen in an inbred herd of farmed sika deer (Cervus nippon) morphologically resembled human hereditary non-polyposis colorectal cancer (HNPCC). Features common to both included multiple de novo sites of tumourigenesis in the proximal colon, sessile and non-polyposis mucosal changes, the frequent finding of mucinous type adenocarcinoma, lymphocyte infiltration into the neoplastic tubules and Crohn's-like lymphoid follicles at the deep margin of the tumour. HNPCC is defined by a germline mutation of mismatch repair (MMR) genes resulting in their inactivation and loss of expression. To test the hypothesis that similar MMR gene inactivation occurs in the deer tumours, the expression of the four most important MMR genes, MSH2, MLH1, MSH6 and PMS2, was examined at the mRNA level by reverse transcriptase polymerase chain reaction (n = 12) and at the protein level by immunohistochemistry (n = 40) in tumour and control tissues. All four genes were expressed equally in normal and neoplastic tissues, so MMR gene inactivation could not be implicated in the carcinogenesis of this tumour in sika deer. Copyright © 2014 Elsevier Ltd. All rights reserved.
Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs

PubMed Central

Cheng, Chao; Ung, Matthew; Grant, Gavin D.; Whitfield, Michael L.

2013-01-01

Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements. PMID:23874175
MicroRNA Gene Regulatory Networks in Peripheral Nerve Sheath Tumors

DTIC Science & Technology

2012-09-01

chondrosarcoma are identified based on the unique histology, cell of origin, clinical features and site distribution. The following are the major... Chondrosarcoma Chondrosarcoma is a cancer composed of cells derived from transformed cells that produce cartilage. Peripheral chondrosarcoma is a malignant...biosynthesis. This is in line with gene expression analyses previously performed in osteochondroma and chondrosarcoma samples showing modulation of
Unveiling network-based functional features through integration of gene expression into protein networks.

PubMed

Jalili, Mahdi; Gebhardt, Tom; Wolkenhauer, Olaf; Salehzadeh-Yazdi, Ali

2018-06-01

Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. Copyright © 2018 Elsevier B.V. All rights reserved.

Snail1 transcription factor controls telomere transcription and integrity

PubMed Central

Mazzolini, Rocco; Gonzàlez, Núria; Garcia-Garijo, Andrea; Millanes-Romero, Alba; Peiró, Sandra; Smith, Susan

2018-01-01

Abstract Besides controlling epithelial-to-mesenchymal transition (EMT) and cell invasion, the Snail1 transcriptional factor also provides cells with cancer stem cell features. Since telomere maintenance is essential for stemness, we have examined the control of telomere integrity by Snail1. Fluorescence in situ hybridization (FISH) analysis indicates that Snail1-depleted mouse mesenchymal stem cells (MSC) have both a dramatic increase of telomere alterations and shorter telomeres. Remarkably, Snail1-deficient MSC present higher levels of both telomerase activity and the long non-coding RNA called telomeric repeat-containing RNA (TERRA), an RNA that controls telomere integrity. Accordingly, Snail1 expression downregulates expression of the telomerase gene (TERT) as well as of TERRA 2q, 11q and 18q. TERRA and TERT are transiently downregulated during TGFβ-induced EMT in NMuMG cells, correlating with Snail1 expression. Global transcriptome analysis indicates that ectopic expression of TERRA affects the transcription of some genes induced during EMT, such as fibronectin, whereas that of TERT does not modify those genes. We propose that Snail1 repression of TERRA is required not only for telomere maintenance but also for the expression of a subset of mesenchymal genes. PMID:29059385
Low expression of N-myc downstream-regulated gene 2 (NDRG2) correlates with poor prognosis in hepatoblastoma.

PubMed

Gödeke, Jan; Luxenburger, Elke; Trippel, Franziska; Becker, Kristina; Häberle, Beate; Müller-Höcker, Josef; von Schweinitz, Dietrich; Kappler, Roland

2016-03-01

Despite tremendous progress in therapy, about 30% of patients with hepatoblastoma still succumb to the disease. Thus, the development of improved therapies as well as the identification of prognostic factors are urgently needed. In the present study, expression and promoter methylation of the N-myc downstream-regulated gene (NDRG2), a tumor suppressor gene contributing to the regulation of the Wnt signalling pathway, was analysed in 38 hepatoblastoma samples by real-time reverse transcription-PCR and pyrosequencing, respectively. The NDRG2 gene was highly expressed in normal pediatric liver tissue, but was significantly downregulated in heptoblastoma primary tumors. Detailed methylation analysis of CpG sites in the NDRG2 promoter region revealed a general high degree of DNA methylation in hepatoblastoma, which correlated with the suppression of NDRG2. By analyzing clinicopathological features we could demonstrate a strong association between low NDRG2 expression and tumor metastasis. Importantly, the overall survival analysis by Kaplan-Meier revealed that high NDRG2 expression was correlated with a higher survival rate in hepatoblastoma patients. Our data show that downregulation of NDRG2 may play an important role in advanced hepatoblastomas.
Active and Repressive Chromatin Are Interspersed without Spreading in an Imprinted Gene Cluster in the Mammalian Genome

PubMed Central

Regha, Kakkad; Sloane, Mathew A.; Huang, Ru; Pauler, Florian M.; Warczok, Katarzyna E.; Melikant, Balázs; Radolf, Martin; Martens, Joost H.A.; Schotta, Gunnar; Jenuwein, Thomas; Barlow, Denise P.

2010-01-01

SUMMARY The Igf2r imprinted cluster is an epigenetic silencing model in which expression of a ncRNA silences multiple genes in cis. Here, we map a 250 kb region in mouse embryonic fibroblast cells to show that histone modifications associated with expressed and silent genes are mutually exclusive and localized to discrete regions. Expressed genes were modified at promoter regions by H3K4me3 + H3K4me2 + H3K9Ac and on putative regulatory elements flanking active promoters by H3K4me2 + H3K9Ac. Silent genes showed two types of nonoverlapping profile. One type spread over large domains of tissue-specific silent genes and contained H3K27me3 alone. A second type formed localized foci on silent imprinted gene promoters and a nonexpressed pseudogene and contained H3K9me3 + H4K20me3 ± HP1. Thus, mammalian chromosome arms contain active chromatin interspersed with repressive chromatin resembling the type of heterochromatin previously considered a feature of centromeres, telomeres, and the inactive X chromosome. PMID:17679087
Determining Cutoff Point of Ensemble Trees Based on Sample Size in Predicting Clinical Dose with DNA Microarray Data.

PubMed

Yılmaz Isıkhan, Selen; Karabulut, Erdem; Alpar, Celal Reha

2016-01-01

Background/Aim . Evaluating the success of dose prediction based on genetic or clinical data has substantially advanced recently. The aim of this study is to predict various clinical dose values from DNA gene expression datasets using data mining techniques. Materials and Methods . Eleven real gene expression datasets containing dose values were included. First, important genes for dose prediction were selected using iterative sure independence screening. Then, the performances of regression trees (RTs), support vector regression (SVR), RT bagging, SVR bagging, and RT boosting were examined. Results . The results demonstrated that a regression-based feature selection method substantially reduced the number of irrelevant genes from raw datasets. Overall, the best prediction performance in nine of 11 datasets was achieved using SVR; the second most accurate performance was provided using a gradient-boosting machine (GBM). Conclusion . Analysis of various dose values based on microarray gene expression data identified common genes found in our study and the referenced studies. According to our findings, SVR and GBM can be good predictors of dose-gene datasets. Another result of the study was to identify the sample size of n = 25 as a cutoff point for RT bagging to outperform a single RT.
An Adaptive Genetic Association Test Using Double Kernel Machines

PubMed Central

Zhan, Xiang; Epstein, Michael P.; Ghosh, Debashis

2014-01-01

Recently, gene set-based approaches have become very popular in gene expression profiling studies for assessing how genetic variants are related to disease outcomes. Since most genes are not differentially expressed, existing pathway tests considering all genes within a pathway suffer from considerable noise and power loss. Moreover, for a differentially expressed pathway, it is of interest to select important genes that drive the effect of the pathway. In this article, we propose an adaptive association test using double kernel machines (DKM), which can both select important genes within the pathway as well as test for the overall genetic pathway effect. This DKM procedure first uses the garrote kernel machines (GKM) test for the purposes of subset selection and then the least squares kernel machine (LSKM) test for testing the effect of the subset of genes. An appealing feature of the kernel machine framework is that it can provide a flexible and unified method for multi-dimensional modeling of the genetic pathway effect allowing for both parametric and nonparametric components. This DKM approach is illustrated with application to simulated data as well as to data from a neuroimaging genetics study. PMID:26640602
PiiL: visualization of DNA methylation and gene expression data in gene pathways.

PubMed

Moghadam, Behrooz Torabi; Zamani, Neda; Komorowski, Jan; Grabherr, Manfred

2017-08-02

DNA methylation is a major mechanism involved in the epigenetic state of a cell. It has been observed that the methylation status of certain CpG sites close to or within a gene can directly affect its expression, either by silencing or, in some cases, up-regulating transcription. However, a vertebrate genome contains millions of CpG sites, all of which are potential targets for methylation, and the specific effects of most sites have not been characterized to date. To study the complex interplay between methylation status, cellular programs, and the resulting phenotypes, we present PiiL, an interactive gene expression pathway browser, facilitating analyses through an integrated view of methylation and expression on multiple levels. PiiL allows for specific hypothesis testing by quickly assessing pathways or gene networks, where the data is projected onto pathways that can be downloaded directly from the online KEGG database. PiiL provides a comprehensive set of analysis features that allow for quick and specific pattern searches. Individual CpG sites and their impact on host gene expression, as well as the impact on other genes present in the regulatory network, can be examined. To exemplify the power of this approach, we analyzed two types of brain tumors, Glioblastoma multiform and lower grade gliomas. At a glance, we could confirm earlier findings that the predominant methylation and expression patterns separate perfectly by mutations in the IDH genes, rather than by histology. We could also infer the IDH mutation status for samples for which the genotype was not known. By applying different filtering methods, we show that a subset of CpG sites exhibits consistent methylation patterns, and that the status of sites affect the expression of key regulator genes, as well as other genes located downstream in the same pathways. PiiL is implemented in Java with focus on a user-friendly graphical interface. The source code is available under the GPL license from https://github.com/behroozt/PiiL.git .
The Airway Microbiome in Severe Asthma: Associations with Disease Features and Severity

PubMed Central

Huang, Yvonne J.; Nariya, Snehal; Harris, Jeffrey M.; Lynch, Susan V.; Choy, David F.; Arron, Joseph R.; Boushey, Homer

2015-01-01

Background Asthma is heterogeneous, and airway dysbiosis is associated with clinical features in mild-moderate asthma. Whether similar relationships exist among patients with severe asthma is unknown. Objective To evaluate relationships between the bronchial microbiome and features of severe asthma. Methods Bronchial brushings from 40 participants in the BOBCAT study (Bronchoscopic Exploratory Research Study of Biomarkers in Corticosteroid-refractory Asthma) were evaluated using 16S rRNA-based methods. Relationships to clinical and inflammatory features were analyzed among microbiome-profiled subjects. Secondarily, bacterial compositional profiles were compared between severe asthmatics, and previously studied healthy controls (n=7), and mild-moderate asthma subjects (n=41). Results In severe asthma, bronchial bacterial composition was associated with several disease-related features, including body-mass index (BMI; Bray-Curtis distance PERMANOVA, p < 0.05), changes in Asthma Control Questionnaire (ACQ) scores (p < 0.01), sputum total leukocytes (p = 0.06) and bronchial biopsy eosinophils (per mm2; p = 0.07). Bacterial communities associated with worsening ACQ and sputum total leukocytes (predominantly Proteobacteria) differed markedly from those associated with BMI (Bacteroidetes/Firmicutes). In contrast, improving/stable ACQ and bronchial epithelial gene expression of FKBP5, an indicator of steroid responsiveness, correlated with Actinobacteria. Mostly negative correlations were observed between biopsy eosinophils and Proteobacteria. No taxa were associated with a T-helper type 2-related epithelial gene expression signature, but expression of Th17-related genes was associated with Proteobacteria. Severe asthma subjects, compared to healthy controls or mild-moderate asthmatics, were significantly enriched in Actinobacteria, although the largest differences observed involved a Klebsiella genus member (7.8 fold-increase in severe asthma, padj < 0.001) Conclusions Specific microbiota are associated with and may modulate inflammatory processes in severe asthma and related phenotypes. Airway dysbiosis in severe asthma appears to differ from that observed in milder asthma in the setting of inhaled corticosteroid use. PMID:26220531
Natural language indicators of differential gene regulation in the human immune system.

PubMed

Mehl, Matthias R; Raison, Charles L; Pace, Thaddeus W W; Arevalo, Jesusa M G; Cole, Steve W

2017-11-21

Adverse social conditions have been linked to a conserved transcriptional response to adversity (CTRA) in circulating leukocytes that may contribute to social gradients in disease. However, the CNS mechanisms involved remain obscure, in part because CTRA gene-expression profiles often track external social-environmental variables more closely than they do self-reported internal affective states such as stress, depression, or anxiety. This study examined the possibility that variations in patterns of natural language use might provide more sensitive indicators of the automatic threat-detection and -response systems that proximally regulate autonomic induction of the CTRA. In 22,627 audio samples of natural speech sampled from the daily interactions of 143 healthy adults, both total language output and patterns of function-word use covaried with CTRA gene expression. These language features predicted CTRA gene expression substantially better than did conventional self-report measures of stress, depression, and anxiety and did so independently of demographic and behavioral factors (age, sex, race, smoking, body mass index) and leukocyte subset distributions. This predictive relationship held when language and gene expression were sampled more than a week apart, suggesting that associations reflect stable individual differences or chronic life circumstances. Given the observed relationship between personal expression and gene expression, patterns of natural language use may provide a useful behavioral indicator of nonconsciously evaluated well-being (implicit safety vs. threat) that is distinct from conscious affective experience and more closely tracks the neurobiological processes involved in peripheral gene regulation. Copyright © 2017 the Author(s). Published by PNAS.
Gene expression responses of HeLa cells to chemical species generated by an atmospheric plasma flow

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yokoyama, Mayo, E-mail: yokoyama@plasma.ifs.tohoku.ac.jp; Johkura, Kohei, E-mail: kohei@shinshu-u.ac.jp; Sato, Takehiko, E-mail: sato@ifs.tohoku.ac.jp

2014-08-08

Highlights: • Response of HeLa cells to a plasma-irradiated medium was revealed by DNA microarray. • Gene expression pattern was basically different from that in a H{sub 2}O{sub 2}-added medium. • Prominently up-/down-regulated genes were partly shared by the two media. • Gene ontology analysis showed both similar and different responses in the two media. • Candidate genes involved in response to ROS were detected in each medium. - Abstract: Plasma irradiation generates many factors able to affect the cellular condition, and this feature has been studied for its application in the field of medicine. We previously reported that hydrogenmore » peroxide (H{sub 2}O{sub 2}) was the major cause of HeLa cell death among the chemical species generated by high level irradiation of a culture medium by atmospheric plasma. To assess the effect of plasma-induced factors on the response of live cells, HeLa cells were exposed to a medium irradiated by a non-lethal plasma flow level, and their gene expression was broadly analyzed by DNA microarray in comparison with that in a corresponding concentration of 51 μM H{sub 2}O{sub 2}. As a result, though the cell viability was sufficiently maintained at more than 90% in both cases, the plasma-medium had a greater impact on it than the H{sub 2}O{sub 2}-medium. Hierarchical clustering analysis revealed fundamentally different cellular responses between these two media. A larger population of genes was upregulated in the plasma-medium, whereas genes were downregulated in the H{sub 2}O{sub 2}-medium. However, a part of the genes that showed prominent differential expression was shared by them, including an immediate early gene ID2. In gene ontology analysis of upregulated genes, the plasma-medium showed more diverse ontologies than the H{sub 2}O{sub 2}-medium, whereas ontologies such as “response to stimulus” were common, and several genes corresponded to “response to reactive oxygen species.” Genes of AP-1 proteins, e.g., JUN and FOS, were detected and notably elevated in the plasma-medium. These results showed that the medium irradiated with a non-lethal level of plasma flow altered various gene expressions of HeLa cells by giving not only common effects with H{sub 2}O{sub 2} but also some distinctive actions. This study suggests that in addition to H{sub 2}O{sub 2}, other chemical species able to affect the cellular responses exist in the plasma-irradiated medium and provide unique features for it, probably increasing the oxidative stress level.« less
Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction

PubMed Central

Kim, Dokyoon; Joung, Je-Gun; Sohn, Kyung-Ah; Shin, Hyunjung; Park, Yu Rang; Ritchie, Marylyn D; Kim, Ju Han

2015-01-01

Objective Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Methods Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Results Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Conclusions Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies. PMID:25002459
Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction.

PubMed

Kim, Dokyoon; Joung, Je-Gun; Sohn, Kyung-Ah; Shin, Hyunjung; Park, Yu Rang; Ritchie, Marylyn D; Kim, Ju Han

2015-01-01

Cancer can involve gene dysregulation via multiple mechanisms, so no single level of genomic data fully elucidates tumor behavior due to the presence of numerous genomic variations within or between levels in a biological system. We have previously proposed a graph-based integration approach that combines multi-omics data including copy number alteration, methylation, miRNA, and gene expression data for predicting clinical outcome in cancer. However, genomic features likely interact with other genomic features in complex signaling or regulatory networks, since cancer is caused by alterations in pathways or complete processes. Here we propose a new graph-based framework for integrating multi-omics data and genomic knowledge to improve power in predicting clinical outcomes and elucidate interplay between different levels. To highlight the validity of our proposed framework, we used an ovarian cancer dataset from The Cancer Genome Atlas for predicting stage, grade, and survival outcomes. Integrating multi-omics data with genomic knowledge to construct pre-defined features resulted in higher performance in clinical outcome prediction and higher stability. For the grade outcome, the model with gene expression data produced an area under the receiver operating characteristic curve (AUC) of 0.7866. However, models of the integration with pathway, Gene Ontology, chromosomal gene set, and motif gene set consistently outperformed the model with genomic data only, attaining AUCs of 0.7873, 0.8433, 0.8254, and 0.8179, respectively. Integrating multi-omics data and genomic knowledge to improve understanding of molecular pathogenesis and underlying biology in cancer should improve diagnostic and prognostic indicators and the effectiveness of therapies. © The Author 2014. Published by Oxford University Press on behalf of the American Medical Informatics Association.
AP-2α and AP-2β cooperatively orchestrate homeobox gene expression during branchial arch patterning.

PubMed

Van Otterloo, Eric; Li, Hong; Jones, Kenneth L; Williams, Trevor

2018-01-25

The evolution of a hinged moveable jaw with variable morphology is considered a major factor behind the successful expansion of the vertebrates. DLX homeobox transcription factors are crucial for establishing the positional code that patterns the mandible, maxilla and intervening hinge domain, but how the genes encoding these proteins are regulated remains unclear. Herein, we demonstrate that the concerted action of the AP-2α and AP-2β transcription factors within the mouse neural crest is essential for jaw patterning. In the absence of these two proteins, the hinge domain is lost and there are alterations in the size and patterning of the jaws correlating with dysregulation of homeobox gene expression, with reduced levels of Emx, Msx and Dlx paralogs accompanied by an expansion of Six1 expression. Moreover, detailed analysis of morphological features and gene expression changes indicate significant overlap with various compound Dlx gene mutants. Together, these findings reveal that the AP-2 genes have a major function in mammalian neural crest development, influencing patterning of the craniofacial skeleton via the DLX code, an effect that has implications for vertebrate facial evolution, as well as for human craniofacial disorders. © 2018. Published by The Company of Biologists Ltd.
Classification of ductal carcinoma in situ by gene expression profiling.

PubMed

Hannemann, Juliane; Velds, Arno; Halfwerk, Johannes B G; Kreike, Bas; Peterse, Johannes L; van de Vijver, Marc J

2006-01-01

Ductal carcinoma in situ (DCIS) is characterised by the intraductal proliferation of malignant epithelial cells. Several histological classification systems have been developed, but assessing the histological type/grade of DCIS lesions is still challenging, making treatment decisions based on these features difficult. To obtain insight in the molecular basis of the development of different types of DCIS and its progression to invasive breast cancer, we have studied differences in gene expression between different types of DCIS and between DCIS and invasive breast carcinomas. Gene expression profiling using microarray analysis has been performed on 40 in situ and 40 invasive breast cancer cases. DCIS cases were classified as well- (n = 6), intermediately (n = 18), and poorly (n = 14) differentiated type. Of the 40 invasive breast cancer samples, five samples were grade I, 11 samples were grade II, and 24 samples were grade III. Using two-dimensional hierarchical clustering, the basal-like type, ERB-B2 type, and the luminal-type tumours originally described for invasive breast cancer could also be identified in DCIS. Using supervised classification, we identified a gene expression classifier of 35 genes, which differed between DCIS and invasive breast cancer; a classifier of 43 genes could be identified separating between well- and poorly differentiated DCIS samples.
Classification of ductal carcinoma in situ by gene expression profiling

PubMed Central

Hannemann, Juliane; Velds, Arno; Halfwerk, Johannes BG; Kreike, Bas; Peterse, Johannes L; van de Vijver, Marc J

2006-01-01

Introduction Ductal carcinoma in situ (DCIS) is characterised by the intraductal proliferation of malignant epithelial cells. Several histological classification systems have been developed, but assessing the histological type/grade of DCIS lesions is still challenging, making treatment decisions based on these features difficult. To obtain insight in the molecular basis of the development of different types of DCIS and its progression to invasive breast cancer, we have studied differences in gene expression between different types of DCIS and between DCIS and invasive breast carcinomas. Methods Gene expression profiling using microarray analysis has been performed on 40 in situ and 40 invasive breast cancer cases. Results DCIS cases were classified as well- (n = 6), intermediately (n = 18), and poorly (n = 14) differentiated type. Of the 40 invasive breast cancer samples, five samples were grade I, 11 samples were grade II, and 24 samples were grade III. Using two-dimensional hierarchical clustering, the basal-like type, ERB-B2 type, and the luminal-type tumours originally described for invasive breast cancer could also be identified in DCIS. Conclusion Using supervised classification, we identified a gene expression classifier of 35 genes, which differed between DCIS and invasive breast cancer; a classifier of 43 genes could be identified separating between well- and poorly differentiated DCIS samples. PMID:17069663
[Gene promoter methylation in glucose-6-phosphate dehydrogenase deﬁciency].

PubMed

Xu, Dan-Dan; Wen, Fei-Qiu; Lv, Rong-Yu; Zhang, Min; Chen, Yun-Sheng; Chen, Xiao-Wen

2016-05-01

To investigate the features of methylation in the promoter region of glucose-6-phosphate dehydrogenase (G6PD) gene and the association between gene promoter methylation and G6PD deﬁciency. Fluorescent quantitative PCR was used to measure the mRNA expression of G6PD in 130 children with G6PD deficiency. Sixty-five children without G6PD deficiency served as the control group. The methylation-sensitive high-resolution melting curve analysis and bisulfite PCR sequencing were used to analyze gene promoter methylation in 22 children with G6PD deﬁciency and low G6PD mRNA expression. The G6PD gene promoter methylation was analyzed in 44 girls with normal G6PD mRNA expression (7 from G6PD deficiency group and 37 from control group). Twenty-two (16.9%) children with G6PD deficiency had relatively low mRNA expression of G6PD; among whom, 16 boys showed no methylation, and 6 girls showed partial methylation. Among the 44 girls with normal G6PD mRNA expression, 40 showed partial methylation, and 4 showed no methylation (1 case in the G6PD group and 3 cases in the control group). Gene promoter methylation is not associated with G6PD deficiency in boys. Girls have partial methylation or no methylation in the G6PD gene, suggesting that the methylation may be related to G6PD deficiency in girls.
Analysis of Bos taurus and Sus scrofa X and Y chromosome transcriptome highlights reproductive driver genes

PubMed Central

Khan, Faheem Ahmed; Liu, Hui; Zhou, Hao; Wang, Kai; Qamar, Muhammad Tahir Ul; Pandupuspitasari, Nuruliarizki Shinta; Shujun, Zhang

2017-01-01

The biology of sperm, its capability of fertilizing an egg and its role in sex ratio are the major biological questions in reproductive biology. To answer these question we integrated X and Y chromosome transcriptome across different species: Bos taurus and Sus scrofa and identified reproductive driver genes based on Weighted Gene Co-Expression Network Analysis (WGCNA) algorithm. Our strategy resulted in 11007 and 10445 unique genes consisting of 9 and 11 reproductive modules in Bos taurus and Sus scrofa, respectively. The consensus module calculation yields an overall 167 overlapped genes which were mapped to 846 DEGs in Bos taurus to finally get a list of 67 dual feature genes. We develop gene co-expression network of selected 67 genes that consists of 58 nodes (27 down-regulated and 31 up-regulated genes) enriched to 66 GO biological process (BP) including 6 GO annotations related to reproduction and two KEGG pathways. Moreover, we searched significantly related TF (ISRE, AP1FJ, RP58, CREL) and miRNAs (bta-miR-181a, bta-miR-17-5p, bta-miR-146b, bta-miR-146a) which targeted the genes in co-expression network. In addition we performed genetic analysis including phylogenetic, functional domain identification, epigenetic modifications, mutation analysis of the most important reproductive driver genes PRM1, PPP2R2B and PAFAH1B1 and finally performed a protein docking analysis to visualize their therapeutic and gene expression regulation ability. PMID:28903352
Homeobox genes in the rodent pineal gland: roles in development and phenotype maintenance.

PubMed

Rath, Martin F; Rohde, Kristian; Klein, David C; Møller, Morten

2013-06-01

The pineal gland is a neuroendocrine gland responsible for nocturnal synthesis of melatonin. During early development of the rodent pineal gland from the roof of the diencephalon, homeobox genes of the orthodenticle homeobox (Otx)- and paired box (Pax)-families are expressed and are essential for normal pineal development consistent with the well-established role that homeobox genes play in developmental processes. However, the pineal gland appears to be unusual because strong homeobox gene expression persists in the pineal gland of the adult brain. Accordingly, in addition to developmental functions, homeobox genes appear to be key regulators in postnatal phenotype maintenance in this tissue. In this paper, we review ontogenetic and phylogenetic aspects of pineal development and recent progress in understanding the involvement of homebox genes in rodent pineal development and adult function. A working model is proposed for understanding the sequential action of homeobox genes in controlling development and mature circadian function of the mammalian pinealocyte based on knowledge from detailed developmental and daily gene expression analyses in rats, the pineal phenotypes of homebox gene-deficient mice and studies on development of the retinal photoreceptor; the pinealocyte and retinal photoreceptor share features not seen in other tissues and are likely to have evolved from the same ancestral photodetector cell.
Homeobox genes in the rodent pineal gland: roles in development and phenotype maintenance

PubMed Central

Rath, Martin F.; Rohde, Kristian; Klein, David C.; Møller, Morten

2012-01-01

The pineal gland is a neuroendocrine gland responsible for nocturnal synthesis of melatonin. During early development of the rodent pineal gland from the roof of the diencephalon, homeobox genes of the orthodenticle homeobox (Otx)- and paired box (Pax)-families are expressed and are essential for normal pineal development consistent with the well-established role that homeobox genes play in developmental processes. However, the pineal gland appears to be unusual because strong homeobox gene expression persists in the pineal gland of the adult brain. Accordingly, in addition to developmental functions, homeobox genes appear to be key regulators in postnatal phenotype maintenance in this tissue. In this paper, we review ontogenetic and phylogenetic aspects of pineal development and recent progress in understanding the involvement of homebox genes in rodent pineal development and adult function. A working model is proposed for understanding the sequential action of homeobox genes in controlling development and mature circadian function of the mammalian pinealocyte based on knowledge from detailed developmental and daily gene expression analyses in rats, the pineal phenotypes of homebox gene-deficient mice and studies on development of the retinal photoreceptor; the pinealocyte and retinal photoreceptor share features not seen in other tissues and are likely to have evolved from the same ancestral photodetector cell. PMID:23076630
Cooperative activity of GABP with PU.1 or C/EBPε regulates lamin B receptor gene expression, implicating their roles in granulocyte nuclear maturation1

PubMed Central

Malu, Krishnakumar; Garhwal, Rahul; Pelletier, Margery G. H.; Gotur, Deepali; Halene, Stephanie; Zwerger, Monika; Yang, Zhong-Fa; Rosmarin, Alan G.; Gaines, Peter

2016-01-01

Nuclear segmentation is a hallmark feature of mammalian neutrophil differentiation, but the mechanisms that control this process are poorly understood. Gene expression in maturing neutrophils requires combinatorial actions of lineage-restricted and more widely expressed transcriptional regulators. Examples include interactions of the widely expressed ETS transcription factor, GA-binding protein (GABP), with the relatively lineage-restricted ETS factor, PU.1, and with CCAAT enhancer binding proteins, C/EBPα and C/EBPε. Whether such cooperative interactions between these transcription factors also regulate the expression of genes encoding proteins that control nuclear segmentation is unclear. We investigated the roles of ETS and C/EBP family transcription factors in regulating the gene encoding the lamin B receptor (LBR), an inner nuclear membrane protein whose expression is required for neutrophil nuclear segmentation. Although C/EBPε was previously shown to bind the Lbr promoter, surprisingly, we found that neutrophils derived from Cebpe null mice exhibited normal Lbr gene and protein expression. Instead, GABP provided transcriptional activation through the Lbr promoter in the absence of C/EBPε, and activities supported by GABP were greatly enhanced by either C/EBPε or PU.1. Both GABP and PU.1 bound Ets sites in the Lbr promoter in vitro, and in vivo within both early myeloid progenitors and differentiating neutrophils. These findings demonstrate that GABP, PU.1, and C/EBPε cooperate to control transcription of the gene encoding LBR, a nuclear envelope protein that is required for the characteristic lobulated morphology of mature neutrophils. PMID:27342846
Characterizing mutation-expression network relationships in multiple cancers.

PubMed

Ghazanfar, Shila; Yang, Jean Yee Hwa

2016-08-01

Data made available through large cancer consortia like The Cancer Genome Atlas make for a rich source of information to be studied across and between cancers. In recent years, network approaches have been applied to such data in uncovering the complex interrelationships between mutational and expression profiles, but lack direct testing for expression changes via mutation. In this pan-cancer study we analyze mutation and gene expression information in an integrative manner by considering the networks generated by testing for differences in expression in direct association with specific mutations. We relate our findings among the 19 cancers examined to identify commonalities and differences as well as their characteristics. Using somatic mutation and gene expression information across 19 cancers, we generated mutation-expression networks per cancer. On evaluation we found that our generated networks were significantly enriched for known cancer-related genes, such as skin cutaneous melanoma (p<0.01 using Network of Cancer Genes 4.0). Our framework identified that while different cancers contained commonly mutated genes, there was little concordance between associated gene expression changes among cancers. Comparison between cancers showed a greater overlap of network nodes for cancers with higher overall non-silent mutation load, compared to those with a lower overall non-silent mutation load. This study offers a framework that explores network information through co-analysis of somatic mutations and gene expression profiles. Our pan-cancer application of this approach suggests that while mutations are frequently common among cancer types, the impact they have on the surrounding networks via gene expression changes varies. Despite this finding, there are some cancers for which mutation-associated network behaviour appears to be similar: suggesting a potential framework for uncovering related cancers for which similar therapeutic strategies may be applicable. Our framework for understanding relationships among cancers has been integrated into an interactive R Shiny application, PAn Cancer Mutation Expression Networks (PACMEN), containing dynamic and static network visualization of the mutation-expression networks. PACMEN also features tools for further examination of network topology characteristics among cancers. Copyright © 2016 Elsevier Ltd. All rights reserved.

Cell-Specific Actions of a Human LHX3 Gene Enhancer During Pituitary and Spinal Cord Development

PubMed Central

Park, Soyoung; Mullen, Rachel D.

2013-01-01

The LIM class of homeodomain protein 3 (LHX3) transcription factor is essential for pituitary gland and nervous system development in mammals. In humans, mutations in the LHX3 gene underlie complex pediatric syndromes featuring deficits in anterior pituitary hormones and defects in the nervous system. The mechanisms that control temporal and spatial expression of the LHX3 gene are poorly understood. The proximal promoters of the human LHX3 gene are insufficient to guide expression in vivo and downstream elements including a conserved enhancer region appear to play a role in tissue-specific expression in the pituitary and nervous system. Here we characterized the activity of this downstream enhancer region in regulating gene expression at the cellular level during development. Human LHX3 enhancer-driven Cre reporter transgenic mice were generated to facilitate studies of enhancer actions. The downstream LHX3 enhancer primarily guides gene transcription in α-glycoprotein subunit -expressing cells secreting the TSHβ, LHβ, or FSHβ hormones and expressing the GATA2 and steroidogenic factor 1 transcription factors. In the developing nervous system, the enhancer serves as a targeting module active in V2a interneurons. These results demonstrate that the downstream LHX3 enhancer is important in specific endocrine and neural cell types but also indicate that additional regulatory elements are likely involved in LHX3 gene expression. Furthermore, these studies revealed significant gonadotrope cell heterogeneity during pituitary development, providing insights into the cellular physiology of this key reproductive regulatory cell. The human LHX3 enhancer-driven Cre reporter transgenic mice also provide a valuable tool for further developmental studies of cell determination and differentiation in the pituitary and nervous system. PMID:24100213
Curcumin eliminates the inhibitory effect of advanced glycation end-products (AGEs) on gene expression of AGE receptor-1 in hepatic stellate cells in vitro

PubMed Central

Lin, Jianguo; Tang, Youcai; Kang, Qiaohua; Chen, Anping

2012-01-01

Diabetes is featured by hyperglycemia, which facilitates the formation of advanced glycation end-products (AGEs). AGEs are a causal factor in development of diabetic complications. AGE receptor-1 (AGE-R1) is responsible for detoxification and clearance of AGEs. Type 2 diabetes mellitus is commonly accompanied by non-alcoholic steatohepatitis, which could cause hepatic fibrosis. Little attention has been paid to effects of AGEs on hepatic fibrogenesis. Curcumin, a phytochemical from turmeric, has been reported to inhibit the activation of hepatic stellate cells (HSCs), the major effectors during hepatic fibrogenesis, and to protect against hepatic fibrogenesis in vitro and in vivo. The current study was designed to evaluate effects of AGEs on inducing HSC activation, to assess the role of curcumin in diminishing the AGE effects and to explore the underlying mechanisms. Our results showed that AGEs stimulated HSC activation by inducing cell proliferation and expression of genes relevant to HSC activation, which were abrogated by curcumin. Curcumin induced gene expression of AGE-R1 in passaged HSCs, which might facilitate the attenuation of the stimulatory effects of AGEs on the activation of HSCs. Further experiments revealed that curcumin inhibited the activity of extracellular signal-regulated kinase (ERK) and induced gene expression and the activity of peroxisome proliferator-activated receptor-gamma (PPARγ), leading to the induction of AGE-R1 gene expression. In summary, AGEs stimulated HSC activation. Curcumin eliminated the AGE effects at least partially by inducing AGE-R1 gene expression. The process was mediated by inhibiting ERK activity, inducing gene expression of PPARγ and stimulating its trans-activity. PMID:22449800
A mixture model-based approach to the clustering of microarray expression data.

PubMed

McLachlan, G J; Bean, R W; Peel, D

2002-03-01

This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets. EMMIX-GENE is available at http://www.maths.uq.edu.au/~gjm/emmix-gene/
DEEP--a tool for differential expression effector prediction.

PubMed

Degenhardt, Jost; Haubrock, Martin; Dönitz, Jürgen; Wingender, Edgar; Crass, Torsten

2007-07-01

High-throughput methods for measuring transcript abundance, like SAGE or microarrays, are widely used for determining differences in gene expression between different tissue types, dignities (normal/malignant) or time points. Further analysis of such data frequently aims at the identification of gene interaction networks that form the causal basis for the observed properties of the systems under examination. To this end, it is usually not sufficient to rely on the measured gene expression levels alone; rather, additional biological knowledge has to be taken into account in order to generate useful hypotheses about the molecular mechanism leading to the realization of a certain phenotype. We present a method that combines gene expression data with biological expert knowledge on molecular interaction networks, as described by the TRANSPATH database on signal transduction, to predict additional--and not necessarily differentially expressed--genes or gene products which might participate in processes specific for either of the examined tissues or conditions. In a first step, significance values for over-expression in tissue/condition A or B are assigned to all genes in the expression data set. Genes with a significance value exceeding a certain threshold are used as starting points for the reconstruction of a graph with signaling components as nodes and signaling events as edges. In a subsequent graph traversal process, again starting from the previously identified differentially expressed genes, all encountered nodes 'inherit' all their starting nodes' significance values. In a final step, the graph is visualized, the nodes being colored according to a weighted average of their inherited significance values. Each node's, or sub-network's, predominant color, ranging from green (significant for tissue/condition A) over yellow (not significant for either tissue/condition) to red (significant for tissue/condition B), thus gives an immediate visual clue on which molecules--differentially expressed or not--may play pivotal roles in the tissues or conditions under examination. The described method has been implemented in Java as a client/server application and a web interface called DEEP (Differential Expression Effector Prediction). The client, which features an easy-to-use graphical interface, can freely be downloaded from the following URL: http://deep.bioinf.med.uni-goettingen.de.
Truncating mutations of MAGEL2 cause Prader-Willi phenotypes and autism.

PubMed

Schaaf, Christian P; Gonzalez-Garay, Manuel L; Xia, Fan; Potocki, Lorraine; Gripp, Karen W; Zhang, Baili; Peters, Brock A; McElwain, Mark A; Drmanac, Radoje; Beaudet, Arthur L; Caskey, C Thomas; Yang, Yaping

2013-11-01

Prader-Willi syndrome (PWS) is caused by the absence of paternally expressed, maternally silenced genes at 15q11-q13. We report four individuals with truncating mutations on the paternal allele of MAGEL2, a gene within the PWS domain. The first subject was ascertained by whole-genome sequencing analysis for PWS features. Three additional subjects were identified by reviewing the results of exome sequencing of 1,248 cases in a clinical laboratory. All four subjects had autism spectrum disorder (ASD), intellectual disability and a varying degree of clinical and behavioral features of PWS. These findings suggest that MAGEL2 is a new gene causing complex ASD and that MAGEL2 loss of function can contribute to several aspects of the PWS phenotype.
Patterning of anteroposterior body axis displayed in the expression of Hox genes in sea cucumber Apostichopus japonicus.

PubMed

Kikuchi, Mani; Omori, Akihito; Kurokawa, Daisuke; Akasaka, Koji

2015-09-01

The presence of an anteroposterior body axis is a fundamental feature of bilateria. Within this group, echinoderms have secondarily evolved pentameral symmetric body plans. Although all echinoderms present bilaterally symmetric larval stages, they dramatically rearrange their body axis and develop a pentaradial body plan during metamorphosis. Therefore, the location of their anteroposterior body axis in adult forms remains a contentious issue. Unlike other echinoderms, sea cucumbers present an obvious anteroposterior axis not rearranged during metamorphosis, thus representing an interesting group to study their anteroposterior axis patterning. Hox genes are known to play a broadly conserved role in anteroposterior axis patterning in deuterostomes. Here, we report the expression patterns of Hox genes from early development to pentactula stage in sea cucumber. In early larval stages, five Hox genes (AjHox1, AjHox7, AjHox8, AjHox11/13a, and AjHox11/13b) were expressed sequentially along the archenteron, suggesting that the role of anteroposterior patterning of the Hox genes is conserved in bilateral larvae of echinoderms. In doliolaria and pentactula stages, eight Hox genes (AjHox1, AjHox5, AjHox7, AjHox8, AjHox9/10, AjHox11/13a, AjHox11/13b, and AjHox11/13c) were expressed sequentially along the digestive tract, following a similar expression pattern to that found in the visceral mesoderm of other bilateria. Unlike other echinoderms, pentameral expression patterns of AjHox genes were not observed in sea cucumber. Altogether, we concluded that AjHox genes are involved in the patterning of the digestive tract in both larvae and metamorphosis of sea cucumbers. In addition, the anteroposterior axis in sea cucumbers might be patterned like that of other bilateria.
We can't all be supermodels: the value of comparative transcriptomics to the study of non-model insects

PubMed Central

Oppenheim, Sara J; Baker, Richard H; Simon, Sabrina; DeSalle, Rob

2015-01-01

Insects are the most diverse group of organisms on the planet. Variation in gene expression lies at the heart of this biodiversity and recent advances in sequencing technology have spawned a revolution in researchers' ability to survey tissue-specific transcriptional complexity across a wide range of insect taxa. Increasingly, studies are using a comparative approach (across species, sexes and life stages) that examines the transcriptional basis of phenotypic diversity within an evolutionary context. In the present review, we summarize much of this research, focusing in particular on three critical aspects of insect biology: morphological development and plasticity; physiological response to the environment; and sexual dimorphism. A common feature that is emerging from these investigations concerns the dynamic nature of transcriptome evolution as indicated by rapid changes in the overall pattern of gene expression, the differential expression of numerous genes with unknown function, and the incorporation of novel, lineage-specific genes into the transcriptional profile. PMID:25524309
Effects of inorganic nitrogen sources on the production of PP-V [(10Z)-12-carboxyl-monascorubramine] and the Expression of the nitrate assimilation gene cluster by Penicillium sp. AZ.

PubMed

Arai, Teppei; Umemura, Sara; Ota, Tamaki; Ogihara, Jun; Kato, Jun; Kasumi, Takafumi

2012-01-01

A fungal strain, Penicillium sp. AZ, produced the azaphilone Monascus pigment homolog when cultured in a medium composed of soluble starch, ammonium nitrate, yeast extract, and citrate buffer, pH 5.0. One of the typical features of violet pigment PP-V [(10Z)-12-carboxyl-monascorubramine] is that pyranoid oxygen is replaced with nitrogen. In this study, we found that ammonia and nitrate nitrogen are available for PP-V biosynthesis, and that ammonia nitrogen was much more effective than nitrate nitrogen. Further, we isolated nitrate assimilation gene cluster, niaD, niiA, and crnA, and analyzed the expression of these genes. The expression levels of all these genes increased with sodium nitrate addition to the culture medium. The results obtained here strongly suggest that Penicillium sp. AZ produced PP-V using nitrate in the form of ammonium reduced from nitrate through a bioprocess assimilatory reaction.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Gompers, Andrea L.; Su-Feher, Linda; Ellegood, Jacob

The chromatin remodeling gene CHD8 represents a central node in neurodevelopmental gene networks implicated in autism. In this paper, we examined the impact of germline heterozygous frameshift Chd8 mutation on neurodevelopment in mice. Chd8 +/ del5 mice displayed normal social interactions with no repetitive behaviors but exhibited cognitive impairment correlated with increased regional brain volume, validating that phenotypes of Chd8 +/ del5 mice overlap pathology reported in humans with CHD8 mutations. We applied network analysis to characterize neurodevelopmental gene expression, revealing widespread transcriptional changes in Chd8 +/ del5 mice across pathways disrupted in neurodevelopmental disorders, including neurogenesis, synaptic processes andmore » neuroimmune signaling. We identified a co-expression module with peak expression in early brain development featuring dysregulation of RNA processing, chromatin remodeling and cell-cycle genes enriched for promoter binding by Chd8, and we validated increased neuronal proliferation and developmental splicing perturbation in Chd8 +/ del5 mice. Finally, this integrative analysis offers an initial picture of the consequences of Chd8 haploinsufficiency for brain development.« less
Gene Transfer into Rat Brain Using Adenoviral Vectors

PubMed Central

Puntel, Mariana; Kroeger, Kurt M.; Sanderson, Nicholas S.R.; Thomas, Clare E.; Castro, Maria G.; Lowenstein, Pedro R.

2010-01-01

Viral vector–mediated gene delivery is an attractive procedure for introducing genes into the brain, both for purposes of basic neuroscience research and to develop gene therapy for neurological diseases. Replication-defective adenoviruses possess many features which make them ideal vectors for this purpose—efficiently transducing terminally differentiated cells such as neurons and glial cells, resulting in high levels of transgene expression in vivo. Also, in the absence of anti-adenovirus immunity, these vectors can sustain very long-term transgene expression within the brain parenchyma. This unit provides protocols for the stereotactic injection of adenoviral vectors into the brain, followed by protocols to detect transgene expression or infiltrates of immune cells by immunocytochemistry or immunofluorescence. ELISPOT and neutralizing antibody assay methodologies are provided to quantitate the levels of cellular and humoral immune responses against adenoviruses. Quantitation of adenoviral vector genomes within the rat brain using qPCR is also described. Curr. Protoc. Neurosci. 50:4.24.1–4.24.49. © 2010 by John Wiley & Sons, Inc. PMID:20066657
A novel approach for dimension reduction of microarray.

PubMed

Aziz, Rabia; Verma, C K; Srivastava, Namita

2017-12-01

This paper proposes a new hybrid search technique for feature (gene) selection (FS) using Independent component analysis (ICA) and Artificial Bee Colony (ABC) called ICA+ABC, to select informative genes based on a Naïve Bayes (NB) algorithm. An important trait of this technique is the optimization of ICA feature vector using ABC. ICA+ABC is a hybrid search algorithm that combines the benefits of extraction approach, to reduce the size of data and wrapper approach, to optimize the reduced feature vectors. This hybrid search technique is facilitated by evaluating the performance of ICA+ABC on six standard gene expression datasets of classification. Extensive experiments were conducted to compare the performance of ICA+ABC with the results obtained from recently published Minimum Redundancy Maximum Relevance (mRMR) +ABC algorithm for NB classifier. Also to check the performance that how ICA+ABC works as feature selection with NB classifier, compared the combination of ICA with popular filter techniques and with other similar bio inspired algorithm such as Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The result shows that ICA+ABC has a significant ability to generate small subsets of genes from the ICA feature vector, that significantly improve the classification accuracy of NB classifier compared to other previously suggested methods. Copyright © 2017 Elsevier Ltd. All rights reserved.
Adaptive expansion of the maize maternally expressed gene (Meg) family involves changes in expression patterns and protein secondary structures of its members

PubMed Central

2014-01-01

Background The Maternally expressed gene (Meg) family is a locally-duplicated gene family of maize which encodes cysteine-rich proteins (CRPs). The founding member of the family, Meg1, is required for normal development of the basal endosperm transfer cell layer (BETL) and is involved in the allocation of maternal nutrients to growing seeds. Despite the important roles of Meg1 in maize seed development, the evolutionary history of the Meg cluster and the activities of the duplicate genes are not understood. Results In maize, the Meg gene cluster resides in a 2.3 Mb-long genomic region that exhibits many features of non-centromeric heterochromatin. Using phylogenetic reconstruction and syntenic alignments, we identified the pedigree of the Meg family, in which 11 of its 13 members arose in maize after allotetraploidization ~4.8 mya. Phylogenetic and population-genetic analyses identified possible signatures suggesting recent positive selection in Meg homologs. Structural analyses of the Meg proteins indicated potentially adaptive changes in secondary structure from α-helix to β-strand during the expansion. Transcriptomic analysis of the maize endosperm indicated that 6 Meg genes are selectively activated in the BETL, and younger Meg genes are more active than older ones. In endosperms from B73 by Mo17 reciprocal crosses, most Meg genes did not display parent-specific expression patterns. Conclusions Recently-duplicated Meg genes have different protein secondary structures, and their expressions in the BETL dominate over those of older members. Together with the signs of positive selections in the young Meg genes, these results suggest that the expansion of the Meg family involves potentially adaptive transitions in which new members with novel functions prevailed over older members. PMID:25084677
Transcriptome analysis of the Populus trichocarpa-Rhizophagus irregularis Mycorrhizal Symbiosis: Regulation of Plant and Fungal Transportomes under Nitrogen Starvation.

PubMed

Calabrese, Silvia; Kohler, Annegret; Niehl, Annette; Veneault-Fourrey, Claire; Boller, Thomas; Courty, Pierre-Emmanuel

2017-06-01

Nutrient transfer is a key feature of the arbuscular mycorrhizal (AM) symbiosis. Valuable mineral nutrients are transferred from the AM fungus to the plant, increasing its fitness and productivity, and, in exchange, the AM fungus receives carbohydrates as an energy source from the plant. Here, we analyzed the transcriptome of the Populus trichocarpa-Rhizophagus irregularis symbiosis using RNA-sequencing of non-mycorrhizal or mycorrhizal fine roots, with a focus on the effect of nitrogen (N) starvation. In R. irregularis, we identified 1,015 differentially expressed genes, whereby N starvation led to a general induction of gene expression. Genes of the functional classes of cell growth, membrane biogenesis and cell structural components were highly abundant. Interestingly, N starvation also led to a general induction of fungal transporters, indicating increased nutrient demand upon N starvation. In non-mycorrhizal P. trichocarpa roots, 1,341 genes were differentially expressed under N starvation. Among the 953 down-regulated genes in N starvation, most were involved in metabolic processes including amino acids, carbohydrate and inorganic ion transport, while the 342 up-regulated genes included many defense-related genes. Mycorrhization led to the up-regulation of 549 genes mainly involved in secondary metabolite biosynthesis and transport; only 24 genes were down-regulated. Mycorrhization specifically induced expression of three ammonium transporters and one phosphate transporter, independently of the N conditions, corroborating the hypothesis that these transporters are important for symbiotic nutrient exchange. In conclusion, our data establish a framework of gene expression in the two symbiotic partners under high-N and low-N conditions. © The Author 2017. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Constraints on the evolution of a doublesex target gene arising from doublesex’s pleiotropic deployment

PubMed Central

Luo, Shengzhan D.; Baker, Bruce S.

2015-01-01

“Regulatory evolution,” that is, changes in a gene’s expression pattern through changes at its regulatory sequence, rather than changes at the coding sequence of the gene or changes of the upstream transcription factors, has been increasingly recognized as a pervasive evolution mechanism. Many somatic sexually dimorphic features of Drosophila melanogaster are the results of gene expression regulated by the doublesex (dsx) gene, which encodes sex-specific transcription factors (DSXF in females and DSXM in males). Rapid changes in such sexually dimorphic features are likely a result of changes at the regulatory sequence of the target genes. We focused on the Flavin-containing monooxygenase-2 (Fmo-2) gene, a likely direct dsx target, to elucidate how sexually dimorphic expression and its evolution are brought about. We found that dsx is deployed to regulate the Fmo-2 transcription both in the midgut and in fat body cells of the spermatheca (a female-specific tissue), through a canonical DSX-binding site in the Fmo-2 regulatory sequence. In the melanogaster group, Fmo-2 transcription in the midgut has evolved rapidly, in contrast to the conserved spermathecal transcription. We identified two cis-regulatory modules (CRM-p and CRM-d) that direct sexually monomorphic or dimorphic Fmo-2 transcription, respectively, in the midguts of these species. Changes of Fmo-2 transcription in the midgut from sexually dimorphic to sexually monomorphic in some species are caused by the loss of CRM-d function, but not the loss of the canonical DSX-binding site. Thus, conferring transcriptional regulation on a CRM level allows the regulation to evolve rapidly in one tissue while evading evolutionary constraints posed by other tissues. PMID:25675536
Improving Classification of Cancer and Mining Biomarkers from Gene Expression Profiles Using Hybrid Optimization Algorithms and Fuzzy Support Vector Machine

PubMed Central

Moteghaed, Niloofar Yousefi; Maghooli, Keivan; Garshasbi, Masoud

2018-01-01

Background: Gene expression data are characteristically high dimensional with a small sample size in contrast to the feature size and variability inherent in biological processes that contribute to difficulties in analysis. Selection of highly discriminative features decreases the computational cost and complexity of the classifier and improves its reliability for prediction of a new class of samples. Methods: The present study used hybrid particle swarm optimization and genetic algorithms for gene selection and a fuzzy support vector machine (SVM) as the classifier. Fuzzy logic is used to infer the importance of each sample in the training phase and decrease the outlier sensitivity of the system to increase the ability to generalize the classifier. A decision-tree algorithm was applied to the most frequent genes to develop a set of rules for each type of cancer. This improved the abilities of the algorithm by finding the best parameters for the classifier during the training phase without the need for trial-and-error by the user. The proposed approach was tested on four benchmark gene expression profiles. Results: Good results have been demonstrated for the proposed algorithm. The classification accuracy for leukemia data is 100%, for colon cancer is 96.67% and for breast cancer is 98%. The results show that the best kernel used in training the SVM classifier is the radial basis function. Conclusions: The experimental results show that the proposed algorithm can decrease the dimensionality of the dataset, determine the most informative gene subset, and improve classification accuracy using the optimal parameters of the classifier with no user interface. PMID:29535919
TRAM (Transcriptome Mapper): database-driven creation and analysis of transcriptome maps from multiple sources

PubMed Central

2011-01-01

Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005
Heterozygous Deletion of FOXA2 Segregates with Disease in a Family with Heterotaxy, Panhypopituitarism, and Biliary Atresia

PubMed Central

Tsai, Ellen A.; Grochowski, Christopher M.; Falsey, Alexandra M.; Rajagopalan, Ramakrishnan; Wendel, Danielle; Devoto, Marcella; Krantz, Ian D.; Loomes, Kathleen M.; Spinner, Nancy B.

2015-01-01

Biliary atresia (BA) is a pediatric cholangiopathy with unknown etiology occurring in isolated and syndromic forms. Laterality defects affecting the cardiovascular and gastrointestinal systems are the most common features present in syndromic BA. Most cases are sporadic, although reports of familial cases have led to the hypothesis of genetic susceptibility in some patients. We identified a child with BA, malrotation, and interrupted inferior vena cava whose father presented with situs inversus, polysplenia, panhypopituitarism, and mildly dysmorphic facial features. Chromosomal microarray analysis demonstrated a 277kb heterozygous deletion on chromosome 20 which included a single gene, FOXA2, in the proband and her father. This deletion was confirmed to be de novo in the father. The proband and her father share a common diagnosis of heterotaxy, but they also each presented with a variety of other issues. Further genetic screening revealed that the proband carried an additional protein-altering polymorphism (rs1904589; p.His165Arg) in the NODAL gene that is not present in the father, and this variant has been shown to decrease expression of the gene. As FOXA2 can be a regulator of NODAL expression, we propose that haploinsufficiency for FOXA2 combined with a decreased expression of NODAL is the likely cause for syndromic BA in this proband. PMID:25765999
TRPM7 maintains progenitor-like features of neuroblastoma cells: implications for metastasis formation

PubMed Central

Middelbeek, Jeroen; Kamermans, Alwin; Kuipers, Arthur J.; Hoogerbrugge, Peter M.; Jalink, Kees; van Leeuwen, Frank N.

2015-01-01

Neuroblastoma is an embryonal tumor derived from poorly differentiated neural crest cells. Current research is aimed at identifying the molecular mechanisms that maintain the progenitor state of neuroblastoma cells and to develop novel therapeutic strategies that induce neuroblastoma cell differentiation. Mechanisms controlling neural crest development are typically dysregulated during neuroblastoma progression, and provide an appealing starting point for drug target discovery. Transcriptional programs involved in neural crest development act as a context dependent gene regulatory network. In addition to BMP, Wnt and Notch signaling, activation of developmental gene expression programs depends on the physical characteristics of the tissue microenvironment. TRPM7, a mechanically regulated TRP channel with kinase activity, was previously found essential for embryogenesis and the maintenance of undifferentiated neural crest progenitors. Hence, we hypothesized that TRPM7 may preserve progenitor-like, metastatic features of neuroblastoma cells. Using multiple neuroblastoma cell models, we demonstrate that TRPM7 expression closely associates with the migratory and metastatic properties of neuroblastoma cells in vitro and in vivo. Moreover, microarray-based expression profiling on control and TRPM7 shRNA transduced neuroblastoma cells indicates that TRPM7 controls a developmental transcriptional program involving the transcription factor SNAI2. Overall, our data indicate that TRPM7 contributes to neuroblastoma progression by maintaining progenitor-like features. PMID:25797249
Codon influence on protein expression in E. coli correlates with mRNA levels

PubMed Central

Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.

2016-01-01

Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
Differential chemokine, chemokine receptor and cytokine expression in Epstein-Barr virus-associated lymphoproliferative diseases.

PubMed

Ohshima, Koichi; Karube, Kennosuke; Hamasaki, Makoto; Tutiya, Takeshi; Yamaguchi, Takahiro; Suefuji, Hiroaki; Suzuki, Keiko; Suzumiya, Junji; Ohga, Shouichi; Kikuchi, Masahiro

2003-08-01

T cell immunity plays an important role in the clinicopathology of Epstein-Barr virus (EBV)-associated diseases. Acute EBV-induced infectious mononucleosis (IM) is a common self-limiting disease, however, other EBV-associated diseases, including chronic active EBV infection (CAEBV), NK cell lymphoma (NKL), and Hodgkin's lymphoma (HL), exhibit distinct clinical features. Chemokines are members of a family of small-secreted proteins. The relationships between chemokines and the chemokine receptor (R) are thought to be important for selectivity of local immunity. Some chemokines, chemokine R and cytokines closely associate with the T cell subtypes, Th1 and Th2 T cells and cytotoxic cells. To clarify the role of T cell immunity in EBV-associated diseases, we conducted gene expression profiling, using chemokine, chemokine R and cytokine DNA chips. Compared to EBV negative non-specific lymphadenitis, CAEBV and NKL exhibited diffuse down- and up-regulation, respectively, of these gene profiles. IM had a predominantly Th1-type profile, whereas HL had a mixed Th1/Th2-type profile. Reduction of the Th1-type cytokine interferon gamma (IFN-gamma) in CAEBV was confirmed by Reverse transcriptase-polymerase chain reaction, whereas IFN-gamma expression was markedly enhanced in NKL, and moderately enhanced in IM. Compared to IM, CAEBV showed slight elevation of "regulated upon activation, normal T expressed and secreted" (RANTES), but almost all other genes assayed were down-regulated. NKL exhibited elevated expression of numerous genes, particularly IFN-gamma-inducible-10 (IP-10) and monokine induced by IFN-gamma (MIG). HL showed variable elevated and reduced expression of various genes, with increased expression of IL-13 receptor and MIG. Our study demonstrated the enormous potential of gene expression profiling for clarifying the pathogenesis of EBV-associated diseases.

RXRα and LXR activate two promoters in placenta- and tumor-specific expression of PLAC1

PubMed Central

Chen, Yaohui; Moradin, Adi; Schlessinger, David; Nagaraja, Ramaiah

2011-01-01

PLAC1 expression, first characterized as restricted to developing placenta among normal tissues, is also found in a wide range of tumors and transformed cell lines. To understand the basis for its unusual expression profile, we have analyzed the gene structure and its mode of transcription. We find that the gene has a hitherto unique feature, with two promoters, P1 and P2, separated by 105 kb. P2 has been described before. Here we define P1 and show that it and P2 are activated by RXRα in conjunction with LXRα or LXRβ. In placenta, P2 is the preferred promoter, whereas various tumor cell lines tend to express predominantly either one or the other promoter. Furthermore, when each promoter is fused to a luciferase reporter gene and transfected into cancer cell lines, the promoter corresponding to the more active endogenous promoter is preferentially transcribed. Joint expression of activating nuclear receptors can partially account for the restricted expression of PLAC1 in placenta, and may be co-opted for preferential P1 or P2 PLAC1 expression in various tumor cells. PMID:21937108
Bayesian feature selection for high-dimensional linear regression via the Ising approximation with applications to genomics.

PubMed

Fisher, Charles K; Mehta, Pankaj

2015-06-01

Feature selection, identifying a subset of variables that are relevant for predicting a response, is an important and challenging component of many methods in statistics and machine learning. Feature selection is especially difficult and computationally intensive when the number of variables approaches or exceeds the number of samples, as is often the case for many genomic datasets. Here, we introduce a new approach--the Bayesian Ising Approximation (BIA)-to rapidly calculate posterior probabilities for feature relevance in L2 penalized linear regression. In the regime where the regression problem is strongly regularized by the prior, we show that computing the marginal posterior probabilities for features is equivalent to computing the magnetizations of an Ising model with weak couplings. Using a mean field approximation, we show it is possible to rapidly compute the feature selection path described by the posterior probabilities as a function of the L2 penalty. We present simulations and analytical results illustrating the accuracy of the BIA on some simple regression problems. Finally, we demonstrate the applicability of the BIA to high-dimensional regression by analyzing a gene expression dataset with nearly 30 000 features. These results also highlight the impact of correlations between features on Bayesian feature selection. An implementation of the BIA in C++, along with data for reproducing our gene expression analyses, are freely available at http://physics.bu.edu/∼pankajm/BIACode. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Rapid endosomal escape of prickly nanodiamonds: implications for gene delivery

PubMed Central

Chu, Zhiqin; Miu, Kaikei; Lung, Pingsai; Zhang, Silu; Zhao, Saisai; Chang, Huan-Cheng; Lin, Ge; Li, Quan

2015-01-01

The prickly nanodiamonds easily entered cells via endocytosis followed by unique intracellular translocation characteristics—quick endosomal escape followed by stable residence in cytoplasm. Endosomal membrane rupturing is identified as the major route of nanodiamonds’ escaping the vesicle confinement and to the cytoplasm. Little cytotoxicity is observed to associate with the nanodiamonds’ cytosolic release. Such features enable its application for gene delivery, which requires both effective cellular uptake and cytosolic release of the gene. Taking green fluorescent protein gene as an example, we demonstrate the successful cytosolic delivery and expression of such a gene using the prickly nanodiamonds as carrier. PMID:26123532
Rapid endosomal escape of prickly nanodiamonds: implications for gene delivery

NASA Astrophysics Data System (ADS)

Chu, Zhiqin; Miu, Kaikei; Lung, Pingsai; Zhang, Silu; Zhao, Saisai; Chang, Huan-Cheng; Lin, Ge; Li, Quan

2015-06-01

The prickly nanodiamonds easily entered cells via endocytosis followed by unique intracellular translocation characteristics—quick endosomal escape followed by stable residence in cytoplasm. Endosomal membrane rupturing is identified as the major route of nanodiamonds’ escaping the vesicle confinement and to the cytoplasm. Little cytotoxicity is observed to associate with the nanodiamonds’ cytosolic release. Such features enable its application for gene delivery, which requires both effective cellular uptake and cytosolic release of the gene. Taking green fluorescent protein gene as an example, we demonstrate the successful cytosolic delivery and expression of such a gene using the prickly nanodiamonds as carrier.
Indications for distinct pathogenic mechanisms of asbestos and silica through gene expression profiling of the response of lung epithelial cells

PubMed Central

Perkins, Timothy N.; Peeters, Paul M.; Shukla, Arti; Arijs, Ingrid; Dragon, Julie; Wouters, Emiel F.M.; Reynaert, Niki L.; Mossman, Brooke T.

2015-01-01

Occupational and environmental exposures to airborne asbestos and silica are associated with the development of lung fibrosis in the forms of asbestosis and silicosis, respectively. However, both diseases display distinct pathologic presentations, likely associated with differences in gene expression induced by different mineral structures, composition and bio-persistent properties. We hypothesized that effects of mineral exposure in the airway epithelium may dictate deviating molecular events that may explain the different pathologies of asbestosis versus silicosis. Using robust gene expression-profiling in conjunction with in-depth pathway analysis, we assessed early (24 h) alterations in gene expression associated with crocidolite asbestos or cristobalite silica exposures in primary human bronchial epithelial cells (NHBEs). Observations were confirmed in an immortalized line (BEAS-2B) by QRT-PCR and protein assays. Utilization of overall gene expression, unsupervised hierarchical cluster analysis and integrated pathway analysis revealed gene alterations that were common to both minerals or unique to either mineral. Our findings reveal that both minerals had potent effects on genes governing cell adhesion/migration, inflammation, and cellular stress, key features of fibrosis. Asbestos exposure was most specifically associated with aberrant cell proliferation and carcinogenesis, whereas silica exposure was highly associated with additional inflammatory responses, as well as pattern recognition, and fibrogenesis. These findings illustrate the use of gene-profiling as a means to determine early molecular events that may dictate pathological processes induced by exogenous cellular insults. In addition, it is a useful approach for predicting the pathogenicity of potentially harmful materials. PMID:25351596
Comparative analyses identify molecular signature of MRI-classified SVZ-associated glioblastoma

PubMed Central

Lin, Chin-Hsing Annie; Rhodes, Christopher T.; Lin, ChenWei; Phillips, Joanna J.; Berger, Mitchel S.

2017-01-01

ABSTRACT Glioblastoma (GBM) is a highly aggressive brain cancer with limited therapeutic options. While efforts to identify genes responsible for GBM have revealed mutations and aberrant gene expression associated with distinct types of GBM, patients with GBM are often diagnosed and classified based on MRI features. Therefore, we seek to identify molecular representatives in parallel with MRI classification for group I and group II primary GBM associated with the subventricular zone (SVZ). As group I and II GBM contain stem-like signature, we compared gene expression profiles between these 2 groups of primary GBM and endogenous neural stem progenitor cells to reveal dysregulation of cell cycle, chromatin status, cellular morphogenesis, and signaling pathways in these 2 types of MRI-classified GBM. In the absence of IDH mutation, several genes associated with metabolism are differentially expressed in these subtypes of primary GBM, implicating metabolic reprogramming occurs in tumor microenvironment. Furthermore, histone lysine methyltransferase EZH2 was upregulated while histone lysine demethylases KDM2 and KDM4 were downregulated in both group I and II primary GBM. Lastly, we identified 9 common genes across large data sets of gene expression profiles among MRI-classified group I/II GBM, a large cohort of GBM subtypes from TCGA, and glioma stem cells by unsupervised clustering comparison. These commonly upregulated genes have known functions in cell cycle, centromere assembly, chromosome segregation, and mitotic progression. Our findings highlight altered expression of genes important in chromosome integrity across all GBM, suggesting a common mechanism of disrupted fidelity of chromosome structure in GBM. PMID:28278055
Dynamic chromatin changes associated with de novo centromere formation in maize euchromatin.

PubMed

Su, Handong; Liu, Yalin; Liu, Yong-Xin; Lv, Zhenling; Li, Hongyao; Xie, Shaojun; Gao, Zhi; Pang, Junling; Wang, Xiu-Jie; Lai, Jinsheng; Birchler, James A; Han, Fangpu

2016-12-01

The inheritance and function of centromeres are not strictly dependent on any specific DNA sequence, but involve an epigenetic component in most species. CENH3, a centromere histone H3 variant, is one of the best-described epigenetic factors in centromere identity, but the chromatin features required during centromere formation have not yet been revealed. We previously identified two de novo centromeres on Zea mays (maize) minichromosomes derived from euchromatic sites with high-density gene distributions but low-density transposon distributions. The distribution of gene location and gene expression in these sites indicates that transcriptionally active regions can initiate de novo centromere formation, and CENH3 seeding shows a preference for gene-free regions or regions with no gene expression. The locations of the expressed genes detected were at relatively hypomethylated loci, and the altered gene expression resulted from de novo centromere formation, but not from the additional copy of the minichromosome. The initial overall DNA methylation level of the two de novo regions was at a low level, but increased substantially to that of native centromeres after centromere formation. These results illustrate the dynamic chromatin changes during euchromatin-originated de novo centromere formation, which provides insight into the mechanism of de novo centromere formation and regulation of subsequent consequences. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Gallus gallus orthologous to human alpha-dystroglycanopathies candidate genes: Gene expression and characterization during chicken embryogenesis.

PubMed

Izquierdo-Lahuerta, Adriana; de Luis, Oscar; Gómez-Esquer, Francisco; Cruces, Jesús; Coloma, Antonio

2016-09-23

Alpha-dystroglycanopathies are a heterogenic group of human rare diseases that have in common defects of α-dystroglycan O-glycosylation. These congenital disorders share common features as muscular dystrophy, malformations on central nervous system and more rarely altered ocular development, as well as mutations on a set of candidate genes involved on those syndromes. Severity of the syndromes is variable, appearing Walker-Warburg as the most severe where mutations at protein O-mannosyl transferases POMT1 and POMT2 genes are frequently described. When studying the lack of MmPomt1 in mouse embryonic development, as a murine model of Walker-Warburg syndrome, MmPomt1 null phenotype was lethal because Reitchert's membrane fails during embryonic development. Here, we report gene expression from Gallus gallus orthologous genes to human candidates on alpha-dystroglycanopathies POMT1, POMT2, POMGnT1, FKTN, FKRP and LARGE, making special emphasis in expression and localization of GgPomt1. Results obtained by quantitative RT-PCR, western-blot and immunochemistry revealed close gene expression patterns among human and chicken at key tissues affected during development when suffering an alpha-dystroglycanopathy, leading us to stand chicken as a useful animal model for molecular characterization of glycosyltransferases involved in the O-glycosylation of α-Dystroglycan and its role in embryonic development. Copyright © 2016 Elsevier Inc. All rights reserved.
Evolutionary and Expression Analyses of the Apple Basic Leucine Zipper Transcription Factor Family

PubMed Central

Zhao, Jiao; Guo, Rongrong; Guo, Chunlei; Hou, Hongmin; Wang, Xiping; Gao, Hua

2016-01-01

Transcription factors (TFs) play essential roles in the regulatory networks controlling many developmental processes in plants. Members of the basic leucine (Leu) zipper (bZIP) TF family, which is unique to eukaryotes, are involved in regulating diverse processes, including flower and vascular development, seed maturation, stress signaling, and defense responses to pathogens. The bZIP proteins have a characteristic bZIP domain composed of a DNA-binding basic region and a Leu zipper dimerization region. In this study, we identified 112 apple (Malus domestica Borkh) bZIP TF-encoding genes, termed MdbZIP genes. Synteny analysis indicated that segmental and tandem duplication events, as well as whole genome duplication, have contributed to the expansion of the apple bZIP family. The family could be divided into 11 groups based on structural features of the encoded proteins, as well as on the phylogenetic relationship of the apple bZIP proteins to those of the model plant Arabidopsis thaliana (AtbZIP genes). Synteny analysis revealed that several paired MdbZIP genes and AtbZIP gene homologs were located in syntenic genomic regions. Furthermore, expression analyses of group A MdbZIP genes showed distinct expression levels in 10 different organs. Moreover, changes in these expression profiles in response to abiotic stress conditions and various hormone treatments identified MdbZIP genes that were responsive to high salinity and drought, as well as to different phytohormones. PMID:27066030
Evolutionary and Expression Analyses of the Apple Basic Leucine Zipper Transcription Factor Family.

PubMed

Zhao, Jiao; Guo, Rongrong; Guo, Chunlei; Hou, Hongmin; Wang, Xiping; Gao, Hua

2016-01-01

Transcription factors (TFs) play essential roles in the regulatory networks controlling many developmental processes in plants. Members of the basic leucine (Leu) zipper (bZIP) TF family, which is unique to eukaryotes, are involved in regulating diverse processes, including flower and vascular development, seed maturation, stress signaling, and defense responses to pathogens. The bZIP proteins have a characteristic bZIP domain composed of a DNA-binding basic region and a Leu zipper dimerization region. In this study, we identified 112 apple (Malus domestica Borkh) bZIP TF-encoding genes, termed MdbZIP genes. Synteny analysis indicated that segmental and tandem duplication events, as well as whole genome duplication, have contributed to the expansion of the apple bZIP family. The family could be divided into 11 groups based on structural features of the encoded proteins, as well as on the phylogenetic relationship of the apple bZIP proteins to those of the model plant Arabidopsis thaliana (AtbZIP genes). Synteny analysis revealed that several paired MdbZIP genes and AtbZIP gene homologs were located in syntenic genomic regions. Furthermore, expression analyses of group A MdbZIP genes showed distinct expression levels in 10 different organs. Moreover, changes in these expression profiles in response to abiotic stress conditions and various hormone treatments identified MdbZIP genes that were responsive to high salinity and drought, as well as to different phytohormones.
A Potato cDNA Encoding a Homologue of Mammalian Multidrug Resistant P-Glycoprotein

NASA Technical Reports Server (NTRS)

Wang, W.; Takezawa, D.; Poovaiah, B. W.

1996-01-01

A homologue of the multidrug resistance (MDR) gene was obtained while screening a potato stolon tip cDNA expression library with S-15-labeled calmodulin. The mammalian MDR gene codes for a membrane-bound P-glycoprotein (170-180 kDa) which imparts multidrug resistance to cancerous cells. The potato cDNA (PMDR1) codes for a polypeptide of 1313 amino acid residues (ca. 144 kDa) and its structural features are very similar to the MDR P-glycoprotein. The N-terminal half of the PMDR1-encoded protein shares striking homology with its C-terminal half, and each half contains a conserved ATP-binding site and six putative transmembrane domains. Southern blot analysis indicated that potato has one or two MDR-like genes. PMDR1 mRNA is constitutively expressed in all organs studied with higher expression in the stem and stolon tip. The PMDR1 expression was highest during tuber initiation and decreased during tuber development.
Dysregulation of gene expression in the striatum of BACHD rats expressing full-length mutant huntingtin and associated abnormalities on molecular and protein levels.

PubMed

Yu-Taeger, Libo; Bonin, Michael; Stricker-Shaver, Janice; Riess, Olaf; Nguyen, Hoa Huu Phuc

2017-05-01

Huntington disease (HD) is an autosomal dominantly inherited neurodegenerative disorder caused by a CAG repeat expansion in the gene coding for the huntingtin protein (HTT). Mutant HTT (mHTT) has been proposed to cause neuronal dysfunction and neuronal loss through multiple mechanisms. Transcriptional changes may be a core pathogenic feature of HD. Utilizing the Affymetrix platform we performed a genome-wide RNA expression analysis in two BACHD transgenic rat lines (TG5 and TG9) at 12 months of age, both of which carry full-length human mHTT but with different expression levels. By defining the threshold of significance at p < 0.01, we found 1608 genes and 871 genes differentially expressed in both TG5 and TG9 rats when compared to the wild type littermates, respectively. We only chose the highly up-/down-regulated genes for further analysis by setting an additional threshold of 1.5 fold change. Comparing gene expression profiles of human HD brains and BACHD rats revealed a high concordance in both functional and IPA (Ingenuity Pathway Analysis) canonical pathways relevant to HD. In addition, we investigated the causes leading to gene expression changes at molecular and protein levels in BACHD rats including the involvement of polyQ-containing transcription factors TATA box-binding protein (TBP), Sp1 and CBP as well as the chromatin structure. We demonstrate that the BACHD rat model recapitulates the gene expression changes of the human disease supporting its role as a preclinical research animal model. We also show for the first time that TFIID complex formation is reduced, while soluble TBP is increased in an HD model. This finding suggests that mHTT is a competitor instead of a recruiter of polyQ-containing transcription factors in the transcription process in HD. Copyright © 2017 Elsevier Ltd. All rights reserved.
Gene Structures, Evolution and Transcriptional Profiling of the WRKY Gene Family in Castor Bean (Ricinus communis L.).

PubMed

Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui

2016-01-01

WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.
Gene expression analysis identifies new candidate genes associated with the development of black skin spots in Corriedale sheep.

PubMed

Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I

2012-02-01

The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value < 0.005) detected in this study, e.g. C-FOS, KLF4 and UFC1, fulfil biological functions that are plausible to be involved in the formation of black spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.
Induction of CaSR expression circumvents the molecular features of malignant CaSR null colon cancer cells.

PubMed

Singh, Navneet; Chakrabarty, Subhas

2013-11-15

We recently reported on the isolation and characterization of calcium sensing receptor (CaSR) null human colon cancer cells (Singh et al., Int J Cancer 2013; 132: 1996-2005). CaSR null cells possess a myriad of molecular features that are linked to a highly malignant and drug resistant phenotype of colon cancer. The CaSR null phenotype can be maintained in defined human embryonic stem cell culture medium. We now show that the CaSR null cells can be induced to differentiate in conventional culture medium, regained the expression of CaSR with a concurrent reversal of the cellular and molecular features associated with the null phenotype. These features include cellular morphology, expression of colon cancer stem cell markers, expression of survivin and thymidylate synthase and sensitivity to fluorouracil. Other features include the expression of epithelial mesenchymal transition linked molecules and transcription factors, oncogenic miRNAs and tumor suppressive molecule and miRNA. With the exception of cancer stem cell markers, the reversal of molecular features, upon the induction of CaSR expression, is directly linked to the expression and function of CaSR because blocking CaSR induction by shRNA circumvented such reversal. We further report that methylation and demethylation of the CaSR gene promoter underlie CaSR expression. Due to the malignant nature of the CaSR null cells, inclusion of the CaSR null phenotype in disease management may improve on the mortality of this disease. Because CaSR is a robust promoter of differentiation and mediates its action through diverse mechanisms and pathways, inactivation of CaSR may serve as a new paradigm in colon carcinogenesis. Copyright © 2013 UICC.
Systems Level Analysis of Systemic Sclerosis Shows a Network of Immune and Profibrotic Pathways Connected with Genetic Polymorphisms

PubMed Central

Mahoney, J. Matthew; Taroni, Jaclyn; Martyanov, Viktor; Wood, Tammara A.; Greene, Casey S.; Pioli, Patricia A.; Hinchcliff, Monique E.; Whitfield, Michael L.

2015-01-01

Systemic sclerosis (SSc) is a rare systemic autoimmune disease characterized by skin and organ fibrosis. The pathogenesis of SSc and its progression are poorly understood. The SSc intrinsic gene expression subsets (inflammatory, fibroproliferative, normal-like, and limited) are observed in multiple clinical cohorts of patients with SSc. Analysis of longitudinal skin biopsies suggests that a patient's subset assignment is stable over 6–12 months. Genetically, SSc is multi-factorial with many genetic risk loci for SSc generally and for specific clinical manifestations. Here we identify the genes consistently associated with the intrinsic subsets across three independent cohorts, show the relationship between these genes using a gene-gene interaction network, and place the genetic risk loci in the context of the intrinsic subsets. To identify gene expression modules common to three independent datasets from three different clinical centers, we developed a consensus clustering procedure based on mutual information of partitions, an information theory concept, and performed a meta-analysis of these genome-wide gene expression datasets. We created a gene-gene interaction network of the conserved molecular features across the intrinsic subsets and analyzed their connections with SSc-associated genetic polymorphisms. The network is composed of distinct, but interconnected, components related to interferon activation, M2 macrophages, adaptive immunity, extracellular matrix remodeling, and cell proliferation. The network shows extensive connections between the inflammatory- and fibroproliferative-specific genes. The network also shows connections between these subset-specific genes and 30 SSc-associated polymorphic genes including STAT4, BLK, IRF7, NOTCH4, PLAUR, CSK, IRAK1, and several human leukocyte antigen (HLA) genes. Our analyses suggest that the gene expression changes underlying the SSc subsets may be long-lived, but mechanistically interconnected and related to a patients underlying genetic risk. PMID:25569146
The transcriptional diversity of 25 Drosophila cell lines

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cherbas, Lucy; Willingham, Aarron; Zhang, Dayu

2010-12-22

Drosophila melanogaster cell lines are important resources for cell biologists. In this article, we catalog the expression of exons, genes, and unannotated transcriptional signals for 25 lines. Unannotated transcription is substantial (typically 19% of euchromatic signal). Conservatively, we identify 1405 novel transcribed regions; 684 of these appear to be new exons of neighboring, often distant, genes. Sixty-four percent of genes are expressed detectably in at least one line, but only 21% are detected in all lines. Each cell line expresses, on average, 5885 genes, including a common set of 3109. Expression levels vary over several orders of magnitude. Major signalingmore » pathways are well represented: most differentiation pathways are ‘‘off’’ and survival/growth pathways ‘‘on.’’ Roughly 50% of the genes expressed by each line are not part of the common set, and these show considerable individuality. Thirty-one percent are expressed at a higher level in at least one cell line than in any single developmental stage, suggesting that each line is enriched for genes characteristic of small sets of cells. Most remarkable is that imaginal disc-derived lines can generally be assigned, on the basis of expression, to small territories within developing discs. These mappings reveal unexpected stability of even fine-grained spatial determination. No two cell lines show identical transcription factor expression. We conclude that each line has retained features of an individual founder cell superimposed on a common ‘‘cell line‘‘ gene expression pattern. We report the transcriptional profiles of 25 Drosophila melanogaster cell lines, principally by whole-genome tiling microarray analysis of total RNA, carried out as part of the modENCODE project. The data produced in this study add to our knowledge of the cell lines and of the Drosophila transcriptome in several ways. We summarize the expression of previously annotated genes in each of the 25 lines with emphasis on what those patterns reveal about the origins of the lines and the stability of spatial expression patterns. In addition, we offer an initial analysis of previously unannotated transcripts in the cell lines.« less
Promoter methylation and expression of DNA repair genes MGMT and ERCC1 in tissue and blood of rectal cancer patients.

PubMed

Shalaby, Sally M; El-Shal, Amal S; Abdelaziz, Lobna A; Abd-Elbary, Eman; Khairy, Mostafa M

2018-02-20

Rectal cancer involves one-third of colorectal cancers (CRCs). Recently, data supported that DNA methylation have a role in CRC pathogenesis. In the present study we aimed to analyze the methylation status of MGMT and ERCC1 promoter regions in blood and tissue of patients with benign and malignant rectal tumors. We also studied the methylated MGMT and ERCC1 genes and their relations with clinicopathological features. Furthermore, we suggested that methylation may play a critical function in the regulation of MGMT and ERCC1 expression. Fifty patients with non-metastatic cancer rectum and 43 patients with benign rectal lesions were involved in the study. DNA extraction from blood and rectal specimens was done to analyze the methylation status of MGMT and ERCC1 genes by methylation-specific PCR method. RNA was extracted also to determine the expression levels of these genes by real time-PCR. The frequency of MGMT and ERCC1 methylation was significantly higher in rectum cancers than in benign tumors both for the tissue and the blood (p<0.001). There was no relation between MGMT or ERCC1 methylation and clinicopathological features; while they were correlated with the response to therapy. An interesting finding that the agreement of the methylation levels in the blood and rectal tissue was classified as good (κ=0.78) for MGMT gene and as very good (κ=0.85) for ERCC1. Lastly, the MGMT and ERCC1 genes methylation was associated with down-regulation of their mRNA expression when compared with the non-methylated status. Our findings provided evidence that both blood and tumor tissue MGMT and ERCC1 methylation were associated with cancer rectum. MGMT or ERCC1 methylation in blood could be suitable non-invasive biomarkers differentiating benign and malignant rectal tumors. Furthermore, the methylation of the MGMT and ERCC1 promoter regions was associated with down-regulation of their mRNA expression. Copyright © 2017 Elsevier B.V. All rights reserved.
Regulation of Benzoate Degradation in Acinetobacter sp. Strain ADP1 by BenM, a LysR-Type Transcriptional Activator

PubMed Central

Collier, Lauren S.; Gaines, George L.; Neidle, Ellen L.

1998-01-01

In Acinetobacter sp. strain ADP1, benzoate degradation requires the ben genes for converting benzoate to catechol and the cat genes for degrading catechol. Here we describe a novel transcriptional activator, BenM, that regulates the chromosomal ben and cat genes. BenM is homologous to CatM, a LysR-type transcriptional activator of the cat genes. Unusual regulatory features of this system include the abilities of both BenM and CatM to recognize the same inducer, cis,cis-muconate, and to regulate some of the same genes, such as catA and catB. Unlike CatM, BenM responded to benzoate. Benzoate together with cis,cis-muconate increased the BenM-dependent expression of the benABCDE operon synergistically. CatM was not required for this synergism, nor did CatM regulate the expression of a chromosomal benA::lacZ transcriptional fusion. BenM-mediated regulation differs significantly from that of the TOL plasmid-encoded conversion of benzoate to catechol in pseudomonads. The benM gene is immediately upstream of, and divergently transcribed from, benA, and a possible DNA binding site for BenM was identified between the two coding regions. Two mutations in the predicted operator/promoter region rendered ben gene expression either constitutive or inducible by cis,cis-muconate but not benzoate. Mutants lacking BenM, CatM, or both of these regulators degraded aromatic compounds at different rates, and the levels of intermediary metabolites that accumulated depended on the genetic background. These studies indicated that BenM is necessary for ben gene expression but not for expression of the cat genes, which can be regulated by CatM. In a catM-disrupted strain, BenM was able to induce higher levels of catA expression than catB expression. PMID:9573203
Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system

PubMed Central

Sunkin, Susan M.; Ng, Lydia; Lau, Chris; Dolbeare, Tim; Gilbert, Terri L.; Thompson, Carol L.; Hawrylycz, Michael; Dang, Chinh

2013-01-01

The Allen Brain Atlas (http://www.brain-map.org) provides a unique online public resource integrating extensive gene expression data, connectivity data and neuroanatomical information with powerful search and viewing tools for the adult and developing brain in mouse, human and non-human primate. Here, we review the resources available at the Allen Brain Atlas, describing each product and data type [such as in situ hybridization (ISH) and supporting histology, microarray, RNA sequencing, reference atlases, projection mapping and magnetic resonance imaging]. In addition, standardized and unique features in the web applications are described that enable users to search and mine the various data sets. Features include both simple and sophisticated methods for gene searches, colorimetric and fluorescent ISH image viewers, graphical displays of ISH, microarray and RNA sequencing data, Brain Explorer software for 3D navigation of anatomy and gene expression, and an interactive reference atlas viewer. In addition, cross data set searches enable users to query multiple Allen Brain Atlas data sets simultaneously. All of the Allen Brain Atlas resources can be accessed through the Allen Brain Atlas data portal. PMID:23193282

Some links on this page may take you to non-federal websites. Their policies may differ from this site.