combined gene analysis: Topics by Science.gov

Sample records for combined gene analysis

Consensus strategy in genes prioritization and combined bioinformatics analysis for preeclampsia pathogenesis.

PubMed

Tejera, Eduardo; Cruz-Monteagudo, Maykel; Burgos, Germán; Sánchez, María-Eugenia; Sánchez-Rodríguez, Aminael; Pérez-Castillo, Yunierkis; Borges, Fernanda; Cordeiro, Maria Natália Dias Soeiro; Paz-Y-Miño, César; Rebelo, Irene

2017-08-08

Preeclampsia is a multifactorial disease with unknown pathogenesis. Even when recent studies explored this disease using several bioinformatics tools, the main objective was not directed to pathogenesis. Additionally, consensus prioritization was proved to be highly efficient in the recognition of genes-disease association. However, not information is available about the consensus ability to early recognize genes directly involved in pathogenesis. Therefore our aim in this study is to apply several theoretical approaches to explore preeclampsia; specifically those genes directly involved in the pathogenesis. We firstly evaluated the consensus between 12 prioritization strategies to early recognize pathogenic genes related to preeclampsia. A communality analysis in the protein-protein interaction network of previously selected genes was done including further enrichment analysis. The enrichment analysis includes metabolic pathways as well as gene ontology. Microarray data was also collected and used in order to confirm our results or as a strategy to weight the previously enriched pathways. The consensus prioritized gene list was rationally filtered to 476 genes using several criteria. The communality analysis showed an enrichment of communities connected with VEGF-signaling pathway. This pathway is also enriched considering the microarray data. Our result point to VEGF, FLT1 and KDR as relevant pathogenic genes, as well as those connected with NO metabolism. Our results revealed that consensus strategy improve the detection and initial enrichment of pathogenic genes, at least in preeclampsia condition. Moreover the combination of the first percent of the prioritized genes with protein-protein interaction network followed by communality analysis reduces the gene space. This approach actually identifies well known genes related with pathogenesis. However, genes like HSP90, PAK2, CD247 and others included in the first 1% of the prioritized list need to be further
Gene analysis of PROP1 in dwarfism with combined pituitary hormone deficiency.

PubMed

Takamura, N; Fofanova, O V; Kinoshita, E; Yamashita, S

1999-06-01

The prophet of Pit-1 gene (PROP1), a novel pituitary-specific homeodomain factor, has been proved to be one of the causative genes for combined pituitary hormone deficiency (CPHD). Recently, PROP1 mutations have been identified in CPHD families, including our Russian cohort. The 2-bp deletion, 296delGA (A301G302del), is the most common mutational hot spot. Furthermore, in our cohort, PROP1 mutations are more common in comparison with human POU1F1 gene mutations. Here we review the gene analysis of PROP1 in patients with CPHD.
Gene identification for risk of relapse in stage I lung adenocarcinoma patients: a combined methodology of gene expression profiling and computational gene network analysis.

PubMed

Ludovini, Vienna; Bianconi, Fortunato; Siggillino, Annamaria; Piobbico, Danilo; Vannucci, Jacopo; Metro, Giulio; Chiari, Rita; Bellezza, Guido; Puma, Francesco; Della Fazia, Maria Agnese; Servillo, Giuseppe; Crinò, Lucio

2016-05-24

Risk assessment and treatment choice remains a challenge in early non-small-cell lung cancer (NSCLC). The aim of this study was to identify novel genes involved in the risk of early relapse (ER) compared to no relapse (NR) in resected lung adenocarcinoma (AD) patients using a combination of high throughput technology and computational analysis. We identified 18 patients (n.13 NR and n.5 ER) with stage I AD. Frozen samples of patients in ER, NR and corresponding normal lung (NL) were subjected to Microarray technology and quantitative-PCR (Q-PCR). A gene network computational analysis was performed to select predictive genes. An independent set of 79 ADs stage I samples was used to validate selected genes by Q-PCR.From microarray analysis we selected 50 genes, using the fold change ratio of ER versus NR. They were validated both in pool and individually in patient samples (ER and NR) by Q-PCR. Fourteen increased and 25 decreased genes showed a concordance between two methods. They were used to perform a computational gene network analysis that identified 4 increased (HOXA10, CLCA2, AKR1B10, FABP3) and 6 decreased (SCGB1A1, PGC, TFF1, PSCA, SPRR1B and PRSS1) genes. Moreover, in an independent dataset of ADs samples, we showed that both high FABP3 expression and low SCGB1A1 expression was associated with a worse disease-free survival (DFS).Our results indicate that it is possible to define, through gene expression and computational analysis, a characteristic gene profiling of patients with an increased risk of relapse that may become a tool for patient selection for adjuvant therapy.
Microarray gene expression profiling analysis combined with bioinformatics in multiple sclerosis.

PubMed

Liu, Mingyuan; Hou, Xiaojun; Zhang, Ping; Hao, Yong; Yang, Yiting; Wu, Xiongfeng; Zhu, Desheng; Guan, Yangtai

2013-05-01

Multiple sclerosis (MS) is the most prevalent demyelinating disease and the principal cause of neurological disability in young adults. Recent microarray gene expression profiling studies have identified several genetic variants contributing to the complex pathogenesis of MS, however, expressional and functional studies are still required to further understand its molecular mechanism. The present study aimed to analyze the molecular mechanism of MS using microarray analysis combined with bioinformatics techniques. We downloaded the gene expression profile of MS from Gene Expression Omnibus (GEO) and analysed the microarray data using the differentially coexpressed genes (DCGs) and links package in R and Database for Annotation, Visualization and Integrated Discovery. The regulatory impact factor (RIF) algorithm was used to measure the impact factor of transcription factor. A total of 1,297 DCGs between MS patients and healthy controls were identified. Functional annotation indicated that these DCGs were associated with immune and neurological functions. Furthermore, the RIF result suggested that IKZF1, BACH1, CEBPB, EGR1, FOS may play central regulatory roles in controlling gene expression in the pathogenesis of MS. Our findings confirm the presence of multiple molecular alterations in MS and indicate the possibility for identifying prognostic factors associated with MS pathogenesis.
Identification of Linkages between EDCs in Personal Care Products and Breast Cancer through Data Integration Combined with Gene Network Analysis.

PubMed

Jeong, Hyeri; Kim, Jongwoon; Kim, Youngjun

2017-09-30

Approximately 1000 chemicals have been reported to possibly have endocrine disrupting effects, some of which are used in consumer products, such as personal care products (PCPs) and cosmetics. We conducted data integration combined with gene network analysis to: (i) identify causal molecular mechanisms between endocrine disrupting chemicals (EDCs) used in PCPs and breast cancer; and (ii) screen candidate EDCs associated with breast cancer. Among EDCs used in PCPs, four EDCs having correlation with breast cancer were selected, and we curated 27 common interacting genes between those EDCs and breast cancer to perform the gene network analysis. Based on the gene network analysis, ESR1, TP53, NCOA1, AKT1, and BCL6 were found to be key genes to demonstrate the molecular mechanisms of EDCs in the development of breast cancer. Using GeneMANIA, we additionally predicted 20 genes which could interact with the 27 common genes. In total, 47 genes combining the common and predicted genes were functionally grouped with the gene ontology and KEGG pathway terms. With those genes, we finally screened candidate EDCs for their potential to increase breast cancer risk. This study highlights that our approach can provide insights to understand mechanisms of breast cancer and identify potential EDCs which are in association with breast cancer.
Identification of Linkages between EDCs in Personal Care Products and Breast Cancer through Data Integration Combined with Gene Network Analysis

PubMed Central

Kim, Jongwoon

2017-01-01

Approximately 1000 chemicals have been reported to possibly have endocrine disrupting effects, some of which are used in consumer products, such as personal care products (PCPs) and cosmetics. We conducted data integration combined with gene network analysis to: (i) identify causal molecular mechanisms between endocrine disrupting chemicals (EDCs) used in PCPs and breast cancer; and (ii) screen candidate EDCs associated with breast cancer. Among EDCs used in PCPs, four EDCs having correlation with breast cancer were selected, and we curated 27 common interacting genes between those EDCs and breast cancer to perform the gene network analysis. Based on the gene network analysis, ESR1, TP53, NCOA1, AKT1, and BCL6 were found to be key genes to demonstrate the molecular mechanisms of EDCs in the development of breast cancer. Using GeneMANIA, we additionally predicted 20 genes which could interact with the 27 common genes. In total, 47 genes combining the common and predicted genes were functionally grouped with the gene ontology and KEGG pathway terms. With those genes, we finally screened candidate EDCs for their potential to increase breast cancer risk. This study highlights that our approach can provide insights to understand mechanisms of breast cancer and identify potential EDCs which are in association with breast cancer. PMID:28973975
Gene expression patterns combined with bioinformatics analysis identify genes associated with cholangiocarcinoma.

PubMed

Li, Chen; Shen, Weixing; Shen, Sheng; Ai, Zhilong

2013-12-01

To explore the molecular mechanisms of cholangiocarcinoma (CC), microarray technology was used to find biomarkers for early detection and diagnosis. The gene expression profiles from 6 patients with CC and 5 normal controls were downloaded from Gene Expression Omnibus and compared. As a result, 204 differentially co-expressed genes (DCGs) in CC patients compared to normal controls were identified using a computational bioinformatics analysis. These genes were mainly involved in coenzyme metabolic process, peptidase activity and oxidation reduction. A regulatory network was constructed by mapping the DCGs to known regulation data. Four transcription factors, FOXC1, ZIC2, NKX2-2 and GCGR, were hub nodes in the network. In conclusion, this study provides a set of targets useful for future investigations into molecular biomarker studies. Copyright © 2013 Elsevier Ltd. All rights reserved.
Transcriptome Sequencing of Dianthus spiculifolius and Analysis of the Genes Involved in Responses to Combined Cold and Drought Stress.

PubMed

Zhou, Aimin; Ma, Hongping; Liu, Enhui; Jiang, Tongtong; Feng, Shuang; Gong, Shufang; Wang, Jingang

2017-04-17

Dianthus spiculifolius , a perennial herbaceous flower and a member of the Caryophyllaceae family, has strong resistance to cold and drought stresses. To explore the transcriptional responses of D. spiculifolius to individual and combined stresses, we performed transcriptome sequencing of seedlings under normal conditions or subjected to cold treatment (CT), simulated drought treatment (DT), or their combination (CTDT). After de novo assembly of the obtained reads, 112,015 unigenes were generated. Analysis of differentially expressed genes (DEGs) showed that 2026, 940, and 2346 genes were up-regulated and 1468, 707, and 1759 were down-regulated in CT, DT, and CTDT samples, respectively. Among all the DEGs, 182 up-regulated and 116 down-regulated genes were identified in all the treatment groups. Analysis of metabolic pathways and regulatory networks associated with the DEGs revealed overlaps and cross-talk between cold and drought stress response pathways. The expression profiles of the selected DEGs in CT, DT, and CTDT samples were characterized and confirmed by quantitative RT-PCR. These DEGs and metabolic pathways may play important roles in the response of D. spiculifolius to the combined stress. Functional characterization of these genes and pathways will provide new targets for enhancement of plant stress tolerance through genetic manipulation.
Transcriptome Sequencing of Dianthus spiculifolius and Analysis of the Genes Involved in Responses to Combined Cold and Drought Stress

PubMed Central

Zhou, Aimin; Ma, Hongping; Liu, Enhui; Jiang, Tongtong; Feng, Shuang; Gong, Shufang; Wang, Jingang

2017-01-01

Dianthus spiculifolius, a perennial herbaceous flower and a member of the Caryophyllaceae family, has strong resistance to cold and drought stresses. To explore the transcriptional responses of D. spiculifolius to individual and combined stresses, we performed transcriptome sequencing of seedlings under normal conditions or subjected to cold treatment (CT), simulated drought treatment (DT), or their combination (CTDT). After de novo assembly of the obtained reads, 112,015 unigenes were generated. Analysis of differentially expressed genes (DEGs) showed that 2026, 940, and 2346 genes were up-regulated and 1468, 707, and 1759 were down-regulated in CT, DT, and CTDT samples, respectively. Among all the DEGs, 182 up-regulated and 116 down-regulated genes were identified in all the treatment groups. Analysis of metabolic pathways and regulatory networks associated with the DEGs revealed overlaps and cross-talk between cold and drought stress response pathways. The expression profiles of the selected DEGs in CT, DT, and CTDT samples were characterized and confirmed by quantitative RT-PCR. These DEGs and metabolic pathways may play important roles in the response of D. spiculifolius to the combined stress. Functional characterization of these genes and pathways will provide new targets for enhancement of plant stress tolerance through genetic manipulation. PMID:28420173
Combining Genotype, Phenotype, and Environment to Infer Potential Candidate Genes.

PubMed

Talbot, Benoit; Chen, Ting-Wen; Zimmerman, Shawna; Joost, Stéphane; Eckert, Andrew J; Crow, Taylor M; Semizer-Cuming, Devrim; Seshadri, Chitra; Manel, Stéphanie

2017-03-01

Population genomic analysis can be an important tool in understanding local adaptation. Identification of potential adaptive loci in such analyses is usually based on the survey of a large genomic dataset in combination with environmental variables. Phenotypic data are less commonly incorporated into such studies, although combining a genome scan analysis with a phenotypic trait analysis can greatly improve the insights obtained from each analysis individually. Here, we aimed to identify loci potentially involved in adaptation to climate in 283 Loblolly pine (Pinus taeda) samples from throughout the species' range in the southeastern United States. We analyzed associations between phenotypic, molecular, and environmental variables from datasets of 3082 single nucleotide polymorphism (SNP) loci and 3 categories of phenotypic traits (gene expression, metabolites, and whole-plant traits). We found only 6 SNP loci that displayed potential signals of local adaptation. Five of the 6 identified SNPs are linked to gene expression traits for lignin development, and 1 is linked with whole-plant traits. We subsequently compared the 6 candidate genes with environmental variables and found a high correlation in only 3 of them (R2 > 0.2). Our study highlights the need for a combination of genotypes, phenotypes, and environmental variables, and for an appropriate sampling scheme and study design, to improve confidence in the identification of potential candidate genes. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Combining classifiers to predict gene function in Arabidopsis thaliana using large-scale gene expression measurements.

PubMed

Lan, Hui; Carson, Rachel; Provart, Nicholas J; Bonner, Anthony J

2007-09-21

Arabidopsis thaliana is the model species of current plant genomic research with a genome size of 125 Mb and approximately 28,000 genes. The function of half of these genes is currently unknown. The purpose of this study is to infer gene function in Arabidopsis using machine-learning algorithms applied to large-scale gene expression data sets, with the goal of identifying genes that are potentially involved in plant response to abiotic stress. Using in house and publicly available data, we assembled a large set of gene expression measurements for A. thaliana. Using those genes of known function, we first evaluated and compared the ability of basic machine-learning algorithms to predict which genes respond to stress. Predictive accuracy was measured using ROC50 and precision curves derived through cross validation. To improve accuracy, we developed a method for combining these classifiers using a weighted-voting scheme. The combined classifier was then trained on genes of known function and applied to genes of unknown function, identifying genes that potentially respond to stress. Visual evidence corroborating the predictions was obtained using electronic Northern analysis. Three of the predicted genes were chosen for biological validation. Gene knockout experiments confirmed that all three are involved in a variety of stress responses. The biological analysis of one of these genes (At1g16850) is presented here, where it is shown to be necessary for the normal response to temperature and NaCl. Supervised learning methods applied to large-scale gene expression measurements can be used to predict gene function. However, the ability of basic learning methods to predict stress response varies widely and depends heavily on how much dimensionality reduction is used. Our method of combining classifiers can improve the accuracy of such predictions - in this case, predictions of genes involved in stress response in plants - and it effectively chooses the appropriate amount
Combining multiple tools outperforms individual methods in gene set enrichment analyses.

PubMed

Alhamdoosh, Monther; Ng, Milica; Wilson, Nicholas J; Sheridan, Julie M; Huynh, Huy; Wilson, Michael J; Ritchie, Matthew E

2017-02-01

Gene set enrichment (GSE) analysis allows researchers to efficiently extract biological insight from long lists of differentially expressed genes by interrogating them at a systems level. In recent years, there has been a proliferation of GSE analysis methods and hence it has become increasingly difficult for researchers to select an optimal GSE tool based on their particular dataset. Moreover, the majority of GSE analysis methods do not allow researchers to simultaneously compare gene set level results between multiple experimental conditions. The ensemble of genes set enrichment analyses (EGSEA) is a method developed for RNA-sequencing data that combines results from twelve algorithms and calculates collective gene set scores to improve the biological relevance of the highest ranked gene sets. EGSEA's gene set database contains around 25 000 gene sets from sixteen collections. It has multiple visualization capabilities that allow researchers to view gene sets at various levels of granularity. EGSEA has been tested on simulated data and on a number of human and mouse datasets and, based on biologists' feedback, consistently outperforms the individual tools that have been combined. Our evaluation demonstrates the superiority of the ensemble approach for GSE analysis, and its utility to effectively and efficiently extrapolate biological functions and potential involvement in disease processes from lists of differentially regulated genes. EGSEA is available as an R package at http://www.bioconductor.org/packages/EGSEA/ . The gene sets collections are available in the R package EGSEAdata from http://www.bioconductor.org/packages/EGSEAdata/ . monther.alhamdoosh@csl.com.au mritchie@wehi.edu.au. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Combined analysis of DNA methylome and transcriptome reveal novel candidate genes with susceptibility to bovine Staphylococcus aureus subclinical mastitis.

PubMed

Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying

2016-07-14

Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis.
Combined analysis of DNA methylome and transcriptome reveal novel candidate genes with susceptibility to bovine Staphylococcus aureus subclinical mastitis

PubMed Central

Song, Minyan; He, Yanghua; Zhou, Huangkai; Zhang, Yi; Li, Xizhi; Yu, Ying

2016-01-01

Subclinical mastitis is a widely spread disease of lactating cows. Its major pathogen is Staphylococcus aureus (S. aureus). In this study, we performed genome-wide integrative analysis of DNA methylation and transcriptional expression to identify candidate genes and pathways relevant to bovine S. aureus subclinical mastitis. The genome-scale DNA methylation profiles of peripheral blood lymphocytes in cows with S. aureus subclinical mastitis (SA group) and healthy controls (CK) were generated by methylated DNA immunoprecipitation combined with microarrays. We identified 1078 differentially methylated genes in SA cows compared with the controls. By integrating DNA methylation and transcriptome data, 58 differentially methylated genes were shared with differently expressed genes, in which 20.7% distinctly hypermethylated genes showed down-regulated expression in SA versus CK, whereas 14.3% dramatically hypomethylated genes showed up-regulated expression. Integrated pathway analysis suggested that these genes were related to inflammation, ErbB signalling pathway and mismatch repair. Further functional analysis revealed that three genes, NRG1, MST1 and NAT9, were strongly correlated with the progression of S. aureus subclinical mastitis and could be used as powerful biomarkers for the improvement of bovine mastitis resistance. Our studies lay the groundwork for epigenetic modification and mechanistic studies on susceptibility of bovine mastitis. PMID:27411928
Regression Analysis of Combined Gene Expression Regulation in Acute Myeloid Leukemia

PubMed Central

Li, Yue; Liang, Minggao; Zhang, Zhaolei

2014-01-01

Gene expression is a combinatorial function of genetic/epigenetic factors such as copy number variation (CNV), DNA methylation (DM), transcription factors (TF) occupancy, and microRNA (miRNA) post-transcriptional regulation. At the maturity of microarray/sequencing technologies, large amounts of data measuring the genome-wide signals of those factors became available from Encyclopedia of DNA Elements (ENCODE) and The Cancer Genome Atlas (TCGA). However, there is a lack of an integrative model to take full advantage of these rich yet heterogeneous data. To this end, we developed RACER (Regression Analysis of Combined Expression Regulation), which fits the mRNA expression as response using as explanatory variables, the TF data from ENCODE, and CNV, DM, miRNA expression signals from TCGA. Briefly, RACER first infers the sample-specific regulatory activities by TFs and miRNAs, which are then used as inputs to infer specific TF/miRNA-gene interactions. Such a two-stage regression framework circumvents a common difficulty in integrating ENCODE data measured in generic cell-line with the sample-specific TCGA measurements. As a case study, we integrated Acute Myeloid Leukemia (AML) data from TCGA and the related TF binding data measured in K562 from ENCODE. As a proof-of-concept, we first verified our model formalism by 10-fold cross-validation on predicting gene expression. We next evaluated RACER on recovering known regulatory interactions, and demonstrated its superior statistical power over existing methods in detecting known miRNA/TF targets. Additionally, we developed a feature selection procedure, which identified 18 regulators, whose activities clustered consistently with cytogenetic risk groups. One of the selected regulators is miR-548p, whose inferred targets were significantly enriched for leukemia-related pathway, implicating its novel role in AML pathogenesis. Moreover, survival analysis using the inferred activities identified C-Fos as a potential AML
Meta-Analysis of Tumor Stem-Like Breast Cancer Cells Using Gene Set and Network Analysis

PubMed Central

Lee, Won Jun; Kim, Sang Cheol; Yoon, Jung-Ho; Yoon, Sang Jun; Lim, Johan; Kim, You-Sun; Kwon, Sung Won; Park, Jeong Hill

2016-01-01

Generally, cancer stem cells have epithelial-to-mesenchymal-transition characteristics and other aggressive properties that cause metastasis. However, there have been no confident markers for the identification of cancer stem cells and comparative methods examining adherent and sphere cells are widely used to investigate mechanism underlying cancer stem cells, because sphere cells have been known to maintain cancer stem cell characteristics. In this study, we conducted a meta-analysis that combined gene expression profiles from several studies that utilized tumorsphere technology to investigate tumor stem-like breast cancer cells. We used our own gene expression profiles along with the three different gene expression profiles from the Gene Expression Omnibus, which we combined using the ComBat method, and obtained significant gene sets using the gene set analysis of our datasets and the combined dataset. This experiment focused on four gene sets such as cytokine-cytokine receptor interaction that demonstrated significance in both datasets. Our observations demonstrated that among the genes of four significant gene sets, six genes were consistently up-regulated and satisfied the p-value of < 0.05, and our network analysis showed high connectivity in five genes. From these results, we established CXCR4, CXCL1 and HMGCS1, the intersecting genes of the datasets with high connectivity and p-value of < 0.05, as significant genes in the identification of cancer stem cells. Additional experiment using quantitative reverse transcription-polymerase chain reaction showed significant up-regulation in MCF-7 derived sphere cells and confirmed the importance of these three genes. Taken together, using meta-analysis that combines gene set and network analysis, we suggested CXCR4, CXCL1 and HMGCS1 as candidates involved in tumor stem-like breast cancer cells. Distinct from other meta-analysis, by using gene set analysis, we selected possible markers which can explain the biological
Combining Genome Wide Association Study and lung eQTL analysis provides evidence for novel genes associated with asthma

PubMed Central

Nieuwenhuis, Maartje A.; Siedlinski, Matteusz; van den Berge, Maarten; Granell, Raquel; Li, Xingnan; Niens, Marijke; van der Vlies, Pieter; Altmüller, Janine; Nürnberg, Peter; Kerkhof, Marjan; van Schayck, Onno C.; Riemersma, Ronald A.; van der Molen, Thys; de Monchy, Jan G.; Bossé, Yohan; Sandford, Andrew; Bruijnzeel-Koomen, Carla A.; van Wijk, Roy G.; ten Hacken, Nick H.; Timens, Wim; Boezen, H. Marike; Henderson, John; Kabesch, Michael; Vonk, Judith M.; Postma, Dirkje S.; Koppelman, Gerard H.

2016-01-01

Background Genome wide association studies (GWAS) of asthma have identified single nucleotide polymorphisms (SNPs) that modestly increase the risk for asthma. This could be due to phenotypic heterogeneity of asthma. Bronchial hyperresponsiveness (BHR) is a phenotypic hallmark of asthma. We aim to identify susceptibility genes for asthma combined with BHR and analyse the presence of cis-eQTLs among replicated SNPs. Secondly, we compare the genetic association of SNPs previously associated with (doctor diagnosed) asthma to our GWAS of asthma with BHR. Methods A GWAS was performed in 920 asthmatics with BHR and 980 controls. Top SNPs of our GWAS were analysed in four replication cohorts and lung cis-eQTL analysis was performed on replicated SNPs. We investigated association of SNPs previously associated with asthma in our data. Results 368 SNPs were followed up for replication. Six SNPs in genes encoding ABI3BP, NAF1, MICA and the 17q21 locus replicated in one or more cohorts, with one locus (17q21) achieving genome wide significance after meta-analysis. Five out of 6 replicated SNPs regulated 35 gene transcripts in whole lung. Eight of 20 asthma associated SNPs from previous GWAS were significantly associated with asthma and BHR. Three SNPs, in IL-33 and GSDMB, showed larger effect sizes in our data compared to published literature. Conclusions Combining GWAS with subsequent lung eQTL analysis revealed disease associated SNPs regulating lung mRNA expression levels of potential new asthma genes. Adding BHR to the asthma definition does not lead to an overall larger genetic effect size than analysing (doctor’s diagnosed) asthma. PMID:27439200
Toward the identification of causal genes in complex diseases: a gene-centric joint test of significance combining genomic and transcriptomic data.

PubMed

Charlesworth, Jac C; Peralta, Juan M; Drigalenko, Eugene; Göring, Harald Hh; Almasy, Laura; Dyer, Thomas D; Blangero, John

2009-12-15

Gene identification using linkage, association, or genome-wide expression is often underpowered. We propose that formal combination of information from multiple gene-identification approaches may lead to the identification of novel loci that are missed when only one form of information is available. Firstly, we analyze the Genetic Analysis Workshop 16 Framingham Heart Study Problem 2 genome-wide association data for HDL-cholesterol using a "gene-centric" approach. Then we formally combine the association test results with genome-wide transcriptional profiling data for high-density lipoprotein cholesterol (HDL-C), from the San Antonio Family Heart Study, using a Z-transform test (Stouffer's method). We identified 39 genes by the joint test at a conservative 1% false-discovery rate, including 9 from the significant gene-based association test and 23 whose expression was significantly correlated with HDL-C. Seven genes identified as significant in the joint test were not independently identified by either the association or expression tests. This combined approach has increased power and leads to the direct nomination of novel candidate genes likely to be involved in the determination of HDL-C levels. Such information can then be used as justification for a more exhaustive search for functional sequence variation within the nominated genes. We anticipate that this type of analysis will improve our speed of identification of regulatory genes causally involved in disease risk.
Combining Gene Signatures Improves Prediction of Breast Cancer Survival

PubMed Central

Zhao, Xi; Naume, Bjørn; Langerød, Anita; Frigessi, Arnoldo; Kristensen, Vessela N.; Børresen-Dale, Anne-Lise; Lingjærde, Ole Christian

2011-01-01

Background Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. Principal Findings To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. Conclusion Combining the predictive strength of multiple gene signatures improves prediction of breast
Combined Analysis of the Fruit Metabolome and Transcriptome Reveals Candidate Genes Involved in Flavonoid Biosynthesis in Actinidia arguta.

PubMed

Li, Yukuo; Fang, Jinbao; Qi, Xiujuan; Lin, Miaomiao; Zhong, Yunpeng; Sun, Leiming; Cui, Wen

2018-05-15

To assess the interrelation between the change of metabolites and the change of fruit color, we performed a combined metabolome and transcriptome analysis of the flesh in two different Actinidia arguta cultivars: "HB" ("Hongbaoshixing") and "YF" ("Yongfengyihao") at two different fruit developmental stages: 70d (days after full bloom) and 100d (days after full bloom). Metabolite and transcript profiling was obtained by ultra-performance liquid chromatography quadrupole time-of-flight tandem mass spectrometer and high-throughput RNA sequencing, respectively. The identification and quantification results of metabolites showed that a total of 28,837 metabolites had been obtained, of which 13,715 were annotated. In comparison of HB100 vs. HB70, 41 metabolites were identified as being flavonoids, 7 of which, with significant difference, were identified as bracteatin, luteolin, dihydromyricetin, cyanidin, pelargonidin, delphinidin and (-)-epigallocatechin. Association analysis between metabolome and transcriptome revealed that there were two metabolic pathways presenting significant differences during fruit development, one of which was flavonoid biosynthesis, in which 14 structural genes were selected to conduct expression analysis, as well as 5 transcription factor genes obtained by transcriptome analysis. RT-qPCR results and cluster analysis revealed that AaF3H , AaLDOX , AaUFGT , AaMYB , AabHLH , and AaHB2 showed the best possibility of being candidate genes. A regulatory network of flavonoid biosynthesis was established to illustrate differentially expressed candidate genes involved in accumulation of metabolites with significant differences, inducing red coloring during fruit development. Such a regulatory network linking genes and flavonoids revealed a system involved in the pigmentation of all-red-fleshed and all-green-fleshed A. arguta , suggesting this conjunct analysis approach is not only useful in understanding the relationship between genotype and phenotype

A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

PubMed

Ishikawa, Akira

2017-11-27

Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.
Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

PubMed

Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

2015-06-01

To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.
Unconditional analyses can increase efficiency in assessing gene-environment interaction of the case-combined-control design.

PubMed

Goldstein, Alisa M; Dondon, Marie-Gabrielle; Andrieu, Nadine

2006-08-01

A design combining both related and unrelated controls, named the case-combined-control design, was recently proposed to increase the power for detecting gene-environment (GxE) interaction. Under a conditional analytic approach, the case-combined-control design appeared to be more efficient and feasible than a classical case-control study for detecting interaction involving rare events. We now propose an unconditional analytic strategy to further increase the power for detecting gene-environment (GxE) interactions. This strategy allows the estimation of GxE interaction and exposure (E) main effects under certain assumptions (e.g. no correlation in E between siblings and the same exposure frequency in both control groups). Only the genetic (G) main effect cannot be estimated because it is biased. Using simulations, we show that unconditional logistic regression analysis is often more efficient than conditional analysis for detecting GxE interaction, particularly for a rare gene and strong effects. The unconditional analysis is also at least as efficient as the conditional analysis when the gene is common and the main and joint effects of E and G are small. Under the required assumptions, the unconditional analysis retains more information than does the conditional analysis for which only discordant case-control pairs are informative leading to more precise estimates of the odds ratios.
Identification of a new gene regulatory circuit involving B cell receptor activated signaling using a combined analysis of experimental, clinical and global gene expression data

PubMed Central

Schrader, Alexandra; Meyer, Katharina; Walther, Neele; Stolz, Ailine; Feist, Maren; Hand, Elisabeth; von Bonin, Frederike; Evers, Maurits; Kohler, Christian; Shirneshan, Katayoon; Vockerodt, Martina; Klapper, Wolfram; Szczepanowski, Monika; Murray, Paul G.; Bastians, Holger; Trümper, Lorenz; Spang, Rainer; Kube, Dieter

2016-01-01

To discover new regulatory pathways in B lymphoma cells, we performed a combined analysis of experimental, clinical and global gene expression data. We identified a specific cluster of genes that was coherently expressed in primary lymphoma samples and suppressed by activation of the B cell receptor (BCR) through αIgM treatment of lymphoma cells in vitro. This gene cluster, which we called BCR.1, includes numerous cell cycle regulators. A reduced expression of BCR.1 genes after BCR activation was observed in different cell lines and also in CD10+ germinal center B cells. We found that BCR activation led to a delayed entry to and progression of mitosis and defects in metaphase. Cytogenetic changes were detected upon long-term αIgM treatment. Furthermore, an inverse correlation of BCR.1 genes with c-Myc co-regulated genes in distinct groups of lymphoma patients was observed. Finally, we showed that the BCR.1 index discriminates activated B cell-like and germinal centre B cell-like diffuse large B cell lymphoma supporting the functional relevance of this new regulatory circuit and the power of guided clustering for biomarker discovery. PMID:27166259
Gene expression profiling combined with bioinformatics analysis identify biomarkers for Parkinson disease.

PubMed

Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

2012-01-01

Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result.
Gene Expression Profiling Combined with Bioinformatics Analysis Identify Biomarkers for Parkinson Disease

PubMed Central

Diao, Hongyu; Li, Xinxing; Hu, Sheng; Liu, Yunhui

2012-01-01

Parkinson disease (PD) progresses relentlessly and affects approximately 4% of the population aged over 80 years old. It is difficult to diagnose in its early stages. The purpose of our study is to identify molecular biomarkers for PD initiation using a computational bioinformatics analysis of gene expression. We downloaded the gene expression profile of PD from Gene Expression Omnibus and identified differentially coexpressed genes (DCGs) and dysfunctional pathways in PD patients compared to controls. Besides, we built a regulatory network by mapping the DCGs to known regulatory data between transcription factors (TFs) and target genes and calculated the regulatory impact factor of each transcription factor. As the results, a total of 1004 genes associated with PD initiation were identified. Pathway enrichment of these genes suggests that biological processes of protein turnover were impaired in PD. In the regulatory network, HLF, E2F1 and STAT4 were found have altered expression levels in PD patients. The expression levels of other transcription factors, NKX3-1, TAL1, RFX1 and EGR3, were not found altered. However, they regulated differentially expressed genes. In conclusion, we suggest that HLF, E2F1 and STAT4 may be used as molecular biomarkers for PD; however, more work is needed to validate our result. PMID:23284986
Combining mouse mammary gland gene expression and comparative mapping for the identification of candidate genes for QTL of milk production traits in cattle

PubMed Central

Ron, Micha; Israeli, Galit; Seroussi, Eyal; Weller, Joel I; Gregg, Jeffrey P; Shani, Moshe; Medrano, Juan F

2007-01-01

Background Many studies have found segregating quantitative trait loci (QTL) for milk production traits in different dairy cattle populations. However, even for relatively large effects with a saturated marker map the confidence interval for QTL location by linkage analysis spans tens of map units, or hundreds of genes. Combining mapping and arraying has been suggested as an approach to identify candidate genes. Thus, gene expression analysis in the mammary gland of genes positioned in the confidence interval of the QTL can bridge the gap between fine mapping and quantitative trait nucleotide (QTN) determination. Results We hybridized Affymetrix microarray (MG-U74v2), containing 12,488 murine probes, with RNA derived from mammary gland of virgin, pregnant, lactating and involuting C57BL/6J mice in a total of nine biological replicates. We combined microarray data from two additional studies that used the same design in mice with a total of 75 biological replicates. The same filtering and normalization was applied to each microarray data using GeneSpring software. Analysis of variance identified 249 differentially expressed probe sets common to the three experiments along the four developmental stages of puberty, pregnancy, lactation and involution. 212 genes were assigned to their bovine map positions through comparative mapping, and thus form a list of candidate genes for previously identified QTLs for milk production traits. A total of 82 of the genes showed mammary gland-specific expression with at least 3-fold expression over the median representing all tissues tested in GeneAtlas. Conclusion This work presents a web tool for candidate genes for QTL (cgQTL) that allows navigation between the map of bovine milk production QTL, potential candidate genes and their level of expression in mammary gland arrays and in GeneAtlas. Three out of four confirmed genes that affect QTL in livestock (ABCG2, DGAT1, GDF8, IGF2) were over expressed in the target organ. Thus, cg
Analysis of Gene Expression Profiles of Soft Tissue Sarcoma Using a Combination of Knowledge-Based Filtering with Integration of Multiple Statistics

PubMed Central

Doi, Ayano; Ichinohe, Risa; Ikuyo, Yoriko; Takahashi, Teruyoshi; Marui, Shigetaka; Yasuhara, Koji; Nakamura, Tetsuro; Sugita, Shintaro; Sakamoto, Hiromi; Yoshida, Teruhiko; Hasegawa, Tadashi

2014-01-01

The diagnosis and treatment of soft tissue sarcomas (STS) have been difficult. Of the diverse histological subtypes, undifferentiated pleomorphic sarcoma (UPS) is particularly difficult to diagnose accurately, and its classification per se is still controversial. Recent advances in genomic technologies provide an excellent way to address such problems. However, it is often difficult, if not impossible, to identify definitive disease-associated genes using genome-wide analysis alone, primarily because of multiple testing problems. In the present study, we analyzed microarray data from 88 STS patients using a combination method that used knowledge-based filtering and a simulation based on the integration of multiple statistics to reduce multiple testing problems. We identified 25 genes, including hypoxia-related genes (e.g., MIF, SCD1, P4HA1, ENO1, and STAT1) and cell cycle- and DNA repair-related genes (e.g., TACC3, PRDX1, PRKDC, and H2AFY). These genes showed significant differential expression among histological subtypes, including UPS, and showed associations with overall survival. STAT1 showed a strong association with overall survival in UPS patients (logrank p = 1.84×10−6 and adjusted p value 2.99×10−3 after the permutation test). According to the literature, the 25 genes selected are useful not only as markers of differential diagnosis but also as prognostic/predictive markers and/or therapeutic targets for STS. Our combination method can identify genes that are potential prognostic/predictive factors and/or therapeutic targets in STS and possibly in other cancers. These disease-associated genes deserve further preclinical and clinical validation. PMID:25188299
Detection of doublecortin domain-containing 2 (DCDC2), a new candidate tumor suppressor gene of hepatocellular carcinoma, by triple combination array analysis

PubMed Central

2013-01-01

Background To detect genes correlated with hepatocellular carcinoma (HCC), we developed a triple combination array consisting of methylation array, gene expression array and single nucleotide polymorphism (SNP) array analysis. Methods A surgical specimen obtained from a 68-year-old female HCC patient was analyzed by triple combination array, which identified doublecortin domain-containing 2 (DCDC2) as a candidate tumor suppressor gene of HCC. Subsequently, samples from 48 HCC patients were evaluated for their DCDC2 methylation and expression status using methylation specific PCR (MSP) and semi-quantitative reverse transcriptase (RT) PCR, respectively. Then, we investigated the relationship between clinicopathological factors and methylation status of DCDC2. Results DCDC2 was revealed to be hypermethylated (methylation value 0.846, range 0–1.0) in cancer tissue, compared with adjacent normal tissue (0.212) by methylation array in the 68-year-old female patient. Expression array showed decreased expression of DCDC2 in cancerous tissue. SNP array showed that the copy number of chromosome 6p22.1, in which DCDC2 resides, was normal. MSP revealed hypermethylation of the promoter region of DCDC2 in 41 of the tumor samples. DCDC2 expression was significantly decreased in the cases with methylation (P = 0.048). Furthermore, the methylated cases revealed worse prognosis for overall survival than unmethylated cases (P = 0.048). Conclusions The present study indicates that triple combination array is an effective method to detect novel genes related to HCC. We propose that DCDC2 is a tumor suppressor gene of HCC. PMID:24034596
[Analysis of the frequencies of genotype combinations of 4 polymorphisms of genes acting on the folate cycle in the Spanish population].

PubMed

Martínez-Frías, María Luisa; Bermejo, Eva; Pérez, Belén; Desviat, Lourdes R; Castro, Margarita; Leal, Fátima; Mansilla, Elena; Martínez-Fernández, María Luisa; Rodríguez-Pinilla, Elvira; Rodríguez, Laura; Ugarte, Magdalena

2008-06-21

Studies on different populations have shown a great variability of the frequencies of different polymorphisms in genes acting in the folate cycle. The present study was aimed to analyze the frequency in the Spanish population of each genotype combination of four polymorphisms, one of them -1561C-T of the glutamate carboxypeptidase II (GCPII) gene- being the first time that is studied in Spain. The study included a meta-analysis of the published data. Using the Spanish Collaborative Study of Congenital Malformations (ECEMC) Network, blood samples of 190 mother-child couples with newborns without any congenital defect, were obtained from 15 Spanish autonomous regions. The study polymorphisms were the 677C-T and 1298A-C polymorphisms of the methylenetetrahydrofolate reductase (MTHFR), the 66A-G of the methionine synthase reductase (MTRR), and the 1561C-T polymorphism of the GCPII gene. To estimate the range for the population frequencies, 99% confidence intervals were calculated. The frequencies observed in our country were significantly different from others, being similar to those obtained in countries of the Mediterranean European area. The 1561C-T polymorphism of the GCPII gene has a frequency in Spain of 5.11%, which is also similar to the values observed in France (5%) and in Italy (6%). On the other hand, the frequency of the genotypes CTCC, TTAC is quite few, while the genotype TTCC was not observed in any mother or infants. A meta-analysis was performed for a big sample (23,612 individuals) and the results showed that with a 99% of probability the values for the genotype combinations CTCC, TTAC, and TTCC were within 0.10-0.24; 0.20-0.36; and 0.003-0.05, respectively. Our results are important to further analyze the relationship with some health problems and individual susceptibilities. Indeed, considering the published observations of the structure and function of the MTHFR enzyme, it is understandable that those genotype combinations that are quite little
[Polymorphism of POU1F1 gene and PRL gene and their combined effects on milk performance traits in Chinese Holstein cattle].

PubMed

Jia, Xiang-Jie; Wang, Chang-Fa; Yang, Gui-Wen; Huang, Jin-Ming; Li, Qiu-Ling; Zhong, Ji-Feng

2011-12-01

Three novel SNPs were found by DNA sequencing, PCR-RFLP and CRS-PCR methods were used for genotyping in 979 Chinese Holstein cattle. One SNP, G1178C, was identified in exon 2 of POU1F1 gene. Two novel SNPs, A906G and A1134G, were identified in 5'-flanking regulatory region (5'-UTR) of PRL gene. The association between polymorphisms of the two genes and milk performance traits were analyzed with PROC GLM of SAS. The results showed that GC genotype at 1178 locus of POU1F1 gene was advantageous for milk yield, milk protein yield, and milk fat yield. AG genotype at 906 locus was advantageous for milk yield. There was no significant difference between 1134 locus and milk performance traits of 5'-UTR of PRL gene. Analysis of genotype combination effect on milk production traits showed that the effect of combined genotype was not simple sum of single genotypes and the effects of gene pyramiding seemed to be more important in molecular breeding.
Identification of essential genes and synthetic lethal gene combinations in Escherichia coli K-12.

PubMed

Mori, Hirotada; Baba, Tomoya; Yokoyama, Katsushi; Takeuchi, Rikiya; Nomura, Wataru; Makishi, Kazuichi; Otsuka, Yuta; Dose, Hitomi; Wanner, Barry L

2015-01-01

Here we describe the systematic identification of single genes and gene pairs, whose knockout causes lethality in Escherichia coli K-12. During construction of precise single-gene knockout library of E. coli K-12, we identified 328 essential gene candidates for growth in complex (LB) medium. Upon establishment of the Keio single-gene deletion library, we undertook the development of the ASKA single-gene deletion library carrying a different antibiotic resistance. In addition, we developed tools for identification of synthetic lethal gene combinations by systematic construction of double-gene knockout mutants. We introduce these methods herein.
Clinical omics analysis of colorectal cancer incorporating copy number aberrations and gene expression data.

PubMed

Yoshida, Tsuyoshi; Kobayashi, Takumi; Itoda, Masaya; Muto, Taika; Miyaguchi, Ken; Mogushi, Kaoru; Shoji, Satoshi; Shimokawa, Kazuro; Iida, Satoru; Uetake, Hiroyuki; Ishikawa, Toshiaki; Sugihara, Kenichi; Mizushima, Hiroshi; Tanaka, Hiroshi

2010-07-29

Colorectal cancer (CRC) is one of the most frequently occurring cancers in Japan, and thus a wide range of methods have been deployed to study the molecular mechanisms of CRC. In this study, we performed a comprehensive analysis of CRC, incorporating copy number aberration (CRC) and gene expression data. For the last four years, we have been collecting data from CRC cases and organizing the information as an "omics" study by integrating many kinds of analysis into a single comprehensive investigation. In our previous studies, we had experienced difficulty in finding genes related to CRC, as we observed higher noise levels in the expression data than in the data for other cancers. Because chromosomal aberrations are often observed in CRC, here, we have performed a combination of CNA analysis and expression analysis in order to identify some new genes responsible for CRC. This study was performed as part of the Clinical Omics Database Project at Tokyo Medical and Dental University. The purpose of this study was to investigate the mechanism of genetic instability in CRC by this combination of expression analysis and CNA, and to establish a new method for the diagnosis and treatment of CRC. Comprehensive gene expression analysis was performed on 79 CRC cases using an Affymetrix Gene Chip, and comprehensive CNA analysis was performed using an Affymetrix DNA Sty array. To avoid the contamination of cancer tissue with normal cells, laser micro-dissection was performed before DNA/RNA extraction. Data analysis was performed using original software written in the R language. We observed a high percentage of CNA in colorectal cancer, including copy number gains at 7, 8q, 13 and 20q, and copy number losses at 8p, 17p and 18. Gene expression analysis provided many candidates for CRC-related genes, but their association with CRC did not reach the level of statistical significance. The combination of CNA and gene expression analysis, together with the clinical information
Down-weighting overlapping genes improves gene set analysis

PubMed Central

2012-01-01

Background The identification of gene sets that are significantly impacted in a given condition based on microarray data is a crucial step in current life science research. Most gene set analysis methods treat genes equally, regardless how specific they are to a given gene set. Results In this work we propose a new gene set analysis method that computes a gene set score as the mean of absolute values of weighted moderated gene t-scores. The gene weights are designed to emphasize the genes appearing in few gene sets, versus genes that appear in many gene sets. We demonstrate the usefulness of the method when analyzing gene sets that correspond to the KEGG pathways, and hence we called our method Pathway Analysis with Down-weighting of Overlapping Genes (PADOG). Unlike most gene set analysis methods which are validated through the analysis of 2-3 data sets followed by a human interpretation of the results, the validation employed here uses 24 different data sets and a completely objective assessment scheme that makes minimal assumptions and eliminates the need for possibly biased human assessments of the analysis results. Conclusions PADOG significantly improves gene set ranking and boosts sensitivity of analysis using information already available in the gene expression profiles and the collection of gene sets to be analyzed. The advantages of PADOG over other existing approaches are shown to be stable to changes in the database of gene sets to be analyzed. PADOG was implemented as an R package available at: http://bioinformaticsprb.med.wayne.edu/PADOG/or http://www.bioconductor.org. PMID:22713124
Mitochondrial gene sequences alone or combined with ITS region sequences provide firm molecular criteria for the classification of Lecanicillium species.

PubMed

Kouvelis, Vassili N; Sialakouma, Aphrodite; Typas, Milton A

2008-07-01

The recent revision of Verticillium sect. Prostrata led to the introduction of the genus Lecanicillium, which comprises the majority of the entomopathogenic strains. Sixty-five strains previously classified as Verticillium lecanii or Verticillium sp. from different geographical regions and hosts were examined and their phylogenetic relationships were determined using sequences from three mitochondrial (mt) genes [the small rRNA subunit (rns), the NADH dehydrogenase subunits 1 (nad1) and 3 (nad3)] and the ITS region. In general, single gene phylogenetic trees differentiated and placed the strains examined in well-supported (by BS analysis) groups of L. lecanii, L. longisporum, L. muscarium, and L. nodulosum, although in some cases a few uncertainties still remained. nad1 was the most informative single gene in phylogenetic analyses and was also found to contain group I introns with putative open reading frames (ORFs) encoding for GIY-YIG endonucleases. The combined use of mt gene sequences resolved taxonomic uncertainties arisen from ITS analysis and, alone or in combination with ITS sequences, helped in placing uncharacterised Verticillium lecanii and Verticillium sp. firmly into Lecanicillium species. Combined gene data from all the mt genes and all the mt genes and the ITS region together, were very similar. Furthermore, a relaxed correlation with host specificity -- at least for Homoptera -- was indicated for the rns and the combined mt gene sequences. Thus, the usefulness of mt gene sequences as a convenient molecular tool in phylogenetic studies of entomopathogenic fungi was demonstrated.
Combining Selective Pressures to Enhance the Durability of Disease Resistance Genes.

PubMed

2016-01-01

The efficacy of disease resistance genes in plants decreases over time because of the selection of virulent pathogen genotypes. A key goal of crop protection programs is to increase the durability of the resistance conferred by these genes. The spatial and temporal deployment of plant disease resistance genes is considered to be a major factor determining their durability. In the literature, four principal strategies combining resistance genes over time and space have been considered to delay the evolution of virulent pathogen genotypes. We reviewed this literature with the aim of determining which deployment strategy results in the greatest durability of resistance genes. Although theoretical and empirical studies comparing deployment strategies of more than one resistance gene are very scarce, they suggest that the overall durability of disease resistance genes can be increased by combining their presence in the same plant (pyramiding). Retrospective analyses of field monitoring data also suggest that the pyramiding of disease resistance genes within a plant is the most durable strategy. By extension, we suggest that the combination of disease resistance genes with other practices for pathogen control (pesticides, farming practices) may be a relevant management strategy to slow down the evolution of virulent pathogen genotypes.
The relationship between gene transcription and combinations of histone modifications

NASA Astrophysics Data System (ADS)

Cui, Xiangjun; Li, Hong; Luo, Liaofu

2012-09-01

Histone modification is an important subject of epigenetics which plays an intrinsic role in transcriptional regulation. It is known that multiple histone modifications act in a combinatorial fashion. In this study, we demonstrated that the pathways within constructed Bayesian networks can give an indication for the combinations among 12 histone modifications which have been studied in the TSS+1kb region in S. cerevisiae. After Bayesian networks for the genes with high transcript levels (H-network) and low transcript levels (L-network) were constructed, the combinations of modifications within the two networks were analyzed from the view of transcript level. The results showed that different combinations played dissimilar roles in the regulation of gene transcription when there exist differences for gene expression at transcription level.
Combined analysis of fourteen nuclear genes refines the Ursidae phylogeny.

PubMed

Pagès, Marie; Calvignac, Sébastien; Klein, Catherine; Paris, Mathilde; Hughes, Sandrine; Hänni, Catherine

2008-04-01

Despite numerous studies, questions remain about the evolutionary history of Ursidae and additional independent genetic markers were needed to elucidate these ambiguities. For this purpose, we sequenced ten nuclear genes for all the eight extant bear species. By combining these new sequences with those of four other recently published nuclear markers, we provide new insights into the phylogenetic relationships of the Ursidae family members. The hypothesis that the giant panda was the first species to diverge among ursids is definitively confirmed and the precise branching order within the Ursus genus is clarified for the first time. Moreover, our analyses indicate that the American and the Asiatic black bears do not cluster as sister taxa, as had been previously hypothesised. Sun and sloth bears clearly appear as the most basal ursine species but uncertainties about their exact relationships remain. Since our larger dataset did not enable us to clarify this last question, identifying rare genomic changes in bear genomes could be a promising solution for further studies.
Combining growth-promoting genes leads to positive epistasis in Arabidopsis thaliana

PubMed Central

Vanhaeren, Hannes; Gonzalez, Nathalie; Coppens, Frederik; De Milde, Liesbeth; Van Daele, Twiggy; Vermeersch, Mattias; Eloy, Nubia B; Storme, Veronique; Inzé, Dirk

2014-01-01

Several genes positively influence final leaf size in Arabidopsis when mutated or overexpressed. The connections between these growth regulators are still poorly understood although such knowledge would further contribute to understand the processes driving leaf growth. In this study, we performed a combinatorial screen with 13 transgenic Arabidopsis lines with an increased leaf size. We found that from 61 analyzed combinations, 39% showed an additional increase in leaf size and most resulted from a positive epistasis on growth. Similar to what is found in other organisms in which such an epistasis assay was performed, only few genes were highly connected in synergistic combinations as we observed a positive epistasis in the majority of the combinations with samba, BRI1OE or SAUR19OE. Furthermore, positive epistasis was found with combinations of genes with a similar mode of action, but also with genes which affect distinct processes, such as cell proliferation and cell expansion. DOI: http://dx.doi.org/10.7554/eLife.02252.001 PMID:24843021
Laser-capture micro dissection combined with next-generation sequencing analysis of cell type-specific deafness gene expression in the mouse cochlea.

PubMed

Nishio, Shin-Ya; Takumi, Yutaka; Usami, Shin-Ichi

2017-05-01

Cochlear implantation (CI), which directly stimulates the cochlear nerves, is the most effective and widely used medical intervention for patients with severe to profound sensorineural hearing loss. The etiology of the hearing loss is speculated to have a major influence of CI outcomes, particularly in cases resulting from mutations in genes preferentially expressed in the spiral ganglion region. To elucidate precise gene expression levels in each part of the cochlea, we performed laser-capture micro dissection in combination with next-generation sequencing analysis and determined the expression levels of all known deafness-associated genes in the organ of Corti, spiral ganglion, lateral wall, and spiral limbs. The results were generally consistent with previous reports based on immunocytochemistry or in situ hybridization. As a notable result, the genes associated with many kinds of syndromic hearing loss (such as Clpp, Hars2, Hsd17b4, Lars2 for Perrault syndrome, Polr1c and Polr1d for Treacher Collins syndrome, Ndp for Norrie Disease, Kal for Kallmann syndrome, Edn3 and Snai2 for Waardenburg Syndrome, Col4a3 for Alport syndrome, Sema3e for CHARGE syndrome, Col9a1 for Sticker syndrome, Cdh23, Cib2, Clrn1, Pcdh15, Ush1c, Ush2a, Whrn for Usher syndrome and Wfs1 for Wolfram syndrome) showed higher levels of expression in the spiral ganglion than in other parts of the cochlea. This dataset will provide a base for more detailed analysis in order to clarify gene functions in the cochlea as well as predict CI outcomes based on gene expression data. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

PubMed

Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

2009-02-01

High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.
Complex regulation of the aflatoxin biosynthesis gene cluster of Aspergillus flavus in relation to various combinations of water activity and temperature.

PubMed

Schmidt-Heydt, Markus; Abdel-Hadi, Ahmed; Magan, Naresh; Geisen, Rolf

2009-11-15

A microarray analysis was performed to study the effect of varying combinations of water activity and temperature on the activation of aflatoxin biosynthesis genes in Aspergillusflavus grown on YES medium. Generally A. flavus showed expression of the aflatoxin biosynthetic genes at all parameter combinations tested. Certain combinations of a(w) and temperature, especially combinations which imposed stress on the fungus resulted in a significant reduction of the growth rate. At these conditions induction of the whole aflatoxin biosynthesis gene cluster occurred, however the produced aflatoxin B(1) was low. At all other combinations (25 degrees C/0.95 and 0.99; 30 degrees C/0.95 and 0.99; 35 degrees C/0.95 and 0.99) a reduced basal level of cluster gene expression occurred. At these combinations a high growth rate was obtained as well as high aflatoxin production. When single genes were compared, two groups with different expression profiles in relation to water activity/temperature combinations occurred. These two groups were co-ordinately localized within the aflatoxin gene cluster. The ratio of aflR/aflJ expression was correlated with increased aflatoxin biosynthesis.
Gene Circuit Analysis of the Terminal Gap Gene huckebein

PubMed Central

Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

2009-01-01

The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network. PMID:19876378
Gene circuit analysis of the terminal gap gene huckebein.

PubMed

Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

2009-10-01

The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network.
Analysis of the clonal repertoire of gene-corrected cells in gene therapy.

PubMed

Paruzynski, Anna; Glimm, Hanno; Schmidt, Manfred; Kalle, Christof von

2012-01-01

Gene therapy-based clinical phase I/II studies using integrating retroviral vectors could successfully treat different monogenetic inherited diseases. However, with increased efficiency of this therapy, severe side effects occurred in various gene therapy trials. In all cases, integration of the vector close to or within a proto-oncogene contributed substantially to the development of the malignancies. Thus, the in-depth analysis of integration site patterns is of high importance to uncover potential clonal outgrowth and to assess the safety of gene transfer vectors and gene therapy protocols. The standard and nonrestrictive linear amplification-mediated PCR (nrLAM-PCR) in combination with high-throughput sequencing exhibits technologies that allow to comprehensively analyze the clonal repertoire of gene-corrected cells and to assess the safety of the used vector system at an early stage on the molecular level. It enables clarifying the biological consequences of the vector system on the fate of the transduced cell. Furthermore, the downstream performance of real-time PCR allows a quantitative estimation of the clonality of individual cells and their clonal progeny. Here, we present a guideline that should allow researchers to perform comprehensive integration site analysis in preclinical and clinical studies. Copyright Â© 2012 Elsevier Inc. All rights reserved.
Synergistically combined gene delivery for enhanced VEGF secretion and anti-apoptosis

PubMed Central

Won, Young-Wook; Lee, Minhyung; Kim, Hyun Ah; Nam, Kihoon; Bull, David A.; Kim, Sung Wan

2013-01-01

With current pharmacological treatments, preventing the remodeling of the left ventricle and the progression to heart failure is a difficult task. Gene therapy is considered to provide a direct treatment to the long-term complications of ischemic heart diseases. Although current gene therapies that use single molecular targets seem potentially possible, they have not achieved a success in the treatment of ischemic diseases. With an efficient polymeric gene carrier, PAM-ABP, we designed a synergistically combined gene delivery strategy to enhance vascular endothelial growth factor (VEGF) secretion and prolong anti-apoptotic effects. A hypoxia-inducible plasmid expressing both hypoxia-inducible heme oxygenase-1 (HO-1) and the Src homology domain-2 containing tyrosine phosphatase-1 microRNA (miSHP 1) and a hypoxia-responsive VEGF plasmid were combined in this study. The positive feedback circuit between HO-1 and VEGF, and the negative regulatory role of SHP-1 in angiogenesis enhance VEGF secretion synergistically. The synergy in VEGF secretion as a consequence of the gene combination and the prolonged HO-1 activity was confirmed in hypoxic cardiomyocytes and cardiomyocyte apoptosis under hypoxia, and was decreased synergistically. These results suggest that the synergistic combination of VEGF, HO-1, and miSHP-1 may be promising for the clinical treatment of ischemic diseases. PMID:24007285
Analysis of multiplex gene expression maps obtained by voxelation.

PubMed

An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

2009-04-29

Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental
An Integrative Genetics Approach to Identify Candidate Genes Regulating BMD: Combining Linkage, Gene Expression, and Association

PubMed Central

Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J

2009-01-01

Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929
The human RHOX gene cluster: target genes and functional analysis of gene variants in infertile men.

PubMed

Borgmann, Jennifer; Tüttelmann, Frank; Dworniczak, Bernd; Röpke, Albrecht; Song, Hye-Won; Kliesch, Sabine; Wilkinson, Miles F; Laurentino, Sandra; Gromoll, Jörg

2016-11-15

The X-linked reproductive homeobox (RHOX) gene cluster encodes transcription factors preferentially expressed in reproductive tissues. This gene cluster has important roles in male fertility based on phenotypic defects of Rhox-mutant mice and the finding that aberrant RHOX promoter methylation is strongly associated with abnormal human sperm parameters. However, little is known about the molecular mechanism of RHOX function in humans. Using gene expression profiling, we identified genes regulated by members of the human RHOX gene cluster. Some genes were uniquely regulated by RHOXF1 or RHOXF2/2B, while others were regulated by both of these transcription factors. Several of these regulated genes encode proteins involved in processes relevant to spermatogenesis; e.g. stress protection and cell survival. One of the target genes of RHOXF2/2B is RHOXF1, suggesting cross-regulation to enhance transcriptional responses. The potential role of RHOX in human infertility was addressed by sequencing all RHOX exons in a group of 250 patients with severe oligozoospermia. This revealed two mutations in RHOXF1 (c.515G > A and c.522C > T) and four in RHOXF2/2B (-73C > G, c.202G > A, c.411C > T and c.679G > A), of which only one (c.202G > A) was found in a control group of men with normal sperm concentration. Functional analysis demonstrated that c.202G > A and c.679G > A significantly impaired the ability of RHOXF2/2B to regulate downstream genes. Molecular modelling suggested that these mutations alter RHOXF2/F2B protein conformation. By combining clinical data with in vitro functional analysis, we demonstrate how the X-linked RHOX gene cluster may function in normal human spermatogenesis and we provide evidence that it is impaired in human male fertility.
Bacterial reference genes for gene expression studies by RT-qPCR: survey and analysis.

PubMed

Rocha, Danilo J P; Santos, Carolina S; Pacheco, Luis G C

2015-09-01

The appropriate choice of reference genes is essential for accurate normalization of gene expression data obtained by the method of reverse transcription quantitative real-time PCR (RT-qPCR). In 2009, a guideline called the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) highlighted the importance of the selection and validation of more than one suitable reference gene for obtaining reliable RT-qPCR results. Herein, we searched the recent literature in order to identify the bacterial reference genes that have been most commonly validated in gene expression studies by RT-qPCR (in the first 5 years following publication of the MIQE guidelines). Through a combination of different search parameters with the text mining tool MedlineRanker, we identified 145 unique bacterial genes that were recently tested as candidate reference genes. Of these, 45 genes were experimentally validated and, in most of the cases, their expression stabilities were verified using the software tools geNorm and NormFinder. It is noteworthy that only 10 of these reference genes had been validated in two or more of the studies evaluated. An enrichment analysis using Gene Ontology classifications demonstrated that genes belonging to the functional categories of DNA Replication (GO: 0006260) and Transcription (GO: 0006351) rendered a proportionally higher number of validated reference genes. Three genes in the former functional class were also among the top five most stable genes identified through an analysis of gene expression data obtained from the Pathosystems Resource Integration Center. These results may provide a guideline for the initial selection of candidate reference genes for RT-qPCR studies in several different bacterial species.
Combining Shapley value and statistics to the analysis of gene expression data in children exposed to air pollution

PubMed Central

Moretti, Stefano; van Leeuwen, Danitsja; Gmuender, Hans; Bonassi, Stefano; van Delft, Joost; Kleinjans, Jos; Patrone, Fioravante; Merlo, Domenico Franco

2008-01-01

Background In gene expression analysis, statistical tests for differential gene expression provide lists of candidate genes having, individually, a sufficiently low p-value. However, the interpretation of each single p-value within complex systems involving several interacting genes is problematic. In parallel, in the last sixty years, game theory has been applied to political and social problems to assess the power of interacting agents in forcing a decision and, more recently, to represent the relevance of genes in response to certain conditions. Results In this paper we introduce a Bootstrap procedure to test the null hypothesis that each gene has the same relevance between two conditions, where the relevance is represented by the Shapley value of a particular coalitional game defined on a microarray data-set. This method, which is called Comparative Analysis of Shapley value (shortly, CASh), is applied to data concerning the gene expression in children differentially exposed to air pollution. The results provided by CASh are compared with the results from a parametric statistical test for testing differential gene expression. Both lists of genes provided by CASh and t-test are informative enough to discriminate exposed subjects on the basis of their gene expression profiles. While many genes are selected in common by CASh and the parametric test, it turns out that the biological interpretation of the differences between these two selections is more interesting, suggesting a different interpretation of the main biological pathways in gene expression regulation for exposed individuals. A simulation study suggests that CASh offers more power than t-test for the detection of differential gene expression variability. Conclusion CASh is successfully applied to gene expression analysis of a data-set where the joint expression behavior of genes may be critical to characterize the expression response to air pollution. We demonstrate a synergistic effect between
In vitro therapeutic effect of PDT combined with VEGF-A gene therapy

NASA Astrophysics Data System (ADS)

Lecaros, Rumwald Leo G.; Huang, Leaf; Hsu, Yih-Chih

2014-02-01

Vascular endothelial growth factor A (VEGF-A), commonly known as VEGF, is one of the primary factors that affect tumor angiogenesis. It was found to be expressed in cancer cell lines including oral squamous cell carcinoma. Photodynamic therapy (PDT) is a novel therapeutic modality to treat cancer by using a photosensitizer which is activated by a light source to produce reactive oxygen species and mediates oxygen-independent hypoxic conditions to tumor. Another emerging treatment to cure cancer is the use of interference RNA (e.g. siRNA) to silence a specific mRNA sequence. VEGF-A was found to be expressed in oral squamous cell carcinoma and overexpressed after 24 hour post-PDT by Western blot analysis. Cell viability was found to decrease at 25 nM of transfected VEGF-A siRNA. In vitro combined therapy of PDT and VEGF-A siRNA showed better response as compared with PDT and gene therapy alone. The results suggest that PDT combined with targeted gene therapy has a potential mean to achieve better therapeutic outcome.
Combined gene expression analysis of whole-tissue and microdissected pancreatic ductal adenocarcinoma identifies genes specifically overexpressed in tumor epithelia.

PubMed

Badea, Liviu; Herlea, Vlad; Dima, Simona Olimpia; Dumitrascu, Traian; Popescu, Irinel

2008-01-01

The precise details of pancreatic ductal adenocarcinoma (PDAC) pathogenesis are still insufficiently known, requiring the use of high-throughput methods. However, PDAC is especially difficult to study using microarrays due to its strong desmoplastic reaction, which involves a hyperproliferating stroma that effectively "masks" the contribution of the minoritary neoplastic epithelial cells. Thus it is not clear which of the genes that have been found differentially expressed between normal and whole tumor tissues are due to the tumor epithelia and which simply reflect the differences in cellular composition. To address this problem, laser microdissection studies have been performed, but these have to deal with much smaller tissue sample quantities and therefore have significantly higher experimental noise. In this paper we combine our own large sample whole-tissue study with a previously published smaller sample microdissection study by Grützmann et al. to identify the genes that are specifically overexpressed in PDAC tumor epithelia. The overlap of this list of genes with other microarray studies of pancreatic cancer as well as with the published literature is impressive. Moreover, we find a number of genes whose over-expression appears to be inversely correlated with patient survival: keratin 7, laminin gamma 2, stratifin, platelet phosphofructokinase, annexin A2, MAP4K4 and OACT2 (MBOAT2), which are all specifically upregulated in the neoplastic epithelia, rather than the tumor stroma. We improve on other microarray studies of PDAC by putting together the higher statistical power due to a larger number of samples with information about cell-type specific expression and patient survival.
Integrated analysis of gene expression and methylation profiles of 48 candidate genes in breast cancer patients.

PubMed

Li, Zibo; Heng, Jianfu; Yan, Jinhua; Guo, Xinwu; Tang, Lili; Chen, Ming; Peng, Limin; Wu, Yepeng; Wang, Shouman; Xiao, Zhi; Deng, Zhongping; Dai, Lizhong; Wang, Jun

2016-11-01

Gene-specific methylation and expression have shown biological and clinical importance for breast cancer diagnosis and prognosis. Integrated analysis of gene methylation and gene expression may identify genes associated with biology mechanism and clinical outcome of breast cancer and aid in clinical management. Using high-throughput microfluidic quantitative PCR, we analyzed the expression profiles of 48 candidate genes in 96 Chinese breast cancer patients and investigated their correlation with gene methylation and associations with breast cancer clinical parameters. Breast cancer-specific gene expression alternation was found in 25 genes with significant expression difference between paired tumor and normal tissues. A total of 9 genes (CCND2, EGFR, GSTP1, PGR, PTGS2, RECK, SOX17, TNFRSF10D, and WIF1) showed significant negative correlation between methylation and gene expression, which were validated in the TCGA database. Total 23 genes (ACADL, APC, BRCA2, CADM1, CAV1, CCND2, CST6, EGFR, ESR2, GSTP1, ICAM5, NPY, PGR, PTGS2, RECK, RUNX3, SFRP1, SOX17, SYK, TGFBR2, TNFRSF10D, WIF1, and WRN) annotated with potential TFBSs in the promoter regions showed negative correlation between methylation and expression. In logistics regression analysis, 31 of the 48 genes showed improved performance in disease prediction with combination of methylation and expression coefficient. Our results demonstrated the complex correlation and the possible regulatory mechanisms between DNA methylation and gene expression. Integration analysis of methylation and expression of candidate genes could improve performance in breast cancer prediction. These findings would contribute to molecular characterization and identification of biomarkers for potential clinical applications.
Combining Genome-Scale Experimental and Computational Methods To Identify Essential Genes in Rhodobacter sphaeroides

DOE PAGES

Burger, Brian T.; Imam, Saheed; Scarborough, Matthew J.; ...

2017-06-06

Rhodobacter sphaeroides is one of the best-studied alphaproteobacteria from biochemical, genetic, and genomic perspectives. To gain a better systems-level understanding of this organism, we generated a large transposon mutant library and used transposon sequencing (Tn-seq) to identify genes that are essential under several growth conditions. Using newly developed Tn-seq analysis software (TSAS), we identified 493 genes as essential for aerobic growth on a rich medium. We then used the mutant library to identify conditionally essential genes under two laboratory growth conditions, identifying 85 additional genes required for aerobic growth in a minimal medium and 31 additional genes required for photosyntheticmore » growth. In all instances, our analyses confirmed essentiality for many known genes and identified genes not previously considered to be essential. We used the resulting Tn-seq data to refine and improve a genome-scale metabolic network model (GEM) for R. sphaeroides. Together, we demonstrate how genetic, genomic, and computational approaches can be combined to obtain a systems-level understanding of the genetic framework underlying metabolic diversity in bacterial species.« less
Tissue Non-Specific Genes and Pathways Associated with Diabetes: An Expression Meta-Analysis.

PubMed

Mei, Hao; Li, Lianna; Liu, Shijian; Jiang, Fan; Griswold, Michael; Mosley, Thomas

2017-01-21

We performed expression studies to identify tissue non-specific genes and pathways of diabetes by meta-analysis. We searched curated datasets of the Gene Expression Omnibus (GEO) database and identified 13 and five expression studies of diabetes and insulin responses at various tissues, respectively. We tested differential gene expression by empirical Bayes-based linear method and investigated gene set expression association by knowledge-based enrichment analysis. Meta-analysis by different methods was applied to identify tissue non-specific genes and gene sets. We also proposed pathway mapping analysis to infer functions of the identified gene sets, and correlation and independent analysis to evaluate expression association profile of genes and gene sets between studies and tissues. Our analysis showed that PGRMC1 and HADH genes were significant over diabetes studies, while IRS1 and MPST genes were significant over insulin response studies, and joint analysis showed that HADH and MPST genes were significant over all combined data sets. The pathway analysis identified six significant gene sets over all studies. The KEGG pathway mapping indicated that the significant gene sets are related to diabetes pathogenesis. The results also presented that 12.8% and 59.0% pairwise studies had significantly correlated expression association for genes and gene sets, respectively; moreover, 12.8% pairwise studies had independent expression association for genes, but no studies were observed significantly different for expression association of gene sets. Our analysis indicated that there are both tissue specific and non-specific genes and pathways associated with diabetes pathogenesis. Compared to the gene expression, pathway association tends to be tissue non-specific, and a common pathway influencing diabetes development is activated through different genes at different tissues.
EGFR Gene Amplification and KRAS Mutation Predict Response to Combination Targeted Therapy in Metastatic Colorectal Cancer.

PubMed

Khan, Sajid A; Zeng, Zhaoshi; Shia, Jinru; Paty, Philip B

2017-07-01

Genetic variability in KRAS and EGFR predicts response to cetuximab in irinotecan refractory colorectal cancer. Whether these markers or others remain predictive in combination biologic therapies including bevacizumab is unknown. We identified predictive biomarkers from patients with irinotecan refractory metastatic colorectal cancer treated with cetuximab plus bevacizumab. Patients who received cetuximab plus bevacizumab for irinotecan refractory colorectal cancer in either of two Phase II trials conducted were identified. Tumor tissue was available for 33 patients. Genomic DNA was extracted and used for mutational analysis of KRAS, BRAF, and p53 genes. Fluorescence in situ hybridization was performed to assess EGFR copy number. The status of single genes and various combinations were tested for association with response. Seven of 33 patients responded to treatment. KRAS mutations were found in 14/33 cases, and 0 responded to treatment (p = 0.01). EGFR gene amplification was seen in 3/33 of tumors and in every case was associated with response to treatment (p < 0.001). TP53 and BRAF mutations were found in 18/33 and 0/33 tumors, respectively, and there were no associations with response to either gene. EGFR gene amplification and KRAS mutations are predictive markers for patients receiving combination biologic therapy of cetuximab plus bevacizumab for metastatic colorectal cancer. One marker or the other is present in the tumor of half of all patients allowing treatment response to be predicted with a high degree of certainty. The role for molecular markers in combination biologic therapy seems promising.
Weighted functional linear regression models for gene-based association analysis.

PubMed

Belonogova, Nadezhda M; Svishcheva, Gulnara R; Wilson, James F; Campbell, Harry; Axenovich, Tatiana I

2018-01-01

Functional linear regression models are effectively used in gene-based association analysis of complex traits. These models combine information about individual genetic variants, taking into account their positions and reducing the influence of noise and/or observation errors. To increase the power of methods, where several differently informative components are combined, weights are introduced to give the advantage to more informative components. Allele-specific weights have been introduced to collapsing and kernel-based approaches to gene-based association analysis. Here we have for the first time introduced weights to functional linear regression models adapted for both independent and family samples. Using data simulated on the basis of GAW17 genotypes and weights defined by allele frequencies via the beta distribution, we demonstrated that type I errors correspond to declared values and that increasing the weights of causal variants allows the power of functional linear models to be increased. We applied the new method to real data on blood pressure from the ORCADES sample. Five of the six known genes with P < 0.1 in at least one analysis had lower P values with weighted models. Moreover, we found an association between diastolic blood pressure and the VMP1 gene (P = 8.18×10-6), when we used a weighted functional model. For this gene, the unweighted functional and weighted kernel-based models had P = 0.004 and 0.006, respectively. The new method has been implemented in the program package FREGAT, which is freely available at https://cran.r-project.org/web/packages/FREGAT/index.html.
Systematic analysis of human kinase genes: a large number of genes and alternative splicing events result in functional and structural diversity

PubMed Central

Milanesi, Luciano; Petrillo, Mauro; Sepe, Leandra; Boccia, Angelo; D'Agostino, Nunzio; Passamano, Myriam; Di Nardo, Salvatore; Tasco, Gianluca; Casadio, Rita; Paolella, Giovanni

2005-01-01

Background Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosys. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies. Results A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was perfomed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge. Conclusion The predicted human kinome was extended by identifiying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment
[Using exon combined target region capture sequencing chip to detect the disease-causing genes of retinitis pigmentosa].

PubMed

Rong, Weining; Chen, Xuejuan; Li, Huiping; Liu, Yani; Sheng, Xunlun

2014-06-01

To detect the disease-causing genes of 10 retinitis pigmentosa pedigrees by using exon combined target region capture sequencing chip. Pedigree investigation study. From October 2010 to December 2013, 10 RP pedigrees were recruited for this study in Ningxia Eye Hospital. All the patients and family members received complete ophthalmic examinations. DNA was abstracted from patients, family members and controls. Using exon combined target region capture sequencing chip to screen the candidate disease-causing mutations. Polymerase chain reaction (PCR) and direct sequencing were used to confirm the disease-causing mutations. Seventy patients and 23 normal family members were recruited from 10 pedigrees. Among 10 RP pedigrees, 1 was autosomal dominant pedigrees and 9 were autosomal recessive pedigrees. 7 mutations related to 5 genes of 5 pedigrees were detected. A frameshift mutation on BBS7 gene was detected in No.2 pedigree, the patients of this pedigree combined with central obesity, polydactyly and mental handicap. No.2 pedigree was diagnosed as Bardet-Biedl syndrome finally. A missense mutation was detected in No.7 and No.10 pedigrees respectively. Because the patients suffered deafness meanwhile, the final diagnosis was Usher syndrome. A missense mutation on C3 gene related to age-related macular degeneration was also detected in No. 7 pedigrees. A nonsense mutation and a missense mutation on CRB1 gene were detected in No. 1 pedigree and a splicesite mutation on PROM1 gene was detected in No. 5 pedigree. Retinitis pigmentosa is a kind of genetic eye disease with diversity clinical phenotypes. Rapid and effective genetic diagnosis technology combined with clinical characteristics analysis is helpful to improve the level of clinical diagnosis of RP.

Molecular characterization and combined genotype association study of bovine cluster of differentiation 14 gene with clinical mastitis in crossbred dairy cattle.

PubMed

Selvan, A Sakthivel; Gupta, I D; Verma, A; Chaudhari, M V; Magotra, A

2016-07-01

The present study was undertaken with the objectives to characterize and to analyze combined genotypes of cluster of differentiation 14 (CD14) gene to explore its association with clinical mastitis in Karan Fries (KF) cows maintained in the National Dairy Research Institute herd, Karnal. Genomic DNA was extracted using blood of randomly selected 94 KF lactating cattle by phenol-chloroform method. After checking its quality and quantity, polymerase chain reaction (PCR) was carried out using six sets of reported gene-specific primers to amplify complete KF CD14 gene. The forward and reverse sequences for each PCR fragments were assembled to form complete sequence for the respective region of KF CD14 gene. The multiple sequence alignments of the edited sequence with the corresponding reference with reported Bos taurus sequence (EU148610.1) were performed with ClustalW software to identify single nucleotide polymorphisms (SNPs). Basic Local Alignment Search Tool analysis was performed to compare the sequence identity of KF CD14 gene with other species. The restriction fragment length polymorphism (RFLP) analysis was carried out in all KF cows using Helicobacter pylori 188I (Hpy188I) (contig 2) and Haemophilus influenzae I (HinfI) (contig 4) restriction enzyme (RE). Cows were assigned genotypes obtained by PCR-RFLP analysis, and association study was done using Chi-square (χ (2)) test. The genotypes of both contigs (loci) number 2 and 4 were combined with respect to each animal to construct combined genotype patterns. Two types of sequences of KF were obtained: One with 2630 bp having one insertion at 616 nucleotide (nt) position and one deletion at 1117 nt position, and the another sequence was of 2629 bp having only one deletion at 615 nt position. ClustalW, multiple alignments of KF CD14 gene sequence with B. taurus cattle sequence (EU148610.1), revealed 24 nt changes (SNPs). Cows were also screened using PCR-RFLP with Hpy188I (contig 2) and HinfI (contig 4) RE
Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function

PubMed Central

Tian, Weidong; Zhang, Lan V; Taşan, Murat; Gibbons, Francis D; King, Oliver D; Park, Julie; Wunderlich, Zeba; Cherry, J Michael; Roth, Frederick P

2008-01-01

Background: Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-association', which transfers function from one gene to another via biological relationships. Results: We have developed a strategy ('Funckenstein') that performs guilt-by-profiling and guilt-by-association and combines the results. Using a benchmark set of functional categories and input data for protein-coding genes in Saccharomyces cerevisiae, Funckenstein was compared with a previous combined strategy. Subsequently, we applied Funckenstein to 2,455 Gene Ontology terms. In the process, we developed 2,455 guilt-by-profiling classifiers based on 8,848 gene characteristics and 12 functional linkage graphs based on 23 biological relationships. Conclusion: Funckenstein outperforms a previous combined strategy using a common benchmark dataset. The combination of 'guilt-by-profiling' and 'guilt-by-association' gave significant improvement over the component classifiers, showing the greatest synergy for the most specific functions. Performance was evaluated by cross-validation and by literature examination of the top-scoring novel predictions. These quantitative predictions should help prioritize experimental study of yeast gene functions. PMID:18613951
A Streamlined Protocol for Molecular Testing of the DMD Gene within a Diagnostic Laboratory: A Combination of Array Comparative Genomic Hybridization and Bidirectional Sequence Analysis

PubMed Central

Marquis-Nicholson, Renate; Lai, Daniel; Love, Jennifer M.; Love, Donald R.

2013-01-01

Purpose. The aim of this study was to develop a streamlined mutation screening protocol for the DMD gene in order to confirm a clinical diagnosis of Duchenne or Becker muscular dystrophy in affected males and to clarify the carrier status of female family members. Methods. Sequence analysis and array comparative genomic hybridization (aCGH) were used to identify mutations in the dystrophin DMD gene. We analysed genomic DNA from six individuals with a range of previously characterised mutations and from eight individuals who had not previously undergone any form of molecular analysis. Results. We successfully identified the known mutations in all six patients. A molecular diagnosis was also made in three of the four patients with a clinical diagnosis who had not undergone prior genetic screening, and testing for familial mutations was successfully completed for the remaining four patients. Conclusion. The mutation screening protocol described here meets best practice guidelines for molecular testing of the DMD gene in a diagnostic laboratory. The aCGH method is a superior alternative to more conventional assays such as multiplex ligation-dependent probe amplification (MLPA). The combination of aCGH and sequence analysis will detect mutations in 98% of patients with the Duchenne or Becker muscular dystrophy. PMID:23476807
Phylogeny of sipunculan worms: A combined analysis of four gene regions and morphology.

PubMed

Schulze, Anja; Cutler, Edward B; Giribet, Gonzalo

2007-01-01

The intra-phyletic relationships of sipunculan worms were analyzed based on DNA sequence data from four gene regions and 58 morphological characters. Initially we analyzed the data under direct optimization using parsimony as optimality criterion. An implied alignment resulting from the direct optimization analysis was subsequently utilized to perform a Bayesian analysis with mixed models for the different data partitions. For this we applied a doublet model for the stem regions of the 18S rRNA. Both analyses support monophyly of Sipuncula and most of the same clades within the phylum. The analyses differ with respect to the relationships among the major groups but whereas the deep nodes in the direct optimization analysis generally show low jackknife support, they are supported by 100% posterior probability in the Bayesian analysis. Direct optimization has been useful for handling sequences of unequal length and generating conservative phylogenetic hypotheses whereas the Bayesian analysis under mixed models provided high resolution in the basal nodes of the tree.
Analysis of Polymorphism of Angiotensin System Genes (ACE, AGTR1, and AGT) and Gene ITGB3 in Patients with Arterial Hypertension in Combination with Metabolic Syndrome.

PubMed

Zotova, T Yu; Kubanova, A P; Azova, M M; Aissa, A Ait; Gigani, O O; Frolov, V A

2016-07-01

Changes in the frequencies of genotypes and mutant alleles of ACE, AGTR1, AGT, and ITGB3 genes were analyzed in patients with arterial hypertension coupled with metabolic syndrome (N=15) and compared with population data and corresponding parameters in patients with isolated hypertension (N=15). Increased frequency of genotype ID of ACE gene (hypertension predictor) was confirmed for both groups. In case of isolated hypertension, M235M genotype (gene AGT) was more frequent, in case of hypertension combined with metabolic syndrome, the frequency of genotypes A1166C and C1166C of the gene AGTR1 was higher in comparison with population data. Comparison of mutant allele frequencies in the two groups showed that at the 90% significance level allele T of the AGT gene was more frequent in hypertension coupled with metabolic syndrome (OR=1.26) and genotype A1166A of the AGTR1 gene was more frequent in the group with isolated hypertension.
Synergistic effects of arsenic trioxide combined with ascorbic acid in human osteosarcoma MG-63 cells: a systems biology analysis.

PubMed

Huang, X C; Maimaiti, X Y M; Huang, C W; Zhang, L; Li, Z B; Chen, Z G; Gao, X; Chen, T Y

2014-01-01

To further understand the synergistic mechanism of As2O3 and asscorbic acid (AA) in human osteosarcoma MG-63 cells by systems biology analysis. Human osteosarcoma MG-63 cells were treated by As2O3 (1 µmol/L), AA (62.5 µmol/L) and combined drugs (1 µmol/L As2O3 plus 62.5 µmol/L AA). Dynamic morphological characteristics were recorded by Cell-IQ system, and growth rate was calculated. Illumina beadchip assay was used to analyze the differential expression genes in different groups. Synergic effects on differential expression genes (DEGs) were analyzed by mixture linear model and singular value decomposition model. KEGG pathway annotations and GO enrichment analysis were performed to figure out the pathways involved in the synergic effects. We captured 1987 differential expression genes in combined therapy MG-63 cells. FAT1 gene was significantly upregulated in all three groups, which is a promising drug target as an important tumor suppressor analogue; meanwhile, HIST1H2BD gene was markedly downregulated in the As2O3 monotherapy group and the combined therapy group, which was found to be upregulated in prostatic cancer. These two genes might play critical roles in synergetic effects of AA and As2O3, although the exact mechanism needs further investigation. KEGG pathway analysis showed many DEGs were related with tight junction, and GO analysis also indicated that DEGs in the combined therapy cells gathered in occluding junction, apical junction complex, cell junction, and tight junction. AA potentiates the efficacy of As2O3 in MG-63 cells. Systems biology analysis showed the synergic effect on the DEGs.
Selection of reference genes for qRT-PCR analysis of gene expression in sea cucumber Apostichopus japonicus during aestivation

NASA Astrophysics Data System (ADS)

Zhao, Ye; Chen, Muyan; Wang, Tianming; Sun, Lina; Xu, Dongxue; Yang, Hongsheng

2014-11-01

Quantitative real-time reverse transcription-polymerase chain reaction (qRT-PCR) is a technique that is widely used for gene expression analysis, and its accuracy depends on the expression stability of the internal reference genes used as normalization factors. However, many applications of qRT-PCR used housekeeping genes as internal controls without validation. In this study, the expression stability of eight candidate reference genes in three tissues (intestine, respiratory tree, and muscle) of the sea cucumber Apostichopus japonicus was assessed during normal growth and aestivation using the geNorm, NormFinder, delta CT, and RefFinder algorithms. The results indicate that the reference genes exhibited significantly different expression patterns among the three tissues during aestivation. In general, the β-tubulin (TUBB) gene was relatively stable in the intestine and respiratory tree tissues. The optimal reference gene combination for intestine was 40S ribosomal protein S18 (RPS18), TUBB, and NADH dehydrogenase (NADH); for respiratory tree, it was β-actin (ACTB), TUBB, and succinate dehydrogenase cytochrome B small subunit (SDHC); and for muscle it was α-tubulin (TUBA) and NADH dehydrogenase [ubiquinone] 1 α subcomplex subunit 13 (NDUFA13). These combinations of internal control genes should be considered for use in further studies of gene expression in A. japonicus during aestivation.
Marker-assisted combination of major genes for pathogen resistance in potato.

PubMed

Gebhardt, C; Bellin, D; Henselewski, H; Lehmann, W; Schwarzfischer, J; Valkonen, J P T

2006-05-01

Closely linked PCR-based markers facilitate the tracing and combining of resistance factors that have been introgressed previously into cultivated potato from different sources. Crosses were performed to combine the Ry ( adg ) gene for extreme resistance to Potato virus Y (PVY) with the Gro1 gene for resistance to the root cyst nematode Globodera rostochiensis and the Rx1 gene for extreme resistance to Potato virus X (PVX), or with resistance to potato wart (Synchytrium endobioticum). Marker-assisted selection (MAS) using four PCR-based diagnostic assays was applied to 110 F1 hybrids resulting from four 2x by 4x cross-combinations. Thirty tetraploid plants having the appropriate marker combinations were selected and tested for presence of the corresponding resistance traits. All plants tested showed the expected resistant phenotype. Unexpectedly, the plants segregated for additional resistance to pathotypes 1, 2 and 6 of S. endobioticum, which was subsequently shown to be inherited from the PVY resistant parents of the crosses. The selected plants can be used as sources of multiple resistance traits in pedigree breeding and are available from a potato germplasm bank.
ExAtlas: An interactive online tool for meta-analysis of gene expression data.

PubMed

Sharov, Alexei A; Schlessinger, David; Ko, Minoru S H

2015-12-01

We have developed ExAtlas, an on-line software tool for meta-analysis and visualization of gene expression data. In contrast to existing software tools, ExAtlas compares multi-component data sets and generates results for all combinations (e.g. all gene expression profiles versus all Gene Ontology annotations). ExAtlas handles both users' own data and data extracted semi-automatically from the public repository (GEO/NCBI database). ExAtlas provides a variety of tools for meta-analyses: (1) standard meta-analysis (fixed effects, random effects, z-score, and Fisher's methods); (2) analyses of global correlations between gene expression data sets; (3) gene set enrichment; (4) gene set overlap; (5) gene association by expression profile; (6) gene specificity; and (7) statistical analysis (ANOVA, pairwise comparison, and PCA). ExAtlas produces graphical outputs, including heatmaps, scatter-plots, bar-charts, and three-dimensional images. Some of the most widely used public data sets (e.g. GNF/BioGPS, Gene Ontology, KEGG, GAD phenotypes, BrainScan, ENCODE ChIP-seq, and protein-protein interaction) are pre-loaded and can be used for functional annotations.
Association analysis of bovine Foxa2 gene single sequence variant and haplotype combinations with growth traits in Chinese cattle.

PubMed

Liu, Mei; Li, Mijie; Wang, Shaoqiang; Xu, Yao; Lan, Xianyong; Li, Zhuanjian; Lei, Chuzhao; Yang, Dongying; Jia, Yutang; Chen, Hong

2014-02-25

Forkhead box A2 (Foxa2) has been recognized as one of the most potent transcriptional activators that is implicated in the control of feeding behavior and energy homeostasis. However, similar researches about the effects of genetic variations of Foxa2 gene on growth traits are lacking. Therefore, this study detected Foxa2 gene polymorphisms by DNA pool sequencing, PCR-RFLP and PCR-ACRS methods in 822 individuals from three Chinese cattle breeds. The results showed that four sequence variants (SVs) were screened, including two mutations (SV1, g. 7005 C>T and SV2, g. 7044 C>G) in intron 4, one mutation (SV3, g. 8449 A>G) in exon 5 and one mutation (SV4, g. 8537 T>C) in the 3'UTR. Notably, association analysis of the single mutations with growth traits in total individuals (at 24months) revealed that significant statistical difference was found in four SVs, and SV4 locus was highly significantly associated with growth traits throughout all three breeds (P<0.05 or P<0.01). Meanwhile, haplotype combination CCCCAGTC also indicated remarkably associated to better chest girth and body weight in Jiaxian Red cattle (P<0.05). We herein described a comprehensive study on the variability of bovine Foxa2 gene that was predictive of molecular markers in cattle breeding for the first time. Copyright © 2013 Elsevier B.V. All rights reserved.
Time-Course Gene Set Analysis for Longitudinal Gene Expression Data

PubMed Central

Hejblum, Boris P.; Skinner, Jason; Thiébaut, Rodolphe

2015-01-01

Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA) introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR) measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial), and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA) for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package. PMID:26111374
A Comprehensive Analysis of Nuclear-Encoded Mitochondrial Genes in Schizophrenia.

PubMed

Gonçalves, Vanessa F; Cappi, Carolina; Hagen, Christian M; Sequeira, Adolfo; Vawter, Marquis P; Derkach, Andriy; Zai, Clement C; Hedley, Paula L; Bybjerg-Grauholm, Jonas; Pouget, Jennie G; Cuperfain, Ari B; Sullivan, Patrick F; Christiansen, Michael; Kennedy, James L; Sun, Lei

2018-05-01

The genetic risk factors of schizophrenia (SCZ), a severe psychiatric disorder, are not yet fully understood. Multiple lines of evidence suggest that mitochondrial dysfunction may play a role in SCZ, but comprehensive association studies are lacking. We hypothesized that variants in nuclear-encoded mitochondrial genes influence susceptibility to SCZ. We conducted gene-based and gene-set analyses using summary association results from the Psychiatric Genomics Consortium Schizophrenia Phase 2 (PGC-SCZ2) genome-wide association study comprising 35,476 cases and 46,839 control subjects. We applied the MAGMA method to three sets of nuclear-encoded mitochondrial genes: oxidative phosphorylation genes, other nuclear-encoded mitochondrial genes, and genes involved in nucleus-mitochondria crosstalk. Furthermore, we conducted a replication study using the iPSYCH SCZ sample of 2290 cases and 21,621 control subjects. In the PGC-SCZ2 sample, 1186 mitochondrial genes were analyzed, among which 159 had p values < .05 and 19 remained significant after multiple testing correction. A meta-analysis of 818 genes combining the PGC-SCZ2 and iPSYCH samples resulted in 104 nominally significant and nine significant genes, suggesting a polygenic model for the nuclear-encoded mitochondrial genes. Gene-set analysis, however, did not show significant results. In an in silico protein-protein interaction network analysis, 14 mitochondrial genes interacted directly with 158 SCZ risk genes identified in PGC-SCZ2 (permutation p = .02), and aldosterone signaling in epithelial cells and mitochondrial dysfunction pathways appeared to be overrepresented in this network of mitochondrial and SCZ risk genes. This study provides evidence that specific aspects of mitochondrial function may play a role in SCZ, but we did not observe its broad involvement even using a large sample. Copyright © 2018 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Bacterial and Pneumocystis Infections in the Lungs of Gene-Knockout Rabbits with Severe Combined Immunodeficiency

PubMed Central

Song, Jun; Wang, Guoshun; Hoenerhoff, Mark J.; Ruan, Jinxue; Yang, Dongshan; Zhang, Jifeng; Yang, Jibing; Lester, Patrick A.; Sigler, Robert; Bradley, Michael; Eckley, Samantha; Cornelius, Kelsey; Chen, Kong; Kolls, Jay K.; Peng, Li; Ma, Liang; Chen, Yuqing Eugene; Sun, Fei; Xu, Jie

2018-01-01

Using the CRISPR/Cas9 gene-editing technology, we recently produced a number of rabbits with mutations in immune function genes, including FOXN1, PRKDC, RAG1, RAG2, and IL2RG. Seven founder knockout rabbits (F0) and three male IL2RG null (−/y) F1 animals demonstrated severe combined immunodeficiency (SCID), characterized by absence or pronounced hypoplasia of the thymus and splenic white pulp, and absence of immature and mature T and B-lymphocytes in peripheral blood. Complete blood count analysis showed severe leukopenia and lymphocytopenia accompanied by severe neutrophilia. Without prophylactic antibiotics, the SCID rabbits universally succumbed to lung infections following weaning. Pathology examination revealed severe heterophilic bronchopneumonia caused by Bordetella bronchiseptica in several animals, but a consistent feature of lung lesions in all animals was a severe interstitial pneumonia caused by Pneumocystis oryctolagi, as confirmed by histological examination and PCR analysis of Pneumocystis genes. The results of this study suggest that these SCID rabbits could serve as a useful model for human SCID to investigate the disease pathogenesis and the development of gene and drug therapies. PMID:29593714
Molecular characterization and combined genotype association study of bovine cluster of differentiation 14 gene with clinical mastitis in crossbred dairy cattle

PubMed Central

Selvan, A. Sakthivel; Gupta, I. D.; Verma, A.; Chaudhari, M. V.; Magotra, A.

2016-01-01

Aim: The present study was undertaken with the objectives to characterize and to analyze combined genotypes of cluster of differentiation 14 (CD14) gene to explore its association with clinical mastitis in Karan Fries (KF) cows maintained in the National Dairy Research Institute herd, Karnal. Materials and Methods: Genomic DNA was extracted using blood of randomly selected 94 KF lactating cattle by phenol-chloroform method. After checking its quality and quantity, polymerase chain reaction (PCR) was carried out using six sets of reported gene-specific primers to amplify complete KF CD14 gene. The forward and reverse sequences for each PCR fragments were assembled to form complete sequence for the respective region of KF CD14 gene. The multiple sequence alignments of the edited sequence with the corresponding reference with reported Bos taurus sequence (EU148610.1) were performed with ClustalW software to identify single nucleotide polymorphisms (SNPs). Basic Local Alignment Search Tool analysis was performed to compare the sequence identity of KF CD14 gene with other species. The restriction fragment length polymorphism (RFLP) analysis was carried out in all KF cows using Helicobacter pylori 188I (Hpy188I) (contig 2) and Haemophilus influenzae I (HinfI) (contig 4) restriction enzyme (RE). Cows were assigned genotypes obtained by PCR-RFLP analysis, and association study was done using Chi-square (χ2) test. The genotypes of both contigs (loci) number 2 and 4 were combined with respect to each animal to construct combined genotype patterns. Results: Two types of sequences of KF were obtained: One with 2630 bp having one insertion at 616 nucleotide (nt) position and one deletion at 1117 nt position, and the another sequence was of 2629 bp having only one deletion at 615 nt position. ClustalW, multiple alignments of KF CD14 gene sequence with B. taurus cattle sequence (EU148610.1), revealed 24 nt changes (SNPs). Cows were also screened using PCR-RFLP with Hpy188I
Systematic analysis of microarray datasets to identify Parkinson's disease‑associated pathways and genes.

PubMed

Feng, Yinling; Wang, Xuefeng

2017-03-01

In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.
Gene-Gene Combination Effect and Interactions among ABCA1, APOA1, SR-B1, and CETP Polymorphisms for Serum High-Density Lipoprotein-Cholesterol in the Japanese Population

PubMed Central

Nakamura, Akihiko; Niimura, Hideshi; Kuwabara, Kazuyo; Takezaki, Toshiro; Morita, Emi; Wakai, Kenji; Hamajima, Nobuyuki; Nishida, Yuichiro; Turin, Tanvir Chowdhury; Suzuki, Sadao; Ohnaka, Keizo; Uemura, Hirokazu; Ozaki, Etsuko; Hosono, Satoyo; Mikami, Haruo; Kubo, Michiaki; Tanaka, Hideo

2013-01-01

Background/Objective Gene-gene interactions in the reverse cholesterol transport system for high-density lipoprotein-cholesterol (HDL-C) are poorly understood. The present study observed gene-gene combination effect and interactions between single nucleotide polymorphisms (SNPs) in ABCA1, APOA1, SR-B1, and CETP in serum HDL-C from a cross-sectional study in the Japanese population. Methods The study population comprised 1,535 men and 1,515 women aged 35–69 years who were enrolled in the Japan Multi-Institutional Collaborative Cohort (J-MICC) Study. We selected 13 SNPs in the ABCA1, APOA1, CETP, and SR-B1 genes in the reverse cholesterol transport system. The effects of genetic and environmental factors were assessed using general linear and logistic regression models after adjusting for age, sex, and region. Principal Findings Alcohol consumption and daily activity were positively associated with HDL-C levels, whereas smoking had a negative relationship. The T allele of CETP, rs3764261, was correlated with higher HDL-C levels and had the highest coefficient (2.93 mg/dL/allele) among the 13 SNPs, which was statistically significant after applying the Bonferroni correction (p<0.001). Gene-gene combination analysis revealed that CETP rs3764261 was associated with high HDL-C levels with any combination of SNPs from ABCA1, APOA1, and SR-B1, although no gene-gene interaction was apparent. An increasing trend for serum HDL-C was also observed with an increasing number of alleles (p<0.001). Conclusions The present study identified a multiplier effect from a polymorphism in CETP with ABCA1, APOA1, and SR-B1, as well as a dose-dependence according to the number of alleles present. PMID:24376512
Effect of the absolute statistic on gene-sampling gene-set analysis methods.

PubMed

Nam, Dougu

2017-06-01

Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.
Clinical and multiple gene expression variables in survival analysis of breast cancer: Analysis with the hypertabastic survival model

PubMed Central

2012-01-01

Background We explore the benefits of applying a new proportional hazard model to analyze survival of breast cancer patients. As a parametric model, the hypertabastic survival model offers a closer fit to experimental data than Cox regression, and furthermore provides explicit survival and hazard functions which can be used as additional tools in the survival analysis. In addition, one of our main concerns is utilization of multiple gene expression variables. Our analysis treats the important issue of interaction of different gene signatures in the survival analysis. Methods The hypertabastic proportional hazards model was applied in survival analysis of breast cancer patients. This model was compared, using statistical measures of goodness of fit, with models based on the semi-parametric Cox proportional hazards model and the parametric log-logistic and Weibull models. The explicit functions for hazard and survival were then used to analyze the dynamic behavior of hazard and survival functions. Results The hypertabastic model provided the best fit among all the models considered. Use of multiple gene expression variables also provided a considerable improvement in the goodness of fit of the model, as compared to use of only one. By utilizing the explicit survival and hazard functions provided by the model, we were able to determine the magnitude of the maximum rate of increase in hazard, and the maximum rate of decrease in survival, as well as the times when these occurred. We explore the influence of each gene expression variable on these extrema. Furthermore, in the cases of continuous gene expression variables, represented by a measure of correlation, we were able to investigate the dynamics with respect to changes in gene expression. Conclusions We observed that use of three different gene signatures in the model provided a greater combined effect and allowed us to assess the relative importance of each in determination of outcome in this data set. These
Validation of reference genes for gene expression analysis in olive (Olea europaea) mesocarp tissue by quantitative real-time RT-PCR

PubMed Central

2014-01-01

Background Gene expression analysis using quantitative reverse transcription PCR (qRT-PCR) is a robust method wherein the expression levels of target genes are normalised using internal control genes, known as reference genes, to derive changes in gene expression levels. Although reference genes have recently been suggested for olive tissues, combined/independent analysis on different cultivars has not yet been tested. Therefore, an assessment of reference genes was required to validate the recent findings and select stably expressed genes across different olive cultivars. Results A total of eight candidate reference genes [glyceraldehyde 3-phosphate dehydrogenase (GAPDH), serine/threonine-protein phosphatase catalytic subunit (PP2A), elongation factor 1 alpha (EF1-alpha), polyubiquitin (OUB2), aquaporin tonoplast intrinsic protein (TIP2), tubulin alpha (TUBA), 60S ribosomal protein L18-3 (60S RBP L18-3) and polypyrimidine tract-binding protein homolog 3 (PTB)] were chosen based on their stability in olive tissues as well as in other plants. Expression stability was examined by qRT-PCR across 12 biological samples, representing mesocarp tissues at various developmental stages in three different olive cultivars, Barnea, Frantoio and Picual, independently and together during the 2009 season with two software programs, GeNorm and BestKeeper. Both software packages identified GAPDH, EF1-alpha and PP2A as the three most stable reference genes across the three cultivars and in the cultivar, Barnea. GAPDH, EF1-alpha and 60S RBP L18-3 were found to be most stable reference genes in the cultivar Frantoio while 60S RBP L18-3, OUB2 and PP2A were found to be most stable reference genes in the cultivar Picual. Conclusions The analyses of expression stability of reference genes using qRT-PCR revealed that GAPDH, EF1-alpha, PP2A, 60S RBP L18-3 and OUB2 are suitable reference genes for expression analysis in developing Olea europaea mesocarp tissues, displaying the highest level
The combination of dimethoxycurcumin with DNA methylation inhibitor enhances gene re-expression of promoter-methylated genes and antagonizes their cytotoxic effect

PubMed Central

Hassan, Hazem E.; Keita, Jean-Arnaud; Narayan, Lawrence; Brady, Sean M.; Frederick, Richard; Carlson, Samuel; C. Glass, Karen; Natesan, Senthil; Buttolph, Thomm; Fandy, Tamer E.

2016-01-01

ABSTRACT Curcumin and its analogs exhibited antileukemic activity either as single agent or in combination therapy. Dimethoxycurcumin (DMC) is a more metabolically stable curcumin analog that was shown to induce the expression of promoter-methylated genes without reversing DNA methylation. Accordingly, co-treatment with DMC and DNA methyltransferase (DNMT) inhibitors could hypothetically enhance the re-expression of promoter-methylated tumor suppressor genes. In this study, we investigated the cytotoxic effects and epigenetic changes associated with the combination of DMC and the DNMT inhibitor decitabine (DAC) in primary leukemia samples and cell lines. The combination demonstrated antagonistic cytotoxic effects and was minimally cytotoxic to primary leukemia cells. The combination did not affect the metabolic stability of DMC. Although the combination enhanced the downregulation of nuclear DNMT proteins, the hypomethylating activity of the combination was not increased significantly compared to DAC alone. On the other hand, the combination significantly increased H3K27 acetylation (H3K27Ac) compared to the single agents near the promoter region of promoter-methylated genes. Furthermore, sequential chromatin immunoprecipitation (ChIP) and DNA pyrosequencing of the chromatin-enriched H3K27Ac did not show any significant decrease in DNA methylation compared to other regions. Consequently, the enhanced induction of promoter-methylated genes by the combination compared to DAC alone is mediated by a mechanism that involves increased histone acetylation and not through potentiation of the DNA hypomethylating activity of DAC. Collectively, our results provide the mechanistic basis for further characterization of this combination in leukemia animal models and early phase clinical trials. PMID:27588609

An effective fuzzy kernel clustering analysis approach for gene expression data.

PubMed

Sun, Lin; Xu, Jiucheng; Yin, Jiaojiao

2015-01-01

Fuzzy clustering is an important tool for analyzing microarray data. A major problem in applying fuzzy clustering method to microarray gene expression data is the choice of parameters with cluster number and centers. This paper proposes a new approach to fuzzy kernel clustering analysis (FKCA) that identifies desired cluster number and obtains more steady results for gene expression data. First of all, to optimize characteristic differences and estimate optimal cluster number, Gaussian kernel function is introduced to improve spectrum analysis method (SAM). By combining subtractive clustering with max-min distance mean, maximum distance method (MDM) is proposed to determine cluster centers. Then, the corresponding steps of improved SAM (ISAM) and MDM are given respectively, whose superiority and stability are illustrated through performing experimental comparisons on gene expression data. Finally, by introducing ISAM and MDM into FKCA, an effective improved FKCA algorithm is proposed. Experimental results from public gene expression data and UCI database show that the proposed algorithms are feasible for cluster analysis, and the clustering accuracy is higher than the other related clustering algorithms.
Assessment of the predictive accuracy of five in silico prediction tools, alone or in combination, and two metaservers to classify long QT syndrome gene mutations.

PubMed

Leong, Ivone U S; Stuckey, Alexander; Lai, Daniel; Skinner, Jonathan R; Love, Donald R

2015-05-13

Long QT syndrome (LQTS) is an autosomal dominant condition predisposing to sudden death from malignant arrhythmia. Genetic testing identifies many missense single nucleotide variants of uncertain pathogenicity. Establishing genetic pathogenicity is an essential prerequisite to family cascade screening. Many laboratories use in silico prediction tools, either alone or in combination, or metaservers, in order to predict pathogenicity; however, their accuracy in the context of LQTS is unknown. We evaluated the accuracy of five in silico programs and two metaservers in the analysis of LQTS 1-3 gene variants. The in silico tools SIFT, PolyPhen-2, PROVEAN, SNPs&GO and SNAP, either alone or in all possible combinations, and the metaservers Meta-SNP and PredictSNP, were tested on 312 KCNQ1, KCNH2 and SCN5A gene variants that have previously been characterised by either in vitro or co-segregation studies as either "pathogenic" (283) or "benign" (29). The accuracy, sensitivity, specificity and Matthews Correlation Coefficient (MCC) were calculated to determine the best combination of in silico tools for each LQTS gene, and when all genes are combined. The best combination of in silico tools for KCNQ1 is PROVEAN, SNPs&GO and SIFT (accuracy 92.7%, sensitivity 93.1%, specificity 100% and MCC 0.70). The best combination of in silico tools for KCNH2 is SIFT and PROVEAN or PROVEAN, SNPs&GO and SIFT. Both combinations have the same scores for accuracy (91.1%), sensitivity (91.5%), specificity (87.5%) and MCC (0.62). In the case of SCN5A, SNAP and PROVEAN provided the best combination (accuracy 81.4%, sensitivity 86.9%, specificity 50.0%, and MCC 0.32). When all three LQT genes are combined, SIFT, PROVEAN and SNAP is the combination with the best performance (accuracy 82.7%, sensitivity 83.0%, specificity 80.0%, and MCC 0.44). Both metaservers performed better than the single in silico tools; however, they did not perform better than the best performing combination of in silico
Combination Gene Therapy for Liver Metastasis of Colon Carcinoma in vivo

NASA Astrophysics Data System (ADS)

Chen, Shu-Hsai; Chen, X. H. Li; Wang, Yibin; Kosai, Ken-Ichiro; Finegold, Milton J.; Rich, Susan S.

1995-03-01

The efficacy of combination therapy with a "suicide gene" and a cytokine gene to treat metastatic colon carcinoma in the liver was investigated. Tumor in the liver was generated by intrahepatic injection of a colon carcinoma cell line (MCA-26) in syngeneic BALB/c mice. Recombinant adenoviral vectors containing various control and therapeutic genes were injected directly into the solid tumors, followed by treatment with ganciclovir. While the tumors continued to grow in all animals treated with a control vector or a mouse interleukin 2 vector, those treated with a herpes simplex virus thymidine kinase vector, with or without the coadministration of the mouse interleukin 2 vector, exhibited dramatic necrosis and regression. However, only animals treated with both vectors developed an effective systemic antitumoral immunity against challenges of tumorigenic doses of parental tumor cells inoculated at distant sites. The antitumoral immunity was associated with the presence of MCA-26 tumor-specific cytolytic CD8^+ T lymphocytes. The results suggest that combination suicide and cytokine gene therapy in vivo can be a powerful approach for treatment of metastatic colon carcinoma in the liver.
MAGMA: Generalized Gene-Set Analysis of GWAS Data

PubMed Central

de Leeuw, Christiaan A.; Mooij, Joris M.; Heskes, Tom; Posthuma, Danielle

2015-01-01

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn’s Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn’s Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn’s Disease data was found to be considerably faster as well. PMID:25885710
MAGMA: generalized gene-set analysis of GWAS data.

PubMed

de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

2015-04-01

By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.
PRGdb: a bioinformatics platform for plant resistance gene analysis

PubMed Central

Sanseverino, Walter; Roma, Guglielmo; De Simone, Marco; Faino, Luigi; Melito, Sara; Stupka, Elia; Frusciante, Luigi; Ercolano, Maria Raffaella

2010-01-01

PRGdb is a web accessible open-source (http://www.prgdb.org) database that represents the first bioinformatic resource providing a comprehensive overview of resistance genes (R-genes) in plants. PRGdb holds more than 16 000 known and putative R-genes belonging to 192 plant species challenged by 115 different pathogens and linked with useful biological information. The complete database includes a set of 73 manually curated reference R-genes, 6308 putative R-genes collected from NCBI and 10463 computationally predicted putative R-genes. Thanks to a user-friendly interface, data can be examined using different query tools. A home-made prediction pipeline called Disease Resistance Analysis and Gene Orthology (DRAGO), based on reference R-gene sequence data, was developed to search for plant resistance genes in public datasets such as Unigene and Genbank. New putative R-gene classes containing unknown domain combinations were discovered and characterized. The development of the PRG platform represents an important starting point to conduct various experimental tasks. The inferred cross-link between genomic and phenotypic information allows access to a large body of information to find answers to several biological questions. The database structure also permits easy integration with other data types and opens up prospects for future implementations. PMID:19906694
Network Analysis of Human Genes Influencing Susceptibility to Mycobacterial Infections

PubMed Central

Lipner, Ettie M.; Garcia, Benjamin J.; Strong, Michael

2016-01-01

Tuberculosis and nontuberculous mycobacterial infections constitute a high burden of pulmonary disease in humans, resulting in over 1.5 million deaths per year. Building on the premise that genetic factors influence the instance, progression, and defense of infectious disease, we undertook a systems biology approach to investigate relationships among genetic factors that may play a role in increased susceptibility or control of mycobacterial infections. We combined literature and database mining with network analysis and pathway enrichment analysis to examine genes, pathways, and networks, involved in the human response to Mycobacterium tuberculosis and nontuberculous mycobacterial infections. This approach allowed us to examine functional relationships among reported genes, and to identify novel genes and enriched pathways that may play a role in mycobacterial susceptibility or control. Our findings suggest that the primary pathways and genes influencing mycobacterial infection control involve an interplay between innate and adaptive immune proteins and pathways. Signaling pathways involved in autoimmune disease were significantly enriched as revealed in our networks. Mycobacterial disease susceptibility networks were also examined within the context of gene-chemical relationships, in order to identify putative drugs and nutrients with potential beneficial immunomodulatory or anti-mycobacterial effects. PMID:26751573
PCAN: phenotype consensus analysis to support disease-gene association.

PubMed

Godard, Patrice; Page, Matthew

2016-12-07

Bridging genotype and phenotype is a fundamental biomedical challenge that underlies more effective target discovery and patient-tailored therapy. Approaches that can flexibly and intuitively, integrate known gene-phenotype associations in the context of molecular signaling networks are vital to effectively prioritize and biologically interpret genes underlying disease traits of interest. We describe Phenotype Consensus Analysis (PCAN); a method to assess the consensus semantic similarity of phenotypes in a candidate gene's signaling neighborhood. We demonstrate that significant phenotype consensus (p < 0.05) is observable for ~67% of 4,549 OMIM disease-gene associations, using a combination of high quality String interactions + Metabase pathways and use Joubert Syndrome to demonstrate the ease with which a significant result can be interrogated to highlight discriminatory traits linked to mechanistically related genes. We advocate phenotype consensus as an intuitive and versatile method to aid disease-gene association, which naturally lends itself to the mechanistic deconvolution of diverse phenotypes. We provide PCAN to the community as an R package ( http://bioconductor.org/packages/PCAN/ ) to allow flexible configuration, extension and standalone use or integration to supplement existing gene prioritization workflows.
A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer.

PubMed

Yang, Mary Qu; Li, Dan; Yang, William; Zhang, Yifan; Liu, Jun; Tong, Weida

2017-01-01

Clear cell renal cell carcinoma (ccRCC) is the most common and most aggressive form of renal cell cancer (RCC). The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1 , as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways.
The limitations of simple gene set enrichment analysis assuming gene independence.

PubMed

Tamayo, Pablo; Steinhardt, George; Liberzon, Arthur; Mesirov, Jill P

2016-02-01

Since its first publication in 2003, the Gene Set Enrichment Analysis method, based on the Kolmogorov-Smirnov statistic, has been heavily used, modified, and also questioned. Recently a simplified approach using a one-sample t-test score to assess enrichment and ignoring gene-gene correlations was proposed by Irizarry et al. 2009 as a serious contender. The argument criticizes Gene Set Enrichment Analysis's nonparametric nature and its use of an empirical null distribution as unnecessary and hard to compute. We refute these claims by careful consideration of the assumptions of the simplified method and its results, including a comparison with Gene Set Enrichment Analysis's on a large benchmark set of 50 datasets. Our results provide strong empirical evidence that gene-gene correlations cannot be ignored due to the significant variance inflation they produced on the enrichment scores and should be taken into account when estimating gene set enrichment significance. In addition, we discuss the challenges that the complex correlation structure and multi-modality of gene sets pose more generally for gene set enrichment methods. © The Author(s) 2012.
Comparative genomic analysis of the PKS genes in five species and expression analysis in upland cotton

PubMed Central

Cheng, Xi; Wang, Yanan; Abdullah, Muhammad; Li, Manli; Li, Dahui; Gao, Junshan

2017-01-01

Plant type III polyketide synthase (PKS) can catalyse the formation of a series of secondary metabolites with different structures and different biological functions; the enzyme plays an important role in plant growth, development and resistance to stress. At present, the PKS gene has been identified and studied in a variety of plants. Here, we identified 11 PKS genes from upland cotton (Gossypium hirsutum) and compared them with 41 PKS genes in Populus tremula, Vitis vinifera, Malus domestica and Arabidopsis thaliana. According to the phylogenetic tree, a total of 52 PKS genes can be divided into four subfamilies (I–IV). The analysis of gene structures and conserved motifs revealed that most of the PKS genes were composed of two exons and one intron and there are two characteristic conserved domains (Chal_sti_synt_N and Chal_sti_synt_C) of the PKS gene family. In our study of the five species, gene duplication was found in addition to Arabidopsis thaliana and we determined that purifying selection has been of great significance in maintaining the function of PKS gene family. From qRT-PCR analysis and a combination of the role of the accumulation of proanthocyanidins (PAs) in brown cotton fibers, we concluded that five PKS genes are candidate genes involved in brown cotton fiber pigment synthesis. These results are important for the further study of brown cotton PKS genes. It not only reveals the relationship between PKS gene family and pigment in brown cotton, but also creates conditions for improving the quality of brown cotton fiber. PMID:29104824
Meta-analysis identifies a MECOM gene as a novel predisposing factor of osteoporotic fracture

PubMed Central

Hwang, Joo-Yeon; Lee, Seung Hun; Go, Min Jin; Kim, Beom-Jun; Kou, Ikuyo; Ikegawa, Shiro; Guo, Yan; Deng, Hong-Wen; Raychaudhuri, Soumya; Kim, Young Jin; Oh, Ji Hee; Kim, Youngdoe; Moon, Sanghoon; Kim, Dong-Joon; Koo, Heejo; Cha, My-Jung; Lee, Min Hye; Yun, Ji Young; Yoo, Hye-Sook; Kang, Young-Ah; Cho, Eun-Hee; Kim, Sang-Wook; Oh, Ki Won; Kang, Moo II; Son, Ho Young; Kim, Shin-Yoon; Kim, Ghi Su; Han, Bok-Ghee; Cho, Yoon Shin; Cho, Myeong-Chan; Lee, Jong-Young; Koh, Jung-Min

2014-01-01

Background Osteoporotic fracture (OF) as a clinical endpoint is a major complication of osteoporosis. To screen for OF susceptibility genes, we performed a genome-wide association study and carried out de novo replication analysis of an East Asian population. Methods Association was tested using a logistic regression analysis. A meta-analysis was performed on the combined results using effect size and standard errors estimated for each study. Results In a combined meta-analysis of a discovery cohort (288 cases and 1139 controls), three hospital based sets in replication stage I (462 cases and 1745 controls), and an independent ethnic group in replication stage II (369 cases and 560 for controls), we identified a new locus associated with OF (rs784288 in the MECOM gene) that showed genome-wide significance (p=3.59×10−8; OR 1.39). RNA interference revealed that a MECOM knockdown suppresses osteoclastogenesis. Conclusions Our findings provide new insights into the genetic architecture underlying OF in East Asians. PMID:23349225
Combined Chromatin and Expression Analysis Reveals Specific Regulatory Mechanisms within Cytokine Genes in the Macrophage Early Immune Response

PubMed Central

Emanuelsson, Olof; Sennblad, Bengt; Pirmoradian Najafabadi, Mohammad; Folkersen, Lasse; Mälarstig, Anders; Lagergren, Jens; Eriksson, Per; Hamsten, Anders; Odeberg, Jacob

2012-01-01

Macrophages play a critical role in innate immunity, and the expression of early response genes orchestrate much of the initial response of the immune system. Macrophages undergo extensive transcriptional reprogramming in response to inflammatory stimuli such as Lipopolysaccharide (LPS). To identify gene transcription regulation patterns involved in early innate immune responses, we used two genome-wide approaches - gene expression profiling and chromatin immunoprecipitation-sequencing (ChIP-seq) analysis. We examined the effect of 2 hrs LPS stimulation on early gene expression and its relation to chromatin remodeling (H3 acetylation; H3Ac) and promoter binding of Sp1 and RNA polymerase II phosphorylated at serine 5 (S5P RNAPII), which is a marker for transcriptional initiation. Our results indicate novel and alternative gene regulatory mechanisms for certain proinflammatory genes. We identified two groups of up-regulated inflammatory genes with respect to chromatin modification and promoter features. One group, including highly up-regulated genes such as tumor necrosis factor (TNF), was characterized by H3Ac, high CpG content and lack of TATA boxes. The second group, containing inflammatory mediators (interleukins and CCL chemokines), was up-regulated upon LPS stimulation despite lacking H3Ac in their annotated promoters, which were low in CpG content but did contain TATA boxes. Genome-wide analysis showed that few H3Ac peaks were unique to either +/−LPS condition. However, within these, an unpacking/expansion of already existing H3Ac peaks was observed upon LPS stimulation. In contrast, a significant proportion of S5P RNAPII peaks (approx 40%) was unique to either condition. Furthermore, data indicated a large portion of previously unannotated TSSs, particularly in LPS-stimulated macrophages, where only 28% of unique S5P RNAPII peaks overlap annotated promoters. The regulation of the inflammatory response appears to occur in a very specific manner at the
Analysis of differential gene expression by bead-based fiber-optic array in nonfunctioning pituitary adenomas.

PubMed

Jiang, Z; Gui, S; Zhang, Y

2011-05-01

Nonfunctioning pituitary adenomas (NFPAs) are relatively common, accounting for 30% of all pituitary adenomas; however, their pathogenesis remains enigmatic. To explore the possible pathogenesis of NFPAs, we used fiber-optic BeadArray to examine gene expression in 5 NFPAs compared with 3 normal pituitaries. 4 differentially expressed genes were chosen randomly for validation by reverse transcriptase-real time quantitative polymerase chain reaction (RT-qPCR). We then analyzed the differentially expressed gene profile with Kyoto Encyclopedia of Genes and Genomes (KEGG). The array analysis indentified significant increases in the expression of 1,402 genes and 383 expressed sequence tags (ESTs), and decreases in 1,697 genes and 113 ESTs in the NFPAs. Bioinformatic and pathway analysis showed that the genes HIGD1B, FAM5C, PMAIP1 and the pathway cell-cycle regulation may play an important role in tumorigenesis and progression of NFPAs. Our data suggest fiber-optic BeadArray combined with pathway analysis of differential gene expression profile appears to be a valid approach for investigating the pathogenesis of tumors. © Georg Thieme Verlag KG Stuttgart · New York.
rpb2 is a reliable reference gene for quantitative gene expression analysis in the dermatophyte Trichophyton rubrum.

PubMed

Jacob, Tiago R; Peres, Nalu T A; Persinoti, Gabriela F; Silva, Larissa G; Mazucato, Mendelson; Rossi, Antonio; Martinez-Rossi, Nilce M

2012-05-01

The selection of reference genes used for data normalization to quantify gene expression by real-time PCR amplifications (qRT-PCR) is crucial for the accuracy of this technique. In spite of this, little information regarding such genes for qRT-PCR is available for gene expression analyses in pathogenic fungi. Thus, we investigated the suitability of eight candidate reference genes in isolates of the human dermatophyte Trichophyton rubrum subjected to several environmental challenges, such as drug exposure, interaction with human nail and skin, and heat stress. The stability of these genes was determined by geNorm, NormFinder and Best-Keeper programs. The gene with the most stable expression in the majority of the conditions tested was rpb2 (DNA-dependent RNA polymerase II), which was validated in three T. rubrum strains. Moreover, the combination of rpb2 and chs1 (chitin synthase) genes provided for the most reliable qRT-PCR data normalization in T. rubrum under a broad range of biological conditions. To the best of our knowledge this is the first report on the selection of reference genes for qRT-PCR data normalization in dermatophytes and the results of these studies should permit further analysis of gene expression under several experimental conditions, with improved accuracy and reliability.
Bone Metastasis in Advanced Breast Cancer: Analysis of Gene Expression Microarray.

PubMed

Cosphiadi, Irawan; Atmakusumah, Tubagus D; Siregar, Nurjati C; Muthalib, Abdul; Harahap, Alida; Mansyur, Muchtarruddin

2018-03-08

Approximately 30% to 40% of breast cancer recurrences involve bone metastasis (BM). Certain genes have been linked to BM; however, none have been able to predict bone involvement. In this study, we analyzed gene expression profiles in advanced breast cancer patients to elucidate genes that can be used to predict BM. A total of 92 advanced breast cancer patients, including 46 patients with BM and 46 patients without BM, were identified for this study. Immunohistochemistry and gene expression analysis was performed on 81 formalin-fixed paraffin-embedded samples. Data were collected through medical records, and gene expression of 200 selected genes compiled from 6 previous studies was performed using NanoString nCounter. Genetic expression profiles showed that 22 genes were significantly differentially expressed between breast cancer patients with metastasis in bone and other organs (BM+) and non-BM, whereas subjects with only BM showed 17 significantly differentially expressed genes. The following genes were associated with an increasing incidence of BM in the BM+ group: estrogen receptor 1 (ESR1), GATA binding protein 3 (GATA3), and melanophilin with an area under the curve (AUC) of 0.804. In the BM group, the following genes were associated with an increasing incidence of BM: ESR1, progesterone receptor, B-cell lymphoma 2, Rab escort protein, N-acetyltransferase 1, GATA3, annexin A9, and chromosome 9 open reading frame 116. ESR1 and GATA3 showed an increased strength of association with an AUC of 0.928. A combination of the identified 3 genes in BM+ and 8 genes in BM showed better prediction than did each individual gene, and this combination can be used as a training set. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
A robust multifactor dimensionality reduction method for detecting gene-gene interactions with application to the genetic analysis of bladder cancer susceptibility

PubMed Central

Gui, Jiang; Andrew, Angeline S.; Andrews, Peter; Nelson, Heather M.; Kelsey, Karl T.; Karagas, Margaret R.; Moore, Jason H.

2010-01-01

A central goal of human genetics is to identify and characterize susceptibility genes for common complex human diseases. An important challenge in this endeavor is the modeling of gene-gene interaction or epistasis that can result in non-additivity of genetic effects. The multifactor dimensionality reduction (MDR) method was developed as machine learning alternative to parametric logistic regression for detecting interactions in absence of significant marginal effects. The goal of MDR is to reduce the dimensionality inherent in modeling combinations of polymorphisms using a computational approach called constructive induction. Here, we propose a Robust Multifactor Dimensionality Reduction (RMDR) method that performs constructive induction using a Fisher’s Exact Test rather than a predetermined threshold. The advantage of this approach is that only those genotype combinations that are determined to be statistically significant are considered in the MDR analysis. We use two simulation studies to demonstrate that this approach will increase the success rate of MDR when there are only a few genotype combinations that are significantly associated with case-control status. We show that there is no loss of success rate when this is not the case. We then apply the RMDR method to the detection of gene-gene interactions in genotype data from a population-based study of bladder cancer in New Hampshire. PMID:21091664
A Comprehensive Analysis of Transcript-Supported De Novo Genes in Saccharomyces sensu stricto Yeasts

PubMed Central

Lu, Tzu-Chiao; Leu, Jun-Yi; Lin, Wen-Chang

2017-01-01

Abstract Novel genes arising from random DNA sequences (de novo genes) have been suggested to be widespread in the genomes of different organisms. However, our knowledge about the origin and evolution of de novo genes is still limited. To systematically understand the general features of de novo genes, we established a robust pipeline to analyze >20,000 transcript-supported coding sequences (CDSs) from the budding yeast Saccharomyces cerevisiae. Our analysis pipeline combined phylogeny, synteny, and sequence alignment information to identify possible orthologs across 20 Saccharomycetaceae yeasts and discovered 4,340 S. cerevisiae-specific de novo genes and 8,871 S. sensu stricto-specific de novo genes. We further combine information on CDS positions and transcript structures to show that >65% of de novo genes arose from transcript isoforms of ancient genes, especially in the upstream and internal regions of ancient genes. Fourteen identified de novo genes with high transcript levels were chosen to verify their protein expressions. Ten of them, including eight transcript isoform-associated CDSs, showed translation signals and five proteins exhibited specific cytosolic localizations. Our results suggest that de novo genes frequently arise in the S. sensu stricto complex and have the potential to be quickly integrated into ancient cellular network. PMID:28981695
Simultaneous mutation detection of three homoeologous genes in wheat by High Resolution Melting analysis and Mutation Surveyor.

PubMed

Dong, Chongmei; Vincent, Kate; Sharp, Peter

2009-12-04

TILLING (Targeting Induced Local Lesions IN Genomes) is a powerful tool for reverse genetics, combining traditional chemical mutagenesis with high-throughput PCR-based mutation detection to discover induced mutations that alter protein function. The most popular mutation detection method for TILLING is a mismatch cleavage assay using the endonuclease CelI. For this method, locus-specific PCR is essential. Most wheat genes are present as three similar sequences with high homology in exons and low homology in introns. Locus-specific primers can usually be designed in introns. However, it is sometimes difficult to design locus-specific PCR primers in a conserved region with high homology among the three homoeologous genes, or in a gene lacking introns, or if information on introns is not available. Here we describe a mutation detection method which combines High Resolution Melting (HRM) analysis of mixed PCR amplicons containing three homoeologous gene fragments and sequence analysis using Mutation Surveyor software, aimed at simultaneous detection of mutations in three homoeologous genes. We demonstrate that High Resolution Melting (HRM) analysis can be used in mutation scans in mixed PCR amplicons containing three homoeologous gene fragments. Combining HRM scanning with sequence analysis using Mutation Surveyor is sensitive enough to detect a single nucleotide mutation in the heterozygous state in a mixed PCR amplicon containing three homoeoloci. The method was tested and validated in an EMS (ethylmethane sulfonate)-treated wheat TILLING population, screening mutations in the carboxyl terminal domain of the Starch Synthase II (SSII) gene. Selected identified mutations of interest can be further analysed by cloning to confirm the mutation and determine the genomic origin of the mutation. Polyploidy is common in plants. Conserved regions of a gene often represent functional domains and have high sequence similarity between homoeologous loci. The method described here
Genome-wide analysis of YY2 versus YY1 target genes

PubMed Central

Chen, Li; Shioda, Toshi; Coser, Kathryn R.; Lynch, Mary C.; Yang, Chuanwei; Schmidt, Emmett V.

2010-01-01

Yin Yang 1 (YY1) is a critical transcription factor controlling cell proliferation, development and DNA damage responses. Retrotranspositions have independently generated additional YY family members in multiple species. Although Drosophila YY1 [pleiohomeotic (Pho)] and its homolog [pleiohomeotic-like (Phol)] redundantly control homeotic gene expression, the regulatory contributions of YY1-homologs have not yet been examined in other species. Indeed, targets for the mammalian YY1 homolog YY2 are completely unknown. Using gene set enrichment analysis, we found that lentiviral constructs containing short hairpin loop inhibitory RNAs for human YY1 (shYY1) and its homolog YY2 (shYY2) caused significant changes in both shared and distinguishable gene sets in human cells. Ribosomal protein genes were the most significant gene set upregulated by both shYY1 and shYY2, although combined shYY1/2 knock downs were not additive. In contrast, shYY2 reversed the anti-proliferative effects of shYY1, and shYY2 particularly altered UV damage response, platelet-specific and mitochondrial function genes. We found that decreases in YY1 or YY2 caused inverse changes in UV sensitivity, and that their combined loss reversed their respective individual effects. Our studies show that human YY2 is not redundant to YY1, and YY2 is a significant regulator of genes previously identified as uniquely responding to YY1. PMID:20215434

A combined analysis of genome-wide expression profiling of bipolar disorder in human prefrontal cortex.

PubMed

Wang, Jinglu; Qu, Susu; Wang, Weixiao; Guo, Liyuan; Zhang, Kunlin; Chang, Suhua; Wang, Jing

2016-11-01

Numbers of gene expression profiling studies of bipolar disorder have been published. Besides different array chips and tissues, variety of the data processes in different cohorts aggravated the inconsistency of results of these genome-wide gene expression profiling studies. By searching the gene expression databases, we obtained six data sets for prefrontal cortex (PFC) of bipolar disorder with raw data and combinable platforms. We used standardized pre-processing and quality control procedures to analyze each data set separately and then combined them into a large gene expression matrix with 101 bipolar disorder subjects and 106 controls. A standard linear mixed-effects model was used to calculate the differentially expressed genes (DEGs). Multiple levels of sensitivity analyses and cross validation with genetic data were conducted. Functional and network analyses were carried out on basis of the DEGs. In the result, we identified 198 unique differentially expressed genes in the PFC of bipolar disorder and control. Among them, 115 DEGs were robust to at least three leave-one-out tests or different pre-processing methods; 51 DEGs were validated with genetic association signals. Pathway enrichment analysis showed these DEGs were related with regulation of neurological system, cell death and apoptosis, and several basic binding processes. Protein-protein interaction network further identified one key hub gene. We have contributed the most comprehensive integrated analysis of bipolar disorder expression profiling studies in PFC to date. The DEGs, especially those with multiple validations, may denote a common signature of bipolar disorder and contribute to the pathogenesis of disease. Copyright © 2016 Elsevier Ltd. All rights reserved.
Familial aggregation analysis of gene expressions

PubMed Central

Rao, Shao-Qi; Xu, Liang-De; Zhang, Guang-Mei; Li, Xia; Li, Lin; Shen, Gong-Qing; Jiang, Yang; Yang, Yue-Ying; Gong, Bin-Sheng; Jiang, Wei; Zhang, Fan; Xiao, Yun; Wang, Qing K

2007-01-01

Traditional studies of familial aggregation are aimed at defining the genetic (and non-genetic) causes of a disease from physiological or clinical traits. However, there has been little attempt to use genome-wide gene expressions, the direct phenotypic measures of genes, as the traits to investigate several extended issues regarding the distributions of familially aggregated genes on chromosomes or in functions. In this study we conducted a genome-wide familial aggregation analysis by using the in vitro cell gene expressions of 3300 human autosome genes (Problem 1 data provided to Genetic Analysis Workshop 15) in order to answer three basic genetics questions. First, we investigated how gene expressions aggregate among different types (degrees) of relative pairs. Second, we conducted a bioinformatics analysis of highly familially aggregated genes to see how they are distributed on chromosomes. Third, we performed a gene ontology enrichment test of familially aggregated genes to find evidence to support their functional consensus. The results indicated that 1) gene expressions did aggregate in families, especially between sibs. Of 3300 human genes analyzed, there were a total of 1105 genes with one or more significant (empirical p < 0.05) familial correlation; 2) there were several genomic hot spots where highly familially aggregated genes (e.g., the chromosome 6 HLA genes cluster) were clustered; 3) as we expected, gene ontology enrichment tests revealed that the 1105 genes were aggregating not only in families but also in functional categories. PMID:18466548
Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Raymond, Amy; Lovell, Scott; Lorimer, Don

2009-12-01

With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38{alpha}), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. colimore » and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.« less
Gene delivery systems by the combination of lipid bubbles and ultrasound.

PubMed

Negishi, Yoichi; Endo-Takahashi, Yoko; Maruyama, Kazuo

2016-11-28

Gene therapy is promising for the treatment of many diseases including cancers and genetic diseases. From the viewpoint of safety, ultrasound (US)-mediated gene delivery with nano/ microbubbles was recently developed as a novel non-viral vector system. US-mediated gene delivery using nano/microbubbles are able to produce transient changes in the permeability of the cell membrane after US-induced cavitation while reducing cellular damage and enables the tissue-specific or the site-specific intracellular delivery of gene both in vitro and in vivo. We have recently developed novel lipid nanobubbles (Lipid Bubbles). These nanobubbles can also be used to enhance the efficacy of the US-mediated genes (plasmid DNA, siRNA, and miRNA etc.) delivery. In this review, we describe US-mediated delivery systems combined with nano/microbubbles and discuss their feasibility as non-viral vector systems.
Identification of suitable reference genes for quantitative gene expression analysis in rat adipose stromal cells induced to trilineage differentiation.

PubMed

Santos, Bruno Paiva Dos; da Costa Diesel, Luciana Fraga; da Silva Meirelles, Lindolfo; Nardi, Nance Beyer; Camassola, Melissa

2016-12-15

This study was designed to (i) identify stable reference genes for the analysis of gene expression during in vitro differentiation of rat adipose stromal cells (rASCs), (ii) recommend stable genes for individual treatment conditions, and (iii) validate these genes by comparison with normalization results from stable and unstable reference genes. On the basis of a literature review, eight genes were selected: Actb, B2m, Hprt1, Ppia, Rplp0, Rpl13a, Rpl5, and Ywhaz. Genes were ranked according to their stability under different culture conditions as assessed using GenNorm, NormFinder, and RefFinder algorithms. Although the employed algorithms returned different rankings, the most frequently top-ranked genes were: B2m and/or Ppia for all 28day treatments (ALL28); Ppia and Hprt1 (adipogenic differentiation; A28), B2m (chondrogenic differentiation; C28), Rpl5 (controls maintained in complete culture medium; CCM), Rplp0 (osteogenic differentiation for 3days; O3), Rpl13a and Actb (osteogenic differentiation for 7days; O7), Rplp0 and Ppia (osteogenic differentiation for 14days; O14), Hprt1 and Ppia (osteogenic differentiation for 28days; O28), as well as Actb (all osteogenesis time points combined; ALLOSTEO). The obtained results indicate that the performance of reference genes depends on the differentiation protocol and on the analysis time, thus providing valuable information for the design of RT-PCR experiments. Copyright © 2016. Published by Elsevier B.V.
Gene-based interaction analysis shows GABAergic genes interacting with parenting in adolescent depressive symptoms.

PubMed

Van Assche, Evelien; Moons, Tim; Cinar, Ozan; Viechtbauer, Wolfgang; Oldehinkel, Albertine J; Van Leeuwen, Karla; Verschueren, Karine; Colpin, Hilde; Lambrechts, Diether; Van den Noortgate, Wim; Goossens, Luc; Claes, Stephan; van Winkel, Ruud

2017-12-01

Most gene-environment interaction studies (G × E) have focused on single candidate genes. This approach is criticized for its expectations of large effect sizes and occurrence of spurious results. We describe an approach that accounts for the polygenic nature of most psychiatric phenotypes and reduces the risk of false-positive findings. We apply this method focusing on the role of perceived parental support, psychological control, and harsh punishment in depressive symptoms in adolescence. Analyses were conducted on 982 adolescents of Caucasian origin (M age (SD) = 13.78 (.94) years) genotyped for 4,947 SNPs in 263 genes, selected based on a literature survey. The Leuven Adolescent Perceived Parenting Scale (LAPPS) and the Parental Behavior Scale (PBS) were used to assess perceived parental psychological control, harsh punishment, and support. The Center for Epidemiologic Studies Depression Scale (CES-D) was the outcome. We used gene-based testing taking into account linkage disequilibrium to identify genes containing SNPs exhibiting an interaction with environmental factors yielding a p-value per single gene. Significant results at the corrected p-value of p < 1.90 × 10 -4 were examined in an independent replication sample of Dutch adolescents (N = 1354). Two genes showed evidence for interaction with perceived support: GABRR1 (p = 4.62 × 10 -5 ) and GABRR2 (p = 9.05 × 10 -6 ). No genes interacted significantly with psychological control or harsh punishment. Gene-based analysis was unable to confirm the interaction of GABRR1 or GABRR2 with support in the replication sample. However, for GABRR2, but not GABRR1, the correlation of the estimates between the two datasets was significant (r (46) = .32; p = .027) and a gene-based analysis of the combined datasets supported GABRR2 × support interaction (p = 1.63 × 10 -4 ). We present a gene-based method for gene-environment interactions in a polygenic context and show that genes
Effect of p27 gene combined with Pientzehuang ([characters: see text]) on tumor growth in osteosarcoma-bearing nude mice.

PubMed

Ren, Shou-song; Yuan, Fang; Liu, Ying-hong; Zhou, Le-tian; Li, Jun

2015-11-01

To observe the effect of p27 gene recombinant adenovirus combined with Chinese medicine Pientzehuang ([characters: see text]) on the growth of xenografted human osteosarcoma in nude mice. Tissue transplantation was used to construct the orthotopic model of human osteosarcoma Saos-2 cell in nude mice. Thirty tumor-bearing nude mice were randomly divided into 5 groups with 6 mice in each group: blank control group (model of osteosarcoma), empty vector group (recombinant adeno-associated virus-multiple cloning site), Pientzehuang group, p27 gene group and combined treatment group (p27 gene combined with Pientzehuang). The effect of combined treatment on human osteosarcoma was analyzed through the tumor formation, tumor volume and inhibition rate of tumor growth. The expression of p27 was measured by immunohistochemical staining and Western blot. The orthotopic model of osteosarcoma in nude mice was successfully constructed. The general appearance of tumor-bearing nude mice in Pientzehuang and p27 gene groups was markedly improved compared with the blank control group; and in the combined treatment group it was significantly improved compared with the Pientzehuang and p27 gene groups. The tumor growth in the Pientzehuang and p27 gene groups was significantly inhibited compared with the blank control group P<0.05); while in the combined treatment group it was markedly inhibited compared with the Pientzehuang and p27 gene groups (P<0.05). The rates of tumor growth inhibition were 34.1%, 56.5% and 63.8% in the Pientzehuang, p27 gene and combined treatment groups, respectively. Meanwhile, the protein expression of p27 gene in the p27 gene group was significantly increased compared with the blank control group (P<0.05); and it was significantly increased in the combined treatment group compared with the p27 gene and Pientzehuang groups (P<0.05). p27 gene introduced by adenovirus combined with Pientzehuang can inhibit the growth of human osteosarcoma cell Saos-2 in nude mice.
Diagnostic value of immunoglobulin κ light chain gene rearrangement analysis in B-cell lymphomas.

PubMed

Kokovic, Ira; Jezersek Novakovic, Barbara; Novakovic, Srdjan

2015-03-01

Analysis of the immunoglobulin κ light chain (IGK) gene is an alternative method for B-cell clonality assessment in the diagnosis of mature B-cell proliferations in which the detection of clonal immunoglobulin heavy chain (IGH) gene rearrangements fails. The aim of the present study was to evaluate the added value of standardized BIOMED-2 assay for the detection of clonal IGK gene rearrangements in the diagnostic setting of suspected B-cell lymphomas. With this purpose, 92 specimens from 80 patients with the final diagnosis of mature B-cell lymphoma (37 specimens), mature T-cell lymphoma (26 specimens) and reactive lymphoid proliferation (29 specimens) were analyzed for B-cell clonality. B-cell clonality analysis was performed using the BIOMED-2 IGH and IGK gene clonality assays. The determined sensitivity of the IGK assay was 67.6%, while the determined sensitivity of the IGH assay was 75.7%. The sensitivity of combined IGH+IGK assay was 81.1%. The determined specificity of the IGK assay was 96.2% in the group of T-cell lymphomas and 96.6% in the group of reactive lesions. The determined specificity of the IGH assay was 84.6% in the group of lymphomas and 86.2% in the group of reactive lesions. The comparison of GeneScan (GS) and heteroduplex pretreatment-polyacrylamide gel electrophoresis (HD-PAGE) methods for the analysis of IGK gene rearrangements showed a higher efficacy of GS analysis in a series of 27 B-cell lymphomas analyzed by both methods. In the present study, we demonstrated that by applying the combined IGH+IGK clonality assay the overall detection rate of B-cell clonality was increased by 5.4%. Thus, we confirmed the added value of the standardized BIOMED-2 IGK assay for assessment of B-cell clonality in suspected B-cell lymphomas with inconclusive clinical and cyto/histological diagnosis.
Bioinformatics analysis of differentially expressed gene profiles associated with systemic lupus erythematosus

PubMed Central

Wu, Chengjiang; Zhao, Yangjing; Lin, Yu; Yang, Xinxin; Yan, Meina; Min, Yujiao; Pan, Zihui; Xia, Sheng; Shao, Qixiang

2018-01-01

DNA microarray and high-throughput sequencing have been widely used to identify the differentially expressed genes (DEGs) in systemic lupus erythematosus (SLE). However, the big data from gene microarrays are also challenging to work with in terms of analysis and processing. The presents study combined data from the microarray expression profile (GSE65391) and bioinformatics analysis to identify the key genes and cellular pathways in SLE. Gene ontology (GO) and cellular pathway enrichment analyses of DEGs were performed to investigate significantly enriched pathways. A protein-protein interaction network was constructed to determine the key genes in the occurrence and development of SLE. A total of 310 DEGs were identified in SLE, including 193 upregulated genes and 117 downregulated genes. GO analysis revealed that the most significant biological process of DEGs was immune system process. Kyoto Encyclopedia of Genes and Genome pathway analysis showed that these DEGs were enriched in signaling pathways associated with the immune system, including the RIG-I-like receptor signaling pathway, intestinal immune network for IgA production, antigen processing and presentation and the toll-like receptor signaling pathway. The current study screened the top 10 genes with higher degrees as hub genes, which included 2′-5′-oligoadenylate synthetase 1, MX dynamin like GTPase 2, interferon induced protein with tetratricopeptide repeats 1, interferon regulatory factor 7, interferon induced with helicase C domain 1, signal transducer and activator of transcription 1, ISG15 ubiquitin-like modifier, DExD/H-box helicase 58, interferon induced protein with tetratricopeptide repeats 3 and 2′-5′-oligoadenylate synthetase 2. Module analysis revealed that these hub genes were also involved in the RIG-I-like receptor signaling, cytosolic DNA-sensing, toll-like receptor signaling and ribosome biogenesis pathways. In addition, these hub genes, from different probe sets, exhibited
Reporter gene bioassays in environmental analysis.

PubMed

Köhler, S; Belkin, S; Schmid, R D

2000-01-01

In parallel to the continuous development of increasingly more sophisticated physical and chemical analytical technologies for the detection of environmental pollutants, there is a progressively more urgent need also for bioassays which report not only on the presence of a chemical but also on its bioavailability and its biological effects. As a partial fulfillment of that need, there has been a rapid development of biosensors based on genetically engineered bacteria. Such microorganisms typically combine a promoter-operator, which acts as the sensing element, with reporter gene(s) coding for easily detectable proteins. These sensors have the ability to detect global parameters such as stress conditions, toxicity or DNA-damaging agents as well as specific organic and inorganic compounds. The systems described in this review, designed to detect different groups of target chemicals, vary greatly in their detection limits, specificity, response times and more. These variations reflect on their potential applicability which, for most of the constructs described, is presently rather limited. Nevertheless, present trends promise that additional improvements will make microbial biosensors an important tool for future environmental analysis.
Avirulence Genes in Cereal Powdery Mildews: The Gene-for-Gene Hypothesis 2.0.

PubMed

Bourras, Salim; McNally, Kaitlin E; Müller, Marion C; Wicker, Thomas; Keller, Beat

2016-01-01

The gene-for-gene hypothesis states that for each gene controlling resistance in the host, there is a corresponding, specific gene controlling avirulence in the pathogen. Allelic series of the cereal mildew resistance genes Pm3 and Mla provide an excellent system for genetic and molecular analysis of resistance specificity. Despite this opportunity for molecular research, avirulence genes in mildews remain underexplored. Earlier work in barley powdery mildew (B.g. hordei) has shown that the reaction to some Mla resistance alleles is controlled by multiple genes. Similarly, several genes are involved in the specific interaction of wheat mildew (B.g. tritici) with the Pm3 allelic series. We found that two mildew genes control avirulence on Pm3f: one gene is involved in recognition by the resistance protein as demonstrated by functional studies in wheat and the heterologous host Nicotiana benthamiana. A second gene is a suppressor, and resistance is only observed in mildew genotypes combining the inactive suppressor and the recognized Avr. We propose that such suppressor/avirulence gene combinations provide the basis of specificity in mildews. Depending on the particular gene combinations in a mildew race, different genes will be genetically identified as the "avirulence" gene. Additionally, the observation of two LINE retrotransposon-encoded avirulence genes in B.g. hordei further suggests that the control of avirulence in mildew is more complex than a canonical gene-for-gene interaction. To fully understand the mildew-cereal interactions, more knowledge on avirulence determinants is needed and we propose ways how this can be achieved based on recent advances in the field.
Comparative modular analysis of gene expression in vertebrate organs.

PubMed

Piasecka, Barbara; Kutalik, Zoltán; Roux, Julien; Bergmann, Sven; Robinson-Rechavi, Marc

2012-03-29

The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.
Establishment of Functional Genomics Pipeline in Mouse Epiblast-Like Tissue by Combining Transcriptomic Analysis and Gene Knockdown/Knockin/Knockout, Using RNA Interference and CRISPR/Cas9.

PubMed

Takata, Nozomu; Sakakura, Eriko; Kasukawa, Takeya; Sakuma, Tetsushi; Yamamoto, Takashi; Sasai, Yoshiki

2016-06-01

The epiblast (foremost embryonic ectoderm) generates all three germ layers and therefore has crucial roles in the formation of all mammalian body cells. However, regulation of epiblast gene expression is poorly understood because of the difficulty of manipulating epiblast tissues in vivo. In the present study, using the self-organizing properties of mouse embryonic stem cell (ESC), we generated and characterized epiblast-like tissue in three-dimensional culture. We identified significant genome-wide gene expression changes in this epiblast-like tissue by transcriptomic analysis. In addition, we identified the particular significance of the Erk/Mapk and integrin-linked kinase pathways, and genes related to ectoderm/epithelial formation, using the bioinformatics resources IPA and DAVID. Here, we focused on Fgf5, which ranked in the top 10 among the discovered genes. To develop a functional analysis of Fgf5, we created an efficient method combining CRISPR/Cas9-mediated genome engineering and RNA interference (RNAi). Notably, we show one-step generation of various Fgf5 reporter lines including heterozygous and homozygous knockins (the GET method). For time- and dose-dependent depletion of fgf5 over the course of development, we generated an ESC line harboring Tol2 transposon-mediated integration of an inducible short hairpin RNA interference system (pdiRNAi). Our findings raised the possibility that Fgf/Erk signaling and apicobasal epithelial integrity are important factors in epiblast development. In addition, our methods provide a framework for a broad array of applications in the areas of mammalian genetics and molecular biology to understand development and to improve future therapeutics.
The Rice B-Box Zinc Finger Gene Family: Genomic Identification, Characterization, Expression Profiling and Diurnal Analysis

PubMed Central

Huang, Jianyan; Zhao, Xiaobo; Weng, Xiaoyu; Wang, Lei; Xie, Weibo

2012-01-01

Background The B-box (BBX) -containing proteins are a class of zinc finger proteins that contain one or two B-box domains and play important roles in plant growth and development. The Arabidopsis BBX gene family has recently been re-identified and renamed. However, there has not been a genome-wide survey of the rice BBX (OsBBX) gene family until now. Methodology/Principal Findings In this study, we identified 30 rice BBX genes through a comprehensive bioinformatics analysis. Each gene was assigned a uniform nomenclature. We described the chromosome localizations, gene structures, protein domains, phylogenetic relationship, whole life-cycle expression profile and diurnal expression patterns of the OsBBX family members. Based on the phylogeny and domain constitution, the OsBBX gene family was classified into five subfamilies. The gene duplication analysis revealed that only chromosomal segmental duplication contributed to the expansion of the OsBBX gene family. The expression profile of the OsBBX genes was analyzed by Affymetrix GeneChip microarrays throughout the entire life-cycle of rice cultivar Zhenshan 97 (ZS97). In addition, microarray analysis was performed to obtain the expression patterns of these genes under light/dark conditions and after three phytohormone treatments. This analysis revealed that the expression patterns of the OsBBX genes could be classified into eight groups. Eight genes were regulated under the light/dark treatments, and eleven genes showed differential expression under at least one phytohormone treatment. Moreover, we verified the diurnal expression of the OsBBX genes using the data obtained from the Diurnal Project and qPCR analysis, and the results indicated that many of these genes had a diurnal expression pattern. Conclusions/Significance The combination of the genome-wide identification and the expression and diurnal analysis of the OsBBX gene family should facilitate additional functional studies of the OsBBX genes. PMID:23118960
[The mutation analysis of PAH gene and prenatal diagnosis in classical phenylketonuria family].

PubMed

Yan, Yousheng; Hao, Shengju; Yao, Fengxia; Sun, Qingmei; Zheng, Lei; Zhang, Qinghua; Zhang, Chuan; Yang, Tao; Huang, Shangzhi

2014-12-01

To characterize the mutation spectrum of phenylalanine hydroxylase (PAH) gene and perform prenatal diagnosis for families with classical phenylketonuria. By stratified sequencing, mutations were detected in the exons and flaking introns of PAH gene of 44 families with classical phenylketonuria. 47 fetuses were diagnosed by combined sequencing with linkage analysis of three common short tandem repeats (STR) (PAH-STR, PAH-26 and PAH-32) in the PAH gene. Thirty-one types of mutations were identified. A total of 84 mutations were identified in 88 alleles (95.45%), in which the most common mutation have been R243Q (21.59%), EX6-96A>G (6.82%), IVS4-1G>A (5.86%) and IVS7+2T>A (5.86%). Most mutations were found in exons 3, 5, 6, 7, 11 and 12. The polymorphism information content (PIC) of these three STR markers was 0.71 (PAH-STR), 0.48 (PAH-26) and 0.40 (PAH-32), respectively. Prenatal diagnosis was performed successfully with the combined method in 47 fetuses of 44 classical phenylketonuria families. Among them, 11 (23.4%) were diagnosed as affected, 24 (51.1%) as carriers, and 12 (25.5%) as unaffected. Prenatal diagnosis can be achieved efficiently and accurately by stratified sequencing of PAH gene and linkage analysis of STR for classical phenylketonuria families.
Analysis of differential gene expression by bead-based fiber-optic array in growth-hormone-secreting pituitary adenomas.

PubMed

Jiang, Zhiquan; Gui, Songbo; Zhang, Yazhuo

2010-09-01

Growth-hormone-secreting pituitary adenomas (GHomas) account for approximately 20% of all pituitary neoplasms. However, the pathogenesis of GHomas remains to be elucidated. To explore the possible pathogenesis of GHomas, we used bead-based fiber-optic arrays to examine the gene expression in five GHomas and compared them to three healthy pituitaries. Four differentially expressed genes were chosen randomly for validation by quantitative real-time reverse transcription-polymerase chain reaction. We then performed pathway analysis on the identified differentially expressed genes using the Kyoto Encyclopedia of Genes and Genomes. Array analysis showed significant increases in the expression of 353 genes and 206 expressed sequence tags (ESTs) and decreases in 565 genes and 29 ESTs. Bioinformatic analysis showed that the genes HIGD1B, HOXB2, ANGPT2, HPGD and BTG2 may play an important role in the tumorigenesis and progression of GHomas. Pathway analysis showed that the wingless-type signaling pathway and extracellular-matrix receptor interactions may play a key role in the tumorigenesis and progression of GHomas. Our data suggested that there are numerous aberrantly expressed genes and pathways involved in the pathogenesis of GHomas. Bead-based fiber-optic arrays combined with pathway analysis of differentially expressed genes appear to be a valid method for investigating the pathogenesis of tumors.
Analysis of differential gene expression by bead-based fiber-optic array in growth-hormone-secreting pituitary adenomas

PubMed Central

JIANG, ZHIQUAN; GUI, SONGBO; ZHANG, YAZHUO

2010-01-01

Growth-hormone-secreting pituitary adenomas (GHomas) account for approximately 20% of all pituitary neoplasms. However, the pathogenesis of GHomas remains to be elucidated. To explore the possible pathogenesis of GHomas, we used bead-based fiber-optic arrays to examine the gene expression in five GHomas and compared them to three healthy pituitaries. Four differentially expressed genes were chosen randomly for validation by quantitative real-time reverse transcription-polymerase chain reaction. We then performed pathway analysis on the identified differentially expressed genes using the Kyoto Encyclopedia of Genes and Genomes. Array analysis showed significant increases in the expression of 353 genes and 206 expressed sequence tags (ESTs) and decreases in 565 genes and 29 ESTs. Bioinformatic analysis showed that the genes HIGD1B, HOXB2, ANGPT2, HPGD and BTG2 may play an important role in the tumorigenesis and progression of GHomas. Pathway analysis showed that the wingless-type signaling pathway and extracellular-matrix receptor interactions may play a key role in the tumorigenesis and progression of GHomas. Our data suggested that there are numerous aberrantly expressed genes and pathways involved in the pathogenesis of GHomas. Bead-based fiber-optic arrays combined with pathway analysis of differentially expressed genes appear to be a valid method for investigating the pathogenesis of tumors. PMID:22993617
DEIVA: a web application for interactive visual analysis of differential gene expression profiles.

PubMed

Harshbarger, Jayson; Kratz, Anton; Carninci, Piero

2017-01-07

Differential gene expression (DGE) analysis is a technique to identify statistically significant differences in RNA abundance for genes or arbitrary features between different biological states. The result of a DGE test is typically further analyzed using statistical software, spreadsheets or custom ad hoc algorithms. We identified a need for a web-based system to share DGE statistical test results, and locate and identify genes in DGE statistical test results with a very low barrier of entry. We have developed DEIVA, a free and open source, browser-based single page application (SPA) with a strong emphasis on being user friendly that enables locating and identifying single or multiple genes in an immediate, interactive, and intuitive manner. By design, DEIVA scales with very large numbers of users and datasets. Compared to existing software, DEIVA offers a unique combination of design decisions that enable inspection and analysis of DGE statistical test results with an emphasis on ease of use.
Improved score statistics for meta-analysis in single-variant and gene-level association studies.

PubMed

Yang, Jingjing; Chen, Sai; Abecasis, Gonçalo

2018-06-01

Meta-analysis is now an essential tool for genetic association studies, allowing them to combine large studies and greatly accelerating the pace of genetic discovery. Although the standard meta-analysis methods perform equivalently as the more cumbersome joint analysis under ideal settings, they result in substantial power loss under unbalanced settings with various case-control ratios. Here, we investigate the power loss problem by the standard meta-analysis methods for unbalanced studies, and further propose novel meta-analysis methods performing equivalently to the joint analysis under both balanced and unbalanced settings. We derive improved meta-score-statistics that can accurately approximate the joint-score-statistics with combined individual-level data, for both linear and logistic regression models, with and without covariates. In addition, we propose a novel approach to adjust for population stratification by correcting for known population structures through minor allele frequencies. In the simulated gene-level association studies under unbalanced settings, our method recovered up to 85% power loss caused by the standard methods. We further showed the power gain of our methods in gene-level tests with 26 unbalanced studies of age-related macular degeneration . In addition, we took the meta-analysis of three unbalanced studies of type 2 diabetes as an example to discuss the challenges of meta-analyzing multi-ethnic samples. In summary, our improved meta-score-statistics with corrections for population stratification can be used to construct both single-variant and gene-level association studies, providing a useful framework for ensuring well-powered, convenient, cross-study analyses. © 2018 WILEY PERIODICALS, INC.
Computational Gene Expression Modeling Identifies Salivary Biomarker Analysis that Predict Oral Feeding Readiness in the Newborn

PubMed Central

Maron, Jill L.; Hwang, Jooyeon S.; Pathak, Subash; Ruthazer, Robin; Russell, Ruby L.; Alterovitz, Gil

2014-01-01

Objective To combine mathematical modeling of salivary gene expression microarray data and systems biology annotation with RT-qPCR amplification to identify (phase I) and validate (phase II) salivary biomarker analysis for the prediction of oral feeding readiness in preterm infants. Study design Comparative whole transcriptome microarray analysis from 12 preterm newborns pre- and post-oral feeding success was used for computational modeling and systems biology analysis to identify potential salivary transcripts associated with oral feeding success (phase I). Selected gene expression biomarkers (15 from computational modeling; 6 evidence-based; and 3 reference) were evaluated by RT-qPCR amplification on 400 salivary samples from successful (n=200) and unsuccessful (n=200) oral feeders (phase II). Genes, alone and in combination, were evaluated by a multivariate analysis controlling for sex and post-conceptional age (PCA) to determine the probability that newborns achieved successful oral feeding. Results Advancing post-conceptional age (p < 0.001) and female sex (p = 0.05) positively predicted an infant’s ability to feed orally. A combination of five genes, NPY2R (hunger signaling), AMPK (energy homeostasis), PLXNA1 (olfactory neurogenesis), NPHP4 (visual behavior) and WNT3 (facial development), in addition to PCA and sex, demonstrated good accuracy for determining feeding success (AUROC = 0.78). Conclusions We have identified objective and biologically relevant salivary biomarkers that noninvasively assess a newborn’s developing brain, sensory and facial development as they relate to oral feeding success. Understanding the mechanisms that underlie the development of oral feeding readiness through translational and computational methods may improve clinical decision making while decreasing morbidities and health care costs. PMID:25620512

Combination of gene expression patterns in whole blood discriminate between tuberculosis infection states

PubMed Central

2014-01-01

Background Genetic factors are involved in susceptibility or protection to tuberculosis (TB). Apart from gene polymorphisms and mutations, changes in levels of gene expression, induced by non-genetic factors, may also determine whether individuals progress to active TB. Methods We analysed the expression level of 45 genes in a total of 47 individuals (23 healthy household contacts and 24 new smear-positive pulmonary TB patients) in Addis Ababa using a dual colour multiplex ligation-dependent probe amplification (dcRT-MLPA) technique to assess gene expression profiles that may be used to distinguish TB cases and their contacts and also latently infected (LTBI) and uninfected household contacts. Results The gene expression level of BLR1, Bcl2, IL4d2, IL7R, FCGR1A, MARCO, MMP9, CCL19, and LTF had significant discriminatory power between sputum smear-positive TB cases and household contacts, with AUCs of 0.84, 0.81, 0.79, 0.79, 0.78, 0.76, 0.75, 0.75 and 0.68 respectively. The combination of Bcl2, BLR1, FCGR1A, IL4d2 and MARCO identified 91.66% of active TB cases and 95.65% of household contacts without active TB. The expression of CCL19, TGFB1, and Foxp3 showed significant difference between LTBI and uninfected contacts, with AUCs of 0.85, 0.82, and 0.75, respectively, whereas the combination of BPI, CCL19, FoxP3, FPR1 and TGFB1 identified 90.9% of QFT- and 91.6% of QFT+ household contacts. Conclusions Expression of single and especially combinations of host genes can accurately differentiate between active TB cases and healthy individuals as well as between LTBI and uninfected contacts. PMID:24885723
Avirulence Genes in Cereal Powdery Mildews: The Gene-for-Gene Hypothesis 2.0

PubMed Central

Bourras, Salim; McNally, Kaitlin E.; Müller, Marion C.; Wicker, Thomas; Keller, Beat

2016-01-01

The gene-for-gene hypothesis states that for each gene controlling resistance in the host, there is a corresponding, specific gene controlling avirulence in the pathogen. Allelic series of the cereal mildew resistance genes Pm3 and Mla provide an excellent system for genetic and molecular analysis of resistance specificity. Despite this opportunity for molecular research, avirulence genes in mildews remain underexplored. Earlier work in barley powdery mildew (B.g. hordei) has shown that the reaction to some Mla resistance alleles is controlled by multiple genes. Similarly, several genes are involved in the specific interaction of wheat mildew (B.g. tritici) with the Pm3 allelic series. We found that two mildew genes control avirulence on Pm3f: one gene is involved in recognition by the resistance protein as demonstrated by functional studies in wheat and the heterologous host Nicotiana benthamiana. A second gene is a suppressor, and resistance is only observed in mildew genotypes combining the inactive suppressor and the recognized Avr. We propose that such suppressor/avirulence gene combinations provide the basis of specificity in mildews. Depending on the particular gene combinations in a mildew race, different genes will be genetically identified as the “avirulence” gene. Additionally, the observation of two LINE retrotransposon-encoded avirulence genes in B.g. hordei further suggests that the control of avirulence in mildew is more complex than a canonical gene-for-gene interaction. To fully understand the mildew–cereal interactions, more knowledge on avirulence determinants is needed and we propose ways how this can be achieved based on recent advances in the field. PMID:26973683
Robust extraction of functional signals from gene set analysis using a generalized threshold free scoring function

PubMed Central

2009-01-01

Background A central task in contemporary biosciences is the identification of biological processes showing response in genome-wide differential gene expression experiments. Two types of analysis are common. Either, one generates an ordered list based on the differential expression values of the probed genes and examines the tail areas of the list for over-representation of various functional classes. Alternatively, one monitors the average differential expression level of genes belonging to a given functional class. So far these two types of method have not been combined. Results We introduce a scoring function, Gene Set Z-score (GSZ), for the analysis of functional class over-representation that combines two previous analysis methods. GSZ encompasses popular functions such as correlation, hypergeometric test, Max-Mean and Random Sets as limiting cases. GSZ is stable against changes in class size as well as across different positions of the analysed gene list in tests with randomized data. GSZ shows the best overall performance in a detailed comparison to popular functions using artificial data. Likewise, GSZ stands out in a cross-validation of methods using split real data. A comparison of empirical p-values further shows a strong difference in favour of GSZ, which clearly reports better p-values for top classes than the other methods. Furthermore, GSZ detects relevant biological themes that are missed by the other methods. These observations also hold when comparing GSZ with popular program packages. Conclusion GSZ and improved versions of earlier methods are a useful contribution to the analysis of differential gene expression. The methods and supplementary material are available from the website http://ekhidna.biocenter.helsinki.fi/users/petri/public/GSZ/GSZscore.html. PMID:19775443
Effect of NET-1 siRNA conjugated sub-micron bubble complex combined with low-frequency ultrasound exposure in gene transfection

PubMed Central

Wu, Bolin; Liang, Xitian; Jing, Hui; Han, Xue; Sun, Yixin; Guo, Cunli; Liu, Ying; Cheng, Wen

2018-01-01

The present study evaluated the effect of NET-1 siRNA-conjugated sub-micron bubble (SMB) complexes combined with low-frequency ultrasound exposure in gene transfection. The NET-1 gene was highly expressed level in SMMC-7721 human hepatocellular carcinoma cell line. The cells were divided into seven groups and treated with different conditions. The groups with or without low-frequency ultrasound exposure, groups of adherent cells, and suspension cells were separated. The NET-1 siRNA-conjugated SMB complexes were made in the laboratory and tested by Zetasizer Nano ZS90 analyzer. Flow cytometry was used to estimate the transfection efficiency and cellular apoptosis. Western blot and quantitative real-time polymerase chain reaction (qPCR) were used for the estimation of the protein and mRNA expressions, respectively. Transwell analysis determined the migration and invasion capacities of the tumor cells. The results did not show any difference in the transfection efficiency between adherent and suspension cells. However, the NET-1 siRNA-SMB complexes combined with low-frequency ultrasound exposure could enhance the gene transfection effectively. In summary, the NET-1 siRNA-SMB complexes appeared to be promising gene vehicle. PMID:29423111
Methods to increase reproducibility in differential gene expression via meta-analysis

PubMed Central

Sweeney, Timothy E.; Haynes, Winston A.; Vallania, Francesco; Ioannidis, John P.; Khatri, Purvesh

2017-01-01

Findings from clinical and biological studies are often not reproducible when tested in independent cohorts. Due to the testing of a large number of hypotheses and relatively small sample sizes, results from whole-genome expression studies in particular are often not reproducible. Compared to single-study analysis, gene expression meta-analysis can improve reproducibility by integrating data from multiple studies. However, there are multiple choices in designing and carrying out a meta-analysis. Yet, clear guidelines on best practices are scarce. Here, we hypothesized that studying subsets of very large meta-analyses would allow for systematic identification of best practices to improve reproducibility. We therefore constructed three very large gene expression meta-analyses from clinical samples, and then examined meta-analyses of subsets of the datasets (all combinations of datasets with up to N/2 samples and K/2 datasets) compared to a ‘silver standard’ of differentially expressed genes found in the entire cohort. We tested three random-effects meta-analysis models using this procedure. We showed relatively greater reproducibility with more-stringent effect size thresholds with relaxed significance thresholds; relatively lower reproducibility when imposing extraneous constraints on residual heterogeneity; and an underestimation of actual false positive rate by Benjamini–Hochberg correction. In addition, multivariate regression showed that the accuracy of a meta-analysis increased significantly with more included datasets even when controlling for sample size. PMID:27634930
Literature mining, gene-set enrichment and pathway analysis for target identification in Behçet's disease.

PubMed

Wilson, Paul; Larminie, Christopher; Smith, Rona

2016-01-01

To use literature mining to catalogue Behçet's associated genes, and advanced computational methods to improve the understanding of the pathways and signalling mechanisms that lead to the typical clinical characteristics of Behçet's patients. To extend this technique to identify potential treatment targets for further experimental validation. Text mining methods combined with gene enrichment tools, pathway analysis and causal analysis algorithms. This approach identified 247 human genes associated with Behçet's disease and the resulting disease map, comprising 644 nodes and 19220 edges, captured important details of the relationships between these genes and their associated pathways, as described in diverse data repositories. Pathway analysis has identified how Behçet's associated genes are likely to participate in innate and adaptive immune responses. Causal analysis algorithms have identified a number of potential therapeutic strategies for further investigation. Computational methods have captured pertinent features of the prominent disease characteristics presented in Behçet's disease and have highlighted NOD2, ICOS and IL18 signalling as potential therapeutic strategies.
Abrogation of Microsatellite-instable Tumors Using a Highly Selective Suicide Gene/Prodrug Combination

PubMed Central

Ferrás, Cristina; Oude Vrielink, Joachim AF; Verspuy, Johan WA; te Riele, Hein; Tsaalbi-Shtylik, Anastasia; de Wind, Niels

2009-01-01

A substantial fraction of sporadic and inherited colorectal and endometrial cancers in humans is deficient in DNA mismatch repair (MMR). These cancers are characterized by length alterations in ubiquitous simple sequence repeats, a phenotype called microsatellite instability. Here we have exploited this phenotype by developing a novel approach for the highly selective gene therapy of MMR-deficient tumors. To achieve this selectivity, we mutated the VP22FCU1 suicide gene by inserting an out-of-frame microsatellite within its coding region. We show that in a significant fraction of microsatellite-instable (MSI) cells carrying the mutated suicide gene, full-length protein becomes expressed within a few cell doublings, presumably resulting from a reverting frameshift within the inserted microsatellite. Treatment of these cells with the innocuous prodrug 5-fluorocytosine (5-FC) induces strong cytotoxicity and we demonstrate that this owes to multiple bystander effects conferred by the suicide gene/prodrug combination. In a mouse model, MMR-deficient tumors that contained the out-of-frame VP22FCU1 gene displayed strong remission after treatment with 5-FC, without any obvious adverse systemic effects to the mouse. By virtue of its high selectivity and potency, this conditional enzyme/prodrug combination may hold promise for the treatment or prevention of MMR-deficient cancer in humans. PMID:19471249
Phylogeny Inference of Closely Related Bacterial Genomes: Combining the Features of Both Overlapping Genes and Collinear Genomic Regions

PubMed Central

Zhang, Yan-Cong; Lin, Kui

2015-01-01

Overlapping genes (OGs) represent one type of widespread genomic feature in bacterial genomes and have been used as rare genomic markers in phylogeny inference of closely related bacterial species. However, the inference may experience a decrease in performance for phylogenomic analysis of too closely or too distantly related genomes. Another drawback of OGs as phylogenetic markers is that they usually take little account of the effects of genomic rearrangement on the similarity estimation, such as intra-chromosome/genome translocations, horizontal gene transfer, and gene losses. To explore such effects on the accuracy of phylogeny reconstruction, we combine phylogenetic signals of OGs with collinear genomic regions, here called locally collinear blocks (LCBs). By putting these together, we refine our previous metric of pairwise similarity between two closely related bacterial genomes. As a case study, we used this new method to reconstruct the phylogenies of 88 Enterobacteriale genomes of the class Gammaproteobacteria. Our results demonstrated that the topological accuracy of the inferred phylogeny was improved when both OGs and LCBs were simultaneously considered, suggesting that combining these two phylogenetic markers may reduce, to some extent, the influence of gene loss on phylogeny inference. Such phylogenomic studies, we believe, will help us to explore a more effective approach to increasing the robustness of phylogeny reconstruction of closely related bacterial organisms. PMID:26715828
Rapid detection of pathological mutations and deletions of the haemoglobin beta gene (HBB) by High Resolution Melting (HRM) analysis and Gene Ratio Analysis Copy Enumeration PCR (GRACE-PCR).

PubMed

Turner, Andrew; Sasse, Jurgen; Varadi, Aniko

2016-10-19

Inherited disorders of haemoglobin are the world's most common genetic diseases, resulting in significant morbidity and mortality. The large number of mutations associated with the haemoglobin beta gene (HBB) makes gene scanning by High Resolution Melting (HRM) PCR an attractive diagnostic approach. However, existing HRM-PCR assays are not able to detect all common point mutations and have only a very limited ability to detect larger gene rearrangements. The aim of the current study was to develop a HBB assay, which can be used as a screening test in highly heterogeneous populations, for detection of both point mutations and larger gene rearrangements. The assay is based on a combination of conventional HRM-PCR and a novel Gene Ratio Analysis Copy Enumeration (GRACE) PCR method. HRM-PCR was extensively optimised, which included the use of an unlabelled probe and incorporation of universal bases into primers to prevent interference from common non-pathological polymorphisms. GRACE-PCR was employed to determine HBB gene copy numbers relative to a reference gene using melt curve analysis to detect rearrangements in the HBB gene. The performance of the assay was evaluated by analysing 410 samples. A total of 44 distinct pathological genotypes were detected. In comparison with reference methods, the assay has a sensitivity of 100 % and a specificity of 98 %. We have developed an assay that detects both point mutations and larger rearrangements of the HBB gene. This assay is quick, sensitive, specific and cost effective making it suitable as an initial screening test that can be used for highly heterogeneous cohorts.
Combined sequence and sequence-structure-based methods for analyzing RAAS gene SNPs: a computational approach.

PubMed

Singh, Kh Dhanachandra; Karthikeyan, Muthusamy

2014-12-01

The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.
Isolated and combined dystonia syndromes - an update on new genes and their phenotypes.

PubMed

Balint, B; Bhatia, K P

2015-04-01

Recent consensus on the definition, phenomenology and classification of dystonia centres around phenomenology and guides our diagnostic approach for the heterogeneous group of dystonias. Current terminology classifies conditions where dystonia is the sole motor feature (apart from tremor) as 'isolated dystonia', while 'combined dystonia' refers to dystonias with other accompanying movement disorders. This review highlights recent advances in the genetics of some isolated and combined dystonic syndromes. Some genes, such as ANO3, GNAL and CIZ1, have been discovered for isolated dystonia, but they are probably not a common cause of classic cervical dystonia. Conversely, the phenotype associated with TUBB4A mutations expanded from that of isolated dystonia to a syndrome of hypomyelination with atrophy of the basal ganglia and cerebellum (H-ABC syndrome). Similarly, ATP1A3 mutations cause a wide phenotypic spectrum ranging from rapid-onset dystonia-parkinsonism to alternating hemiplegia of childhood. Other entities entailing dystonia-parkinsonism include dopamine transporter deficiency syndrome (SLC63 mutations); dopa-responsive dystonias; young-onset parkinsonism (PARKIN, PINK1 and DJ-1 mutations); PRKRA mutations; and X-linked TAF1 mutations, which rarely can also manifest in women. Clinical and genetic heterogeneity also characterizes myoclonus-dystonia, which includes not only the classical phenotype associated with epsilon-sarcoglycan mutations but rarely also presentation of ANO3 gene mutations, TITF1 gene mutations typically underlying benign hereditary chorea, and some dopamine synthesis pathway conditions due to GCH1 and TH mutations. Thus, new genes are being recognized for isolated dystonia, and the phenotype of known genes is broadening and now involves different combined dystonia syndromes. © 2015 EAN.
Gene expression profile analysis of Ligon lintless-1 (Li1) mutant reveals important genes and pathways in cotton leaf and fiber development.

PubMed

Ding, Mingquan; Jiang, Yurong; Cao, Yuefen; Lin, Lifeng; He, Shae; Zhou, Wei; Rong, Junkang

2014-02-10

Ligon lintless-1 (Li1) is a monogenic dominant mutant of Gossypium hirsutum (upland cotton) with a phenotype of impaired vegetative growth and short lint fibers. Despite years of research involving genetic mapping and gene expression profile analysis of Li1 mutant ovule tissues, the gene remains uncloned and the underlying pathway of cotton fiber elongation is still unclear. In this study, we report the whole genome-level deep-sequencing analysis of leaf tissues of the Li1 mutant. Differentially expressed genes in leaf tissues of mutant versus wild-type (WT) plants are identified, and the underlying pathways and potential genes that control leaf and fiber development are inferred. The results show that transcription factors AS2, YABBY5, and KANDI-like are significantly differentially expressed in mutant tissues compared with WT ones. Interestingly, several fiber development-related genes are found in the downregulated gene list of the mutant leaf transcriptome. These genes include heat shock protein family, cytoskeleton arrangement, cell wall synthesis, energy, H2O2 metabolism-related genes, and WRKY transcription factors. This finding suggests that the genes are involved in leaf morphology determination and fiber elongation. The expression data are also compared with the previously published microarray data of Li1 ovule tissues. Comparative analysis of the ovule transcriptomes of Li1 and WT reveals that a number of pathways important for fiber elongation are enriched in the downregulated gene list at different fiber development stages (0, 6, 9, 12, 15, 18dpa). Differentially expressed genes identified in both leaf and fiber samples are aligned with cotton whole genome sequences and combined with the genetic fine mapping results to identify a list of candidate genes for Li1. Copyright © 2013 Elsevier B.V. All rights reserved.
Multilevel Regulation of Bacterial Gene Expression with the Combined STAR and Antisense RNA System.

PubMed

Lee, Young Je; Kim, Soo-Jung; Moon, Tae Seok

2018-03-16

Synthetic small RNA regulators have emerged as a versatile tool to predictably control bacterial gene expression. Owing to their simple design principles, small size, and highly orthogonal behavior, these engineered genetic parts have been incorporated into genetic circuits. However, efforts to achieve more sophisticated cellular functions using RNA regulators have been hindered by our limited ability to integrate different RNA regulators into complex circuits. Here, we present a combined RNA regulatory system in Escherichia coli that uses small transcription activating RNA (STAR) and antisense RNA (asRNA) to activate or deactivate target gene expression in a programmable manner. Specifically, we demonstrated that the activated target output by the STAR system can be deactivated by expressing two different types of asRNAs: one binds to and sequesters the STAR regulator, affecting the transcription process, while the other binds to the target mRNA, affecting the translation process. We improved deactivation efficiencies (up to 96%) by optimizing each type of asRNA and then integrating the two optimized asRNAs into a single circuit. Furthermore, we demonstrated that the combined STAR and asRNA system can control gene expression in a reversible way and can regulate expression of a gene in the genome. Lastly, we constructed and simultaneously tested two A AND NOT B logic gates in the same cell to show sophisticated multigene regulation by the combined system. Our approach establishes a methodology for integrating multiple RNA regulators to rationally control multiple genes.
Analysis of bHLH coding genes using gene co-expression network approach.

PubMed

Srivastava, Swati; Sanchita; Singh, Garima; Singh, Noopur; Srivastava, Gaurava; Sharma, Ashok

2016-07-01

Network analysis provides a powerful framework for the interpretation of data. It uses novel reference network-based metrices for module evolution. These could be used to identify module of highly connected genes showing variation in co-expression network. In this study, a co-expression network-based approach was used for analyzing the genes from microarray data. Our approach consists of a simple but robust rank-based network construction. The publicly available gene expression data of Solanum tuberosum under cold and heat stresses were considered to create and analyze a gene co-expression network. The analysis provide highly co-expressed module of bHLH coding genes based on correlation values. Our approach was to analyze the variation of genes expression, according to the time period of stress through co-expression network approach. As the result, the seed genes were identified showing multiple connections with other genes in the same cluster. Seed genes were found to be vary in different time periods of stress. These analyzed seed genes may be utilized further as marker genes for developing the stress tolerant plant species.
Gene ARMADA: an integrated multi-analysis platform for microarray data implemented in MATLAB.

PubMed

Chatziioannou, Aristotelis; Moulos, Panagiotis; Kolisis, Fragiskos N

2009-10-27

The microarray data analysis realm is ever growing through the development of various tools, open source and commercial. However there is absence of predefined rational algorithmic analysis workflows or batch standardized processing to incorporate all steps, from raw data import up to the derivation of significantly differentially expressed gene lists. This absence obfuscates the analytical procedure and obstructs the massive comparative processing of genomic microarray datasets. Moreover, the solutions provided, heavily depend on the programming skills of the user, whereas in the case of GUI embedded solutions, they do not provide direct support of various raw image analysis formats or a versatile and simultaneously flexible combination of signal processing methods. We describe here Gene ARMADA (Automated Robust MicroArray Data Analysis), a MATLAB implemented platform with a Graphical User Interface. This suite integrates all steps of microarray data analysis including automated data import, noise correction and filtering, normalization, statistical selection of differentially expressed genes, clustering, classification and annotation. In its current version, Gene ARMADA fully supports 2 coloured cDNA and Affymetrix oligonucleotide arrays, plus custom arrays for which experimental details are given in tabular form (Excel spreadsheet, comma separated values, tab-delimited text formats). It also supports the analysis of already processed results through its versatile import editor. Besides being fully automated, Gene ARMADA incorporates numerous functionalities of the Statistics and Bioinformatics Toolboxes of MATLAB. In addition, it provides numerous visualization and exploration tools plus customizable export data formats for seamless integration by other analysis tools or MATLAB, for further processing. Gene ARMADA requires MATLAB 7.4 (R2007a) or higher and is also distributed as a stand-alone application with MATLAB Component Runtime. Gene ARMADA provides a
Random forests-based differential analysis of gene sets for gene expression data.

PubMed

Hsueh, Huey-Miin; Zhou, Da-Wei; Tsai, Chen-An

2013-04-10

In DNA microarray studies, gene-set analysis (GSA) has become the focus of gene expression data analysis. GSA utilizes the gene expression profiles of functionally related gene sets in Gene Ontology (GO) categories or priori-defined biological classes to assess the significance of gene sets associated with clinical outcomes or phenotypes. Many statistical approaches have been proposed to determine whether such functionally related gene sets express differentially (enrichment and/or deletion) in variations of phenotypes. However, little attention has been given to the discriminatory power of gene sets and classification of patients. In this study, we propose a method of gene set analysis, in which gene sets are used to develop classifications of patients based on the Random Forest (RF) algorithm. The corresponding empirical p-value of an observed out-of-bag (OOB) error rate of the classifier is introduced to identify differentially expressed gene sets using an adequate resampling method. In addition, we discuss the impacts and correlations of genes within each gene set based on the measures of variable importance in the RF algorithm. Significant classifications are reported and visualized together with the underlying gene sets and their contribution to the phenotypes of interest. Numerical studies using both synthesized data and a series of publicly available gene expression data sets are conducted to evaluate the performance of the proposed methods. Compared with other hypothesis testing approaches, our proposed methods are reliable and successful in identifying enriched gene sets and in discovering the contributions of genes within a gene set. The classification results of identified gene sets can provide an valuable alternative to gene set testing to reveal the unknown, biologically relevant classes of samples or patients. In summary, our proposed method allows one to simultaneously assess the discriminatory ability of gene sets and the importance of genes for
Large-scale gene function analysis with the PANTHER classification system.

PubMed

Mi, Huaiyu; Muruganujan, Anushya; Casagrande, John T; Thomas, Paul D

2013-08-01

The PANTHER (protein annotation through evolutionary relationship) classification system (http://www.pantherdb.org/) is a comprehensive system that combines gene function, ontology, pathways and statistical analysis tools that enable biologists to analyze large-scale, genome-wide data from sequencing, proteomics or gene expression experiments. The system is built with 82 complete genomes organized into gene families and subfamilies, and their evolutionary relationships are captured in phylogenetic trees, multiple sequence alignments and statistical models (hidden Markov models or HMMs). Genes are classified according to their function in several different ways: families and subfamilies are annotated with ontology terms (Gene Ontology (GO) and PANTHER protein class), and sequences are assigned to PANTHER pathways. The PANTHER website includes a suite of tools that enable users to browse and query gene functions, and to analyze large-scale experimental data with a number of statistical tests. It is widely used by bench scientists, bioinformaticians, computer scientists and systems biologists. In the 2013 release of PANTHER (v.8.0), in addition to an update of the data content, we redesigned the website interface to improve both user experience and the system's analytical capability. This protocol provides a detailed description of how to analyze genome-wide experimental data with the PANTHER classification system.
Unique Physiological and Transcriptional Shifts under Combinations of Salinity, Drought, and Heat.

PubMed

Shaar-Moshe, Lidor; Blumwald, Eduardo; Peleg, Zvi

2017-05-01

Climate-change-driven stresses such as extreme temperatures, water deficit, and ion imbalance are projected to exacerbate and jeopardize global food security. Under field conditions, these stresses usually occur simultaneously and cause damages that exceed single stresses. Here, we investigated the transcriptional patterns and morpho-physiological acclimations of Brachypodium dystachion to single salinity, drought, and heat stresses, as well as their double and triple stress combinations. Hierarchical clustering analysis of morpho-physiological acclimations showed that several traits exhibited a gradually aggravating effect as plants were exposed to combined stresses. On the other hand, other morphological traits were dominated by salinity, while some physiological traits were shaped by heat stress. Response patterns of differentially expressed genes, under single and combined stresses (i.e. common stress genes), were maintained only among 37% of the genes, indicating a limited expression consistency among partially overlapping stresses. A comparison between common stress genes and genes that were uniquely expressed only under combined stresses (i.e. combination unique genes) revealed a significant shift from increased intensity to antagonistic responses, respectively. The different transcriptional signatures imply an alteration in the mode of action under combined stresses and limited ability to predict plant responses as different stresses are combined. Coexpression analysis coupled with enrichment analysis revealed that each gene subset was enriched with different biological processes. Common stress genes were enriched with known stress response pathways, while combination unique-genes were enriched with unique processes and genes with unknown functions that hold the potential to improve stress tolerance and enhance cereal productivity under suboptimal field conditions. © 2017 American Society of Plant Biologists. All Rights Reserved.
Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach.

PubMed

Chowdhury, Nilotpal; Sapru, Shantanu

2015-01-01

Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting
Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach

PubMed Central

Chowdhury, Nilotpal; Sapru, Shantanu

2015-01-01

Introduction Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. Aim The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Methods Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate – adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Results Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. Conclusion To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This

Combining Evidence of Preferential Gene-Tissue Relationships from Multiple Sources

PubMed Central

Guo, Jing; Hammar, Mårten; Öberg, Lisa; Padmanabhuni, Shanmukha S.; Bjäreland, Marcus; Dalevi, Daniel

2013-01-01

An important challenge in drug discovery and disease prognosis is to predict genes that are preferentially expressed in one or a few tissues, i.e. showing a considerably higher expression in one tissue(s) compared to the others. Although several data sources and methods have been published explicitly for this purpose, they often disagree and it is not evident how to retrieve these genes and how to distinguish true biological findings from those that are due to choice-of-method and/or experimental settings. In this work we have developed a computational approach that combines results from multiple methods and datasets with the aim to eliminate method/study-specific biases and to improve the predictability of preferentially expressed human genes. A rule-based score is used to merge and assign support to the results. Five sets of genes with known tissue specificity were used for parameter pruning and cross-validation. In total we identify 3434 tissue-specific genes. We compare the genes of highest scores with the public databases: PaGenBase (microarray), TiGER (EST) and HPA (protein expression data). The results have 85% overlap to PaGenBase, 71% to TiGER and only 28% to HPA. 99% of our predictions have support from at least one of these databases. Our approach also performs better than any of the databases on identifying drug targets and biomarkers with known tissue-specificity. PMID:23950964
Polymorphisms in the methylene tetrahydrofolate reductase gene and their unique combinations are associated with an increased susceptibility to the renal cancers.

PubMed

Ajaz, Sadia; Khaliq, Shagufta; Hashmi, Altaf; Naqvi, Syed Ali Anwar; Rizvi, Syed Adib-ul-Hassan; Mehdi, Syed Qasim

2012-05-01

Two single nucleotide polymorphisms in the methylene tetrahydrofolate reductase (MTHFR) gene, 677C/T and 1298A/C, encode the thermolabile isoforms of the MTHFR enzyme that adversely affect the folic acid metabolic pathway. In the present study, these polymorphisms were investigated for their associations with the risk and prognosis of the renal cell carcinomas (RCCs) in Pakistani patients. The study included 168 RCC patients and 178 controls. The polymorphisms were analyzed by the polymerase chain reaction-restriction fragment length polymorphism method. Statistical analysis revealed that the C-allele and homozygous C genotype of the MTHFR 1298A/C polymorphism were significantly correlated with the risk of RCCs (odds ratio [OR]=1.60; 95% confidence interval [CI]=1.1-2.34 and OR=3.26; 95% CI=1.27-8.37, respectively). The combined genotype analysis showed that the 677CC+1298CC combination greatly increased the susceptibility to RCCs (OR=8.34; 95% CI=2.7-25.7). The 677CT+1298AA and 677CC+1298CA combinations were also associated with an increased risk of RCC (OR=3.21; 95% CI=1.3-7.8 and OR=2.45; 95% CI=1.3-4.6, respectively). The combined genotype effects were also evident in a semiparametric expectation-maximization-based haplotype analysis. The results presented here indicate that the two MTHFR gene polymorphisms are significantly associated with the risk of RCCs in a cohort of Pakistani patients and may be useful as susceptibility markers in other populations of the world as well.
Meta-analysis methods for combining multiple expression profiles: comparisons, statistical characterization and an application guideline

PubMed Central

2013-01-01

Background As high-throughput genomic technologies become accurate and affordable, an increasing number of data sets have been accumulated in the public domain and genomic information integration and meta-analysis have become routine in biomedical research. In this paper, we focus on microarray meta-analysis, where multiple microarray studies with relevant biological hypotheses are combined in order to improve candidate marker detection. Many methods have been developed and applied in the literature, but their performance and properties have only been minimally investigated. There is currently no clear conclusion or guideline as to the proper choice of a meta-analysis method given an application; the decision essentially requires both statistical and biological considerations. Results We performed 12 microarray meta-analysis methods for combining multiple simulated expression profiles, and such methods can be categorized for different hypothesis setting purposes: (1) HS A : DE genes with non-zero effect sizes in all studies, (2) HS B : DE genes with non-zero effect sizes in one or more studies and (3) HS r : DE gene with non-zero effect in "majority" of studies. We then performed a comprehensive comparative analysis through six large-scale real applications using four quantitative statistical evaluation criteria: detection capability, biological association, stability and robustness. We elucidated hypothesis settings behind the methods and further apply multi-dimensional scaling (MDS) and an entropy measure to characterize the meta-analysis methods and data structure, respectively. Conclusions The aggregated results from the simulation study categorized the 12 methods into three hypothesis settings (HS A , HS B , and HS r ). Evaluation in real data and results from MDS and entropy analyses provided an insightful and practical guideline to the choice of the most suitable method in a given application. All source files for simulation and real data are available on
Meta-analysis methods for combining multiple expression profiles: comparisons, statistical characterization and an application guideline.

PubMed

Chang, Lun-Ching; Lin, Hui-Min; Sibille, Etienne; Tseng, George C

2013-12-21

As high-throughput genomic technologies become accurate and affordable, an increasing number of data sets have been accumulated in the public domain and genomic information integration and meta-analysis have become routine in biomedical research. In this paper, we focus on microarray meta-analysis, where multiple microarray studies with relevant biological hypotheses are combined in order to improve candidate marker detection. Many methods have been developed and applied in the literature, but their performance and properties have only been minimally investigated. There is currently no clear conclusion or guideline as to the proper choice of a meta-analysis method given an application; the decision essentially requires both statistical and biological considerations. We performed 12 microarray meta-analysis methods for combining multiple simulated expression profiles, and such methods can be categorized for different hypothesis setting purposes: (1) HS(A): DE genes with non-zero effect sizes in all studies, (2) HS(B): DE genes with non-zero effect sizes in one or more studies and (3) HS(r): DE gene with non-zero effect in "majority" of studies. We then performed a comprehensive comparative analysis through six large-scale real applications using four quantitative statistical evaluation criteria: detection capability, biological association, stability and robustness. We elucidated hypothesis settings behind the methods and further apply multi-dimensional scaling (MDS) and an entropy measure to characterize the meta-analysis methods and data structure, respectively. The aggregated results from the simulation study categorized the 12 methods into three hypothesis settings (HS(A), HS(B), and HS(r)). Evaluation in real data and results from MDS and entropy analyses provided an insightful and practical guideline to the choice of the most suitable method in a given application. All source files for simulation and real data are available on the author's publication website.
Combining suppressive subtractive hybridization and cDNA microarrays to identify dietary phosphorus-responsive genes of the rainbow trout (Oncorhynchus mykiss) kidney.

PubMed

Lake, Jennifer; Gravel, Catherine; Koko, Gabriel Koffi D; Robert, Claude; Vandenberg, Grant W

2010-03-01

Phosphorus (P)-responsive genes and how they regulate renal adaptation to phosphorous-deficient diets in animals, including fish, are not well understood. RNA abundance profiling using cDNA microarrays is an efficient approach to study nutrient-gene interactions and identify these dietary P-responsive genes. To test the hypothesis that dietary P-responsive genes are differentially expressed in fish fed varying P levels, rainbow trout were fed a practical high-P diet (R20: 0.96% P) or a low-P diet (R0: 0.38% P) for 7 weeks. The differentially-expressed genes between dietary groups were identified and compared from the kidney by combining suppressive subtractive hybridization (SSH) with cDNA microarray analysis. A number of genes were confirmed by real-time PCR, and correlated with plasma and bone P concentrations. Approximately 54 genes were identified as potential dietary P-responsive after 7 weeks on a diet deficient in P according to cDNA microarray analysis. Of 18 selected genes, 13 genes were confirmed to be P-responsive at 7 weeks by real-time PCR analysis, including: iNOS, cytochrome b, cytochrome c oxidase subunit II , alpha-globin I, beta-globin, ATP synthase, hyperosmotic protein 21, COL1A3, Nkef, NDPK, glucose phosphate isomerase 1, Na+/H+ exchange protein and GDP dissociation inhibitor 2. Many of these dietary P-responsive genes responded in a moderate way (R0/R20 ratio: <2-3 or >0.5) and in a transient manner to dietary P limitation. In summary, renal adaptation to dietary P deficiency in trout involves changes in the expression of several genes, suggesting a profile of metabolic stress, since many of these differentially-expressed candidates are associated with the cellular adaptative responses. Crown Copyright 2009. Published by Elsevier Inc. All rights reserved.
Principles of gene microarray data analysis.

PubMed

Mocellin, Simone; Rossi, Carlo Riccardo

2007-01-01

The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Gene-Gene and Gene-Environment Interactions in Ulcerative Colitis

PubMed Central

Wang, Ming-Hsi; Fiocchi, Claudio; Zhu, Xiaofeng; Ripke, Stephan; Kamboh, M. Ilyas; Rebert, Nancy; Duerr, Richard H.; Achkar, Jean-Paul

2014-01-01

Genome-wide association studies (GWAS) have identified at least 133 ulcerative colitis (UC) associated loci. The role of genetic factors in clinical practice is not clearly defined. The relevance of genetic variants to disease pathogenesis is still uncertain because of not characterized gene-gene and gene-environment interactions. We examined the predictive value of combining the 133 UC risk loci with genetic interactions in an ongoing inflammatory bowel disease (IBD) GWAS. The Wellcome Trust Case-Control Consortium (WTCCC) IBD GWAS was used as a replication cohort. We applied logic regression (LR), a novel adaptive regression methodology, to search for high order interactions. Exploratory genotype correlations with UC sub-phenotypes (extent of disease, need of surgery, age of onset, extra-intestinal manifestations and primary sclerosing cholangitis (PSC)) were conducted. The combination of 133 UC loci yielded good UC risk predictability (area under the curve [AUC] of 0.86). A higher cumulative allele score predicted higher UC risk. Through LR, several lines of evidence for genetic interactions were identified and successfully replicated in the WTCCC cohort. The genetic interactions combined with the gene-smoking interaction significantly improved predictability in the model (AUC, from 0.86 to 0.89, P=3.26E-05). Explained UC variance increased from 37% to 42% after adding the interaction terms. A within case analysis found suggested genetic association with PSC. Our study demonstrates that the LR methodology allows the identification and replication of high order genetic interactions in UC GWAS datasets. UC risk can be predicted by a 133 loci and improved by adding gene-gene and gene-environment interactions. PMID:24241240
Network Analysis Reveals Putative Genes Affecting Meat Quality in Angus Cattle.

PubMed

Mateescu, Raluca G; Garrick, Dorian J; Reecy, James M

2017-01-01

Improvements in eating satisfaction will benefit consumers and should increase beef demand which is of interest to the beef industry. Tenderness, juiciness, and flavor are major determinants of the palatability of beef and are often used to reflect eating satisfaction. Carcass qualities are used as indicator traits for meat quality, with higher quality grade carcasses expected to relate to more tender and palatable meat. However, meat quality is a complex concept determined by many component traits making interpretation of genome-wide association studies (GWAS) on any one component challenging to interpret. Recent approaches combining traditional GWAS with gene network interactions theory could be more efficient in dissecting the genetic architecture of complex traits. Phenotypic measures of 23 traits reflecting carcass characteristics, components of meat quality, along with mineral and peptide concentrations were used along with Illumina 54k bovine SNP genotypes to derive an annotated gene network associated with meat quality in 2,110 Angus beef cattle. The efficient mixed model association (EMMAX) approach in combination with a genomic relationship matrix was used to directly estimate the associations between 54k SNP genotypes and each of the 23 component traits. Genomic correlated regions were identified by partial correlations which were further used along with an information theory algorithm to derive gene network clusters. Correlated SNP across 23 component traits were subjected to network scoring and visualization software to identify significant SNP. Significant pathways implicated in the meat quality complex through GO term enrichment analysis included angiogenesis, inflammation, transmembrane transporter activity, and receptor activity. These results suggest that network analysis using partial correlations and annotation of significant SNP can reveal the genetic architecture of complex traits and provide novel information regarding biological mechanisms
Meta-analysis of human gene expression in response to Mycobacterium tuberculosis infection reveals potential therapeutic targets.

PubMed

Wang, Zhang; Arat, Seda; Magid-Slav, Michal; Brown, James R

2018-01-10

With the global emergence of multi-drug resistant strains of Mycobacterium tuberculosis, new strategies to treat tuberculosis are urgently needed such as therapeutics targeting potential human host factors. Here we performed a statistical meta-analysis of human gene expression in response to both latent and active pulmonary tuberculosis infections from nine published datasets. We found 1655 genes that were significantly differentially expressed during active tuberculosis infection. In contrast, no gene was significant for latent tuberculosis. Pathway enrichment analysis identified 90 significant canonical human pathways, including several pathways more commonly related to non-infectious diseases such as the LRRK2 pathway in Parkinson's disease, and PD-1/PD-L1 signaling pathway important for new immuno-oncology therapies. The analysis of human genome-wide association studies datasets revealed tuberculosis-associated genetic variants proximal to several genes in major histocompatibility complex for antigen presentation. We propose several new targets and drug-repurposing opportunities including intravenous immunoglobulin, ion-channel blockers and cancer immuno-therapeutics for development as combination therapeutics with anti-mycobacterial agents. Our meta-analysis provides novel insights into host genes and pathways important for tuberculosis and brings forth potential drug repurposing opportunities for host-directed therapies.
Turning publicly available gene expression data into discoveries using gene set context analysis.

PubMed

Ji, Zhicheng; Vokes, Steven A; Dang, Chi V; Ji, Hongkai

2016-01-08

Gene Set Context Analysis (GSCA) is an open source software package to help researchers use massive amounts of publicly available gene expression data (PED) to make discoveries. Users can interactively visualize and explore gene and gene set activities in 25,000+ consistently normalized human and mouse gene expression samples representing diverse biological contexts (e.g. different cells, tissues and disease types, etc.). By providing one or multiple genes or gene sets as input and specifying a gene set activity pattern of interest, users can query the expression compendium to systematically identify biological contexts associated with the specified gene set activity pattern. In this way, researchers with new gene sets from their own experiments may discover previously unknown contexts of gene set functions and hence increase the value of their experiments. GSCA has a graphical user interface (GUI). The GUI makes the analysis convenient and customizable. Analysis results can be conveniently exported as publication quality figures and tables. GSCA is available at https://github.com/zji90/GSCA. This software significantly lowers the bar for biomedical investigators to use PED in their daily research for generating and screening hypotheses, which was previously difficult because of the complexity, heterogeneity and size of the data. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Haplotype combination of the bovine CFL2 gene sequence variants and association with growth traits in Qinchuan cattle.

PubMed

Sun, Yujia; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong

2015-06-01

The aim of this study was to examine the association of cofilin2 (CFL2) gene polymorphisms with growth traits in Chinese Qinchuan cattle. Three single nucleotide polymorphisms (SNPs) were identified in the bovine CFL2 gene using DNA sequencing and (forced) PCR-RFLP methods. These polymorphisms included a missense mutation (NC_007319.5: g. C 2213 G) in exon 4, one synonymous mutation (NC_007319.5: g. T 1694 A) in exon 4, and a mutation (NC_007319.5: g. G 1500 A) in intron 2, respectively. In addition, we evaluated the haplotype frequency and linkage disequilibrium coefficient of three sequence variants in 488 individuals in QC cattle. All the three SNPs in QC cattle belonged to an intermediate level of genetic diversity (0.25analysis of three SNPs showed that 8 different haplotypes were identified in all, but only 5 haplotypes were listed except for those with a frequency of <0.03. Hap4 (-GTC-) had the highest haplotype frequencies (34.70%). However in the three SNPs there were no significant associations between the 13 combined genotypes of the CFL2 gene and growth traits. LD analysis showed that the SNP T 1694 A and C 2213 G loci had a strong linkage (r(2)>0.33). Association analysis indicated that SNP G 1500 A, T 1694 A and C 2213 G were significantly associated with growth traits in the QC population. The results of our study suggest that the CFL2 gene may be a strong candidate gene that affects growth traits in the QC cattle breeding program. Copyright © 2015 Elsevier B.V. All rights reserved.
Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

PubMed

Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

2015-01-27

Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
GARNET--gene set analysis with exploration of annotation relations.

PubMed

Rho, Kyoohyoung; Kim, Bumjin; Jang, Youngjun; Lee, Sanghyun; Bae, Taejeong; Seo, Jihae; Seo, Chaehwa; Lee, Jihyun; Kang, Hyunjung; Yu, Ungsik; Kim, Sunghoon; Lee, Sanghyuk; Kim, Wan Kyu

2011-02-15

Gene set analysis is a powerful method of deducing biological meaning for an a priori defined set of genes. Numerous tools have been developed to test statistical enrichment or depletion in specific pathways or gene ontology (GO) terms. Major difficulties towards biological interpretation are integrating diverse types of annotation categories and exploring the relationships between annotation terms of similar information. GARNET (Gene Annotation Relationship NEtwork Tools) is an integrative platform for gene set analysis with many novel features. It includes tools for retrieval of genes from annotation database, statistical analysis & visualization of annotation relationships, and managing gene sets. In an effort to allow access to a full spectrum of amassed biological knowledge, we have integrated a variety of annotation data that include the GO, domain, disease, drug, chromosomal location, and custom-defined annotations. Diverse types of molecular networks (pathways, transcription and microRNA regulations, protein-protein interaction) are also included. The pair-wise relationship between annotation gene sets was calculated using kappa statistics. GARNET consists of three modules--gene set manager, gene set analysis and gene set retrieval, which are tightly integrated to provide virtually automatic analysis for gene sets. A dedicated viewer for annotation network has been developed to facilitate exploration of the related annotations. GARNET (gene annotation relationship network tools) is an integrative platform for diverse types of gene set analysis, where complex relationships among gene annotations can be easily explored with an intuitive network visualization tool (http://garnet.isysbio.org/ or http://ercsb.ewha.ac.kr/garnet/).
Preferential Allele Expression Analysis Identifies Shared Germline and Somatic Driver Genes in Advanced Ovarian Cancer

PubMed Central

Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash

2016-01-01

Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499
Transcriptome analysis of Petunia axillaris flowers reveals genes involved in morphological differentiation and metabolite transport

PubMed Central

Amano, Ikuko; Kitajima, Sakihito; Suzuki, Hideyuki; Koeduka, Takao

2018-01-01

The biosynthesis of plant secondary metabolites is associated with morphological and metabolic differentiation. As a consequence, gene expression profiles can change drastically, and primary and secondary metabolites, including intermediate and end-products, move dynamically within and between cells. However, little is known about the molecular mechanisms underlying differentiation and transport mechanisms. In this study, we performed a transcriptome analysis of Petunia axillaris subsp. parodii, which produces various volatiles in its corolla limbs and emits metabolites to attract pollinators. RNA-sequencing from leaves, buds, and limbs identified 53,243 unigenes. Analysis of differentially expressed genes, combined with gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses, showed that many biological processes were highly enriched in limbs. These included catabolic processes and signaling pathways of hormones, such as gibberellins, and metabolic pathways, including phenylpropanoids and fatty acids. Moreover, we identified five transporter genes that showed high expression in limbs, and we performed spatiotemporal expression analyses and homology searches to infer their putative functions. Our systematic analysis provides comprehensive transcriptomic information regarding morphological differentiation and metabolite transport in the Petunia flower and lays the foundation for establishing the specific mechanisms that control secondary metabolite biosynthesis in plants. PMID:29902274
A combined analysis of genetically correlated traits identifies 187 loci and a role for neurogenesis and myelination in intelligence.

PubMed

Hill, W D; Marioni, R E; Maghzian, O; Ritchie, S J; Hagenaars, S P; McIntosh, A M; Gale, C R; Davies, G; Deary, I J

2018-01-11

Intelligence, or general cognitive function, is phenotypically and genetically correlated with many traits, including a wide range of physical, and mental health variables. Education is strongly genetically correlated with intelligence (r g = 0.70). We used these findings as foundations for our use of a novel approach-multi-trait analysis of genome-wide association studies (MTAG; Turley et al. 2017)-to combine two large genome-wide association studies (GWASs) of education and intelligence, increasing statistical power and resulting in the largest GWAS of intelligence yet reported. Our study had four goals: first, to facilitate the discovery of new genetic loci associated with intelligence; second, to add to our understanding of the biology of intelligence differences; third, to examine whether combining genetically correlated traits in this way produces results consistent with the primary phenotype of intelligence; and, finally, to test how well this new meta-analytic data sample on intelligence predicts phenotypic intelligence in an independent sample. By combining datasets using MTAG, our functional sample size increased from 199,242 participants to 248,482. We found 187 independent loci associated with intelligence, implicating 538 genes, using both SNP-based and gene-based GWAS. We found evidence that neurogenesis and myelination-as well as genes expressed in the synapse, and those involved in the regulation of the nervous system-may explain some of the biological differences in intelligence. The results of our combined analysis demonstrated the same pattern of genetic correlations as those from previous GWASs of intelligence, providing support for the meta-analysis of these genetically-related phenotypes.
RAMONA: a Web application for gene set analysis on multilevel omics data.

PubMed

Sass, Steffen; Buettner, Florian; Mueller, Nikola S; Theis, Fabian J

2015-01-01

Decreasing costs of modern high-throughput experiments allow for the simultaneous analysis of altered gene activity on various molecular levels. However, these multi-omics approaches lead to a large amount of data, which is hard to interpret for a non-bioinformatician. Here, we present the remotely accessible multilevel ontology analysis (RAMONA). It offers an easy-to-use interface for the simultaneous gene set analysis of combined omics datasets and is an extension of the previously introduced MONA approach. RAMONA is based on a Bayesian enrichment method for the inference of overrepresented biological processes among given gene sets. Overrepresentation is quantified by interpretable term probabilities. It is able to handle data from various molecular levels, while in parallel coping with redundancies arising from gene set overlaps and related multiple testing problems. The comprehensive output of RAMONA is easy to interpret and thus allows for functional insight into the affected biological processes. With RAMONA, we provide an efficient implementation of the Bayesian inference problem such that ontologies consisting of thousands of terms can be processed in the order of seconds. RAMONA is implemented as ASP.NET Web application and publicly available at http://icb.helmholtz-muenchen.de/ramona. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Identification of gene expression profiles and key genes in subchondral bone of osteoarthritis using weighted gene coexpression network analysis.

PubMed

Guo, Sheng-Min; Wang, Jian-Xiong; Li, Jin; Xu, Fang-Yuan; Wei, Quan; Wang, Hai-Ming; Huang, Hou-Qiang; Zheng, Si-Lin; Xie, Yu-Jie; Zhang, Chi

2018-06-15

Osteoarthritis (OA) significantly influences the quality life of people around the world. It is urgent to find an effective way to understand the genetic etiology of OA. We used weighted gene coexpression network analysis (WGCNA) to explore the key genes involved in the subchondral bone pathological process of OA. Fifty gene expression profiles of GSE51588 were downloaded from the Gene Expression Omnibus database. The OA-associated genes and gene ontologies were acquired from JuniorDoc. Weighted gene coexpression network analysis was used to find disease-related networks based on 21756 gene expression correlation coefficients, hub-genes with the highest connectivity in each module were selected, and the correlation between module eigengene and clinical traits was calculated. The genes in the traits-related gene coexpression modules were subject to functional annotation and pathway enrichment analysis using ClusterProfiler. A total of 73 gene modules were identified, of which, 12 modules were found with high connectivity with clinical traits. Five modules were found with enriched OA-associated genes. Moreover, 310 OA-associated genes were found, and 34 of them were among hub-genes in each module. Consequently, enrichment results indicated some key metabolic pathways, such as extracellular matrix (ECM)-receptor interaction (hsa04512), focal adhesion (hsa04510), the phosphatidylinositol 3'-kinase (PI3K)-Akt signaling pathway (PI3K-AKT) (hsa04151), transforming growth factor beta pathway, and Wnt pathway. We intended to identify some core genes, collagen (COL)6A3, COL6A1, ITGA11, BAMBI, and HCK, which could influence downstream signaling pathways once they were activated. In this study, we identified important genes within key coexpression modules, which associate with a pathological process of subchondral bone in OA. Functional analysis results could provide important information to understand the mechanism of OA. © 2018 Wiley Periodicals, Inc.
Nonviral vectors for cancer gene therapy: prospects for integrating vectors and combination therapies.

PubMed

Ohlfest, John R; Freese, Andrew B; Largaespada, David A

2005-12-01

Gene therapy has the potential to improve the clinical outcome of many cancers by transferring therapeutic genes into tumor cells or normal host tissue. Gene transfer into tumor cells or tumor-associated stroma is being employed to induce tumor cell death, stimulate anti-tumor immune response, inhibit angiogenesis, and control tumor cell growth. Viral vectors have been used to achieve this proof of principle in animal models and, in select cases, in human clinical trials. Nevertheless, there has been considerable interest in developing nonviral vectors for cancer gene therapy. Nonviral vectors are simpler, more amenable to large-scale manufacture, and potentially safer for clinical use. Nonviral vectors were once limited by low gene transfer efficiency and transient or steadily declining gene expression. However, recent improvements in plasmid-based vectors and delivery methods are showing promise in circumventing these obstacles. This article reviews the current status of nonviral cancer gene therapy, with an emphasis on combination strategies, long-term gene transfer using transposons and bacteriophage integrases, and future directions.
MALDI-TOF mass spectrometry for quantitative gene expression analysis of acid responses in Staphylococcus aureus.

PubMed

Rode, Tone Mari; Berget, Ingunn; Langsrud, Solveig; Møretrø, Trond; Holck, Askild

2009-07-01

Microorganisms are constantly exposed to new and altered growth conditions, and respond by changing gene expression patterns. Several methods for studying gene expression exist. During the last decade, the analysis of microarrays has been one of the most common approaches applied for large scale gene expression studies. A relatively new method for gene expression analysis is MassARRAY, which combines real competitive-PCR and MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry. In contrast to microarray methods, MassARRAY technology is suitable for analysing a larger number of samples, though for a smaller set of genes. In this study we compare the results from MassARRAY with microarrays on gene expression responses of Staphylococcus aureus exposed to acid stress at pH 4.5. RNA isolated from the same stress experiments was analysed using both the MassARRAY and the microarray methods. The MassARRAY and microarray methods showed good correlation. Both MassARRAY and microarray estimated somewhat lower fold changes compared with quantitative real-time PCR (qRT-PCR). The results confirmed the up-regulation of the urease genes in acidic environments, and also indicated the importance of metal ion regulation. This study shows that the MassARRAY technology is suitable for gene expression analysis in prokaryotes, and has advantages when a set of genes is being analysed for an organism exposed to many different environmental conditions.

Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

PubMed

Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

2015-11-01

Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.
Genetic analysis of the ADGF multigene family by homologous recombination and gene conversion in Drosophila.

PubMed

Dolezal, Tomas; Gazi, Michal; Zurovec, Michal; Bryant, Peter J

2003-10-01

Many Drosophila genes exist as members of multigene families and within each family the members can be functionally redundant, making it difficult to identify them by classical mutagenesis techniques based on phenotypic screening. We have addressed this problem in a genetic analysis of a novel family of six adenosine deaminase-related growth factors (ADGFs). We used ends-in targeting to introduce mutations into five of the six ADGF genes, taking advantage of the fact that five of the family members are encoded by a three-gene cluster and a two-gene cluster. We used two targeting constructs to introduce loss-of-function mutations into all five genes, as well as to isolate different combinations of multiple mutations, independent of phenotypic consequences. The results show that (1) it is possible to use ends-in targeting to disrupt gene clusters; (2) gene conversion, which is usually considered a complication in gene targeting, can be used to help recover different mutant combinations in a single screening procedure; (3) the reduction of duplication to a single copy by induction of a double-strand break is better explained by the single-strand annealing mechanism than by simple crossing over between repeats; and (4) loss of function of the most abundantly expressed family member (ADGF-A) leads to disintegration of the fat body and the development of melanotic tumors in mutant larvae.
A single center analysis of nucleophosmin in acute myeloid leukemia: value of combining immunohistochemistry with molecular mutation analysis.

PubMed

Woolthuis, Carolien M; Mulder, André B; Verkaik-Schakel, Rikst Nynke; Rosati, Stefano; Diepstra, Arjan; van den Berg, Eva; Schuringa, Jan Jacob; Vellenga, Edo; Kluin, Philip M; Huls, Gerwin

2013-10-01

Mutations of nucleophosmin 1 are frequently found in acute myeloid leukemia and lead to aberrant cytoplasmic accumulation of nucleophosmin protein. Immunohistochemical staining is therefore recommended as the technique of choice in front-line screening. In this study, we assessed the sensitivity and specificity of immunohistochemistry on formalin-fixed bone marrow biopsies compared with gold standard molecular analysis to predict nucleophosmin 1 mutation status in 119 patients with acute myeloid leukemia. Discrepant cases were further characterized by gene expression analyses and fluorescence in situ hybridization. A large overlap between both methods was observed. Nevertheless, nine patients demonstrated discordant results at initial screening. Five cases demonstrated nuclear staining of nucleophosmin 1 by immunohistochemistry, but a nucleophosmin 1 mutation by molecular analysis. In two cases this could be attributed to technical issues and in three cases minor subpopulations of myeloblasts had not been discovered initially. All tested cases exhibited the characteristic nucleophosmin-mutated gene expression pattern. Four cases had cytoplasmic nucleophosmin 1 staining and a nucleophosmin-mutated gene expression pattern without a detectable nucleophosmin 1 mutation. In two of these cases we found the chromosomal translocation t(3;5)(q25;q35) encoding the NPM-MLF1 fusion protein. In the other discrepant cases the aberrant cytoplasmic nucleophosmin staining and gene expression could not be explained. In total six patients (5%) had true discordant results between immunohistochemistry and mutation analysis. We conclude that cytoplasmic nucleophosmin localization is not always caused by a conventional nucleophosmin 1 mutation and that in the screening for nucleophosmin 1 abnormalities, most information will be obtained by combining immunohistochemistry with molecular analysis.
A single center analysis of nucleophosmin in acute myeloid leukemia: value of combining immunohistochemistry with molecular mutation analysis

PubMed Central

Woolthuis, Carolien M.; Mulder, André B.; Verkaik-Schakel, Rikst Nynke; Rosati, Stefano; Diepstra, Arjan; van den Berg, Eva; Schuringa, Jan Jacob; Vellenga, Edo; Kluin, Philip M.; Huls, Gerwin

2013-01-01

Mutations of nucleophosmin 1 are frequently found in acute myeloid leukemia and lead to aberrant cytoplasmic accumulation of nucleophosmin protein. Immunohistochemical staining is therefore recommended as the technique of choice in front-line screening. In this study, we assessed the sensitivity and specificity of immunohistochemistry on formalin-fixed bone marrow biopsies compared with gold standard molecular analysis to predict nucleophosmin 1 mutation status in 119 patients with acute myeloid leukemia. Discrepant cases were further characterized by gene expression analyses and fluorescence in situ hybridization. A large overlap between both methods was observed. Nevertheless, nine patients demonstrated discordant results at initial screening. Five cases demonstrated nuclear staining of nucleophosmin 1 by immunohistochemistry, but a nucleophosmin 1 mutation by molecular analysis. In two cases this could be attributed to technical issues and in three cases minor subpopulations of myeloblasts had not been discovered initially. All tested cases exhibited the characteristic nucleophosmin-mutated gene expression pattern. Four cases had cytoplasmic nucleophosmin 1 staining and a nucleophosmin-mutated gene expression pattern without a detectable nucleophosmin 1 mutation. In two of these cases we found the chromosomal translocation t(3;5)(q25;q35) encoding the NPM-MLF1 fusion protein. In the other discrepant cases the aberrant cytoplasmic nucleophosmin staining and gene expression could not be explained. In total six patients (5%) had true discordant results between immunohistochemistry and mutation analysis. We conclude that cytoplasmic nucleophosmin localization is not always caused by a conventional nucleophosmin 1 mutation and that in the screening for nucleophosmin 1 abnormalities, most information will be obtained by combining immunohistochemistry with molecular analysis. PMID:23716555
Model-based gene set analysis for Bioconductor.

PubMed

Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

2011-07-01

Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.
GSCALite: A Web Server for Gene Set Cancer Analysis.

PubMed

Liu, Chun-Jie; Hu, Fei-Fei; Xia, Mengxuan; Han, Leng; Zhang, Qiong; Guo, An-Yuan

2018-05-22

The availability of cancer genomic data makes it possible to analyze genes related to cancer. Cancer is usually the result of a set of genes and the signal of a single gene could be covered by background noise. Here, we present a web server named Gene Set Cancer Analysis (GSCALite) to analyze a set of genes in cancers with the following functional modules. (i) Differential expression in tumor vs normal, and the survival analysis; (ii) Genomic variations and their survival analysis; (iii) Gene expression associated cancer pathway activity; (iv) miRNA regulatory network for genes; (v) Drug sensitivity for genes; (vi) Normal tissue expression and eQTL for genes. GSCALite is a user-friendly web server for dynamic analysis and visualization of gene set in cancer and drug sensitivity correlation, which will be of broad utilities to cancer researchers. GSCALite is available on http://bioinfo.life.hust.edu.cn/web/GSCALite/. guoay@hust.edu.cn or zhangqiong@hust.edu.cn. Supplementary data are available at Bioinformatics online.
Predicting gene regulatory networks by combining spatial and temporal gene expression data in Arabidopsis root stem cells

PubMed Central

de Luis Balaguer, Maria Angels; Fisher, Adam P.; Clark, Natalie M.; Fernandez-Espinosa, Maria Guadalupe; Möller, Barbara K.; Weijers, Dolf; Williams, Cranos; Lorenzo, Oscar; Sozzani, Rosangela

2017-01-01

Identifying the transcription factors (TFs) and associated networks involved in stem cell regulation is essential for understanding the initiation and growth of plant tissues and organs. Although many TFs have been shown to have a role in the Arabidopsis root stem cells, a comprehensive view of the transcriptional signature of the stem cells is lacking. In this work, we used spatial and temporal transcriptomic data to predict interactions among the genes involved in stem cell regulation. To accomplish this, we transcriptionally profiled several stem cell populations and developed a gene regulatory network inference algorithm that combines clustering with dynamic Bayesian network inference. We leveraged the topology of our networks to infer potential major regulators. Specifically, through mathematical modeling and experimental validation, we identified PERIANTHIA (PAN) as an important molecular regulator of quiescent center function. The results presented in this work show that our combination of molecular biology, computational biology, and mathematical modeling is an efficient approach to identify candidate factors that function in the stem cells. PMID:28827319
Combining R gene and quantitative resistance increases effectiveness of cultivar resistance against Leptosphaeria maculans in Brassica napus in different environments

PubMed Central

Mitrousia, Georgia K.; Sidique, Siti Nordahliawate M.; Qi, Aiming; Fitt, Bruce D. L.

2018-01-01

Using cultivar resistance against pathogens is one of the most economical and environmentally friendly methods for control of crop diseases. However, cultivar resistance can be easily rendered ineffective due to changes in pathogen populations or environments. To test the hypothesis that combining R gene-mediated resistance and quantitative resistance (QR) in one cultivar can provide more effective resistance than use of either type of resistance on its own, effectiveness of resistance in eight oilseed rape (Brassica napus) cultivars with different R genes and/or QR against Leptosphaeria maculans (phoma stem canker) was investigated in 13 different environments/sites over three growing seasons (2010/2011, 2011/2012 and 2012/2013). Cultivar Drakkar with no R genes and no QR was used as susceptible control and for sampling L. maculans populations. Isolates of L. maculans were obtained from the 13 sites in 2010/2011 to assess frequencies of avirulent alleles of different effector genes (AvrLm1, AvrLm4 or AvrLm7) corresponding to the resistance genes (Rlm1, Rlm4 or Rlm7) used in the field experiments. Results of field experiments showed that cultivars DK Cabernet (Rlm1 + QR) and Adriana (Rlm4 + QR) had significantly less severe phoma stem canker than cultivars Capitol (Rlm1) and Bilbao (Rlm4), respectively. Results of controlled environment experiments confirmed the presence of Rlm genes and/or QR in these four cultivars. Analysis of L. maculans populations from different sites showed that the mean frequencies of AvrLm1 (10%) and AvrLm4 (41%) were less than that of AvrLm7 (100%), suggesting that Rlm1 and Rlm4 gene-mediated resistances were partially rendered ineffective while Rlm7 resistance was still effective. Cultivar Excel (Rlm7 + QR) had less severe canker than cultivar Roxet (Rlm7), but the difference between them was not significant due to influence of the effective resistance gene Rlm7. For the two cultivars with only QR, Es-Astrid (QR) had less severe stem
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.

PubMed

Tintle, Nathan L; Sitarik, Alexandra; Boerema, Benjamin; Young, Kylie; Best, Aaron A; Dejongh, Matthew

2012-08-08

Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.
Toxicogenomic analysis in the combined effect of tributyltin and benzo[a]pyrene on the development of zebrafish embryos.

PubMed

Huang, Lixing; Zuo, Zhenghong; Zhang, Youyu; Wang, Chonggang

2015-01-01

There is a growing recognition that the toxic effects of chemical mixtures are been an important issue in toxicological sciences. Tributyltin (TBT) and benzo[a]pyrene (BaP) are widespread pollutants that occur simultaneously in the aquatic environments. This study was designed to examine comprehensively the combined effects of TBT and BaP on zebrafish (Danio rerio) embryos using toxicogenomic approach combined with biochemical detection and morphological analysis, and tried to gain insight into the mechanisms underlying the combined effects of TBT and BaP. The results of toxicogenomic data indicated that: (1) TBT cotreatment rescued the embryos from decreased hatching ratio caused by BaP alone, while the alteration of gene expression (in this article the phrase gene expression is used as a synonym to gene transcription, although in is acknowledged that gene expression can also be regulated by, e.g., translation and mRNA or protein stability) relative to zebrafish hatching in the BaP groups was resumed by the cotreatment with TBT; (2) BaP cotreatment decreased TBT-mediated dorsal curvature, and alleviated the perturbation of Notch pathway caused by TBT alone; (3) cotreatment with TBT decreased BaP-mediated bradycardia, which might be due to that TBT cotreatment alleviated the perturbation in expression of genes related to cardiac muscle cell development and calcium handling caused by BaP alone; 4) TBT cotreatment brought an antagonistic effect on the BaP-mediated oxidative stress and DNA damage. These results suggested that toxicogenomic approach was available for analyzing combined toxicity with high sensitivity and accuracy, which might improve our understanding and predictability for the combined effects of chemicals. Copyright © 2014 Elsevier B.V. All rights reserved.
Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool

PubMed Central

Clark, Neil R.; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D.; Jones, Matthew R.; Ma’ayan, Avi

2016-01-01

Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community. PMID:26848405
Combining Cytotoxic and Immune-Mediated Gene Therapy to Treat Brain Tumors

PubMed Central

Curtin, James F.; King, Gwendalyn D.; Candolfi, Marianela; Greeno, Remy B.; Kroeger, Kurt M.; Lowenstein, Pedro R.; Castro, Maria G.

2006-01-01

Glioblastoma (GBM) is a type of intracranial brain tumor, for which there is no cure. In spite of advances in surgery, chemotherapy and radiotherapy, patients die within a year of diagnosis. Therefore, there is a critical need to develop novel therapeutic approaches for this disease. Gene therapy, which is the use of genes or other nucleic acids as drugs, is a powerful new treatment strategy which can be developed to treat GBM. Several treatment modalities are amenable for gene therapy implementation, e.g. conditional cytotoxic approaches, targeted delivery of toxins into the tumor mass, immune stimulatory strategies, and these will all be the focus of this review. Both conditional cytotoxicity and targeted toxin mediated tumor death, are aimed at eliminating an established tumor mass and preventing further growth. Tumors employ several defensive strategies that suppress and inhibit anti-tumor immune responses. A better understanding of the mechanisms involved in eliciting anti-tumor immune responses has identified promising targets for immunotherapy. Immunotherapy is designed to aid the immune system to recognize and destroy tumor cells in order to eliminate the tumor burden. Also, immune-therapeutic strategies have the added advantage that an activated immune system has the capability of recognizing tumor cells at distant sites from the primary tumor, therefore targeting metastasis distant from the primary tumor locale. Pre-clinical models and clinical trials have demonstrated that in spite of their location within the central nervous system (CNS), a tissue described as ‘immune privileged’, brain tumors can be effectively targeted by the activated immune system following various immunotherapeutic strategies. This review will highlight recent advances in brain tumor immunotherapy, with particular emphasis on advances made using gene therapy strategies, as well as reviewing other novel therapies that can be used in combination with immunotherapy. Another
Treatment of Diabetic Mice with a Combination of Ketogenic Diet and Aerobic Exercise via Modulations of PPARs Gene Programs

PubMed Central

Xu, Lingyan; Xia, Jie; Wang, Dongmei; Qian, Min

2018-01-01

Type 2 diabetes is a prevalent chronic disease arising as a serious public health problem worldwide. Diet intervention is considered to be a critical strategy in glycemic control of diabetic patients. Recently, the low-carbohydrate ketogenic diet is shown to be effective in glycemic control and weight loss. However, hepatic lipid accumulation could be observed in mice treated with ketogenic diet. On the other hand, exercise is a well-known approach for treating nonalcoholic fatty liver disease. We thus hypothesize that the combination of ketogenic diet and exercise could improve insulin sensitivity, while minimizing adverse effect of hepatic steatosis. In order to test this hypothesis, we established diabetic mice model with streptozotocin (STZ) and divided them into control group, ketogenic diet group, and ketogenic diet with aerobic exercise group. We found that after six weeks of intervention, mice treated with ketogenic diet and ketogenic diet combined with exercise both have lower body weights, HbAlc level, HOMA index, and improvements in insulin sensitivity, compared with diabetes group. In addition, mice in ketogenic diet intervention exhibited hepatic steatosis shown by serum and hepatic parameters, as well as histochemistry staining in the liver, which could be largely relieved by exercise. Furthermore, gene analysis revealed that ketogenic diet in combination with exercise reduced PPARγ and lipid synthetic genes, as well as enhancing PPARα and lipid β-oxidation gene program in the liver compared to those in ketogenic diet without exercise. Overall, the present study demonstrated that the combination of ketogenic diet and a moderate-intensity aerobic exercise intervention improved insulin sensitivity in diabetic mice, while avoiding hepatic steatosis, which provided a novel strategy in the combat of diabetes. PMID:29743883
Treatment of Diabetic Mice with a Combination of Ketogenic Diet and Aerobic Exercise via Modulations of PPARs Gene Programs.

PubMed

Zhang, Qiang; Xu, Lingyan; Xia, Jie; Wang, Dongmei; Qian, Min; Ding, Shuzhe

2018-01-01

Type 2 diabetes is a prevalent chronic disease arising as a serious public health problem worldwide. Diet intervention is considered to be a critical strategy in glycemic control of diabetic patients. Recently, the low-carbohydrate ketogenic diet is shown to be effective in glycemic control and weight loss. However, hepatic lipid accumulation could be observed in mice treated with ketogenic diet. On the other hand, exercise is a well-known approach for treating nonalcoholic fatty liver disease. We thus hypothesize that the combination of ketogenic diet and exercise could improve insulin sensitivity, while minimizing adverse effect of hepatic steatosis. In order to test this hypothesis, we established diabetic mice model with streptozotocin (STZ) and divided them into control group, ketogenic diet group, and ketogenic diet with aerobic exercise group. We found that after six weeks of intervention, mice treated with ketogenic diet and ketogenic diet combined with exercise both have lower body weights, HbAlc level, HOMA index, and improvements in insulin sensitivity, compared with diabetes group. In addition, mice in ketogenic diet intervention exhibited hepatic steatosis shown by serum and hepatic parameters, as well as histochemistry staining in the liver, which could be largely relieved by exercise. Furthermore, gene analysis revealed that ketogenic diet in combination with exercise reduced PPAR γ and lipid synthetic genes, as well as enhancing PPAR α and lipid β -oxidation gene program in the liver compared to those in ketogenic diet without exercise. Overall, the present study demonstrated that the combination of ketogenic diet and a moderate-intensity aerobic exercise intervention improved insulin sensitivity in diabetic mice, while avoiding hepatic steatosis, which provided a novel strategy in the combat of diabetes.
Graphite Web: web tool for gene set analysis exploiting pathway topology

PubMed Central

Sales, Gabriele; Calura, Enrica; Martini, Paolo; Romualdi, Chiara

2013-01-01

Graphite web is a novel web tool for pathway analyses and network visualization for gene expression data of both microarray and RNA-seq experiments. Several pathway analyses have been proposed either in the univariate or in the global and multivariate context to tackle the complexity and the interpretation of expression results. These methods can be further divided into ‘topological’ and ‘non-topological’ methods according to their ability to gain power from pathway topology. Biological pathways are, in fact, not only gene lists but can be represented through a network where genes and connections are, respectively, nodes and edges. To this day, the most used approaches are non-topological and univariate although they miss the relationship among genes. On the contrary, topological and multivariate approaches are more powerful, but difficult to be used by researchers without bioinformatic skills. Here we present Graphite web, the first public web server for pathway analysis on gene expression data that combines topological and multivariate pathway analyses with an efficient system of interactive network visualizations for easy results interpretation. Specifically, Graphite web implements five different gene set analyses on three model organisms and two pathway databases. Graphite Web is freely available at http://graphiteweb.bio.unipd.it/. PMID:23666626
Human microRNA target analysis and gene ontology clustering by GOmir, a novel stand-alone application

PubMed Central

Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia

2009-01-01

Background microRNAs (miRNAs) are single-stranded RNA molecules of about 20–23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. Results GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. Conclusion GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA. PMID:19534746
Human microRNA target analysis and gene ontology clustering by GOmir, a novel stand-alone application.

PubMed

Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia

2009-06-16

microRNAs (miRNAs) are single-stranded RNA molecules of about 20-23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA.
MAVTgsa: An R Package for Gene Set (Enrichment) Analysis

DOE PAGES

Chien, Chih-Yi; Chang, Ching-Wei; Tsai, Chen-An; ...

2014-01-01

Gene semore » t analysis methods aim to determine whether an a priori defined set of genes shows statistically significant difference in expression on either categorical or continuous outcomes. Although many methods for gene set analysis have been proposed, a systematic analysis tool for identification of different types of gene set significance modules has not been developed previously. This work presents an R package, called MAVTgsa, which includes three different methods for integrated gene set enrichment analysis. (1) The one-sided OLS (ordinary least squares) test detects coordinated changes of genes in gene set in one direction, either up- or downregulation. (2) The two-sided MANOVA (multivariate analysis variance) detects changes both up- and downregulation for studying two or more experimental conditions. (3) A random forests-based procedure is to identify gene sets that can accurately predict samples from different experimental conditions or are associated with the continuous phenotypes. MAVTgsa computes the P values and FDR (false discovery rate) q -value for all gene sets in the study. Furthermore, MAVTgsa provides several visualization outputs to support and interpret the enrichment results. This package is available online.« less
[Differential gene expression in incompatible interaction between Lilium regale Wilson and Fusarium oxysporum f. sp. lilii revealed by combined SSH and microarray analysis].

PubMed

Rao, J; Liu, D; Zhang, N; He, H; Ge, F; Chen, C

2014-01-01

Fusarium wilt, caused by a soilborne pathogen Fusarium oxysporum f. sp. lilii, is the major disease of lily (Lilium L.). In order to isolate the genes differentially expressed in a resistant reaction to F. oxysporum in L. regale Wilson, a cDNA library was constructed with L. regale root during F. oxysporum infection using the suppression subtractive hybridization (SSH), and a total of 585 unique expressed sequence tags (ESTs) were obtained. Furthermore, the gene expression profiles in the incompatible interaction between L. regale and F. oxysporum were revealed by oligonucleotide microarray analysis of 585 unique ESTs comparison to the compatible interaction between a susceptible Lilium Oriental Hybrid 'Siberia' and F. oxysporum. The result of expression profile analysis indicated that the genes encoding pathogenesis-related proteins (PRs), antioxidative stress enzymes, secondary metabolism enzymes, transcription factors, signal transduction proteins as well as a large number of unknown genes were involved in early defense response of L. regale to F. oxysporum infection. Moreover, the following quantitative reverse transcription PCR (QRT-PCR) analysis confirmed reliability of the oligonucleotide microarray data. In the present study, isolation of differentially expressed genes in L. regale during response to F. oxysporum helped to uncover the molecular mechanism associated with the resistance of L. regale against F. oxysporum.
Separate enrichment analysis of pathways for up- and downregulated genes.

PubMed

Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng

2014-03-06

Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.

Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

PubMed

Guo, Yong; Qiu, Li-Juan

2013-01-01

The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
Gene-based rare allele analysis identified a risk gene of Alzheimer's disease.

PubMed

Kim, Jong Hun; Song, Pamela; Lim, Hyunsun; Lee, Jae-Hyung; Lee, Jun Hong; Park, Sun Ah

2014-01-01

Alzheimer's disease (AD) has a strong propensity to run in families. However, the known risk genes excluding APOE are not clinically useful. In various complex diseases, gene studies have targeted rare alleles for unsolved heritability. Our study aims to elucidate previously unknown risk genes for AD by targeting rare alleles. We used data from five publicly available genetic studies from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and the database of Genotypes and Phenotypes (dbGaP). A total of 4,171 cases and 9,358 controls were included. The genotype information of rare alleles was imputed using 1,000 genomes. We performed gene-based analysis of rare alleles (minor allele frequency≤3%). The genome-wide significance level was defined as meta P<1.8×10(-6) (0.05/number of genes in human genome = 0.05/28,517). ZNF628, which is located at chromosome 19q13.42, showed a genome-wide significant association with AD. The association of ZNF628 with AD was not dependent on APOE ε4. APOE and TREM2 were also significantly associated with AD, although not at genome-wide significance levels. Other genes identified by targeting common alleles could not be replicated in our gene-based rare allele analysis. We identified that rare variants in ZNF628 are associated with AD. The protein encoded by ZNF628 is known as a transcription factor. Furthermore, the associations of APOE and TREM2 with AD were highly significant, even in gene-based rare allele analysis, which implies that further deep sequencing of these genes is required in AD heritability studies.
The Use of a Combined Bioinformatics Approach to Locate Antibiotic Resistance Genes on Plasmids From Whole Genome Sequences of Salmonella enterica Serovars From Humans in Ghana.

PubMed

Kudirkiene, Egle; Andoh, Linda A; Ahmed, Shahana; Herrero-Fresno, Ana; Dalsgaard, Anders; Obiri-Danso, Kwasi; Olsen, John E

2018-01-01

In the current study, we identified plasmids carrying antimicrobial resistance genes in draft whole genome sequences of 16 selected Salmonella enterica isolates representing six different serovars from humans in Ghana. The plasmids and the location of resistance genes in the genomes were predicted using a combination of PlasmidFinder, ResFinder, plasmidSPAdes and BLAST genomic analysis tools. Subsequently, S1-PFGE was employed for analysis of plasmid profiles. Whole genome sequencing confirmed the presence of antimicrobial resistance genes in Salmonella isolates showing multidrug resistance phenotypically. ESBL, either bla TEM52-B or bla CTX-M15 were present in two cephalosporin resistant isolates of S . Virchow and S . Poona, respectively. The systematic genome analysis revealed the presence of different plasmids in different serovars, with or without insertion of antimicrobial resistance genes. In S . Enteritidis, resistance genes were carried predominantly on plasmids of IncN type, in S . Typhimurium on plasmids of IncFII(S)/IncFIB(S)/IncQ1 type. In S . Virchow and in S . Poona, resistance genes were detected on plasmids of IncX1 and TrfA/IncHI2/IncHI2A type, respectively. The latter two plasmids were described for the first time in these serovars. The combination of genomic analytical tools allowed nearly full mapping of the resistance plasmids in all Salmonella strains analyzed. The results suggest that the improved analytical approach used in the current study may be used to identify plasmids that are specifically associated with resistance phenotypes in whole genome sequences. Such knowledge would allow the development of rapid multidrug resistance tracking tools in Salmonella populations using WGS.
[Analysis of horizontal transfer gene of Bombyx mori NPV].

PubMed

Duan, Hai-Rong; Qiu, De-Bin; Gong, Cheng-Liang; Huang, Mo-Li

2011-06-01

For research on genetic characters and evolutionary origin of the genome of baculoviruses, a comprehensive homology search and phylogenetic analysis of the complete genomes of Bombyx mori NPV and Bombyx mori were used. Three horizontally transferred genes (inhibitor of apoptosis, chitinase, and UDP-glucosyltransferase) were identified, and there was evidence that all of these genes were derived from the insect host. The results of analysis showed lots of differences between the features of horizontal transferred genes and the ones of whole genomic genes, such as nucleotide composition, codon usagebias and selection pressure. These results reconfirmed that the horizontally transferred genes are exogenous. The analysis of gene function suggested that horizontally transferred genes acquired from an ancestral host insect can increase the efficiency of baculoviruses transmission.
Combined magnetic and gravity analysis

NASA Technical Reports Server (NTRS)

Hinze, W. J.; Braile, L. W.; Chandler, V. W.; Mazella, F. E.

1975-01-01

Efforts are made to identify methods of decreasing magnetic interpretation ambiguity by combined gravity and magnetic analysis, to evaluate these techniques in a preliminary manner, to consider the geologic and geophysical implications of correlation, and to recommend a course of action to evaluate methods of correlating gravity and magnetic anomalies. The major thrust of the study was a search and review of the literature. The literature of geophysics, geology, geography, and statistics was searched for articles dealing with spatial correlation of independent variables. An annotated bibliography referencing the Germane articles and books is presented. The methods of combined gravity and magnetic analysis techniques are identified and reviewed. A more comprehensive evaluation of two types of techniques is presented. Internal correspondence of anomaly amplitudes is examined and a combined analysis is done utilizing Poisson's theorem. The geologic and geophysical implications of gravity and magnetic correlation based on both theoretical and empirical relationships are discussed.
Combined DNA methylation and gene expression profiling in gastrointestinal stromal tumors reveals hypomethylation of SPP1 as an independent prognostic factor.

PubMed

Haller, Florian; Zhang, Jitao David; Moskalev, Evgeny A; Braun, Alexander; Otto, Claudia; Geddert, Helene; Riazalhosseini, Yasser; Ward, Aoife; Balwierz, Aleksandra; Schaefer, Inga-Marie; Cameron, Silke; Ghadimi, B Michael; Agaimy, Abbas; Fletcher, Jonathan A; Hoheisel, Jörg; Hartmann, Arndt; Werner, Martin; Wiemann, Stefan; Sahin, Ozgür

2015-03-01

Gastrointestinal stromal tumors (GISTs) have distinct gene expression patterns according to localization, genotype and aggressiveness. DNA methylation at CpG dinucleotides is an important mechanism for regulation of gene expression. We performed targeted DNA methylation analysis of 1.505 CpG loci in 807 cancer-related genes in a cohort of 76 GISTs, combined with genome-wide mRNA expression analysis in 22 GISTs, to identify signatures associated with clinicopathological parameters and prognosis. Principal component analysis revealed distinct DNA methylation patterns associated with anatomical localization, genotype, mitotic counts and clinical follow-up. Methylation of a single CpG dinucleotide in the non-CpG island promoter of SPP1 was significantly correlated with shorter disease-free survival. Hypomethylation of this CpG was an independent prognostic parameter in a multivariate analysis compared to anatomical localization, genotype, tumor size and mitotic counts in a cohort of 141 GISTs with clinical follow-up. The epigenetic regulation of SPP1 was confirmed in vitro, and the functional impact of SPP1 protein on tumorigenesis-related signaling pathways was demonstrated. In summary, SPP1 promoter methylation is a novel and independent prognostic parameter in GISTs, and might be helpful in estimating the aggressiveness of GISTs from the intermediate-risk category. © 2014 UICC.
Prioritization of candidate disease genes by combining topological similarity and semantic similarity.

PubMed

Liu, Bin; Jin, Min; Zeng, Pan

2015-10-01

The identification of gene-phenotype relationships is very important for the treatment of human diseases. Studies have shown that genes causing the same or similar phenotypes tend to interact with each other in a protein-protein interaction (PPI) network. Thus, many identification methods based on the PPI network model have achieved good results. However, in the PPI network, some interactions between the proteins encoded by candidate gene and the proteins encoded by known disease genes are very weak. Therefore, some studies have combined the PPI network with other genomic information and reported good predictive performances. However, we believe that the results could be further improved. In this paper, we propose a new method that uses the semantic similarity between the candidate gene and known disease genes to set the initial probability vector of a random walk with a restart algorithm in a human PPI network. The effectiveness of our method was demonstrated by leave-one-out cross-validation, and the experimental results indicated that our method outperformed other methods. Additionally, our method can predict new causative genes of multifactor diseases, including Parkinson's disease, breast cancer and obesity. The top predictions were good and consistent with the findings in the literature, which further illustrates the effectiveness of our method. Copyright © 2015 Elsevier Inc. All rights reserved.
Global gene expression analysis combined with a genomics approach for the identification of signal transduction networks involved in postnatal mouse myocardial proliferation and development.

PubMed

Wang, Ruoxin; Su, Chao; Wang, Xinting; Fu, Qiang; Gao, Xingjie; Zhang, Chunyan; Yang, Jie; Yang, Xi; Wei, Minxin

2018-01-01

Mammalian cardiomyocytes may permanently lose their ability to proliferate after birth. Therefore, studying the proliferation and growth arrest of cardiomyocytes during the postnatal period may enhance the current understanding regarding this molecular mechanism. The present study identified the differentially expressed genes in hearts obtained from 24 h‑old mice, which contain proliferative cardiomyocytes; 7‑day‑old mice, in which the cardiomyocytes are undergoing a proliferative burst; and 10‑week‑old mice, which contain growth‑arrested cardiomyocytes, using global gene expression analysis. Furthermore, myocardial proliferation and growth arrest were analyzed from numerous perspectives, including Gene Ontology annotation, cluster analysis, pathway enrichment and network construction. The results of a Gene Ontology analysis indicated that, with increasing age, enriched gene function was not only associated with cell cycle, cell division and mitosis, but was also associated with metabolic processes and protein synthesis. In the pathway analysis, 'cell cycle', proliferation pathways, such as the 'PI3K‑AKT signaling pathway', and 'metabolic pathways' were well represented. Notably, the cluster analysis revealed that bone morphogenetic protein (BMP)1, BMP10, cyclin E2, E2F transcription factor 1 and insulin like growth factor 1 exhibited increased expression in hearts obtained from 7‑day‑old mice. In addition, the signal transduction pathway associated with the cell cycle was identified. The present study primarily focused on genes with altered expression, including downregulated anaphase promoting complex subunit 1, cell division cycle (CDC20), cyclin dependent kinase 1, MYC proto-oncogene, bHLH transcription factor and CDC25C, and upregulated growth arrest and DNA damage inducible α in 10-week group, which may serve important roles in postnatal myocardial cell cycle arrest. In conclusion, these data may provide important information
Combining evidence, biomedical literature and statistical dependence: new insights for functional annotation of gene sets

PubMed Central

Aubry, Marc; Monnier, Annabelle; Chicault, Celine; de Tayrac, Marie; Galibert, Marie-Dominique; Burgun, Anita; Mosser, Jean

2006-01-01

Background Large-scale genomic studies based on transcriptome technologies provide clusters of genes that need to be functionally annotated. The Gene Ontology (GO) implements a controlled vocabulary organised into three hierarchies: cellular components, molecular functions and biological processes. This terminology allows a coherent and consistent description of the knowledge about gene functions. The GO terms related to genes come primarily from semi-automatic annotations made by trained biologists (annotation based on evidence) or text-mining of the published scientific literature (literature profiling). Results We report an original functional annotation method based on a combination of evidence and literature that overcomes the weaknesses and the limitations of each approach. It relies on the Gene Ontology Annotation database (GOA Human) and the PubGene biomedical literature index. We support these annotations with statistically associated GO terms and retrieve associative relations across the three GO hierarchies to emphasise the major pathways involved by a gene cluster. Both annotation methods and associative relations were quantitatively evaluated with a reference set of 7397 genes and a multi-cluster study of 14 clusters. We also validated the biological appropriateness of our hybrid method with the annotation of a single gene (cdc2) and that of a down-regulated cluster of 37 genes identified by a transcriptome study of an in vitro enterocyte differentiation model (CaCo-2 cells). Conclusion The combination of both approaches is more informative than either separate approach: literature mining can enrich an annotation based only on evidence. Text-mining of the literature can also find valuable associated MEDLINE references that confirm the relevance of the annotation. Eventually, GO terms networks can be built with associative relations in order to highlight cooperative and competitive pathways and their connected molecular functions. PMID:16674810
A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis

PubMed Central

Fitzpatrick, David A; Logue, Mary E; Stajich, Jason E; Butler, Geraldine

2006-01-01

Background To date, most fungal phylogenies have been derived from single gene comparisons, or from concatenated alignments of a small number of genes. The increase in fungal genome sequencing presents an opportunity to reconstruct evolutionary events using entire genomes. As a tool for future comparative, phylogenomic and phylogenetic studies, we used both supertrees and concatenated alignments to infer relationships between 42 species of fungi for which complete genome sequences are available. Results A dataset of 345,829 genes was extracted from 42 publicly available fungal genomes. Supertree methods were employed to derive phylogenies from 4,805 single gene families. We found that the average consensus supertree method may suffer from long-branch attraction artifacts, while matrix representation with parsimony (MRP) appears to be immune from these. A genome phylogeny was also reconstructed from a concatenated alignment of 153 universally distributed orthologs. Our MRP supertree and concatenated phylogeny are highly congruent. Within the Ascomycota, the sub-phyla Pezizomycotina and Saccharomycotina were resolved. Both phylogenies infer that the Leotiomycetes are the closest sister group to the Sordariomycetes. There is some ambiguity regarding the placement of Stagonospora nodurum, the sole member of the class Dothideomycetes present in the dataset. Within the Saccharomycotina, a monophyletic clade containing organisms that translate CTG as serine instead of leucine is evident. There is also strong support for two groups within the CTG clade, one containing the fully sexual species Candida lusitaniae, Candida guilliermondii and Debaryomyces hansenii, and the second group containing Candida albicans, Candida dubliniensis, Candida tropicalis, Candida parapsilosis and Lodderomyces elongisporus. The second major clade within the Saccharomycotina contains species whose genomes have undergone a whole genome duplication (WGD), and their close relatives. We could not
Bacteriophage Mediates Efficient Gene Transfer in Combination with Conventional Transfection Reagents

PubMed Central

Donnelly, Amanda; Yata, Teerapong; Bentayebi, Kaoutar; Suwan, Keittisak; Hajitou, Amin

2015-01-01

The development of commercially available transfection reagents for gene transfer applications has revolutionized the field of molecular biology and scientific research. However, the challenge remains in ensuring that they are efficient, safe, reproducible and cost effective. Bacteriophage (phage)-based viral vectors have the potential to be utilized for general gene transfer applications within research and industry. Yet, they require adaptations in order to enable them to efficiently enter cells and overcome mammalian cellular barriers, as they infect bacteria only; furthermore, limited progress has been made at increasing their efficiency. The production of a novel hybrid nanocomplex system consisting of two different nanomaterial systems, phage vectors and conventional transfection reagents, could overcome these limitations. Here we demonstrate that the combination of cationic lipids, cationic polymers or calcium phosphate with M13 bacteriophage-derived vectors, engineered to carry a mammalian transgene cassette, resulted in increased cellular attachment, entry and improved transgene expression in human cells. Moreover, addition of a targeting ligand into the nanocomplex system, through genetic engineering of the phage capsid further increased gene expression and was effective in a stable cell line generation application. Overall, this new hybrid nanocomplex system (i) provides enhanced phage-mediated gene transfer; (ii) is applicable for laboratory transfection processes and (iii) shows promise within industry for large-scale gene transfer applications. PMID:26670247
Bacteriophage Mediates Efficient Gene Transfer in Combination with Conventional Transfection Reagents.

PubMed

Donnelly, Amanda; Yata, Teerapong; Bentayebi, Kaoutar; Suwan, Keittisak; Hajitou, Amin

2015-12-08

The development of commercially available transfection reagents for gene transfer applications has revolutionized the field of molecular biology and scientific research. However, the challenge remains in ensuring that they are efficient, safe, reproducible and cost effective. Bacteriophage (phage)-based viral vectors have the potential to be utilized for general gene transfer applications within research and industry. Yet, they require adaptations in order to enable them to efficiently enter cells and overcome mammalian cellular barriers, as they infect bacteria only; furthermore, limited progress has been made at increasing their efficiency. The production of a novel hybrid nanocomplex system consisting of two different nanomaterial systems, phage vectors and conventional transfection reagents, could overcome these limitations. Here we demonstrate that the combination of cationic lipids, cationic polymers or calcium phosphate with M13 bacteriophage-derived vectors, engineered to carry a mammalian transgene cassette, resulted in increased cellular attachment, entry and improved transgene expression in human cells. Moreover, addition of a targeting ligand into the nanocomplex system, through genetic engineering of the phage capsid further increased gene expression and was effective in a stable cell line generation application. Overall, this new hybrid nanocomplex system (i) provides enhanced phage-mediated gene transfer; (ii) is applicable for laboratory transfection processes and (iii) shows promise within industry for large-scale gene transfer applications.
Rapid Communication: MiR-92a as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.

PubMed

Lai, Y C; Fujikawa, T; Ando, T; Kitahara, G; Koiwa, M; Kubota, C; Miura, N

2017-06-01

Our aim was to identify a suitable microRNA housekeeping gene for real-time PCR analysis of bovine mastitis-related microRNA in milk. We identified , , and as housekeeping gene candidates on the basis of previous Solexa sequencing results. Threshold cycle (CT) values for , , and did not differ between milk from control cows and milk from mastitis-affected cows. NormFinder software identified as the most stable single housekeeping gene. We evaluated the suitability of the housekeeping gene candidates by using them to assess expression levels of the inflammation-related gene . Regardless of the housekeeping gene candidates used for normalization, relative expression levels of were significantly higher in mastitis-affected samples than in control samples. However, of all the housekeeping genes and gene combinations investigated, normalization with alone generated the difference in relative expression between mastitis-affected and control samples with the highest significance. These results suggest that is suitable for use as a housekeeping gene for analysis of bovine mastitis-related microRNA in milk.
Microbubble-assisted p53, RB, and p130 gene transfer in combination with radiation therapy in prostate cancer.

PubMed

Nande, Rounak; Greco, Adelaide; Gossman, Michael S; Lopez, Jeffrey P; Claudio, Luigi; Salvatore, Marco; Brunetti, Arturo; Denvir, James; Howard, Candace M; Claudio, Pier Paolo

2013-06-01

Combining radiation therapy and direct intratumoral (IT) injection of adenoviral vectors has been explored as a means to enhance the therapeutic potential of gene transfer. A major challenge for gene transfer is systemic delivery of nucleic acids directly into an affected tissue. Ultrasound (US) contrast agents (microbubbles) are viable candidates to enhance targeted delivery of systemically administered genes. Here we show that p53, pRB, and p130 gene transfer mediated by US cavitation of microbubbles at the tumor site resulted in targeted gene transduction and increased reduction in tumor growth compared to DU-145 prostate cancer cell xenografts treated intratumorally with adenovirus (Ad) or radiation alone. Microbubble-assisted/US-mediated Ad.p53 and Ad.RB treated tumors showed significant reduction in tumor volume compared to Ad.p130 treated tumors (p<0.05). Additionally, US mediated microbubble delivery of p53 and RB combined with external beam radiation resulted in the most profound tumor reduction in DU-145 xenografted nude mice (p<0.05) compared to radiation alone. These findings highlight the potential therapeutic applications of this novel image-guided gene transfer technology in combination with external beam radiation for prostate cancer patients with therapy resistant disease.
An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods.

PubMed

Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo

2014-06-01

In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both
Dynamic association rules for gene expression data analysis.

PubMed

Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

2015-10-14

The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed
[Familial male-limited precocious puberty due to Asp578His mutations in the LHCGR gene: clinical characteristics and gene analysis in an infant].

PubMed

Wang, Min; Li, Min; Liu, Yue-Sheng; Lei, Si-Min; Xiao, Yan-Feng

2017-11-01

The aim of the study was to provide a descriptive analysis of familial male-limited precocious puberty (FMPP), which is a rare inherited disease caused by heterozygous constitutively activating mutations of the luteinizing hormone/choriogonadotropin receptor gene (LHCGR). The patient was a ten-month-old boy, presenting with penile enlargement, pubic hair formation, and spontaneous erections. Based on the clinical manifestations and laboratory data, including sexual characteristics, serum testosterone levels, GnRH stimulation test, and bone age, this boy was diagnosed with peripheral precocious puberty. Subsequently the precocious puberty-related genes were analyzed by direct DNA sequencing of amplified PCR products from the patient and his parents. Genetic analysis revealed a novel heterozygous missense mutation c.1732G>C (Asp578His) of the LHCGR gene exon11 in the patient, which had never been reported. His parents had no mutations. After combined treatment with aromatase inhibitor letrozole and anti-androgen spironolactone for six months, the patient's symptoms were controlled. The findings in this study expand the mutation spectrum of the LHCGR gene, and provide molecular evidence for the etiologic diagnosis as well as for the genetic counseling and prenatal diagnosis in the family.
Programmable control of bacterial gene expression with the combined CRISPR and antisense RNA system

PubMed Central

Lee, Young Je; Hoynes-O'Connor, Allison; Leong, Matthew C.; Moon, Tae Seok

2016-01-01

A central goal of synthetic biology is to implement diverse cellular functions by predictably controlling gene expression. Though research has focused more on protein regulators than RNA regulators, recent advances in our understanding of RNA folding and functions have motivated the use of RNA regulators. RNA regulators provide an advantage because they are easier to design and engineer than protein regulators, potentially have a lower burden on the cell and are highly orthogonal. Here, we combine the CRISPR system from Streptococcus pyogenes and synthetic antisense RNAs (asRNAs) in Escherichia coli strains to repress or derepress a target gene in a programmable manner. Specifically, we demonstrate for the first time that the gene target repressed by the CRISPR system can be derepressed by expressing an asRNA that sequesters a small guide RNA (sgRNA). Furthermore, we demonstrate that tunable levels of derepression can be achieved (up to 95%) by designing asRNAs that target different regions of a sgRNA and by altering the hybridization free energy of the sgRNA–asRNA complex. This new system, which we call the combined CRISPR and asRNA system, can be used to reversibly repress or derepress multiple target genes simultaneously, allowing for rational reprogramming of cellular functions. PMID:26837577
Biochemical and molecular characterization of thyroid tissue by micro-Raman spectroscopy and gene expression analysis

NASA Astrophysics Data System (ADS)

Neto, Lázaro P. M.; Martin, Aírton A.; Soto, Claudio A. T.; Santos, André B. O.; Mello, Evandro S.; Pereira, Marina A.; Cernea, Cláudio R.; Brandão, Lenine G.; Canevari, Renata A.

2016-02-01

Thyroid carcinomas represent the main endocrine malignancy and their diagnosis may produce inconclusive results. Raman spectroscopy and gene expression analysis have shown excellent results on the differentiation of carcinomas. This study aimed to improve the discrimination between different thyroid pathologies combining of both analyses. A total of 35 thyroid tissues samples including normal tissue (n=10), goiter (n=10), papillary (n=10) and follicular carcinomas (n=5) were analyzed. Confocal Raman spectra was obtain by using a Rivers Diagnostic System, 785 nm laser excitation and CCD detector. The data was processed by the software Labspec5 and Origin 8.5 and analyzed by Minitab® program. The gene expression analysis was performed by qRT-PCR technique for TG, TPO, PDGFB, SERPINA1, LGALS3 and TFF3 genes and statistically analyzed by Mann-Whitney test. The confocal Raman spectroscopy allowed a maximum discrimination of 91.1% between normal and tumor tissues, 84.8% between benign and malignant pathologies and 84.6% among carcinomas analyzed. Significant differences was observed for TG, LGALS3, SERPINA1 and TFF3 genes between benign lesions and carcinomas, and SERPINA1 and TFF3 genes between papillary and follicular carcinomas. Principal component analysis was performed using PC1 and PC2 in the papillary carcinoma samples that showed over gene expression when compared with normal sample, where 90% of discrimination was observed at the Amide 1 (1655 cm-1), and at the tyrosine spectra region (856 cm-1). The discrimination of tissues thyroid carried out by confocal Raman spectroscopy and gene expression analysis indicate that these techniques are promising tools to be used in the diagnosis of thyroid lesions.
Construction and analysis of gene-gene dynamics influence networks based on a Boolean model.

PubMed

Mazaya, Maulida; Trinh, Hung-Cuong; Kwon, Yung-Keun

2017-12-21

Identification of novel gene-gene relations is a crucial issue to understand system-level biological phenomena. To this end, many methods based on a correlation analysis of gene expressions or structural analysis of molecular interaction networks have been proposed. They have a limitation in identifying more complicated gene-gene dynamical relations, though. To overcome this limitation, we proposed a measure to quantify a gene-gene dynamical influence (GDI) using a Boolean network model and constructed a GDI network to indicate existence of a dynamical influence for every ordered pair of genes. It represents how much a state trajectory of a target gene is changed by a knockout mutation subject to a source gene in a gene-gene molecular interaction (GMI) network. Through a topological comparison between GDI and GMI networks, we observed that the former network is denser than the latter network, which implies that there exist many gene pairs of dynamically influencing but molecularly non-interacting relations. In addition, a larger number of hub genes were generated in the GDI network. On the other hand, there was a correlation between these networks such that the degree value of a node was positively correlated to each other. We further investigated the relationships of the GDI value with structural properties and found that there are negative and positive correlations with the length of a shortest path and the number of paths, respectively. In addition, a GDI network could predict a set of genes whose steady-state expression is affected in E. coli gene-knockout experiments. More interestingly, we found that the drug-targets with side-effects have a larger number of outgoing links than the other genes in the GDI network, which implies that they are more likely to influence the dynamics of other genes. Finally, we found biological evidences showing that the gene pairs which are not molecularly interacting but dynamically influential can be considered for novel gene-gene

Combined Bisulfite Restriction Analysis for brain tissue identification.

PubMed

Samsuwan, Jarunya; Muangsub, Tachapol; Yanatatsaneejit, Pattamawadee; Mutirangura, Apiwat; Kitkumthorn, Nakarin

2018-05-01

According to the tissue-specific methylation database (doi: 10.1016/j.gene.2014.09.060), methylation at CpG locus cg03096975 in EML2 has been preliminarily proven to be specific to brain tissue. In this study, we enlarged sample size and developed a technique for identifying brain tissue in aged samples. Combined Bisulfite Restriction Analysis-for EML2 (COBRA-EML2) technique was established and validated in various organ samples obtained from 108 autopsies. In addition, this technique was also tested for its reliability, minimal DNA concentration detected, and use in aged samples and in samples obtained from specific brain compartments and spinal cord. COBRA-EML2 displayed 100% sensitivity and specificity for distinguishing brain tissue from other tissues, showed high reliability, was capable of detecting minimal DNA concentration (0.015ng/μl), could be used for identifying brain tissue in aged samples. In summary, COBRA-EML2 is a technique to identify brain tissue. This analysis is useful in criminal cases since it can identify the vital organ tissues from small samples acquired from criminal scenes. The results from this analysis can be counted as a medical and forensic marker supporting criminal investigations, and as one of the evidences in court rulings. Copyright © 2018 Elsevier B.V. All rights reserved.
RNA-Seq analysis of yak ovary: improving yak gene structure information and mining reproduction-related genes.

PubMed

Lan, DaoLiang; Xiong, XianRong; Wei, YanLi; Xu, Tong; Zhong, JinCheng; Zhi, XiangDong; Wang, Yong; Li, Jian

2014-09-01

RNA-Seq, a high-throughput (HT) sequencing technique, has been used effectively in large-scale transcriptomic studies, and is particularly useful for improving gene structure information and mining of new genes. In this study, RNA-Seq HT technology was employed to analyze the transcriptome of yak ovary. After Illumina-Solexa deep sequencing, 26826516 clean reads with a total of 4828772880 bp were obtained from the ovary library. Alignment analysis showed that 16992 yak genes mapped to the yak genome and 3734 of these genes were involved in alternative splicing. Gene structure refinement analysis showed that 7340 genes that were annotated in the yak genome could be extended at the 5' or 3' ends based on the alignments been the transcripts and the genome sequence. Novel transcript prediction analysis identified 6321 new transcripts with lengths ranging from 180 to 14884 bp, and 2267 of them were predicted to code proteins. BLAST analysis of the new transcripts showed that 1200?4933 mapped to the non-redundant (nr), nucleotide (nt) and/or SwissProt sequence databases. Comparative statistical analysis of the new mapped transcripts showed that the majority of them were similar to genes in Bos taurus (41.4%), Bos grunniens mutus (33.0%), Ovis aries (6.3%), Homo sapiens (2.8%), Mus musculus (1.6%) and other species. Functional analysis showed that these expressed genes were involved in various Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes pathways. GO analysis of the new transcripts found that the largest proportion of them was associated with reproduction. The results of this study will provide a basis for describing the normal transcriptome map of yak ovary and for future studies on yak breeding performance. Moreover, the results confirmed that RNA-Seq HT technology is highly advantageous in improving gene structure information and mining of new genes, as well as in providing valuable data to expand the yak genome information.
Meta-type analysis of dopaminergic effects on gene expression in the neuroendocrine brain of female goldfish.

PubMed

Popesku, Jason T; Martyniuk, Christopher J; Trudeau, Vance L

2012-01-01

Dopamine (DA) is a major neurotransmitter important for neuroendocrine control and recent studies have described genomic signaling pathways activated and inhibited by DA agonists and antagonists in the goldfish brain. Here we perform a meta-type analysis using microarray datasets from experiments conducted with female goldfish to characterize the gene expression responses that underlie dopaminergic signaling. Sexually mature, pre-spawning [gonadosomatic index (GSI) = 4.5 ± 1.3%] or sexually regressing (GSI = 3 ± 0.4%) female goldfish (15-40 g) injected intraperitoneally with either SKF 38393, LY 171555, SCH 23390, sulpiride, or a combination of 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine and α-methyl-p-tyrosine. Microarray meta-type analysis identified 268 genes in the telencephalon and hypothalamus as having reciprocal (i.e., opposite between agonism and antagonism/depletion) fold change responses, suggesting that these transcripts are likely targets for DA-mediated regulation. Noteworthy genes included ependymin, vimentin, and aromatase, genes that support the significance of DA in neuronal plasticity and tissue remodeling. Sub-network enrichment analysis (SNEA) was used to identify common gene regulators and binding proteins associated with the differentially expressed genes mediated by DA. SNEA analysis identified gene expression targets that were related to three major categories that included cell signaling (STAT3, SP1, SMAD, Jun/Fos), immune response (IL-6, IL-1β, TNFs, cytokine, NF-κB), and cell proliferation and growth (IGF1, TGFβ1). These gene networks are also known to be associated with neurodegenerative disorders such as Parkinsons' disease, well-known to be associated with loss of dopaminergic neurons. This study identifies genes and networks that underlie DA signaling in the vertebrate CNS and provides targets that may be key neuroendocrine regulators. The results provide a foundation for future work on dopaminergic
T-cell lymphomas associated gene expression signature: Bioinformatics analysis based on gene expression Omnibus.

PubMed

Zhou, Lei-Lei; Xu, Xiao-Yue; Ni, Jie; Zhao, Xia; Zhou, Jian-Wei; Feng, Ji-Feng

2018-06-01

Due to the low incidence and the heterogeneity of subtypes, the biological process of T-cell lymphomas is largely unknown. Although many genes have been detected in T-cell lymphomas, the role of these genes in biological process of T-cell lymphomas was not further analyzed. Two qualified datasets were downloaded from Gene Expression Omnibus database. The biological functions of differentially expressed genes were evaluated by gene ontology enrichment and KEGG pathway analysis. The network for intersection genes was constructed by the cytoscape v3.0 software. Kaplan-Meier survival curves and log-rank test were employed to assess the association between differentially expressed genes and clinical characters. The intersection mRNAs were proved to be associated with fundamental processes of T-cell lymphoma cells. These intersection mRNAs were involved in the activation of some cancer-related pathways, including PI3K/AKT, Ras, JAK-STAT, and NF-kappa B signaling pathway. PDGFRA, CXCL12, and CCL19 were the most significant central genes in the signal-net analysis. The results of survival analysis are not entirely credible. Our findings uncovered aberrantly expressed genes and a complex RNA signal network in T-cell lymphomas and indicated cancer-related pathways involved in disease initiation and progression, providing a new insight for biotargeted therapy in T-cell lymphomas. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

PubMed

Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

2010-10-07

PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out
Gene set analysis using variance component tests.

PubMed

Huang, Yen-Tsung; Lin, Xihong

2013-06-28

Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.
Meta-analysis of gene-level associations for rare variants based on single-variant statistics.

PubMed

Hu, Yi-Juan; Berndt, Sonja I; Gustafsson, Stefan; Ganna, Andrea; Hirschhorn, Joel; North, Kari E; Ingelsson, Erik; Lin, Dan-Yu

2013-08-08

Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying "causal" rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines

PubMed Central

Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

2016-01-01

Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes. PMID:29263807
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines.

PubMed

Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J

2016-01-01

Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
GFD-Net: A novel semantic similarity methodology for the analysis of gene networks.

PubMed

Díaz-Montaña, Juan J; Díaz-Díaz, Norberto; Gómez-Vela, Francisco

2017-04-01

Since the popularization of biological network inference methods, it has become crucial to create methods to validate the resulting models. Here we present GFD-Net, the first methodology that applies the concept of semantic similarity to gene network analysis. GFD-Net combines the concept of semantic similarity with the use of gene network topology to analyze the functional dissimilarity of gene networks based on Gene Ontology (GO). The main innovation of GFD-Net lies in the way that semantic similarity is used to analyze gene networks taking into account the network topology. GFD-Net selects a functionality for each gene (specified by a GO term), weights each edge according to the dissimilarity between the nodes at its ends and calculates a quantitative measure of the network functional dissimilarity, i.e. a quantitative value of the degree of dissimilarity between the connected genes. The robustness of GFD-Net as a gene network validation tool was demonstrated by performing a ROC analysis on several network repositories. Furthermore, a well-known network was analyzed showing that GFD-Net can also be used to infer knowledge. The relevance of GFD-Net becomes more evident in Section "GFD-Net applied to the study of human diseases" where an example of how GFD-Net can be applied to the study of human diseases is presented. GFD-Net is available as an open-source Cytoscape app which offers a user-friendly interface to configure and execute the algorithm as well as the ability to visualize and interact with the results(http://apps.cytoscape.org/apps/gfdnet). Copyright © 2017 Elsevier Inc. All rights reserved.
Comprehensive detection of genes causing a phenotype using phenotype sequencing and pathway analysis.

PubMed

Harper, Marc; Gronenberg, Luisa; Liao, James; Lee, Christopher

2014-01-01

Discovering all the genetic causes of a phenotype is an important goal in functional genomics. We combine an experimental design for detecting independent genetic causes of a phenotype with a high-throughput sequencing analysis that maximizes sensitivity for comprehensively identifying them. Testing this approach on a set of 24 mutant strains generated for a metabolic phenotype with many known genetic causes, we show that this pathway-based phenotype sequencing analysis greatly improves sensitivity of detection compared with previous methods, and reveals a wide range of pathways that can cause this phenotype. We demonstrate our approach on a metabolic re-engineering phenotype, the PEP/OAA metabolic node in E. coli, which is crucial to a substantial number of metabolic pathways and under renewed interest for biofuel research. Out of 2157 mutations in these strains, pathway-phenoseq discriminated just five gene groups (12 genes) as statistically significant causes of the phenotype. Experimentally, these five gene groups, and the next two high-scoring pathway-phenoseq groups, either have a clear connection to the PEP metabolite level or offer an alternative path of producing oxaloacetate (OAA), and thus clearly explain the phenotype. These high-scoring gene groups also show strong evidence of positive selection pressure, compared with strictly neutral selection in the rest of the genome.
Genome-wide identification and analysis of the MADS-box gene family in bread wheat (Triticum aestivum L.)

PubMed Central

Yang, Congcong; Ding, Puyang; Liu, Yaxi; Qiao, Linyi; Chang, Zhijian; Geng, Hongwei; Wang, Penghao; Jiang, Qiantao; Wang, Jirui; Chen, Guoyue; Wei, Yuming; Zheng, Youliang; Lan, Xiujin

2017-01-01

The MADS-box genes encode transcription factors with key roles in plant growth and development. A comprehensive analysis of the MADS-box gene family in bread wheat (Triticum aestivum) has not yet been conducted, and our understanding of their roles in stress is rather limited. Here, we report the identification and characterization of the MADS-box gene family in wheat. A total of 180 MADS-box genes classified as 32 Mα, 5 Mγ, 5 Mδ, and 138 MIKC types were identified. Evolutionary analysis of the orthologs among T. urartu, Aegilops tauschii and wheat as well as homeologous sequences analysis among the three sub-genomes in wheat revealed that gene loss and chromosomal rearrangements occurred during and/or after the origin of bread wheat. Forty wheat MADS-box genes that were expressed throughout the investigated tissues and development stages were identified. The genes that were regulated in response to both abiotic stresses (i.e., phosphorus deficiency, drought, heat, and combined drought and heat) and biotic stresses (i.e., Fusarium graminearum, Septoria tritici, stripe rust and powdery mildew) were detected as well. A few notable MADS-box genes were specifically expressed in a single tissue and those showed relatively higher expression differences between the stress and control treatment. The expression patterns of considerable MADS-box genes differed from those of their orthologs in Brachypodium, rice, and Arabidopsis. Collectively, the present study provides new insights into the possible roles of MADS-box genes in response to stresses and will be valuable for further functional studies of important candidate MADS-box genes. PMID:28742823
Genome-wide identification and analysis of the MADS-box gene family in bread wheat (Triticum aestivum L.).

PubMed

Ma, Jian; Yang, Yujie; Luo, Wei; Yang, Congcong; Ding, Puyang; Liu, Yaxi; Qiao, Linyi; Chang, Zhijian; Geng, Hongwei; Wang, Penghao; Jiang, Qiantao; Wang, Jirui; Chen, Guoyue; Wei, Yuming; Zheng, Youliang; Lan, Xiujin

2017-01-01

The MADS-box genes encode transcription factors with key roles in plant growth and development. A comprehensive analysis of the MADS-box gene family in bread wheat (Triticum aestivum) has not yet been conducted, and our understanding of their roles in stress is rather limited. Here, we report the identification and characterization of the MADS-box gene family in wheat. A total of 180 MADS-box genes classified as 32 Mα, 5 Mγ, 5 Mδ, and 138 MIKC types were identified. Evolutionary analysis of the orthologs among T. urartu, Aegilops tauschii and wheat as well as homeologous sequences analysis among the three sub-genomes in wheat revealed that gene loss and chromosomal rearrangements occurred during and/or after the origin of bread wheat. Forty wheat MADS-box genes that were expressed throughout the investigated tissues and development stages were identified. The genes that were regulated in response to both abiotic stresses (i.e., phosphorus deficiency, drought, heat, and combined drought and heat) and biotic stresses (i.e., Fusarium graminearum, Septoria tritici, stripe rust and powdery mildew) were detected as well. A few notable MADS-box genes were specifically expressed in a single tissue and those showed relatively higher expression differences between the stress and control treatment. The expression patterns of considerable MADS-box genes differed from those of their orthologs in Brachypodium, rice, and Arabidopsis. Collectively, the present study provides new insights into the possible roles of MADS-box genes in response to stresses and will be valuable for further functional studies of important candidate MADS-box genes.
Association analysis of nine candidate gene polymorphisms in Indian patients with type 2 diabetic retinopathy.

PubMed

Balasubbu, Suganthalakshmi; Sundaresan, Periasamy; Rajendran, Anand; Ramasamy, Kim; Govindarajan, Gowthaman; Perumalsamy, Namperumalsamy; Hejtmancik, J Fielding

2010-11-10

Diabetic retinopathy (DR) is classically defined as a microvasculopathy that primarily affects the small blood vessels of the inner retina as a complication of diabetes mellitus (DM).It is a multifactorial disease with a strong genetic component. The aim of this study is to investigate the association of a set of nine candidate genes with the development of diabetic retinopathy in a South Indian cohort who have type 2 diabetes mellitus (T2DM). Seven candidate genes (RAGE, PEDF, AKR1B1, EPO, HTRA1, ICAM and HFE) were chosen based on reported association with DR in the literature. Two more, CFH and ARMS2, were chosen based on their roles in biological pathways previously implicated in DR. Fourteen single nucleotide polymorphisms (SNPs) and one dinucleotide repeat polymorphism, previously reported to show association with DR or other related diseases, were genotyped in 345 DR and 356 diabetic patients without retinopathy (DNR). The genes which showed positive association in this screening set were tested further in additional sets of 100 DR and 90 DNR additional patients from the Aravind Eye Hospital. Those which showed association in the secondary screen were subjected to a combined analysis with the 100 DR and 100 DNR subjects previously recruited and genotyped through the Sankara Nethralaya Hospital, India. Genotypes were evaluated using a combination of direct sequencing, TaqMan SNP genotyping, RFLP analysis, and SNaPshot PCR assays. Chi-square and Fisher exact tests were used to analyze the genotype and allele frequencies. Among the nine loci (15 polymorphisms) screened, SNP rs2070600 (G82S) in the RAGE gene, showed significant association with DR (allelic P = 0.016, dominant model P = 0.012), compared to DNR. SNP rs2070600 further showed significant association with DR in the confirmation cohort (P = 0.035, dominant model P = 0.032). Combining the two cohorts gave an allelic P < 0.003 and dominant P = 0.0013). Combined analysis with the Sankara Nethralaya cohort
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

PubMed

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
A Novel Combination of Homeobox Genes Is Expressed in Mesenchymal Chorionic Stem/Stromal Cells in First Trimester and Term Pregnancies

PubMed Central

Liu, Haiying; Murthi, Padma; Qin, Sharon; Kusuma, Gina D.; Borg, Anthony J.; Knöfler, Martin; Haslinger, Peter; Manuelpillai, Ursula; Pertile, Mark D.; Abumaree, Mohamed

2014-01-01

Human chorionic mesenchymal stem/stromal cells (CMSCs) derived from the placenta are similar to adult tissue-derived MSCs. The aim of this study was to investigate the role of these cells in normal placental development. Transcription factors, particularly members of the homeobox gene family, play crucial roles in maintaining stem cell proliferation and lineage specification in embryonic tissues. In adult tissues and organs, stem cells proliferate at low levels in their niche until they receive cues from the microenvironment to differentiate. The homeobox genes that are expressed in the CMSC niche in placental tissues have not been identified. We used the novel strategy of laser capture microdissection to isolate the stromal component of first trimester villi and excluded the cytotrophoblast and syncytiotrophoblast layers that comprise the outer layer of the chorionic villi. Microarray analysis was then used to screen for homeobox genes in the microdissected tissue. Candidate homeobox genes were selected for further RNA analysis. Immunohistochemistry of candidate genes in first trimester placental villous stromal tissue revealed homeobox genes Meis1, myeloid ectropic viral integration site 1 homolog 2 (MEIS2), H2.0-like Drosophila (HLX), transforming growth factor β-induced factor (TGIF), and distal-less homeobox 5 (DLX5) were expressed in the vascular niche where CMSCs have been shown to reside. Expression of MEIS2, HLX, TGIF, and DLX5 was also detected in scattered stromal cells. Real-time polymerase chain reaction and immunocytochemistry verified expression of MEIS2, HLX, TGIF, and DLX5 homeobox genes in first trimester and term CMSCs. These data suggest a combination of regulatory homeobox genes is expressed in CMSCs from early placental development to term, which may be required for stem cell proliferation and differentiation. PMID:24692208
The link between the microbial ecology, gene expression, and biokinetics of denitrifying polyphosphate-accumulating systems under different electron acceptor combinations.

PubMed

Vieira, A; Ribera-Guardia, A; Marques, R; Barreto Crespo, M T; Oehmen, A; Carvalho, G

2018-06-02

The emission of the greenhouse gas nitrous oxide (N 2 O) can occur during biological nutrient removal. Denitrifying enhanced biological phosphorus removal (d-EBPR) systems are an efficient means of removing phosphate and nitrogen, performed by denitrifying polyphosphate-accumulating organisms (d-PAOs). The aim of this work was to study the effect of various combinations of electron acceptors, nitrate (NO 3 - ), nitrite (NO 2 - ), and N 2 O, on the denitrification pathway of a d-EBPR system. Batch tests were performed with different electron acceptor combinations, to explore the denitrification pathway. Reverse transcriptase-qPCR (RT-qPCR) and high-throughput sequencing, combined with chemical analysis, were used to study gene expression, microbial diversity, and denitrification kinetics. The potential for N 2 O production was greater than the potential for its reduction in most tests. A strong correlation was observed between the N 2 O reduction rate and the relative gene expression of nitrous oxide reductase per nitrite reductase (nosZ/(nirS + nirK)), suggesting that the expression of denitrifying marker genes is a strong predictor of the N 2 O reduction rate. The d-EBPR community maintained a core population with low variations throughout the study. Furthermore, phylogenetic analyses of the studied marker genes revealed that the organisms actively involved in denitrification were closely related to Thauera sp., Candidatus Accumulibacter phosphatis, and Candidatus Competibacter denitrificans. Moreover, Competibacter-related OTUs seem to be important contributors to the N 2 O reduction capacity of the system, likely scavenging the N 2 O produced by other organisms. Overall, this study contributes to a better understanding of the microbial biochemistry and the genetics involving biological denitrification removal, important to minimize N 2 O emissions in wastewater treatment plants.
Citrate Accumulation-Related Gene Expression and/or Enzyme Activity Analysis Combined With Metabolomics Provide a Novel Insight for an Orange Mutant

PubMed Central

Guo, Ling-Xia; Shi, Cai-Yun; Liu, Xiao; Ning, Dong-Yuan; Jing, Long-Fei; Yang, Huan; Liu, Yong-Zhong

2016-01-01

‘Hong Anliu’ (HAL, Citrus sinensis cv. Hong Anliu) is a bud mutant of ‘Anliu’ (AL), characterized by a comprehensive metabolite alteration, such as lower accumulation of citrate, high accumulation of lycopene and soluble sugars in fruit juice sacs. Due to carboxylic acid metabolism connects other metabolite biosynthesis and/or catabolism networks, we therefore focused analyzing citrate accumulation-related gene expression profiles and/or enzyme activities, along with metabolic fingerprinting between ‘HAL’ and ‘AL’. Compared with ‘AL’, the transcript levels of citrate biosynthesis- and utilization-related genes and/or the activities of their respective enzymes such as citrate synthase, cytosol aconitase and ATP-citrate lyase were significantly higher in ‘HAL’. Nevertheless, the mitochondrial aconitase activity, the gene transcript levels of proton pumps, including vacuolar H+-ATPase, vacuolar H+-PPase, and the juice sac-predominant p-type proton pump gene (CsPH8) were significantly lower in ‘HAL’. These results implied that ‘HAL’ has higher abilities for citrate biosynthesis and utilization, but lower ability for the citrate uptake into vacuole compared with ‘AL’. Combined with the metabolites-analyzing results, a model was then established and suggested that the reduction in proton pump activity is the key factor for the low citrate accumulation and the comprehensive metabolite alterations as well in ‘HAL’. PMID:27385485
ADAGE signature analysis: differential expression analysis with data-defined gene sets.

PubMed

Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

2017-11-22

Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed
SigEMD: A powerful method for differential gene expression analysis in single-cell RNA sequencing data.

PubMed

Wang, Tianyu; Nabavi, Sheida

2018-04-24

Differential gene expression analysis is one of the significant efforts in single cell RNA sequencing (scRNAseq) analysis to discover the specific changes in expression levels of individual cell types. Since scRNAseq exhibits multimodality, large amounts of zero counts, and sparsity, it is different from the traditional bulk RNA sequencing (RNAseq) data. The new challenges of scRNAseq data promote the development of new methods for identifying differentially expressed (DE) genes. In this study, we proposed a new method, SigEMD, that combines a data imputation approach, a logistic regression model and a nonparametric method based on the Earth Mover's Distance, to precisely and efficiently identify DE genes in scRNAseq data. The regression model and data imputation are used to reduce the impact of large amounts of zero counts, and the nonparametric method is used to improve the sensitivity of detecting DE genes from multimodal scRNAseq data. By additionally employing gene interaction network information to adjust the final states of DE genes, we further reduce the false positives of calling DE genes. We used simulated datasets and real datasets to evaluate the detection accuracy of the proposed method and to compare its performance with those of other differential expression analysis methods. Results indicate that the proposed method has an overall powerful performance in terms of precision in detection, sensitivity, and specificity. Copyright © 2018 Elsevier Inc. All rights reserved.

Transcriptome analysis of Brassica napus pod using RNA-Seq and identification of lipid-related candidate genes.

PubMed

Xu, Hai-Ming; Kong, Xiang-Dong; Chen, Fei; Huang, Ji-Xiang; Lou, Xiang-Yang; Zhao, Jian-Yi

2015-10-24

Brassica napus is an important oilseed crop. Dissection of the genetic architecture underlying oil-related biological processes will greatly facilitates the genetic improvement of rapeseed. The differential gene expression during pod development offers a snapshot on the genes responsible for oil accumulation in. To identify candidate genes in the linkage peaks reported previously, we used RNA sequencing (RNA-Seq) technology to analyze the pod transcriptomes of German cultivar Sollux and Chinese inbred line Gaoyou. The RNA samples were collected for RNA-Seq at 5-7, 15-17 and 25-27 days after flowering (DAF). Bioinformatics analysis was performed to investigate differentially expressed genes (DEGs). Gene annotation analysis was integrated with QTL mapping and Brassica napus pod transcriptome profiling to detect potential candidate genes in oilseed. Four hundred sixty five and two thousand, one hundred fourteen candidate DEGs were identified, respectively, between two varieties at the same stages and across different periods of each variety. Then, 33 DEGs between Sollux and Gaoyou were identified as the candidate genes affecting seed oil content by combining those DEGs with the quantitative trait locus (QTL) mapping results, of which, one was found to be homologous to Arabidopsis thaliana lipid-related genes. Intervarietal DEGs of lipid pathways in QTL regions represent important candidate genes for oil-related traits. Integrated analysis of transcriptome profiling, QTL mapping and comparative genomics with other relative species leads to efficient identification of most plausible functional genes underlying oil-content related characters, offering valuable resources for bettering breeding program of Brassica napus. This study provided a comprehensive overview on the pod transcriptomes of two varieties with different oil-contents at the three developmental stages.
Identification of candidate genes involved in neuroblastoma progression by combining genomic and expression microarrays with survival data.

PubMed

Łastowska, M; Viprey, V; Santibanez-Koref, M; Wappler, I; Peters, H; Cullinane, C; Roberts, P; Hall, A G; Tweddle, D A; Pearson, A D J; Lewis, I; Burchill, S A; Jackson, M S

2007-11-22

Identifying genes, whose expression is consistently altered by chromosomal gains or losses, is an important step in defining genes of biological relevance in a wide variety of tumour types. However, additional criteria are needed to discriminate further among the large number of candidate genes identified. This is particularly true for neuroblastoma, where multiple genomic copy number changes of proven prognostic value exist. We have used Affymetrix microarrays and a combination of fluorescent in situ hybridization and single nucleotide polymorphism (SNP) microarrays to establish expression profiles and delineate copy number alterations in 30 primary neuroblastomas. Correlation of microarray data with patient survival and analysis of expression within rodent neuroblastoma cell lines were then used to define further genes likely to be involved in the disease process. Using this approach, we identify >1000 genes within eight recurrent genomic alterations (loss of 1p, 3p, 4p, 10q and 11q, 2p gain, 17q gain, and the MYCN amplicon) whose expression is consistently altered by copy number change. Of these, 84 correlate with patient survival, with the minimal regions of 17q gain and 4p loss being enriched significantly for such genes. These include genes involved in RNA and DNA metabolism, and apoptosis. Orthologues of all but one of these genes on 17q are overexpressed in rodent neuroblastoma cell lines. A significant excess of SNPs whose copy number correlates with survival is also observed on proximal 4p in stage 4 tumours, and we find that deletion of 4p is associated with improved outcome in an extended cohort of tumours. These results define the major impact of genomic copy number alterations upon transcription within neuroblastoma, and highlight genes on distal 17q and proximal 4p for downstream analyses. They also suggest that integration of discriminators, such as survival and comparative gene expression, with microarray data may be useful in the identification of
Identification of functional enolase genes of the silkworm Bombyx mori from public databases with a combination of dry and wet bench processes.

PubMed

Kikuchi, Akira; Nakazato, Takeru; Ito, Katsuhiko; Nojima, Yosui; Yokoyama, Takeshi; Iwabuchi, Kikuo; Bono, Hidemasa; Toyoda, Atsushi; Fujiyama, Asao; Sato, Ryoichi; Tabunoki, Hiroko

2017-01-13

Various insect species have been added to genomic databases over the years. Thus, researchers can easily obtain online genomic information on invertebrates and insects. However, many incorrectly annotated genes are included in these databases, which can prevent the correct interpretation of subsequent functional analyses. To address this problem, we used a combination of dry and wet bench processes to select functional genes from public databases. Enolase is an important glycolytic enzyme in all organisms. We used a combination of dry and wet bench processes to identify functional enolases in the silkworm Bombyx mori (BmEno). First, we detected five annotated enolases from public databases using a Hidden Markov Model (HMM) search, and then through cDNA cloning, Northern blotting, and RNA-seq analysis, we revealed three functional enolases in B. mori: BmEno1, BmEno2, and BmEnoC. BmEno1 contained a conserved key amino acid residue for metal binding and substrate binding in other species. However, BmEno2 and BmEnoC showed a change in this key amino acid. Phylogenetic analysis showed that BmEno2 and BmEnoC were distinct from BmEno1 and other enolases, and were distributed only in lepidopteran clusters. BmEno1 was expressed in all of the tissues used in our study. In contrast, BmEno2 was mainly expressed in the testis with some expression in the ovary and suboesophageal ganglion. BmEnoC was weakly expressed in the testis. Quantitative RT-PCR showed that the mRNA expression of BmEno2 and BmEnoC correlated with testis development; thus, BmEno2 and BmEnoC may be related to lepidopteran-specific spermiogenesis. We identified and characterized three functional enolases from public databases with a combination of dry and wet bench processes in the silkworm B. mori. In addition, we determined that BmEno2 and BmEnoC had species-specific functions. Our strategy could be helpful for the detection of minor genes and functional genes in non-model organisms from public databases.
GeneSigDB: a manually curated database and resource for analysis of gene expression signatures

PubMed Central

Culhane, Aedín C.; Schröder, Markus S.; Sultana, Razvan; Picard, Shaita C.; Martinelli, Enzo N.; Kelly, Caroline; Haibe-Kains, Benjamin; Kapushesky, Misha; St Pierre, Anne-Alyssa; Flahive, William; Picard, Kermshlise C.; Gusenleitner, Daniel; Papenhausen, Gerald; O'Connor, Niall; Correll, Mick; Quackenbush, John

2012-01-01

GeneSigDB (http://www.genesigdb.org or http://compbio.dfci.harvard.edu/genesigdb/) is a database of gene signatures that have been extracted and manually curated from the published literature. It provides a standardized resource of published prognostic, diagnostic and other gene signatures of cancer and related disease to the community so they can compare the predictive power of gene signatures or use these in gene set enrichment analysis. Since GeneSigDB release 1.0, we have expanded from 575 to 3515 gene signatures, which were collected and transcribed from 1604 published articles largely focused on gene expression in cancer, stem cells, immune cells, development and lung disease. We have made substantial upgrades to the GeneSigDB website to improve accessibility and usability, including adding a tag cloud browse function, facetted navigation and a ‘basket’ feature to store genes or gene signatures of interest. Users can analyze GeneSigDB gene signatures, or upload their own gene list, to identify gene signatures with significant gene overlap and results can be viewed on a dynamic editable heatmap that can be downloaded as a publication quality image. All data in GeneSigDB can be downloaded in numerous formats including .gmt file format for gene set enrichment analysis or as a R/Bioconductor data file. GeneSigDB is available from http://www.genesigdb.org. PMID:22110038
MINER: exploratory analysis of gene interaction networks by machine learning from expression data.

PubMed

Kadupitige, Sidath Randeni; Leung, Kin Chun; Sellmeier, Julia; Sivieng, Jane; Catchpoole, Daniel R; Bain, Michael E; Gaëta, Bruno A

2009-12-03

The reconstruction of gene regulatory networks from high-throughput "omics" data has become a major goal in the modelling of living systems. Numerous approaches have been proposed, most of which attempt only "one-shot" reconstruction of the whole network with no intervention from the user, or offer only simple correlation analysis to infer gene dependencies. We have developed MINER (Microarray Interactive Network Exploration and Representation), an application that combines multivariate non-linear tree learning of individual gene regulatory dependencies, visualisation of these dependencies as both trees and networks, and representation of known biological relationships based on common Gene Ontology annotations. MINER allows biologists to explore the dependencies influencing the expression of individual genes in a gene expression data set in the form of decision, model or regression trees, using their domain knowledge to guide the exploration and formulate hypotheses. Multiple trees can then be summarised in the form of a gene network diagram. MINER is being adopted by several of our collaborators and has already led to the discovery of a new significant regulatory relationship with subsequent experimental validation. Unlike most gene regulatory network inference methods, MINER allows the user to start from genes of interest and build the network gene-by-gene, incorporating domain expertise in the process. This approach has been used successfully with RNA microarray data but is applicable to other quantitative data produced by high-throughput technologies such as proteomics and "next generation" DNA sequencing.
Lentiviral gene ontology (LeGO) vectors equipped with novel drug-selectable fluorescent proteins: new building blocks for cell marking and multi-gene analysis.

PubMed

Weber, K; Mock, U; Petrowitz, B; Bartsch, U; Fehse, B

2010-04-01

Vector-encoded fluorescent proteins (FPs) facilitate unambiguous identification or sorting of gene-modified cells by fluorescence-activated cell sorting (FACS). Exploiting this feature, we have recently developed lentiviral gene ontology (LeGO) vectors (www.LentiGO-Vectors.de) for multi-gene analysis in different target cells. In this study, we extend the LeGO principle by introducing 10 different drug-selectable FPs created by fusing one of the five selection marker (protecting against blasticidin, hygromycin, neomycin, puromycin and zeocin) and one of the five FP genes (Cerulean, eGFP, Venus, dTomato and mCherry). All tested fusion proteins allowed both fluorescence-mediated detection and drug-mediated selection of LeGO-transduced cells. Newly generated codon-optimized hygromycin- and neomycin-resistance genes showed improved expression as compared with their ancestors. New LeGO constructs were produced at titers >10(6) per ml (for non-concentrated supernatants). We show efficient combinatorial marking and selection of various cells, including mesenchymal stem cells, simultaneously transduced with different LeGO constructs. Inclusion of the cytomegalovirus early enhancer/chicken beta-actin promoter into LeGO vectors facilitated robust transgene expression in and selection of neural stem cells and their differentiated progeny. We suppose that the new drug-selectable markers combining advantages of FACS and drug selection are well suited for numerous applications and vector systems. Their inclusion into LeGO vectors opens new possibilities for (stem) cell tracking and functional multi-gene analysis.
Thermal tolerance in the keystone species Daphnia magna-a candidate gene and an outlier analysis approach.

PubMed

Jansen, M; Geerts, A N; Rago, A; Spanier, K I; Denis, C; De Meester, L; Orsini, L

2017-04-01

Changes in temperature have occurred throughout Earth's history. However, current warming trends exacerbated by human activities impose severe and rapid loss of biodiversity. Although understanding the mechanisms orchestrating organismal response to climate change is important, remarkably few studies document their role in nature. This is because only few systems enable the combined analysis of genetic and plastic responses to environmental change over long time spans. Here, we characterize genetic and plastic responses to temperature increase in the aquatic keystone grazer Daphnia magna combining a candidate gene and an outlier analysis approach. We capitalize on the short generation time of our species, facilitating experimental evolution, and the production of dormant eggs enabling the analysis of long-term response to environmental change through a resurrection ecology approach. We quantify plasticity in the expression of 35 candidate genes in D. magna populations resurrected from a lake that experienced changes in average temperature over the past century and from experimental populations differing in thermal tolerance isolated from a selection experiment. By measuring expression in multiple genotypes from each of these populations in control and heat treatments, we assess plastic responses to extreme temperature events. By measuring evolutionary changes in gene expression between warm- and cold-adapted populations, we assess evolutionary response to temperature changes. Evolutionary response to temperature increase is also assessed via an outlier analysis using EST-linked microsatellite loci. This study provides the first insights into the role of plasticity and genetic adaptation in orchestrating adaptive responses to environmental change in D. magna. © 2017 John Wiley & Sons Ltd.
Programmable control of bacterial gene expression with the combined CRISPR and antisense RNA system.

PubMed

Lee, Young Je; Hoynes-O'Connor, Allison; Leong, Matthew C; Moon, Tae Seok

2016-03-18

A central goal of synthetic biology is to implement diverse cellular functions by predictably controlling gene expression. Though research has focused more on protein regulators than RNA regulators, recent advances in our understanding of RNA folding and functions have motivated the use of RNA regulators. RNA regulators provide an advantage because they are easier to design and engineer than protein regulators, potentially have a lower burden on the cell and are highly orthogonal. Here, we combine the CRISPR system from Streptococcus pyogenes and synthetic antisense RNAs (asRNAs) in Escherichia coli strains to repress or derepress a target gene in a programmable manner. Specifically, we demonstrate for the first time that the gene target repressed by the CRISPR system can be derepressed by expressing an asRNA that sequesters a small guide RNA (sgRNA). Furthermore, we demonstrate that tunable levels of derepression can be achieved (up to 95%) by designing asRNAs that target different regions of a sgRNA and by altering the hybridization free energy of the sgRNA-asRNA complex. This new system, which we call the combined CRISPR and asRNA system, can be used to reversibly repress or derepress multiple target genes simultaneously, allowing for rational reprogramming of cellular functions. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis.

PubMed

Thomassen, Mads; Tan, Qihua; Kruse, Torben A

2009-01-01

Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of "Gene Set Enrichment Analysis" we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.
Genes for seed longevity in barley identified by genomic analysis on Near Isogenic Lines.

PubMed

Wozny, Dorothee; Kramer, Katharina; Finkemeier, Iris; Acosta, Ivan F; Koornneef, Maarten

2018-05-09

Genes controlling differences in seed longevity between two barley (Hordeum vulgare) accessions were identified by combining quantitative genetics 'omics' technologies in Near Isogenic Lines (NILs). The NILs were derived from crosses between the spring barley landraces L94 from Ethiopia and Cebada Capa from Argentina. A combined transcriptome and proteome analysis on mature, non-aged seeds of the two parental lines and the L94 NILs by RNA-sequencing and total seed proteomic profiling identified the UDP-glycosyltransferase MLOC_11661.1 as candidate gene for the QTL on 2H, and the NADP-dependent malic enzyme (NADP-ME) MLOC_35785.1 as possible downstream target gene. To validate these candidates, they were expressed in Arabidopsis under the control of constitutive promoters to attempt complementing the T-DNA knock-out line nadp-me1. Both the NADP-ME MLOC_35785.1 and the UDP-glycosyltransferase MLOC_11661.1 were able to rescue the nadp-me1 seed longevity phenotype. In the case of the UDP-glycosyltransferase, with high accumulation in NILs, only the coding sequence of Cebada Capa had a rescue effect. This article is protected by copyright. All rights reserved.
Mapping autosomal recessive intellectual disability: combined microarray and exome sequencing identifies 26 novel candidate genes in 192 consanguineous families.

PubMed

Harripaul, R; Vasli, N; Mikhailov, A; Rafiq, M A; Mittal, K; Windpassinger, C; Sheikh, T I; Noor, A; Mahmood, H; Downey, S; Johnson, M; Vleuten, K; Bell, L; Ilyas, M; Khan, F S; Khan, V; Moradi, M; Ayaz, M; Naeem, F; Heidari, A; Ahmed, I; Ghadami, S; Agha, Z; Zeinali, S; Qamar, R; Mozhdehipanah, H; John, P; Mir, A; Ansar, M; French, L; Ayub, M; Vincent, J B

2018-04-01

Approximately 1% of the global population is affected by intellectual disability (ID), and the majority receive no molecular diagnosis. Previous studies have indicated high levels of genetic heterogeneity, with estimates of more than 2500 autosomal ID genes, the majority of which are autosomal recessive (AR). Here, we combined microarray genotyping, homozygosity-by-descent (HBD) mapping, copy number variation (CNV) analysis, and whole exome sequencing (WES) to identify disease genes/mutations in 192 multiplex Pakistani and Iranian consanguineous families with non-syndromic ID. We identified definite or candidate mutations (or CNVs) in 51% of families in 72 different genes, including 26 not previously reported for ARID. The new ARID genes include nine with loss-of-function mutations (ABI2, MAPK8, MPDZ, PIDD1, SLAIN1, TBC1D23, TRAPPC6B, UBA7 and USP44), and missense mutations include the first reports of variants in BDNF or TET1 associated with ID. The genes identified also showed overlap with de novo gene sets for other neuropsychiatric disorders. Transcriptional studies showed prominent expression in the prenatal brain. The high yield of AR mutations for ID indicated that this approach has excellent clinical potential and should inform clinical diagnostics, including clinical whole exome and genome sequencing, for populations in which consanguinity is common. As with other AR disorders, the relevance will also apply to outbred populations.
VIZARD: analysis of Affymetrix Arabidopsis GeneChip data

NASA Technical Reports Server (NTRS)

Moseyko, Nick; Feldman, Lewis J.

2002-01-01

SUMMARY: The Affymetrix GeneChip Arabidopsis genome array has proved to be a very powerful tool for the analysis of gene expression in Arabidopsis thaliana, the most commonly studied plant model organism. VIZARD is a Java program created at the University of California, Berkeley, to facilitate analysis of Arabidopsis GeneChip data. It includes several integrated tools for filtering, sorting, clustering and visualization of gene expression data as well as tools for the discovery of regulatory motifs in upstream sequences. VIZARD also includes annotation and upstream sequence databases for the majority of genes represented on the Affymetrix Arabidopsis GeneChip array. AVAILABILITY: VIZARD is available free of charge for educational, research, and not-for-profit purposes, and can be downloaded at http://www.anm.f2s.com/research/vizard/ CONTACT: moseyko@uclink4.berkeley.edu.
In silico analysis of miRNA-mediated gene regulation in OCA and OA genes.

PubMed

Kamaraj, Balu; Gopalakrishnan, Chandrasekhar; Purohit, Rituraj

2014-12-01

Albinism is an autosomal recessive genetic disorder due to low secretion of melanin. The oculocutaneous albinism (OCA) and ocular albinism (OA) genes are responsible for melanin production and also act as a potential targets for miRNAs. The role of miRNA is to inhibit the protein synthesis partially or completely by binding with the 3'UTR of the mRNA thus regulating gene expression. In this analysis, we predicted the genetic variation that occurred in 3'UTR of the transcript which can be a reason for low melanin production thus causing albinism. The single nucleotide polymorphisms (SNPs) in 3'UTR cause more new binding sites for miRNA which binds with mRNA which leads to inhibit the translation process either partially or completely. The SNPs in the mRNA of OCA and OA genes can create new binding sites for miRNA which may control the gene expression and lead to hypopigmentation. We have developed a computational procedure to determine the SNPs in the 3'UTR region of mRNA of OCA (TYR, OCA2, TYRP1 and SLC45A2) and OA (GPR143) genes which will be a potential cause for albinism. We identified 37 SNPs in five genes that are predicted to create 87 new binding sites on mRNA, which may lead to abrogation of the translation process. Expression analysis confirms that these genes are highly expressed in skin and eye regions. It is well supported by enrichment analysis that these genes are mainly involved in eye pigmentation and melanin biosynthesis process. The network analysis also shows how the genes are interacting and expressing in a complex network. This insight provides clue to wet-lab researches to understand the expression pattern of OCA and OA genes and binding phenomenon of mRNA and miRNA upon mutation, which is responsible for inhibition of translation process at genomic levels.
Molecular characterization and expression analysis of Triticum aestivum squamosa-promoter binding protein-box genes involved in ear development.

PubMed

Zhang, Bin; Liu, Xia; Zhao, Guangyao; Mao, Xinguo; Li, Ang; Jing, Ruilian

2014-06-01

Wheat (Triticum aestivum L.) is one of the most important crops in the world. Squamosa-promoter binding protein (SBP)-box genes play a critical role in regulating flower and fruit development. In this study, 10 novel SBP-box genes (TaSPL genes) were isolated from wheat ((Triticum aestivum L.) cultivar Yanzhan 4110). Phylogenetic analysis classified the TaSPL genes into five groups (G1-G5). The motif combinations and expression patterns of the TaSPL genes varied among the five groups with each having own distinctive characteristics: TaSPL20/21 in G1 and TaSPL17 in G2 mainly expressed in the shoot apical meristem and the young ear, and their expression levels responded to development of the ear; TaSPL6/15 belonging to G3 were upregulated and TaSPL1/23 in G4 were downregulated during grain development; the gene in G5 (TaSPL3) expressed constitutively. Thus, the consistency of the phylogenetic analysis, motif compositions, and expression patterns of the TaSPL genes revealed specific gene structures and functions. On the other hand, the diverse gene structures and different expression patterns suggested that wheat SBP-box genes have a wide range of functions. The results also suggest a potential role for wheat SBP-box genes in ear development. This study provides a significant beginning of functional analysis of SBP-box genes in wheat. © 2014 The Authors. Journal of Integrative Plant Biology Published by Wiley Publishing Asia Pty Ltd on behalf of Institute of Botany, Chinese Academy of Sciences.
Meta-analysis identifies gene-by-environment interactions as demonstrated in a study of 4,965 mice.

PubMed

Kang, Eun Yong; Han, Buhm; Furlotte, Nicholas; Joo, Jong Wha J; Shih, Diana; Davis, Richard C; Lusis, Aldons J; Eskin, Eleazar

2014-01-01

Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis
Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice

PubMed Central

Joo, Jong Wha J.; Shih, Diana; Davis, Richard C.; Lusis, Aldons J.; Eskin, Eleazar

2014-01-01

Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis
Combining lipophilic dye, in situ hybridization, immunohistochemistry, and histology.

PubMed

Duncan, Jeremy; Kersigo, Jennifer; Gray, Brian; Fritzsch, Bernd

2011-03-17

Going beyond single gene function to cut deeper into gene regulatory networks requires multiple mutations combined in a single animal. Such analysis of two or more genes needs to be complemented with in situ hybridization of other genes, or immunohistochemistry of their proteins, both in whole mounted developing organs or sections for detailed resolution of the cellular and tissue expression alterations. Combining multiple gene alterations requires the use of cre or flipase to conditionally delete genes and avoid embryonic lethality. Required breeding schemes dramatically enhance effort and cost proportional to the number of genes mutated, with an outcome of very few animals with the full repertoire of genetic modifications desired. Amortizing the vast amount of effort and time to obtain these few precious specimens that are carrying multiple mutations necessitates tissue optimization. Moreover, investigating a single animal with multiple techniques makes it easier to correlate gene deletion defects with expression profiles. We have developed a technique to obtain a more thorough analysis of a given animal; with the ability to analyze several different histologically recognizable structures as well as gene and protein expression all from the same specimen in both whole mounted organs and sections. Although mice have been utilized to demonstrate the effectiveness of this technique it can be applied to a wide array of animals. To do this we combine lipophilic dye tracing, whole mount in situ hybridization, immunohistochemistry, and histology to extract the maximal possible amount of data.
Combining Lipophilic dye, in situ Hybridization, Immunohistochemistry, and Histology

PubMed Central

Duncan, Jeremy; Kersigo, Jennifer; Gray, Brian; Fritzsch, Bernd

2011-01-01

Going beyond single gene function to cut deeper into gene regulatory networks requires multiple mutations combined in a single animal. Such analysis of two or more genes needs to be complemented with in situ hybridization of other genes, or immunohistochemistry of their proteins, both in whole mounted developing organs or sections for detailed resolution of the cellular and tissue expression alterations. Combining multiple gene alterations requires the use of cre or flipase to conditionally delete genes and avoid embryonic lethality. Required breeding schemes dramatically enhance effort and cost proportional to the number of genes mutated, with an outcome of very few animals with the full repertoire of genetic modifications desired. Amortizing the vast amount of effort and time to obtain these few precious specimens that are carrying multiple mutations necessitates tissue optimization. Moreover, investigating a single animal with multiple techniques makes it easier to correlate gene deletion defects with expression profiles. We have developed a technique to obtain a more thorough analysis of a given animal; with the ability to analyze several different histologically recognizable structures as well as gene and protein expression all from the same specimen in both whole mounted organs and sections. Although mice have been utilized to demonstrate the effectiveness of this technique it can be applied to a wide array of animals. To do this we combine lipophilic dye tracing, whole mount in situ hybridization, immunohistochemistry, and histology to extract the maximal possible amount of data. PMID:21445047
Genome-wide analysis of the WRKY gene family in cotton.

PubMed

Dou, Lingling; Zhang, Xiaohong; Pang, Chaoyou; Song, Meizhen; Wei, Hengling; Fan, Shuli; Yu, Shuxun

2014-12-01

WRKY proteins are major transcription factors involved in regulating plant growth and development. Although many studies have focused on the functional identification of WRKY genes, our knowledge concerning many areas of WRKY gene biology is limited. For example, in cotton, the phylogenetic characteristics, global expression patterns, molecular mechanisms regulating expression, and target genes/pathways of WRKY genes are poorly characterized. Therefore, in this study, we present a genome-wide analysis of the WRKY gene family in cotton (Gossypium raimondii and Gossypium hirsutum). We identified 116 WRKY genes in G. raimondii from the completed genome sequence, and we cloned 102 WRKY genes in G. hirsutum. Chromosomal location analysis indicated that WRKY genes in G. raimondii evolved mainly from segmental duplication followed by tandem amplifications. Phylogenetic analysis of alga, bryophyte, lycophyta, monocot and eudicot WRKY domains revealed family member expansion with increasing complexity of the plant body. Microarray, expression profiling and qRT-PCR data revealed that WRKY genes in G. hirsutum may regulate the development of fibers, anthers, tissues (roots, stems, leaves and embryos), and are involved in the response to stresses. Expression analysis showed that most group II and III GhWRKY genes are highly expressed under diverse stresses. Group I members, representing the ancestral form, seem to be insensitive to abiotic stress, with low expression divergence. Our results indicate that cotton WRKY genes might have evolved by adaptive duplication, leading to sensitivity to diverse stresses. This study provides fundamental information to inform further analysis and understanding of WRKY gene functions in cotton species.
Peptide micelle-mediated delivery of tissue-specific suicide gene and combined therapy with avastin in a glioblastoma model.

PubMed

Oh, Binna; Han, Jaesik; Choi, Eunji; Tan, Xiaonan; Lee, Minhyung

2015-04-01

Bevacizumab (Avastin) is an angiogenesis inhibitor used as a treatment for various cancers. In this study, the combination therapy of Avastin and glioblastoma-specific thymidine kinase gene [pEpo-NI2-SV-herpes simplex virus thymidine kinase(HSVtk)] was evaluated in a glioblastoma animal model. The R7L10 peptide was used as a gene carrier of pEpo-NI2-SV-HSVtk. Gel retardation assays confirmed that R7L10 formed stable complexes with pEpo-NI2-SV-HSVtk. R7L10 protected DNA from nuclease digestion. R7L10 had lower transfection efficiency than polyethylenimine (PEI; 25 kDa). However, the in vitro and in vivo toxicity assays showed that R7L10 had lower cytotoxicity than PEI, suggesting that R7L10 is safer than PEI. For the combination therapy, Avastin was injected intravenously and the pEpo-NI2-SV-HSVtk/R7L10 complexes were injected intratumorally in the glioblastoma animal model. Tumor growth was most effectively inhibited by the combination therapy of Avastin and the gene. The immunostaining results confirmed that the HSVtk genes were expressed in the groups with the pEpo-NI2-SV-HSVtk/R7L10 complex. The terminal deoxynucleotidyl transferase dUTP nick end labeling assay showed a higher level of apoptotic cells in the combination group than the pEpo-NI2-SV-HSVtk/R7L10 complex or Avastin group. In conclusion, the combination of Avastin and the glioblastoma-specific HSVtk gene has a higher antitumor effect than single therapy of Avastin or HSVtk after intratumoral administration in glioblastoma animal model. © 2015 Wiley Periodicals, Inc. and the American Pharmacists Association.

Phylogenetic analysis at deep timescales: unreliable gene trees, bypassed hidden support, and the coalescence/concatalescence conundrum.

PubMed

Gatesy, John; Springer, Mark S

2014-11-01

Large datasets are required to solve difficult phylogenetic problems that are deep in the Tree of Life. Currently, two divergent systematic methods are commonly applied to such datasets: the traditional supermatrix approach (= concatenation) and "shortcut" coalescence (= coalescence methods wherein gene trees and the species tree are not co-estimated). When applied to ancient clades, these contrasting frameworks often produce congruent results, but in recent phylogenetic analyses of Placentalia (placental mammals), this is not the case. A recent series of papers has alternatively disputed and defended the utility of shortcut coalescence methods at deep phylogenetic scales. Here, we examine this exchange in the context of published phylogenomic data from Mammalia; in particular we explore two critical issues - the delimitation of data partitions ("genes") in coalescence analysis and hidden support that emerges with the combination of such partitions in phylogenetic studies. Hidden support - increased support for a clade in combined analysis of all data partitions relative to the support evident in separate analyses of the various data partitions, is a hallmark of the supermatrix approach and a primary rationale for concatenating all characters into a single matrix. In the most extreme cases of hidden support, relationships that are contradicted by all gene trees are supported when all of the genes are analyzed together. A valid fear is that shortcut coalescence methods might bypass or distort character support that is hidden in individual loci because small gene fragments are analyzed in isolation. Given the extensive systematic database for Mammalia, the assumptions and applicability of shortcut coalescence methods can be assessed with rigor to complement a small but growing body of simulation work that has directly compared these methods to concatenation. We document several remarkable cases of hidden support in both supermatrix and coalescence paradigms and argue
GenePattern | Informatics Technology for Cancer Research (ITCR)

Cancer.gov

GenePattern is a genomic analysis platform that provides access to hundreds of tools for the analysis and visualization of multiple data types. A web-based interface provides easy access to these tools and allows the creation of multi-step analysis pipelines that enable reproducible in silico research. A new GenePattern Notebook environment allows users to combine GenePattern analyses with text, graphics, and code to create complete reproducible research narratives.
Microarray Meta-Analysis Focused on the Response of Genes Involved in Redox Homeostasis to Diverse Abiotic Stresses in Rice

PubMed Central

de Abreu Neto, Joao B.; Frei, Michael

2016-01-01

Plants are exposed to a wide range of abiotic stresses (AS), which often occur in combination. Because physiological investigations typically focus on one stress, our understanding of unspecific stress responses remains limited. The plant redox homeostasis, i.e., the production and removal of reactive oxygen species (ROS), may be involved in many environmental stress conditions. Therefore, this study intended to identify genes, which are activated in diverse AS, focusing on ROS-related pathways. We conducted a meta-analysis (MA) of microarray experiments, focusing on rice. Transcriptome data were mined from public databases and fellow researchers, which represented 36 different experiments and investigated diverse AS, including ozone stress, drought, heat, cold, salinity, and mineral deficiencies/toxicities. To overcome the inherent artifacts of different MA methods, data were processed using Fisher, rOP, REM, and product of rank (GeneSelector), and genes identified by most approaches were considered as shared differentially expressed genes (DEGs). Two MA strategies were adopted: first, datasets were separated into shoot, root, and seedling experiments, and these tissues were analyzed separately to identify shared DEGs. Second, shoot and seedling experiments were classed into oxidative stress (OS), i.e., ozone and hydrogen peroxide treatments directly producing ROS in plant tissue, and other AS, in which ROS production is indirect. In all tissues and stress conditions, genes a priori considered as ROS-related were overrepresented among the DEGs, as they represented 4% of all expressed genes but 7–10% of the DEGs. The combined MA approach was substantially more conservative than individual MA methods and identified 1001 shared DEGs in shoots, 837 shared DEGs in root, and 1172 shared DEGs in seedlings. Within the OS and AS groups, 990 and 1727 shared DEGs were identified, respectively. In total, 311 genes were shared between OS and AS, including many regulatory
Assessing the utility of gene co-expression stability in combination with correlation in the analysis of protein-protein interaction networks

PubMed Central

2011-01-01

Background Gene co-expression, in the form of a correlation coefficient, has been valuable in the analysis, classification and prediction of protein-protein interactions. However, it is susceptible to bias from a few samples having a large effect on the correlation coefficient. Gene co-expression stability is a means of quantifying this bias, with high stability indicating robust, unbiased co-expression correlation coefficients. We assess the utility of gene co-expression stability as an additional measure to support the co-expression correlation in the analysis of protein-protein interaction networks. Results We studied the patterns of co-expression correlation and stability in interacting proteins with respect to their interaction promiscuity, levels of intrinsic disorder, and essentiality or disease-relatedness. Co-expression stability, along with co-expression correlation, acts as a better classifier of hub proteins in interaction networks, than co-expression correlation alone, enabling the identification of a class of hubs that are functionally distinct from the widely accepted transient (date) and obligate (party) hubs. Proteins with high levels of intrinsic disorder have low co-expression correlation and high stability with their interaction partners suggesting their involvement in transient interactions, except for a small group that have high co-expression correlation and are typically subunits of stable complexes. Similar behavior was seen for disease-related and essential genes. Interacting proteins that are both disordered have higher co-expression stability than ordered protein pairs. Using co-expression correlation and stability, we found that transient interactions are more likely to occur between an ordered and a disordered protein while obligate interactions primarily occur between proteins that are either both ordered, or disordered. Conclusions We observe that co-expression stability shows distinct patterns in structurally and functionally
When is hub gene selection better than standard meta-analysis?

PubMed

Langfelder, Peter; Mischel, Paul S; Horvath, Steve

2013-01-01

Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to
[From gene cloning to expressional analysis--practice and experience from educational reform of experimental gene engineering].

PubMed

Wu, Yan-Hua; Guo, Bin; Lou, Hui-Ling; Cui, Yu-Liang; Gu, Hui-Juan; Qiao, Shou-Yi

2012-02-01

Experimental gene engineering is a laboratory course focusing on the molecular structure, expression pattern and biological function of genes. Providing our students with a solid knowledge base and correct ways to conduct research is very important for high-quality education of genetic engineering. Inspired by recent progresses in this field, we improved the experimental gene engineering course by adding more updated knowledge and technologies and emphasizing on the combination of teaching and research, with the aim of offering our students a good start in their scientific careers.
Derivation of an artificial gene to improve classification accuracy upon gene selection.

PubMed

Seo, Minseok; Oh, Sejong

2012-02-01

Classification analysis has been developed continuously since 1936. This research field has advanced as a result of development of classifiers such as KNN, ANN, and SVM, as well as through data preprocessing areas. Feature (gene) selection is required for very high dimensional data such as microarray before classification work. The goal of feature selection is to choose a subset of informative features that reduces processing time and provides higher classification accuracy. In this study, we devised a method of artificial gene making (AGM) for microarray data to improve classification accuracy. Our artificial gene was derived from a whole microarray dataset, and combined with a result of gene selection for classification analysis. We experimentally confirmed a clear improvement of classification accuracy after inserting artificial gene. Our artificial gene worked well for popular feature (gene) selection algorithms and classifiers. The proposed approach can be applied to any type of high dimensional dataset. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Gene expression analysis of rheumatoid arthritis synovial lining regions by cDNA microarray combined with laser microdissection: up-regulation of inflammation-associated STAT1, IRF1, CXCL9, CXCL10, and CCL5

PubMed Central

Yoshida, S; Arakawa, F; Higuchi, F; Ishibashi, Y; Goto, M; Sugita, Y; Nomura, Y; Niino, D; Shimizu, K; Aoki, R; Hashikawa, K; Kimura, Y; Yasuda, K; Tashiro, K; Kuhara, S; Nagata, K; Ohshima, K

2012-01-01

Objectives The main histological change in rheumatoid arthritis (RA) is the villous proliferation of synovial lining cells, an important source of cytokines and chemokines, which are associated with inflammation. The aim of this study was to evaluate gene expression in the microdissected synovial lining cells of RA patients, using those of osteoarthritis (OA) patients as the control. Methods Samples were obtained during total joint replacement from 11 RA and five OA patients. Total RNA from the synovial lining cells was derived from selected specimens by laser microdissection (LMD) for subsequent cDNA microarray analysis. In addition, the expression of significant genes was confirmed immunohistochemically. Results The 14 519 genes detected by cDNA microarray were used to compare gene expression levels in synovial lining cells from RA with those from OA patients. Cluster analysis indicated that RA cells, including low- and high-expression subgroups, and OA cells were stored in two main clusters. The molecular activity of RA was statistically consistent with its clinical and histological activity. Expression levels of signal transducer and activator of transcription 1 (STAT1), interferon regulatory factor 1 (IRF1), and the chemokines CXCL9, CXCL10, and CCL5 were statistically significantly higher in the synovium of RA than in that of OA. Immunohistochemically, the lining synovium of RA, but not that of OA, clearly expressed STAT1, IRF1, and chemokines, as was seen in microarray analysis combined with LMD. Conclusions Our findings indicate an important role for lining synovial cells in the inflammatory and proliferative processes of RA. Further understanding of the local signalling in structural components is important in rheumatology. PMID:22401175
Combinations of mutant FAD2 and FAD3 genes to produce high oleic acid and low linolenic acid soybean oil.

PubMed

Pham, Anh-Tung; Shannon, J Grover; Bilyeu, Kristin D

2012-08-01

High oleic acid soybeans were produced by combining mutant FAD2-1A and FAD2-1B genes. Despite having a high oleic acid content, the linolenic acid content of these soybeans was in the range of 4-6 %, which may be high enough to cause oxidative instability of the oil. Therefore, a study was conducted to incorporate one or two mutant FAD3 genes into the high oleic acid background to further reduce the linolenic acid content. As a result, soybean lines with high oleic acid and low linolenic acid (HOLL) content were produced using different sources of mutant FAD2-1A genes. While oleic acid content of these HOLL lines was stable across two testing environments, the reduction of linolenic acid content varied depending on the number of mutant FAD3 genes combined with mutant FAD2-1 genes, on the severity of mutation in the FAD2-1A gene, and on the testing environment. Combination of two mutant FAD2-1 genes and one mutant FAD3 gene resulted in less than 2 % linolenic acid content in Portageville, Missouri (MO) while four mutant genes were needed to achieve the same linolenic acid in Columbia, MO. This study generated non-transgenic soybeans with the highest oleic acid content and lowest linolenic acid content reported to date, offering a unique alternative to produce a fatty acid profile similar to olive oil.
Gene Ontology-Based Analysis of Zebrafish Omics Data Using the Web Tool Comparative Gene Ontology.

PubMed

Ebrahimie, Esmaeil; Fruzangohar, Mario; Moussavi Nik, Seyyed Hani; Newman, Morgan

2017-10-01

Gene Ontology (GO) analysis is a powerful tool in systems biology, which uses a defined nomenclature to annotate genes/proteins within three categories: "Molecular Function," "Biological Process," and "Cellular Component." GO analysis can assist in revealing functional mechanisms underlying observed patterns in transcriptomic, genomic, and proteomic data. The already extensive and increasing use of zebrafish for modeling genetic and other diseases highlights the need to develop a GO analytical tool for this organism. The web tool Comparative GO was originally developed for GO analysis of bacterial data in 2013 ( www.comparativego.com ). We have now upgraded and elaborated this web tool for analysis of zebrafish genetic data using GOs and annotations from the Gene Ontology Consortium.
Pathway-based factor analysis of gene expression data produces highly heritable phenotypes that associate with age.

PubMed

Anand Brown, Andrew; Ding, Zhihao; Viñuela, Ana; Glass, Dan; Parts, Leopold; Spector, Tim; Winn, John; Durbin, Richard

2015-03-09

Statistical factor analysis methods have previously been used to remove noise components from high-dimensional data prior to genetic association mapping and, in a guided fashion, to summarize biologically relevant sources of variation. Here, we show how the derived factors summarizing pathway expression can be used to analyze the relationships between expression, heritability, and aging. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarize patterns of gene expression to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 "pathway phenotypes" that summarized patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bonferroni threshold ([Formula: see text]). These phenotypes are more heritable ([Formula: see text]) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolizing sugars and fatty acids; others relate to insulin signaling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors. Copyright © 2015 Brown et al.
Pathway-Based Factor Analysis of Gene Expression Data Produces Highly Heritable Phenotypes That Associate with Age

PubMed Central

Anand Brown, Andrew; Ding, Zhihao; Viñuela, Ana; Glass, Dan; Parts, Leopold; Spector, Tim; Winn, John; Durbin, Richard

2015-01-01

Statistical factor analysis methods have previously been used to remove noise components from high-dimensional data prior to genetic association mapping and, in a guided fashion, to summarize biologically relevant sources of variation. Here, we show how the derived factors summarizing pathway expression can be used to analyze the relationships between expression, heritability, and aging. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarize patterns of gene expression to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 “pathway phenotypes” that summarized patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bonferroni threshold (P<5.38×10−5). These phenotypes are more heritable (h2=0.32) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolizing sugars and fatty acids; others relate to insulin signaling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors. PMID:25758824
Meta-Type Analysis of Dopaminergic Effects on Gene Expression in the Neuroendocrine Brain of Female Goldfish

PubMed Central

Popesku, Jason T.; Martyniuk, Christopher J.; Trudeau, Vance L.

2012-01-01

Dopamine (DA) is a major neurotransmitter important for neuroendocrine control and recent studies have described genomic signaling pathways activated and inhibited by DA agonists and antagonists in the goldfish brain. Here we perform a meta-type analysis using microarray datasets from experiments conducted with female goldfish to characterize the gene expression responses that underlie dopaminergic signaling. Sexually mature, pre-spawning [gonadosomatic index (GSI) = 4.5 ± 1.3%] or sexually regressing (GSI = 3 ± 0.4%) female goldfish (15–40 g) injected intraperitoneally with either SKF 38393, LY 171555, SCH 23390, sulpiride, or a combination of 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine and α-methyl-p-tyrosine. Microarray meta-type analysis identified 268 genes in the telencephalon and hypothalamus as having reciprocal (i.e., opposite between agonism and antagonism/depletion) fold change responses, suggesting that these transcripts are likely targets for DA-mediated regulation. Noteworthy genes included ependymin, vimentin, and aromatase, genes that support the significance of DA in neuronal plasticity and tissue remodeling. Sub-network enrichment analysis (SNEA) was used to identify common gene regulators and binding proteins associated with the differentially expressed genes mediated by DA. SNEA analysis identified gene expression targets that were related to three major categories that included cell signaling (STAT3, SP1, SMAD, Jun/Fos), immune response (IL-6, IL-1β, TNFs, cytokine, NF-κB), and cell proliferation and growth (IGF1, TGFβ1). These gene networks are also known to be associated with neurodegenerative disorders such as Parkinsons’ disease, well-known to be associated with loss of dopaminergic neurons. This study identifies genes and networks that underlie DA signaling in the vertebrate CNS and provides targets that may be key neuroendocrine regulators. The results provide a foundation for future work on dopaminergic
Identification of pathogenic genes related to rheumatoid arthritis through integrated analysis of DNA methylation and gene expression profiling.

PubMed

Zhang, Lei; Ma, Shiyun; Wang, Huailiang; Su, Hang; Su, Ke; Li, Longjie

2017-11-15

The purpose of our study was to identify new pathogenic genes used for exploring the pathogenesis of rheumatoid arthritis (RA). To screen pathogenic genes of RA, an integrated analysis was performed by using the microarray datasets in RA derived from the Gene Expression Omnibus (GEO) database. The functional annotation and potential pathways of differentially expressed genes (DEGs) were further discovered by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. Afterwards, the integrated analysis of DNA methylation and gene expression profiling was used to screen crucial genes. In addition, we used RT-PCR and MSP to verify the expression levels and methylation status of these crucial genes in 20 synovial biopsy samples obtained from 10 RA model mice and 10 normal mice. BCL11B, CCDC88C, FCRLA and APOL6 were both up-regulated and hypomethylated in RA according to integrated analysis, RT-PCR and MSP verification. Four crucial genes (BCL11B, CCDC88C, FCRLA and APOL6) identified and analyzed in this study might be closely connected with the pathogenesis of RA. Copyright © 2017. Published by Elsevier B.V.
Genome-Wide Comparative Analysis of the Phospholipase D Gene Families among Allotetraploid Cotton and Its Diploid Progenitors

PubMed Central

Tang, Kai; Dong, Chun-Juan; Liu, Jin-Yuan

2016-01-01

In this study, 40 phospholipase D (PLD) genes were identified from allotetraploid cotton Gossypium hirsutum, and 20 PLD genes were examined in diploid cotton Gossypium raimondii. Combining with 19 previously identified Gossypium arboreum PLD genes, a comparative analysis was performed among the PLD gene families among allotetraploid and two diploid cottons. Based on the orthologous relationships, we found that almost each G. hirsutum PLD had a corresponding homolog in the G. arboreum and G. raimondii genomes, except for GhPLDβ3A, whose homolog GaPLDβ3 may have been lost during the evolution of G. arboreum after the interspecific hybridization. Phylogenetic analysis showed that all of the cotton PLDs were unevenly classified into six numbered subgroups: α, β/γ, δ, ε, ζ and φ. An N-terminal C2 domain was found in the α, β/γ, δ and ε subgroups, while phox homology (PX) and pleckstrin homology (PH) domains were identified in the ζ subgroup. The subgroup φ possessed a single peptide instead of a functional domain. In each phylogenetic subgroup, the PLDs showed high conservation in gene structure and amino acid sequences in functional domains. The expansion of GhPLD and GrPLD gene families were mainly attributed to segmental duplication and partly attributed to tandem duplication. Furthermore, purifying selection played a critical role in the evolution of PLD genes in cotton. Quantitative RT-PCR documented that allotetraploid cotton PLD genes were broadly expressed and each had a unique spatial and developmental expression pattern, indicating their functional diversification in cotton growth and development. Further analysis of cis-regulatory elements elucidated transcriptional regulations and potential functions. Our comparative analysis provided valuable information for understanding the putative functions of the PLD genes in cotton fiber. PMID:27213891
A combination test for detection of gene-environment interaction in cohort studies.

PubMed

Coombes, Brandon; Basu, Saonli; McGue, Matt

2017-07-01

Identifying gene-environment (G-E) interactions can contribute to a better understanding of disease etiology, which may help researchers develop disease prevention strategies and interventions. One big criticism of studying G-E interaction is the lack of power due to sample size. Studies often restrict the interaction search to the top few hundred hits from a genome-wide association study or focus on potential candidate genes. In this paper, we test interactions between a candidate gene and an environmental factor to improve power by analyzing multiple variants within a gene. We extend recently developed score statistic based genetic association testing approaches to the G-E interaction testing problem. We also propose tests for interaction using gene-based summary measures that pool variants together. Although it has recently been shown that these summary measures can be biased and may lead to inflated type I error, we show that under several realistic scenarios, we can still provide valid tests of interaction. These tests use significantly less degrees of freedom and thus can have much higher power to detect interaction. Additionally, we demonstrate that the iSeq-aSum-min test, which combines a gene-based summary measure test, iSeq-aSum-G, and an interaction-based summary measure test, iSeq-aSum-I, provides a powerful alternative to test G-E interaction. We demonstrate the performance of these approaches using simulation studies and illustrate their performance to study interaction between the SNPs in several candidate genes and family climate environment on alcohol consumption using the Minnesota Center for Twin and Family Research dataset. © 2017 WILEY PERIODICALS, INC.
Comprehensive Analysis of the Soybean (Glycine max) GmLAX Auxin Transporter Gene Family

PubMed Central

Chai, Chenglin; Wang, Yongqin; Valliyodan, Babu; Nguyen, Henry T.

2016-01-01

The phytohormone auxin plays a critical role in regulation of plant growth and development as well as plant responses to abiotic stresses. This is mainly achieved through its uneven distribution in plant via a polar auxin transport process. Auxin transporters are major players in polar auxin transport. The AUXIN RESISTENT 1/LIKE AUX1 (AUX/LAX) auxin influx carriers belong to the amino acid permease family of proton-driven transporters and function in the uptake of indole-3-acetic acid (IAA). In this study, genome-wide comprehensive analysis of the soybean AUX/LAX (GmLAX) gene family, including phylogenic relationships, chromosome localization, and gene structure, was carried out. A total of 15 GmLAX genes, including seven duplicated gene pairs, were identified in the soybean genome. They were distributed on 10 chromosomes. Despite their higher percentage identities at the protein level, GmLAXs exhibited versatile tissue-specific expression patterns, indicating coordinated functioning during plant growth and development. Most GmLAXs were responsive to drought and dehydration stresses and auxin and abscisic acid (ABA) stimuli, in a tissue- and/or time point- sensitive mode. Several GmLAX members were involved in responding to salt stress. Sequence analysis revealed that promoters of GmLAXs contained different combinations of stress-related cis-regulatory elements. These studies suggest that the soybean GmLAXs were under control of a very complex regulatory network, responding to various internal and external signals. This study helps to identity candidate GmLAXs for further analysis of their roles in soybean development and adaption to adverse environments. PMID:27014306
Analysis of MHC class I genes across horse MHC haplotypes

PubMed Central

Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.

2010-01-01

The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063
Warehousing re-annotated cancer genes for biomarker meta-analysis.

PubMed

Orsini, M; Travaglione, A; Capobianco, E

2013-07-01

Translational research in cancer genomics assigns a fundamental role to bioinformatics in support of candidate gene prioritization with regard to both biomarker discovery and target identification for drug development. Efforts in both such directions rely on the existence and constant update of large repositories of gene expression data and omics records obtained from a variety of experiments. Users who interactively interrogate such repositories may have problems in retrieving sample fields that present limited associated information, due for instance to incomplete entries or sometimes unusable files. Cancer-specific data sources present similar problems. Given that source integration usually improves data quality, one of the objectives is keeping the computational complexity sufficiently low to allow an optimal assimilation and mining of all the information. In particular, the scope of integrating intraomics data can be to improve the exploration of gene co-expression landscapes, while the scope of integrating interomics sources can be that of establishing genotype-phenotype associations. Both integrations are relevant to cancer biomarker meta-analysis, as the proposed study demonstrates. Our approach is based on re-annotating cancer-specific data available at the EBI's ArrayExpress repository and building a data warehouse aimed to biomarker discovery and validation studies. Cancer genes are organized by tissue with biomedical and clinical evidences combined to increase reproducibility and consistency of results. For better comparative evaluation, multiple queries have been designed to efficiently address all types of experiments and platforms, and allow for retrieval of sample-related information, such as cell line, disease state and clinical aspects. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Multiscale Embedded Gene Co-expression Network Analysis

PubMed Central

Song, Won-Min; Zhang, Bin

2015-01-01

Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. PMID:26618778

Multiscale Embedded Gene Co-expression Network Analysis.

PubMed

Song, Won-Min; Zhang, Bin

2015-11-01

Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.
Genome-Wide Detection and Analysis of Multifunctional Genes

PubMed Central

Pritykin, Yuri; Ghersi, Dario; Singh, Mona

2015-01-01

Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655
Phylogenic analysis of the M genes of influenza viruses isolated from free-flying water birds from their Northern Territory to Hokkaido, Japan.

PubMed

Manzoor, Rashid; Sakoda, Yoshihiro; Mweene, Aaron; Tsuda, Yoshimi; Kishida, Noriko; Bai, Gui-Rong; Kameyama, Ken-Ichiro; Isoda, Norikazu; Soda, Kosuke; Naito, Michiko; Kida, Hiroshi

2008-10-01

During 2000-2007, 218 influenza viruses of 28 different combinations of HA (H1-H13) and NA (N1-N9) subtypes were isolated from fecal samples of free-flying water birds at two distant lakes in Hokkaido, Japan. Phylogenic analysis of the matrix (M) genes of 67 strains, selected on the basis of their subtype combinations, revealed that A/duck/Hokkaido/W95/2006 (H10N8) was a reassortant whose M and NA genes [corrected] belonged to North American non-gull-avian and the other six [corrected] genes to Eurasian non-gull-avian lineages. The M genes of other 65 strains belonged to Eurasian non-gull-avian and the one to Eurasian-gull lineages. The M genes of 65 strains were grouped into three different sublineages, indicating that influenza viruses circulating in different populations of free-flying water birds have evolved independently in nature.
Gene therapy for adenosine deaminase-deficient severe combined immune deficiency: clinical comparison of retroviral vectors and treatment plans.

PubMed

Candotti, Fabio; Shaw, Kit L; Muul, Linda; Carbonaro, Denise; Sokolic, Robert; Choi, Christopher; Schurman, Shepherd H; Garabedian, Elizabeth; Kesserwan, Chimene; Jagadeesh, G Jayashree; Fu, Pei-Yu; Gschweng, Eric; Cooper, Aaron; Tisdale, John F; Weinberg, Kenneth I; Crooks, Gay M; Kapoor, Neena; Shah, Ami; Abdel-Azim, Hisham; Yu, Xiao-Jin; Smogorzewska, Monika; Wayne, Alan S; Rosenblatt, Howard M; Davis, Carla M; Hanson, Celine; Rishi, Radha G; Wang, Xiaoyan; Gjertson, David; Yang, Otto O; Balamurugan, Arumugam; Bauer, Gerhard; Ireland, Joanna A; Engel, Barbara C; Podsakoff, Gregory M; Hershfield, Michael S; Blaese, R Michael; Parkman, Robertson; Kohn, Donald B

2012-11-01

We conducted a gene therapy trial in 10 patients with adenosine deaminase (ADA)-deficient severe combined immunodeficiency using 2 slightly different retroviral vectors for the transduction of patients' bone marrow CD34(+) cells. Four subjects were treated without pretransplantation cytoreduction and remained on ADA enzyme-replacement therapy (ERT) throughout the procedure. Only transient (months), low-level (< 0.01%) gene marking was observed in PBMCs of 2 older subjects (15 and 20 years of age), whereas some gene marking of PBMC has persisted for the past 9 years in 2 younger subjects (4 and 6 years). Six additional subjects were treated using the same gene transfer protocol, but after withdrawal of ERT and administration of low-dose busulfan (65-90 mg/m(2)). Three of these remain well, off ERT (5, 4, and 3 years postprocedure), with gene marking in PBMC of 1%-10%, and ADA enzyme expression in PBMC near or in the normal range. Two subjects were restarted on ERT because of poor gene marking and immune recovery, and one had a subsequent allogeneic hematopoietic stem cell transplantation. These studies directly demonstrate the importance of providing nonmyeloablative pretransplantation conditioning to achieve therapeutic benefits with gene therapy for ADA-deficient severe combined immunodeficiency.
Gene therapy for adenosine deaminase–deficient severe combined immune deficiency: clinical comparison of retroviral vectors and treatment plans

PubMed Central

Candotti, Fabio; Shaw, Kit L.; Muul, Linda; Carbonaro, Denise; Sokolic, Robert; Choi, Christopher; Schurman, Shepherd H.; Garabedian, Elizabeth; Kesserwan, Chimene; Jagadeesh, G. Jayashree; Fu, Pei-Yu; Gschweng, Eric; Cooper, Aaron; Tisdale, John F.; Weinberg, Kenneth I.; Crooks, Gay M.; Kapoor, Neena; Shah, Ami; Abdel-Azim, Hisham; Yu, Xiao-Jin; Smogorzewska, Monika; Wayne, Alan S.; Rosenblatt, Howard M.; Davis, Carla M.; Hanson, Celine; Rishi, Radha G.; Wang, Xiaoyan; Gjertson, David; Yang, Otto O.; Balamurugan, Arumugam; Bauer, Gerhard; Ireland, Joanna A.; Engel, Barbara C.; Podsakoff, Gregory M.; Hershfield, Michael S.; Blaese, R. Michael; Parkman, Robertson

2012-01-01

We conducted a gene therapy trial in 10 patients with adenosine deaminase (ADA)–deficient severe combined immunodeficiency using 2 slightly different retroviral vectors for the transduction of patients' bone marrow CD34+ cells. Four subjects were treated without pretransplantation cytoreduction and remained on ADA enzyme-replacement therapy (ERT) throughout the procedure. Only transient (months), low-level (< 0.01%) gene marking was observed in PBMCs of 2 older subjects (15 and 20 years of age), whereas some gene marking of PBMC has persisted for the past 9 years in 2 younger subjects (4 and 6 years). Six additional subjects were treated using the same gene transfer protocol, but after withdrawal of ERT and administration of low-dose busulfan (65-90 mg/m2). Three of these remain well, off ERT (5, 4, and 3 years postprocedure), with gene marking in PBMC of 1%-10%, and ADA enzyme expression in PBMC near or in the normal range. Two subjects were restarted on ERT because of poor gene marking and immune recovery, and one had a subsequent allogeneic hematopoietic stem cell transplantation. These studies directly demonstrate the importance of providing nonmyeloablative pretransplantation conditioning to achieve therapeutic benefits with gene therapy for ADA-deficient severe combined immunodeficiency. PMID:22968453
Evaluation and selection of reliable reference genes for gene expression under abiotic stress in cotton (Gossypium hirsutum L.).

PubMed

Wang, Min; Wang, Qinglian; Zhang, Baohong

2013-11-01

Reference genes are critical for normalization of the gene expression level of target genes. The widely used housekeeping genes may change their expression levels at different tissue under different treatment or stress conditions. Therefore, systematical evaluation on the housekeeping genes is required for gene expression analysis. Up to date, no work was performed to evaluate the housekeeping genes in cotton under stress treatment. In this study, we chose 10 housekeeping genes to systematically assess their expression levels at two different tissues (leaves and roots) under two different abiotic stresses (salt and drought) with three different concentrations. Our results show that there is no best reference gene for all tissues at all stress conditions. The reliable reference gene should be selected based on a specific condition. For example, under salt stress, UBQ7, GAPDH and EF1A8 are better reference genes in leaves; TUA10, UBQ7, CYP1, GAPDH and EF1A8 were better in roots. Under drought stress, UBQ7, EF1A8, TUA10, and GAPDH showed less variety of expression level in leaves and roots. Thus, it is better to identify reliable reference genes first before performing any gene expression analysis. However, using a combination of housekeeping genes as reference gene may provide a new strategy for normalization of gene expression. In this study, we found that combination of four housekeeping genes worked well as reference genes under all the stress conditions. © 2013.
GeneBuilder: interactive in silico prediction of gene structure.

PubMed

Milanesi, L; D'Angelo, D; Rogozin, I B

1999-01-01

Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
Selection of reference genes for transcriptional analysis of edible tubers of potato (Solanum tuberosum L.).

PubMed

Mariot, Roberta Fogliatto; de Oliveira, Luisa Abruzzi; Voorhuijzen, Marleen M; Staats, Martijn; Hutten, Ronald C B; Van Dijk, Jeroen P; Kok, Esther; Frazzon, Jeverson

2015-01-01

Potato (Solanum tuberosum) yield has increased dramatically over the last 50 years and this has been achieved by a combination of improved agronomy and biotechnology efforts. Gene studies are taking place to improve new qualities and develop new cultivars. Reverse transcriptase quantitative polymerase chain reaction (RT-qPCR) is a bench-marking analytical tool for gene expression analysis, but its accuracy is highly dependent on a reliable normalization strategy of an invariant reference genes. For this reason, the goal of this work was to select and validate reference genes for transcriptional analysis of edible tubers of potato. To do so, RT-qPCR primers were designed for ten genes with relatively stable expression in potato tubers as observed in RNA-Seq experiments. Primers were designed across exon boundaries to avoid genomic DNA contamination. Differences were observed in the ranking of candidate genes identified by geNorm, NormFinder and BestKeeper algorithms. The ranks determined by geNorm and NormFinder were very similar and for all samples the most stable candidates were C2, exocyst complex component sec3 (SEC3) and ATCUL3/ATCUL3A/CUL3/CUL3A (CUL3A). According to BestKeeper, the importin alpha and ubiquitin-associated/ts-n genes were the most stable. Three genes were selected as reference genes for potato edible tubers in RT-qPCR studies. The first one, called C2, was selected in common by NormFinder and geNorm, the second one is SEC3, selected by NormFinder, and the third one is CUL3A, selected by geNorm. Appropriate reference genes identified in this work will help to improve the accuracy of gene expression quantification analyses by taking into account differences that may be observed in RNA quality or reverse transcription efficiency across the samples.
Breast cancer prognosis by combinatorial analysis of gene expression data.

PubMed

Alexe, Gabriela; Alexe, Sorin; Axelrod, David E; Bonates, Tibérius O; Lozina, Irina I; Reiss, Michael; Hammer, Peter L

2006-01-01

The potential of applying data analysis tools to microarray data for diagnosis and prognosis is illustrated on the recent breast cancer dataset of van 't Veer and coworkers. We re-examine that dataset using the novel technique of logical analysis of data (LAD), with the double objective of discovering patterns characteristic for cases with good or poor outcome, using them for accurate and justifiable predictions; and deriving novel information about the role of genes, the existence of special classes of cases, and other factors. Data were analyzed using the combinatorics and optimization-based method of LAD, recently shown to provide highly accurate diagnostic and prognostic systems in cardiology, cancer proteomics, hematology, pulmonology, and other disciplines. LAD identified a subset of 17 of the 25,000 genes, capable of fully distinguishing between patients with poor, respectively good prognoses. An extensive list of 'patterns' or 'combinatorial biomarkers' (that is, combinations of genes and limitations on their expression levels) was generated, and 40 patterns were used to create a prognostic system, shown to have 100% and 92.9% weighted accuracy on the training and test sets, respectively. The prognostic system uses fewer genes than other methods, and has similar or better accuracy than those reported in other studies. Out of the 17 genes identified by LAD, three (respectively, five) were shown to play a significant role in determining poor (respectively, good) prognosis. Two new classes of patients (described by similar sets of covering patterns, gene expression ranges, and clinical features) were discovered. As a by-product of the study, it is shown that the training and the test sets of van 't Veer have differing characteristics. The study shows that LAD provides an accurate and fully explanatory prognostic system for breast cancer using genomic data (that is, a system that, in addition to predicting good or poor prognosis, provides an individualized
GeneMesh: a web-based microarray analysis tool for relating differentially expressed genes to MeSH terms.

PubMed

Jani, Saurin D; Argraves, Gary L; Barth, Jeremy L; Argraves, W Scott

2010-04-01

An important objective of DNA microarray-based gene expression experimentation is determining inter-relationships that exist between differentially expressed genes and biological processes, molecular functions, cellular components, signaling pathways, physiologic processes and diseases. Here we describe GeneMesh, a web-based program that facilitates analysis of DNA microarray gene expression data. GeneMesh relates genes in a query set to categories available in the Medical Subject Headings (MeSH) hierarchical index. The interface enables hypothesis driven relational analysis to a specific MeSH subcategory (e.g., Cardiovascular System, Genetic Processes, Immune System Diseases etc.) or unbiased relational analysis to broader MeSH categories (e.g., Anatomy, Biological Sciences, Disease etc.). Genes found associated with a given MeSH category are dynamically linked to facilitate tabular and graphical depiction of Entrez Gene information, Gene Ontology information, KEGG metabolic pathway diagrams and intermolecular interaction information. Expression intensity values of groups of genes that cluster in relation to a given MeSH category, gene ontology or pathway can be displayed as heat maps of Z score-normalized values. GeneMesh operates on gene expression data derived from a number of commercial microarray platforms including Affymetrix, Agilent and Illumina. GeneMesh is a versatile web-based tool for testing and developing new hypotheses through relating genes in a query set (e.g., differentially expressed genes from a DNA microarray experiment) to descriptors making up the hierarchical structure of the National Library of Medicine controlled vocabulary thesaurus, MeSH. The system further enhances the discovery process by providing links between sets of genes associated with a given MeSH category to a rich set of html linked tabular and graphic information including Entrez Gene summaries, gene ontologies, intermolecular interactions, overlays of genes onto KEGG
Antibiotic Combinations That Enable One-Step, Targeted Mutagenesis of Chromosomal Genes.

PubMed

Lee, Wonsik; Do, Truc; Zhang, Ge; Kahne, Daniel; Meredith, Timothy C; Walker, Suzanne

2018-06-08

Targeted modification of bacterial chromosomes is necessary to understand new drug targets, investigate virulence factors, elucidate cell physiology, and validate results of -omics-based approaches. For some bacteria, reverse genetics remains a major bottleneck to progress in research. Here, we describe a compound-centric strategy that combines new negative selection markers with known positive selection markers to achieve simple, efficient one-step genome engineering of bacterial chromosomes. The method was inspired by the observation that certain nonessential metabolic pathways contain essential late steps, suggesting that antibiotics targeting a late step can be used to select for the absence of genes that control flux into the pathway. Guided by this hypothesis, we have identified antibiotic/counterselectable markers to accelerate reverse engineering of two increasingly antibiotic-resistant pathogens, Staphylococcus aureus and Acinetobacter baumannii. For S. aureus, we used wall teichoic acid biosynthesis inhibitors to select for the absence of tarO and for A. baumannii, we used colistin to select for the absence of lpxC. We have obtained desired gene deletions, gene fusions, and promoter swaps in a single plating step with perfect efficiency. Our method can also be adapted to generate markerless deletions of genes using FLP recombinase. The tools described here will accelerate research on two important pathogens, and the concept we outline can be readily adapted to any organism for which a suitable target pathway can be identified.
Functional Analysis of the Arabidopsis TETRASPANIN Gene Family in Plant Growth and Development.

PubMed

Wang, Feng; Muto, Antonella; Van de Velde, Jan; Neyt, Pia; Himanen, Kristiina; Vandepoele, Klaas; Van Lijsebettens, Mieke

2015-11-01

TETRASPANIN (TET) genes encode conserved integral membrane proteins that are known in animals to function in cellular communication during gamete fusion, immunity reaction, and pathogen recognition. In plants, functional information is limited to one of the 17 members of the Arabidopsis (Arabidopsis thaliana) TET gene family and to expression data in reproductive stages. Here, the promoter activity of all 17 Arabidopsis TET genes was investigated by pAtTET::NUCLEAR LOCALIZATION SIGNAL-GREEN FLUORESCENT PROTEIN/β-GLUCURONIDASE reporter lines throughout the life cycle, which predicted functional divergence in the paralogous genes per clade. However, partial overlap was observed for many TET genes across the clades, correlating with few phenotypes in single mutants and, therefore, requiring double mutant combinations for functional investigation. Mutational analysis showed a role for TET13 in primary root growth and lateral root development and redundant roles for TET5 and TET6 in leaf and root growth through negative regulation of cell proliferation. Strikingly, a number of TET genes were expressed in embryonic and seedling progenitor cells and remained expressed until the differentiation state in the mature plant, suggesting a dynamic function over developmental stages. The cis-regulatory elements together with transcription factor-binding data provided molecular insight into the sites, conditions, and perturbations that affect TET gene expression and positioned the TET genes in different molecular pathways; the data represent a hypothesis-generating resource for further functional analyses. © 2015 American Society of Plant Biologists. All Rights Reserved.
Multi-membership gene regulation in pathway based microarray analysis

PubMed Central

2011-01-01

Background Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. Results We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. Conclusions We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes. PMID:21939531
Multi-membership gene regulation in pathway based microarray analysis.

PubMed

Pavlidis, Stelios P; Payne, Annette M; Swift, Stephen M

2011-09-22

Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes.
A gene network bioinformatics analysis for pemphigoid autoimmune blistering diseases.

PubMed

Barone, Antonio; Toti, Paolo; Giuca, Maria Rita; Derchi, Giacomo; Covani, Ugo

2015-07-01

In this theoretical study, a text mining search and clustering analysis of data related to genes potentially involved in human pemphigoid autoimmune blistering diseases (PAIBD) was performed using web tools to create a gene/protein interaction network. The Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database was employed to identify a final set of PAIBD-involved genes and to calculate the overall significant interactions among genes: for each gene, the weighted number of links, or WNL, was registered and a clustering procedure was performed using the WNL analysis. Genes were ranked in class (leader, B, C, D and so on, up to orphans). An ontological analysis was performed for the set of 'leader' genes. Using the above-mentioned data network, 115 genes represented the final set; leader genes numbered 7 (intercellular adhesion molecule 1 (ICAM-1), interferon gamma (IFNG), interleukin (IL)-2, IL-4, IL-6, IL-8 and tumour necrosis factor (TNF)), class B genes were 13, whereas the orphans were 24. The ontological analysis attested that the molecular action was focused on extracellular space and cell surface, whereas the activation and regulation of the immunity system was widely involved. Despite the limited knowledge of the present pathologic phenomenon, attested by the presence of 24 genes revealing no protein-protein direct or indirect interactions, the network showed significant pathways gathered in several subgroups: cellular components, molecular functions, biological processes and the pathologic phenomenon obtained from the Kyoto Encyclopaedia of Genes and Genomes (KEGG) database. The molecular basis for PAIBD was summarised and expanded, which will perhaps give researchers promising directions for the identification of new therapeutic targets.
Candidate Loci for Yield-Related Traits in Maize Revealed by a Combination of MetaQTL Analysis and Regional Association Mapping

PubMed Central

Chen, Lin; An, Yixin; Li, Yong-xiang; Li, Chunhui; Shi, Yunsu; Song, Yanchun; Zhang, Dengfeng; Wang, Tianyu; Li, Yu

2017-01-01

Maize grain yield and related traits are complex and are controlled by a large number of genes of small effect or quantitative trait loci (QTL). Over the years, a large number of yield-related QTLs have been identified in maize and deposited in public databases. However, integrating and re-analyzing these data and mining candidate loci for yield-related traits has become a major issue in maize. In this study, we collected information on QTLs conferring maize yield-related traits from 33 published studies. Then, 999 of these QTLs were iteratively projected and subjected to meta-analysis to obtain metaQTLs (MQTLs). A total of 76 MQTLs were found across the maize genome. Based on a comparative genomics strategy, several maize orthologs of rice yield-related genes were identified in these MQTL regions. Furthermore, three potential candidate genes (Gene ID: GRMZM2G359974, GRMZM2G301884, and GRMZM2G083894) associated with kernel size and weight within three MQTL regions were identified using regional association mapping, based on the results of the meta-analysis. This strategy, combining MQTL analysis and regional association mapping, is helpful for functional marker development and rapid identification of candidate genes or loci. PMID:29312420
Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies.

PubMed

Martini, Paolo; Risso, Davide; Sales, Gabriele; Romualdi, Chiara; Lanfranchi, Gerolamo; Cagnin, Stefano

2011-04-11

In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focused on the identification of significant differentially expressed genes. Later, researchers moved toward the systematic integration of gene expression profiles with additional biological information, such as chromosomal location, ontological annotations or sequence features. The analysis of gene expression linked to physical location of genes on chromosomes allows the identification of transcriptionally imbalanced regions, while, Gene Set Analysis focuses on the detection of coordinated changes in transcriptional levels among sets of biologically related genes. In this field, meta-analysis offers the possibility to compare different studies, addressing the same biological question to fully exploit public gene expression datasets. We describe STEPath, a method that starts from gene expression profiles and integrates the analysis of imbalanced region as an a priori step before performing gene set analysis. The application of STEPath in individual studies produced gene set scores weighted by chromosomal activation. As a final step, we propose a way to compare these scores across different studies (meta-analysis) on related biological issues. One complication with meta-analysis is batch effects, which occur because molecular measurements are affected by laboratory conditions, reagent lots and personnel differences. Major problems occur when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. We evaluated the power of combining chromosome mapping and gene set enrichment analysis, performing the analysis on a dataset of leukaemia (example of individual study) and on a dataset of skeletal muscle diseases (meta-analysis approach). In leukaemia, we identified the Hox gene set, a gene set closely related to the pathology that other algorithms of gene set analysis do not
Genome-Wide Identification and Evaluation of Reference Genes for Quantitative RT-PCR Analysis during Tomato Fruit Development.

PubMed

Cheng, Yuan; Bian, Wuying; Pang, Xin; Yu, Jiahong; Ahammed, Golam J; Zhou, Guozhi; Wang, Rongqing; Ruan, Meiying; Li, Zhimiao; Ye, Qingjing; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

2017-01-01

Gene expression analysis in tomato fruit has drawn increasing attention nowadays. Quantitative real-time PCR (qPCR) is a routine technique for gene expression analysis. In qPCR operation, reliability of results largely depends on the choice of appropriate reference genes (RGs). Although tomato is a model for fruit biology study, few RGs for qPCR analysis in tomato fruit had yet been developed. In this study, we initially identified 38 most stably expressed genes based on tomato transcriptome data set, and their expression stabilities were further determined in a set of tomato fruit samples of four different fruit developmental stages (Immature, mature green, breaker, mature red) using qPCR analysis. Two statistical algorithms, geNorm and Normfinder, concordantly determined the superiority of these identified putative RGs. Notably, SlFRG05 (Solyc01g104170), SlFRG12 (Solyc04g009770), SlFRG16 (Solyc10g081190), SlFRG27 (Solyc06g007510), and SlFRG37 (Solyc11g005330) were proved to be suitable RGs for tomato fruit development study. Further analysis using geNorm indicate that the combined use of SlFRG03 (Solyc02g063070) and SlFRG27 would provide more reliable normalization results in qPCR experiments. The identified RGs in this study will be beneficial for future qPCR analysis of tomato fruit developmental study, as well as for the potential identification of optimal normalization controls in other plant species.
Genome-Wide Identification and Evaluation of Reference Genes for Quantitative RT-PCR Analysis during Tomato Fruit Development

PubMed Central

Cheng, Yuan; Bian, Wuying; Pang, Xin; Yu, Jiahong; Ahammed, Golam J.; Zhou, Guozhi; Wang, Rongqing; Ruan, Meiying; Li, Zhimiao; Ye, Qingjing; Yao, Zhuping; Yang, Yuejian; Wan, Hongjian

2017-01-01

Gene expression analysis in tomato fruit has drawn increasing attention nowadays. Quantitative real-time PCR (qPCR) is a routine technique for gene expression analysis. In qPCR operation, reliability of results largely depends on the choice of appropriate reference genes (RGs). Although tomato is a model for fruit biology study, few RGs for qPCR analysis in tomato fruit had yet been developed. In this study, we initially identified 38 most stably expressed genes based on tomato transcriptome data set, and their expression stabilities were further determined in a set of tomato fruit samples of four different fruit developmental stages (Immature, mature green, breaker, mature red) using qPCR analysis. Two statistical algorithms, geNorm and Normfinder, concordantly determined the superiority of these identified putative RGs. Notably, SlFRG05 (Solyc01g104170), SlFRG12 (Solyc04g009770), SlFRG16 (Solyc10g081190), SlFRG27 (Solyc06g007510), and SlFRG37 (Solyc11g005330) were proved to be suitable RGs for tomato fruit development study. Further analysis using geNorm indicate that the combined use of SlFRG03 (Solyc02g063070) and SlFRG27 would provide more reliable normalization results in qPCR experiments. The identified RGs in this study will be beneficial for future qPCR analysis of tomato fruit developmental study, as well as for the potential identification of optimal normalization controls in other plant species. PMID:28900431
In vivo genome-wide analysis of multiple tissues identifies gene regulatory networks, novel functions and downstream regulatory genes for Bapx1 and its co-regulation with Sox9 in the mammalian vertebral column.

PubMed

Chatterjee, Sumantra; Sivakamasundari, V; Yap, Sook Peng; Kraus, Petra; Kumar, Vibhor; Xing, Xing; Lim, Siew Lan; Sng, Joel; Prabhakar, Shyam; Lufkin, Thomas

2014-12-05

Vertebrate organogenesis is a highly complex process involving sequential cascades of transcription factor activation or repression. Interestingly a single developmental control gene can occasionally be essential for the morphogenesis and differentiation of tissues and organs arising from vastly disparate embryological lineages. Here we elucidated the role of the mammalian homeobox gene Bapx1 during the embryogenesis of five distinct organs at E12.5 - vertebral column, spleen, gut, forelimb and hindlimb - using expression profiling of sorted wildtype and mutant cells combined with genome wide binding site analysis. Furthermore we analyzed the development of the vertebral column at the molecular level by combining transcriptional profiling and genome wide binding data for Bapx1 with similarly generated data sets for Sox9 to assemble a detailed gene regulatory network revealing genes previously not reported to be controlled by either of these two transcription factors. The gene regulatory network appears to control cell fate decisions and morphogenesis in the vertebral column along with the prevention of premature chondrocyte differentiation thus providing a detailed molecular view of vertebral column development.

Identification and analysis of novel genes involved in gravitropism of Arabidopsis thaliana.

NASA Astrophysics Data System (ADS)

Morita, Miyo T.; Tasaka, Masao; Masatoshi Taniguchi, .

2012-07-01

Gravitropism is a continuous control with regard to the orientation and juxtaposition of the various parts of the plant body in response to gravity. In higher plants, the relative directional change of gravity is mainly suscepted in specialized cells called statocytes, followed by signal conversion from physical information into physiological information within the statocytes. We have studied the early process of shoot gravitropism, gravity sensing and signaling process, mainly by molecular genetic approach. In Arabidopsis shoot, statocytes are the endodermal cells. sgr1/scarcrow (scr) and sgr7/short-root (shr) mutants fail to form the endodermis and to respond to gravity in their inflorescence stems. Since both SGR1/SCR and SGR7/SHR are transcriptional factors, at least a subset of their downstream genes can be expected to be involved in gravitropism. In addition, eal1 (endodermal-amyloplast less 1), which exhibits no gravitropism in inflorescence stem but retains ability to form endodermis, is a hypomorphic allele of sgr7/shr. Take advantage of these mutants, we performed DNA microarray analysis and compared gene expression profiles between wild type and the mutants. We found that approx. 40 genes were commonly down-regulated in these mutants and termed them DGE (DOWN-REGULATED GENE IN EAL1) genes. DGE1 has sequence similarity to Oryza sativa LAZY1 that is involved in shoot gravitropism of rice. DGE2 has a short region homologous to DGE1. DTL (DGE TWO-LIKE}) that has 54% identity to DGE2 is found in Arabidopsis genome. All three genes are conserved in angiosperm but have no known functional domains or motifs. We analyzed T-DNA insertion for these genes in single or multiple combinations. In dge1 dge2 dtl triple mutant, gravitropic response of shoot, hypocotyl and root dramatically reduced. Now we are carrying out further physiological and molecular genetic analysis of the triple mutant.
Identification of candidate genes in osteoporosis by integrated microarray analysis.

PubMed

Li, J J; Wang, B Q; Fei, Q; Yang, Y; Li, D

2016-12-01

In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed genes (DEGs) between patients with osteoporosis and normal controls. Gene function analysis was performed to uncover the functions of identified DEGs. A total of three microarray studies were selected for integrated analysis. In all, 1125 genes were found to be signiﬁcantly differentially expressed between osteoporosis patients and normal controls, with 373 upregulated and 752 downregulated genes. Positive regulation of the cellular amino metabolic process (gene ontology (GO): 0033240, false discovery rate (FDR) = 1.00E + 00) was significantly enriched under the GO category for biological processes, while for molecular functions, flavin adenine dinucleotide binding (GO: 0050660, FDR = 3.66E-01) and androgen receptor binding (GO: 0050681, FDR = 6.35E-01) were significantly enriched. DEGs were enriched in many osteoporosis-related signalling pathways, including those of mitogen-activated protein kinase (MAPK) and calcium. Protein-protein interaction (PPI) network analysis showed that the significant hub proteins contained ubiquitin specific peptidase 9, X-linked (Degree = 99), ubiquitin specific peptidase 19 (Degree = 57) and ubiquitin conjugating enzyme E2 B (Degree = 57). Analysis of gene function of identified differentially expressed genes may expand our understanding of fundamental mechanisms leading to osteoporosis. Moreover, significantly enriched pathways, such as MAPK and calcium, may involve in osteoporosis through osteoblastic differentiation and bone formation.Cite this article: J. J
LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights

PubMed Central

Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

2016-01-01

Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher’s exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO’s usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher. PMID:26750448
LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

PubMed

Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

2016-01-11

Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.
A combination of PhP typing and β-d-glucuronidase gene sequence variation analysis for differentiation of Escherichia coli from humans and animals.

PubMed

Masters, N; Christie, M; Katouli, M; Stratton, H

2015-06-01

We investigated the usefulness of the β-d-glucuronidase gene variance in Escherichia coli as a microbial source tracking tool using a novel algorithm for comparison of sequences from a prescreened set of host-specific isolates using a high-resolution PhP typing method. A total of 65 common biochemical phenotypes belonging to 318 E. coli strains isolated from humans and domestic and wild animals were analysed for nucleotide variations at 10 loci along a 518 bp fragment of the 1812 bp β-d-glucuronidase gene. Neighbour-joining analysis of loci variations revealed 86 (76.8%) human isolates and 91.2% of animal isolates were correctly identified. Pairwise hierarchical clustering improved assignment; where 92 (82.1%) human and 204 (99%) animal strains were assigned to their respective cluster. Our data show that initial typing of isolates and selection of common types from different hosts prior to analysis of the β-d-glucuronidase gene sequence improves source identification. We also concluded that numerical profiling of the nucleotide variations can be used as a valuable approach to differentiate human from animal E. coli. This study signifies the usefulness of the β-d-glucuronidase gene as a marker for differentiating human faecal pollution from animal sources.
Expression analysis in response to drought stress in soybean: Shedding light on the regulation of metabolic pathway genes.

PubMed

Guimarães-Dias, Fábia; Neves-Borges, Anna Cristina; Viana, Antonio Americo Barbosa; Mesquita, Rosilene Oliveira; Romano, Eduardo; de Fátima Grossi-de-Sá, Maria; Nepomuceno, Alexandre Lima; Loureiro, Marcelo Ehlers; Alves-Ferreira, Márcio

2012-06-01

Metabolomics analysis of wild type Arabidopsis thaliana plants, under control and drought stress conditions revealed several metabolic pathways that are induced under water deficit. The metabolic response to drought stress is also associated with ABA dependent and independent pathways, allowing a better understanding of the molecular mechanisms in this model plant. Through combining an in silico approach and gene expression analysis by quantitative real-time PCR, the present work aims at identifying genes of soybean metabolic pathways potentially associated with water deficit. Digital expression patterns of Arabidopsis genes, which were selected based on the basis of literature reports, were evaluated under drought stress condition by Genevestigator. Genes that showed strong induction under drought stress were selected and used as bait to identify orthologs in the soybean genome. This allowed us to select 354 genes of putative soybean orthologs of 79 Arabidopsis genes belonging to 38 distinct metabolic pathways. The expression pattern of the selected genes was verified in the subtractive libraries available in the GENOSOJA project. Subsequently, 13 genes from different metabolic pathways were selected for validation by qPCR experiments. The expression of six genes was validated in plants undergoing drought stress in both pot-based and hydroponic cultivation systems. The results suggest that the metabolic response to drought stress is conserved in Arabidopsis and soybean plants.
Effects of blood-activating and stasis-removing drugs combined with VEGF gene transfer on angiogenesis in ischemic necrosis of the femoral head.

PubMed

Li, Jun-Hui; Wu, Ya-Ling; Ye, Jian-Hong; Ning, Ya-Gong; Yu, Hai-Ying; Peng, Zhong-Jie; Luan, Xiao-Wen

2009-09-01

To observe the promoting effects of blood-activating and stasis-removing Chinese drugs combined with vascular endothelial growth factor (VEGF) gene transfer on angiogenesis in ischemic necrosis of the femoral head. Forty Japanese giant-ear rabbits were randomly divided into a control group, a model group, a Chinese drug group, a gene group, and a combined group. After 8 weeks of treatment, the rate of VEGF positive cell expression in the synovium of the femoral head was measured using the immunohistochemical method, and the number of blood vessels in the femoral head was measured by digital subtraction angiography. The rate of VEGF positive cell expression in the model group was significantly lower than that in the Chinese drug group (P < 0.05) and very significantly lower than those in other groups (P < 0.01); but in the combined group it was significantly higher than in the Chinese drug group (P < 0.05). The differences in the number of blood vessels in area A between the model group and other groups were not statistically significant. However, in area B, the number of blood vessels significantly increased in the control group, the gene group and the combined group as compared with the model group (P < 0.05), and in the combined group the number of blood vessels was significantly more than in the gene group (P < 0.05); but in the Chinese drug group it was not significantly different than the model group (P > 0.05). Either the blood-activating and stasis-removing Chinese drugs or VEGF gene transfer can promote the angiogenesis and building of collateral circulation for femoral head ischemic necrosis, and the combined therapy with Chinese drugs or VEGF gene transfer may show a better therapeutic effect. The present study provides an experimental basis for clinical application of the combined therapy with the blood-activating and stasis-removing Chinese drugs and VEGF gene transfer.
Meta-analysis of pathway enrichment: combining independent and dependent omics data sets.

PubMed

Kaever, Alexander; Landesfeind, Manuel; Feussner, Kirstin; Morgenstern, Burkhard; Feussner, Ivo; Meinicke, Peter

2014-01-01

A major challenge in current systems biology is the combination and integrative analysis of large data sets obtained from different high-throughput omics platforms, such as mass spectrometry based Metabolomics and Proteomics or DNA microarray or RNA-seq-based Transcriptomics. Especially in the case of non-targeted Metabolomics experiments, where it is often impossible to unambiguously map ion features from mass spectrometry analysis to metabolites, the integration of more reliable omics technologies is highly desirable. A popular method for the knowledge-based interpretation of single data sets is the (Gene) Set Enrichment Analysis. In order to combine the results from different analyses, we introduce a methodical framework for the meta-analysis of p-values obtained from Pathway Enrichment Analysis (Set Enrichment Analysis based on pathways) of multiple dependent or independent data sets from different omics platforms. For dependent data sets, e.g. obtained from the same biological samples, the framework utilizes a covariance estimation procedure based on the nonsignificant pathways in single data set enrichment analysis. The framework is evaluated and applied in the joint analysis of Metabolomics mass spectrometry and Transcriptomics DNA microarray data in the context of plant wounding. In extensive studies of simulated data set dependence, the introduced correlation could be fully reconstructed by means of the covariance estimation based on pathway enrichment. By restricting the range of p-values of pathways considered in the estimation, the overestimation of correlation, which is introduced by the significant pathways, could be reduced. When applying the proposed methods to the real data sets, the meta-analysis was shown not only to be a powerful tool to investigate the correlation between different data sets and summarize the results of multiple analyses but also to distinguish experiment-specific key pathways.
Functional Analysis With a Barcoder Yeast Gene Overexpression System

PubMed Central

Douglas, Alison C.; Smith, Andrew M.; Sharifpoor, Sara; Yan, Zhun; Durbic, Tanja; Heisler, Lawrence E.; Lee, Anna Y.; Ryan, Owen; Göttert, Hendrikje; Surendra, Anu; van Dyk, Dewald; Giaever, Guri; Boone, Charles; Nislow, Corey; Andrews, Brenda J.

2012-01-01

Systematic analysis of gene overexpression phenotypes provides an insight into gene function, enzyme targets, and biological pathways. Here, we describe a novel functional genomics platform that enables a highly parallel and systematic assessment of overexpression phenotypes in pooled cultures. First, we constructed a genome-level collection of ~5100 yeast barcoder strains, each of which carries a unique barcode, enabling pooled fitness assays with a barcode microarray or sequencing readout. Second, we constructed a yeast open reading frame (ORF) galactose-induced overexpression array by generating a genome-wide set of yeast transformants, each of which carries an individual plasmid-born and sequence-verified ORF derived from the Saccharomyces cerevisiae full-length EXpression-ready (FLEX) collection. We combined these collections genetically using synthetic genetic array methodology, generating ~5100 strains, each of which is barcoded and overexpresses a specific ORF, a set we termed “barFLEX.” Additional synthetic genetic array allows the barFLEX collection to be moved into different genetic backgrounds. As a proof-of-principle, we describe the properties of the barFLEX overexpression collection and its application in synthetic dosage lethality studies under different environmental conditions. PMID:23050238
A Method for Gene-Based Pathway Analysis Using Genomewide Association Study Summary Statistics Reveals Nine New Type 1 Diabetes Associations

PubMed Central

Evangelou, Marina; Smyth, Deborah J; Fortune, Mary D; Burren, Oliver S; Walker, Neil M; Guo, Hui; Onengut-Gumuscu, Suna; Chen, Wei-Min; Concannon, Patrick; Rich, Stephen S; Todd, John A; Wallace, Chris

2014-01-01

Pathway analysis can complement point-wise single nucleotide polymorphism (SNP) analysis in exploring genomewide association study (GWAS) data to identify specific disease-associated genes that can be candidate causal genes. We propose a straightforward methodology that can be used for conducting a gene-based pathway analysis using summary GWAS statistics in combination with widely available reference genotype data. We used this method to perform a gene-based pathway analysis of a type 1 diabetes (T1D) meta-analysis GWAS (of 7,514 cases and 9,045 controls). An important feature of the conducted analysis is the removal of the major histocompatibility complex gene region, the major genetic risk factor for T1D. Thirty-one of the 1,583 (2%) tested pathways were identified to be enriched for association with T1D at a 5% false discovery rate. We analyzed these 31 pathways and their genes to identify SNPs in or near these pathway genes that showed potentially novel association with T1D and attempted to replicate the association of 22 SNPs in additional samples. Replication P-values were skewed () with 12 of the 22 SNPs showing . Support, including replication evidence, was obtained for nine T1D associated variants in genes ITGB7 (rs11170466, ), NRP1 (rs722988, ), BAD (rs694739, ), CTSB (rs1296023, ), FYN (rs11964650, ), UBE2G1 (rs9906760, ), MAP3K14 (rs17759555, ), ITGB1 (rs1557150, ), and IL7R (rs1445898, ). The proposed methodology can be applied to other GWAS datasets for which only summary level data are available. PMID:25371288
The drug target genes show higher evolutionary conservation than non-target genes.

PubMed

Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie

2016-01-26

Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.
Exome Sequencing and Linkage Analysis Identified Novel Candidate Genes in Recessive Intellectual Disability Associated with Ataxia.

PubMed

Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia

2015-10-01

Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).
Comparative and evolutionary analysis of the HES/HEY gene family reveal exon/intron loss and teleost specific duplication events.

PubMed

Zhou, Mi; Yan, Jun; Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan

2012-01-01

HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and duplication.
Comprehensive Gene expression meta-analysis and integrated bioinformatic approaches reveal shared signatures between thrombosis and myeloproliferative disorders

PubMed Central

Jha, Prabhash Kumar; Vijay, Aatira; Sahu, Anita; Ashraf, Mohammad Zahid

2016-01-01

Thrombosis is a leading cause of morbidity and mortality in patients with myeloproliferative disorders (MPDs), particularly polycythemia vera (PV) and essential thrombocythemia (ET). Despite the attempts to establish a link between them, the shared biological mechanisms are yet to be characterized. An integrated gene expression meta-analysis of five independent publicly available microarray data of the three diseases was conducted to identify shared gene expression signatures and overlapping biological processes. Using INMEX bioinformatic tool, based on combined Effect Size (ES) approaches, we identified a total of 1,157 differentially expressed genes (DEGs) (697 overexpressed and 460 underexpressed genes) shared between the three diseases. EnrichR tool’s rich library was used for comprehensive functional enrichment and pathway analysis which revealed “mRNA Splicing” and “SUMO E3 ligases SUMOylate target proteins” among the most enriched terms. Network based meta-analysis identified MYC and FN1 to be the most highly ranked hub genes. Our results reveal that the alterations in biomarkers of the coagulation cascade like F2R, PROS1, SELPLG and ITGB2 were common between the three diseases. Interestingly, the study has generated a novel database of candidate genetic markers, pathways and transcription factors shared between thrombosis and MPDs, which might aid in the development of prognostic therapeutic biomarkers. PMID:27892526
ribB and ribBA genes from Acidithiobacillus ferrooxidans: expression levels under different growth conditions and phylogenetic analysis.

PubMed

Knegt, Fábio H P; Mello, Luciane V; Reis, Fernanda C; Santos, Marcos T; Vicentini, Renato; Ferraz, Lúcio F C; Ottoboni, Laura M M

2008-01-01

Acidithiobacillus ferrooxidans is a Gram-negative, chemolithoautotrophic bacterium involved in metal bioleaching. Using the RNA arbitrarily primed polymerase chain reaction (RAP-PCR), we have identified several cDNAs that were differentially expressed when A. ferrooxidans LR was submitted to potassium- and phosphate-limiting conditions. One of these cDNAs showed similarity with ribB. An analysis of the A. ferrooxidans ATCC 23270 genome, made available by The Institute for Genomic Research, showed that the ribB gene was not located in the rib operon, but a ribBA gene was present in this operon instead. The ribBA gene was isolated from A. ferrooxidans LR and expression of both ribB and ribBA was investigated. Transcript levels of both genes were enhanced in cells grown in the absence of K2HPO4, in the presence of zinc and copper sulfate and in different pHs. Transcript levels decreased upon exposure to a temperature higher than the ideal 30 degrees C and at pH 1.2. A comparative genomic analysis using the A. ferrooxidans ATCC 23270 genome revealed similar putative regulatory elements for both genes. Moreover, an RFN element was identified upstream from the ribB gene. Phylogenetic analysis of the distribution of RibB and RibBA in bacteria showed six different combinations. We suggest that the presence of duplicated riboflavin synthesis genes in bacteria must provide their host with some benefit in certain stressful situations.
Gene Expression Profile Analysis is Directly Affected by the Selected Reference Gene: The Case of Leaf-Cutting Atta Sexdens

PubMed Central

Máximo, Wesley P. F.; Zanetti, Ronald; Paiva, Luciano V.

2018-01-01

Although several ant species are important targets for the development of molecular control strategies, only a few studies focus on identifying and validating reference genes for quantitative reverse transcription polymerase chain reaction (RT-qPCR) data normalization. We provide here an extensive study to identify and validate suitable reference genes for gene expression analysis in the ant Atta sexdens, a threatening agricultural pest in South America. The optimal number of reference genes varies according to each sample and the result generated by RefFinder differed about which is the most suitable reference gene. Results suggest that the RPS16, NADH and SDHB genes were the best reference genes in the sample pool according to stability values. The SNF7 gene expression pattern was stable in all evaluated sample set. In contrast, when using less stable reference genes for normalization a large variability in SNF7 gene expression was recorded. There is no universal reference gene suitable for all conditions under analysis, since these genes can also participate in different cellular functions, thus requiring a systematic validation of possible reference genes for each specific condition. The choice of reference genes on SNF7 gene normalization confirmed that unstable reference genes might drastically change the expression profile analysis of target candidate genes. PMID:29419794
A pathway-based network analysis of hypertension-related genes

NASA Astrophysics Data System (ADS)

Wang, Huan; Hu, Jing-Bo; Xu, Chuan-Yun; Zhang, De-Hai; Yan, Qian; Xu, Ming; Cao, Ke-Fei; Zhang, Xu-Sheng

2016-02-01

Complex network approach has become an effective way to describe interrelationships among large amounts of biological data, which is especially useful in finding core functions and global behavior of biological systems. Hypertension is a complex disease caused by many reasons including genetic, physiological, psychological and even social factors. In this paper, based on the information of biological pathways, we construct a network model of hypertension-related genes of the salt-sensitive rat to explore the interrelationship between genes. Statistical and topological characteristics show that the network has the small-world but not scale-free property, and exhibits a modular structure, revealing compact and complex connections among these genes. By the threshold of integrated centrality larger than 0.71, seven key hub genes are found: Jun, Rps6kb1, Cycs, Creb312, Cdk4, Actg1 and RT1-Da. These genes should play an important role in hypertension, suggesting that the treatment of hypertension should focus on the combination of drugs on multiple genes.
Pan- and core- network analysis of co-expression genes in a model plant

DOE PAGES

He, Fei; Maslov, Sergei

2016-12-16

Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Pan- and core- network analysis of co-expression genes in a model plant

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Fei; Maslov, Sergei

Genome-wide gene expression experiments have been performed using the model plant Arabidopsis during the last decade. Some studies involved construction of coexpression networks, a popular technique used to identify groups of co-regulated genes, to infer unknown gene functions. One approach is to construct a single coexpression network by combining multiple expression datasets generated in different labs. We advocate a complementary approach in which we construct a large collection of 134 coexpression networks based on expression datasets reported in individual publications. To this end we reanalyzed public expression data. To describe this collection of networks we introduced concepts of ‘pan-network’ andmore » ‘core-network’ representing union and intersection between a sizeable fractions of individual networks, respectively. Here, we showed that these two types of networks are different both in terms of their topology and biological function of interacting genes. For example, the modules of the pan-network are enriched in regulatory and signaling functions, while the modules of the core-network tend to include components of large macromolecular complexes such as ribosomes and photosynthetic machinery. Our analysis is aimed to help the plant research community to better explore the information contained within the existing vast collection of gene expression data in Arabidopsis.« less
Multifunctional Poly(L-lactide)-Polyethylene Glycol-Grafted Graphene Quantum Dots for Intracellular MicroRNA Imaging and Combined Specific-Gene-Targeting Agents Delivery for Improved Therapeutics.

PubMed

Dong, Haifeng; Dai, Wenhao; Ju, Huangxian; Lu, Huiting; Wang, Shiyan; Xu, Liping; Zhou, Shu-Feng; Zhang, Yue; Zhang, Xueji

2015-05-27

Photoluminescent (PL) graphene quantum dots (GQDs) with large surface area and superior mechanical flexibility exhibit fascinating optical and electronic properties and possess great promising applications in biomedical engineering. Here, a multifunctional nanocomposite of poly(l-lactide) (PLA) and polyethylene glycol (PEG)-grafted GQDs (f-GQDs) was proposed for simultaneous intracellular microRNAs (miRNAs) imaging analysis and combined gene delivery for enhanced therapeutic efficiency. The functionalization of GQDs with PEG and PLA imparts the nanocomposite with super physiological stability and stable photoluminescence over a broad pH range, which is vital for cell imaging. Cell experiments demonstrate the f-GQDs excellent biocompatibility, lower cytotoxicity, and protective properties. Using the HeLa cell as a model, we found the f-GQDs effectively delivered a miRNA probe for intracellular miRNA imaging analysis and regulation. Notably, the large surface of GQDs was capable of simultaneous adsorption of agents targeting miRNA-21 and survivin, respectively. The combined conjugation of miRNA-21-targeting and survivin-targeting agents induced better inhibition of cancer cell growth and more apoptosis of cancer cells, compared with conjugation of agents targeting miRNA-21 or survivin alone. These findings highlight the promise of the highly versatile multifunctional nanocomposite in biomedical application of intracellular molecules analysis and clinical gene therapeutics.

Validation of reference genes for quantitative gene expression analysis in experimental epilepsy.

PubMed

Sadangi, Chinmaya; Rosenow, Felix; Norwood, Braxton A

2017-12-01

To grasp the molecular mechanisms and pathophysiology underlying epilepsy development (epileptogenesis) and epilepsy itself, it is important to understand the gene expression changes that occur during these phases. Quantitative real-time polymerase chain reaction (qPCR) is a technique that rapidly and accurately determines gene expression changes. It is crucial, however, that stable reference genes are selected for each experimental condition to ensure that accurate values are obtained for genes of interest. If reference genes are unstably expressed, this can lead to inaccurate data and erroneous conclusions. To date, epilepsy studies have used mostly single, nonvalidated reference genes. This is the first study to systematically evaluate reference genes in male Sprague-Dawley rat models of epilepsy. We assessed 15 potential reference genes in hippocampal tissue obtained from 2 different models during epileptogenesis, 1 model during chronic epilepsy, and a model of noninjurious seizures. Reference gene ranking varied between models and also differed between epileptogenesis and chronic epilepsy time points. There was also some variance between the four mathematical models used to rank reference genes. Notably, we found novel reference genes to be more stably expressed than those most often used in experimental epilepsy studies. The consequence of these findings is that reference genes suitable for one epilepsy model may not be appropriate for others and that reference genes can change over time. It is, therefore, critically important to validate potential reference genes before using them as normalizing factors in expression analysis in order to ensure accurate, valid results. © 2017 Wiley Periodicals, Inc.
Cloning, characterization, and physical mapping of the canine Prop-1 gene (PROP1): exclusion as a candidate for combined pituitary hormone deficiency in German shepherd dogs.

PubMed

Lantinga-van Leeuwen, I S; Kooistra, H S; Mol, J A; Renier, C; Breen, M; van Oost, B A

2000-01-01

Abnormalities in the genes encoding Pit-1 and Prop-1 have been reported to cause combined pituitary hormone deficiency (CPHD) in mice and humans. In dogs, a similar phenotype has been described in the German shepherd breed. We have previously reported that the Pit-1 gene (POU1F1) is not mutated in affected German shepherd dogs. In this study, we report the isolation and mapping of the canine Prop-1 gene (PROP1), and we assessed the involvement of PROP1 in German shepherd dog dwarfism. The canine PROP1 gene was found to contain three exons, encoding a 226 amino acid protein. The deduced amino acid sequence was 79% and 84% homologous with the mouse and human Prop-1 protein, respectively. Using fluorescence in situ hybridization, PROP1 was mapped to canine chromosome 11. Further mapping with a canine radiation hybrid panel showed co-localization with the polymorphic DNA marker AHT137. Sequence analysis of genomic DNA from dwarf German shepherd dogs revealed no alterations in the PROP1 gene. Moreover, linkage analysis of AHT137 revealed no co-segregation between the PROP1 locus and the CPHD phenotype, excluding this gene as candidate for canine CPHD and providing a new spontaneous model of hypopituitarism. Copyright 2000 S. Karger AG, Basel
Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

PubMed

Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

2011-11-01

Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.
Gene expression profiles analysis identifies key genes for acute lung injury in patients with sepsis.

PubMed

Guo, Zhiqiang; Zhao, Chuncheng; Wang, Zheng

2014-09-26

To identify critical genes and biological pathways in acute lung injury (ALI), a comparative analysis of gene expression profiles of patients with ALI + sepsis compared with patients with sepsis alone were performed with bioinformatic tools. GSE10474 was downloaded from Gene Expression Omnibus, including a collective of 13 whole blood samples with ALI + sepsis and 21 whole blood samples with sepsis alone. After pre-treatment with robust multichip averaging (RMA) method, differential analysis was conducted using simpleaffy package based upon t-test and fold change. Hierarchical clustering was also performed using function hclust from package stats. Beisides, functional enrichment analysis was conducted using iGepros. Moreover, the gene regulatory network was constructed with information from Kyoto Encyclopedia of Genes and Genomes (KEGG) and then visualized by Cytoscape. A total of 128 differentially expressed genes (DEGs) were identified, including 47 up- and 81 down-regulated genes. The significantly enriched functions included negative regulation of cell proliferation, regulation of response to stimulus and cellular component morphogenesis. A total of 27 DEGs were significantly enriched in 16 KEGG pathways, such as protein digestion and absorption, fatty acid metabolism, amoebiasis, etc. Furthermore, the regulatory network of these 27 DEGs was constructed, which involved several key genes, including protein tyrosine kinase 2 (PTK2), v-src avian sarcoma (SRC) and Caveolin 2 (CAV2). PTK2, SRC and CAV2 may be potential markers for diagnosis and treatment of ALI. The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/5865162912987143.
The de novo Q167K mutation in the POU1F1 gene leads to combined pituitary hormone deficiency in an Italian patient.

PubMed

Malvagia, Sabrina; Poggi, Giovanni Maria; Pasquini, Elisabetta; Donati, Maria Alice; Pela, Ivana; Morrone, Amelia; Zammarchi, Enrico

2003-11-01

The POU1F1 gene encodes a transcription factor that is important for the development and differentiation of the cells producing GH, prolactin, and TSH in the anterior pituitary gland. Patients with POU1F1 mutations show a combined pituitary hormone deficiency with low or absent levels of GH, prolactin, and TSH. Fourteen mutations have been reported in the POU1F1 gene up to now. These genetic lesions can be inherited either in an autosomal dominant or an autosomal recessive mode. We report on the first Italian patient, a girl, affected by combined pituitary hormone deficiency. The patient was found to be positive for congenital hypothyroidism (with low TSH levels) at neonatal screening. Substitutive therapy was started, but subsequent growth was very poor, although psychomotor development was substantially normal. Hospitalized at 10 mo she showed hypotonic crises, growth retardation, delayed bone age, and facial dysmorphism. In addition to congenital hypothyroidism, GH and prolactin deficiencies were found. Mutation DNA analysis of the patient's POU1F1 gene identified the novel Q167K amino acid change at the heterozygous level. The highly conserved Q167 residue is located in the POU-specific domain. No mutation was detected in the other allele. DNA analysis in the proband's parents did not identify this amino acid substitution, suggesting a de novo genetic lesion. From these data it can be hypothesized that the Q167K mutation has a dominant negative effect.
Oligoamine analogues in combination with 2-difluoromethylornithine synergistically induce re-expression of aberrantly silenced tumour-suppressor genes

PubMed Central

Wu, Yu; Steinbergs, Nora; Murray-Stewart, Tracy; Marton, Laurence J.; Casero, Robert A.

2011-01-01

Epigenetic gene silencing is an important mechanism in the initiation and progression of cancer. Abnormal DNA CpG island hypermethylation and histone modifications are involved in aberrant silencing of tumour-suppressor genes. LSD1 (lysine-specific demethylase 1) was the first enzyme identified to specifically demethylate H3K4 (Lys4 of histone H3). Methylated H3K4 is an important mark associated with transcriptional activation. The flavin adenine dinucleotide-binding amine oxidase domain of LSD1 is homologous with two polyamine oxidases, SMO (spermine oxidase) and APAO (N1-acetylpolyamine oxidase). We have demonstrated previously that long-chain polyamine analogues, the oligoamines, are inhibitors of LSD1. In the present paper we report the synergistic effects of specific oligoamines in combination with DFMO (2-difluoromethylornithine), an inhibitor of ornithine decarboxylase, in human colorectal cancer cells. DFMO treatment depletes natural polyamines and increases the uptake of exogenous polyamines. The combination of oligoamines and DFMO results in a synergistic re-expression of aberrantly silenced tumour-suppressor genes, including SFRP2 (secreted frizzled-related protein 2), which encodes a Wnt signalling pathway antagonist and plays an anti-tumorigenic role in colorectal cancer. The treatment-induced re-expression of SFRP2 is associated with increased H3K4me2 (di-methyl H3K4) in the gene promoter. The combination of LSD1-inhibiting oligoamines and DFMO represents a novel approach to epigenetic therapy of cancer. PMID:22132744
Validation of Reference Genes for Robust qRT-PCR Gene Expression Analysis in the Rice Blast Fungus Magnaporthe oryzae.

PubMed

Che Omar, Sarena; Bentley, Michael A; Morieri, Giulia; Preston, Gail M; Gurr, Sarah J

2016-01-01

The rice blast fungus causes significant annual harvest losses. It also serves as a genetically-tractable model to study fungal ingress. Whilst pathogenicity determinants have been unmasked and changes in global gene expression described, we know little about Magnaporthe oryzae cell wall remodelling. Our interests, in wall remodelling genes expressed during infection, vegetative growth and under exogenous wall stress, demand robust choice of reference genes for quantitative Real Time-PCR (qRT-PCR) data normalisation. We describe the expression stability of nine candidate reference genes profiled by qRT-PCR with cDNAs derived during asexual germling development, from sexual stage perithecia and from vegetative mycelium grown under various exogenous stressors. Our Minimum Information for Publication of qRT-PCR Experiments (MIQE) compliant analysis reveals a set of robust reference genes used to track changes in the expression of the cell wall remodelling gene MGG_Crh2 (MGG_00592). We ranked nine candidate reference genes by their expression stability (M) and report the best gene combination needed for reliable gene expression normalisation, when assayed in three tissue groups (Infective, Vegetative, and Global) frequently used in M. oryzae expression studies. We found that MGG_Actin (MGG_03982) and the 40S 27a ribosomal subunit MGG_40s (MGG_02872) proved to be robust reference genes for the Infection group and MGG_40s and MGG_Ef1 (Elongation Factor1-α) for both Vegetative and Global groups. Using the above validated reference genes, M. oryzae MGG_Crh2 expression was found to be significantly (p<0.05) elevated three-fold during vegetative growth as compared with dormant spores and two fold higher under cell wall stress (Congo Red) compared to growth under optimal conditions. We recommend the combinatorial use of two reference genes, belonging to the cytoskeleton and ribosomal synthesis functional groups, MGG_Actin, MGG_40s, MGG_S8 (Ribosomal subunit 40S S8) or MGG
Effects of single and combined low frequency electromagnetic fields and simulated microgravity on gene expression of human mesenchymal stem cells during chondrogenesis

PubMed Central

Hammerschmid, Florian; Blum, Helmut; Krebs, Stefan; Redeker, Julia I.; Holzapfel, Boris M.; Jansson, Volkmar; Müller, Peter E.

2016-01-01

Introduction Low frequency electromagnetic fields (LF-EMF) and simulated microgravity (SMG) have been observed to affect chondrogenesis. A controlled bioreactor system was developed to apply LF-EMF and SMG singly or combined during chondrogenic differentiation of human mesenchymal stem cells (hMSCs) in 3D culture. Material and methods An external motor gear SMG bioreactor was combined with magnetic Helmholtz coils for EMF (5 mT; 15 Hz). Pellets of hMSCs (±TGF-β3) were cultured (P5) under SMG, LF-EMF, LF-EMF/SMG and control (1 g) conditions for 3 weeks. Sections were stained with safranin-O and collagen type II. Gene expression was evaluated by microarray and real-time polymerase chain reaction analysis. Results Simulated microgravity application significantly changed gene expression; specifically, COLXA1 but also COL2A1, which represents the chondrogenic potential, were reduced (p < 0.05). Low frequency electromagnetic fields application showed no gene expression changes on a microarray basis. LF-EMF/SMG application obtained significant different expression values from cultures obtained under SMG conditions with a re-increase of COL2A1, therefore rescuing the chondrogenic potential, which had been lowered by SMG. Conclusions Simulated microgravity lowered hypertrophy but also the chondrogenic potential of hMSCs. Combined LF-EMF/SMG provided a rescue effect of the chondrogenic potential of hMSCs although no LF-EMF effect was observed under optimal conditions. The study provides new insights into how LF-EMF and SMG affect chondrogenesis of hMSCs and how they generate interdependent effects. PMID:29765449
[Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

PubMed

Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

2012-07-01

In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.
Gene-centric Meta-analysis in 87,736 Individuals of European Ancestry Identifies Multiple Blood-Pressure-Related Loci

PubMed Central

Tragante, Vinicius; Barnes, Michael R.; Ganesh, Santhi K.; Lanktree, Matthew B.; Guo, Wei; Franceschini, Nora; Smith, Erin N.; Johnson, Toby; Holmes, Michael V.; Padmanabhan, Sandosh; Karczewski, Konrad J.; Almoguera, Berta; Barnard, John; Baumert, Jens; Chang, Yen-Pei Christy; Elbers, Clara C.; Farrall, Martin; Fischer, Mary E.; Gaunt, Tom R.; Gho, Johannes M.I.H.; Gieger, Christian; Goel, Anuj; Gong, Yan; Isaacs, Aaron; Kleber, Marcus E.; Leach, Irene Mateo; McDonough, Caitrin W.; Meijs, Matthijs F.L.; Melander, Olle; Nelson, Christopher P.; Nolte, Ilja M.; Pankratz, Nathan; Price, Tom S.; Shaffer, Jonathan; Shah, Sonia; Tomaszewski, Maciej; van der Most, Peter J.; Van Iperen, Erik P.A.; Vonk, Judith M.; Witkowska, Kate; Wong, Caroline O.L.; Zhang, Li; Beitelshees, Amber L.; Berenson, Gerald S.; Bhatt, Deepak L.; Brown, Morris; Burt, Amber; Cooper-DeHoff, Rhonda M.; Connell, John M.; Cruickshanks, Karen J.; Curtis, Sean P.; Davey-Smith, George; Delles, Christian; Gansevoort, Ron T.; Guo, Xiuqing; Haiqing, Shen; Hastie, Claire E.; Hofker, Marten H.; Hovingh, G. Kees; Kim, Daniel S.; Kirkland, Susan A.; Klein, Barbara E.; Klein, Ronald; Li, Yun R.; Maiwald, Steffi; Newton-Cheh, Christopher; O’Brien, Eoin T.; Onland-Moret, N. Charlotte; Palmas, Walter; Parsa, Afshin; Penninx, Brenda W.; Pettinger, Mary; Vasan, Ramachandran S.; Ranchalis, Jane E.; M Ridker, Paul; Rose, Lynda M.; Sever, Peter; Shimbo, Daichi; Steele, Laura; Stolk, Ronald P.; Thorand, Barbara; Trip, Mieke D.; van Duijn, Cornelia M.; Verschuren, W. Monique; Wijmenga, Cisca; Wyatt, Sharon; Young, J. Hunter; Zwinderman, Aeilko H.; Bezzina, Connie R.; Boerwinkle, Eric; Casas, Juan P.; Caulfield, Mark J.; Chakravarti, Aravinda; Chasman, Daniel I.; Davidson, Karina W.; Doevendans, Pieter A.; Dominiczak, Anna F.; FitzGerald, Garret A.; Gums, John G.; Fornage, Myriam; Hakonarson, Hakon; Halder, Indrani; Hillege, Hans L.; Illig, Thomas; Jarvik, Gail P.; Johnson, Julie A.; Kastelein, John J.P.; Koenig, Wolfgang; Kumari, Meena; März, Winfried; Murray, Sarah S.; O’Connell, Jeffery R.; Oldehinkel, Albertine J.; Pankow, James S.; Rader, Daniel J.; Redline, Susan; Reilly, Muredach P.; Schadt, Eric E.; Kottke-Marchant, Kandice; Snieder, Harold; Snyder, Michael; Stanton, Alice V.; Tobin, Martin D.; Uitterlinden, André G.; van der Harst, Pim; van der Schouw, Yvonne T.; Samani, Nilesh J.; Watkins, Hugh; Johnson, Andrew D.; Reiner, Alex P.; Zhu, Xiaofeng; de Bakker, Paul I.W.; Levy, Daniel; Asselbergs, Folkert W.; Munroe, Patricia B.; Keating, Brendan J.

2014-01-01

Blood pressure (BP) is a heritable risk factor for cardiovascular disease. To investigate genetic associations with systolic BP (SBP), diastolic BP (DBP), mean arterial pressure (MAP), and pulse pressure (PP), we genotyped ∼50,000 SNPs in up to 87,736 individuals of European ancestry and combined these in a meta-analysis. We replicated findings in an independent set of 68,368 individuals of European ancestry. Our analyses identified 11 previously undescribed associations in independent loci containing 31 genes including PDE1A, HLA-DQB1, CDK6, PRKAG2, VCL, H19, NUCB2, RELA, HOXC@ complex, FBN1, and NFAT5 at the Bonferroni-corrected array-wide significance threshold (p < 6 × 10−7) and confirmed 27 previously reported associations. Bioinformatic analysis of the 11 loci provided support for a putative role in hypertension of several genes, such as CDK6 and NUCB2. Analysis of potential pharmacological targets in databases of small molecules showed that ten of the genes are predicted to be a target for small molecules. In summary, we identified previously unknown loci associated with BP. Our findings extend our understanding of genes involved in BP regulation, which may provide new targets for therapeutic intervention or drug response stratification. PMID:24560520
GenePublisher: Automated analysis of DNA microarray data.

PubMed

Knudsen, Steen; Workman, Christopher; Sicheritz-Ponten, Thomas; Friis, Carsten

2003-07-01

GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with a specification of the data. The server performs normalization, statistical analysis and visualization of the data. The results are run against databases of signal transduction pathways, metabolic pathways and promoter sequences in order to extract more information. The results of the entire analysis are summarized in report form and returned to the user.
Functional analysis of neuronal microRNAs in Caenorhabditis elegans dauer formation by combinational genetics and Neuronal miRISC immunoprecipitation.

PubMed

Than, Minh T; Kudlow, Brian A; Han, Min

2013-06-01

Identifying the physiological functions of microRNAs (miRNAs) is often challenging because miRNAs commonly impact gene expression under specific physiological conditions through complex miRNA::mRNA interaction networks and in coordination with other means of gene regulation, such as transcriptional regulation and protein degradation. Such complexity creates difficulties in dissecting miRNA functions through traditional genetic methods using individual miRNA mutations. To investigate the physiological functions of miRNAs in neurons, we combined a genetic "enhancer" approach complemented by biochemical analysis of neuronal miRNA-induced silencing complexes (miRISCs) in C. elegans. Total miRNA function can be compromised by mutating one of the two GW182 proteins (AIN-1), an important component of miRISC. We found that combining an ain-1 mutation with a mutation in unc-3, a neuronal transcription factor, resulted in an inappropriate entrance into the stress-induced, alternative larval stage known as dauer, indicating a role of miRNAs in preventing aberrant dauer formation. Analysis of this genetic interaction suggests that neuronal miRNAs perform such a role partly by regulating endogenous cyclic guanosine monophosphate (cGMP) signaling, potentially influencing two other dauer-regulating pathways. Through tissue-specific immunoprecipitations of miRISC, we identified miRNAs and their likely target mRNAs within neuronal tissue. We verified the biological relevance of several of these miRNAs and found that many miRNAs likely regulate dauer formation through multiple dauer-related targets. Further analysis of target mRNAs suggests potential miRNA involvement in various neuronal processes, but the importance of these miRNA::mRNA interactions remains unclear. Finally, we found that neuronal genes may be more highly regulated by miRNAs than intestinal genes. Overall, our study identifies miRNAs and their targets, and a physiological function of these miRNAs in neurons. It
RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes.

PubMed

Ono, Hiromasa; Ogasawara, Osamu; Okubo, Kosaku; Bono, Hidemasa

2017-08-29

Gene expression data are exponentially accumulating; thus, the functional annotation of such sequence data from metadata is urgently required. However, life scientists have difficulty utilizing the available data due to its sheer magnitude and complicated access. We have developed a web tool for browsing reference gene expression pattern of mammalian tissues and cell lines measured using different methods, which should facilitate the reuse of the precious data archived in several public databases. The web tool is called Reference Expression dataset (RefEx), and RefEx allows users to search by the gene name, various types of IDs, chromosomal regions in genetic maps, gene family based on InterPro, gene expression patterns, or biological categories based on Gene Ontology. RefEx also provides information about genes with tissue-specific expression, and the relative gene expression values are shown as choropleth maps on 3D human body images from BodyParts3D. Combined with the newly incorporated Functional Annotation of Mammals (FANTOM) dataset, RefEx provides insight regarding the functional interpretation of unfamiliar genes. RefEx is publicly available at http://refex.dbcls.jp/.
Combined radiation and p53 gene therapy of malignant glioma cells.

PubMed

Badie, B; Goh, C S; Klaver, J; Herweijer, H; Boothman, D A

1999-01-01

More than half of malignant gliomas reportedly have alterations in the p53 tumor suppressor gene. Because p53 plays a key role in the cellular response to DNA-damaging agents, we investigated the role of p53 gene therapy before ionizing radiation in cultured human glioma cells containing normal or mutated p53. Three established human glioma cell lines expressing the wild-type (U87 MG, p53wt) or mutant (A172 and U373 MG, p53mut) p53 gene were transduced by recombinant adenoviral vectors bearing human p53 (Adp53) and Escherichia coli beta-galactosidase genes (AdLacZ, control virus) before radiation (0-20 Gy). Changes in p53, p21, and Bax expression were studied by Western immunoblotting, whereas cell cycle alterations and apoptosis were investigated by flow cytometry and nuclear staining. Survival was assessed by clonogenic assays. Within 48 hours of Adp53 exposure, all three cell lines demonstrated p53 expression at a viral multiplicity of infection of 100. p21, which is a p53-inducible downstream effector gene, was overexpressed, and cells were arrested in the G1 phase. Bax expression, which is thought to play a role in p53-induced apoptosis, did not change with either radiation or Adp53. Apoptosis and survival after p53 gene therapy varied. U87 MG (p53wt) cells showed minimal apoptosis after Adp53, irradiation, or combined treatments. U373 MG (p53mut) cells underwent massive apoptosis and died within 48 hours of Adp53 treatment, independent of irradiation. Surprisingly, A172 (p53mut) cells demonstrated minimal apoptosis after Adp53 exposure; however, unlike U373 MG cells, apoptosis increased with radiation dose. Survival of all three cell lines was reduced dramatically after >10 Gy. Although Adp53 transduction significantly reduced the survival of U373 MG cells and inhibited A172 growth, it had no effect on the U87 MG cell line. Transduction with AdLacZ did not affect apoptosis or cell cycle progression and only minimally affected survival in all cell lines. We
Comprehensive analysis of the codon usage patterns in the envelope glycoprotein E2 gene of the classical swine fever virus

PubMed Central

Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong

2017-01-01

The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV. PMID:28880881
Comprehensive analysis of the codon usage patterns in the envelope glycoprotein E2 gene of the classical swine fever virus.

PubMed

Chen, Ye; Li, Xinxin; Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong

2017-01-01

The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV.
Combined serial analysis of gene expression and transcription factor binding site prediction identifies novel-candidate-target genes of Nr2e1 in neocortex development.

PubMed

Schmouth, Jean-François; Arenillas, David; Corso-Díaz, Ximena; Xie, Yuan-Yun; Bohacec, Slavita; Banks, Kathleen G; Bonaguro, Russell J; Wong, Siaw H; Jones, Steven J M; Marra, Marco A; Simpson, Elizabeth M; Wasserman, Wyeth W

2015-07-24

Nr2e1 (nuclear receptor subfamily 2, group e, member 1) encodes a transcription factor important in neocortex development. Previous work has shown that nuclear receptors can have hundreds of target genes, and bind more than 300 co-interacting proteins. However, recognition of the critical role of Nr2e1 in neural stem cells and neocortex development is relatively recent, thus the molecular mechanisms involved for this nuclear receptor are only beginning to be understood. Serial analysis of gene expression (SAGE), has given researchers both qualitative and quantitative information pertaining to biological processes. Thus, in this work, six LongSAGE mouse libraries were generated from laser microdissected tissue samples of dorsal VZ/SVZ (ventricular zone and subventricular zone) from the telencephalon of wild-type (Wt) and Nr2e1-null embryos at the critical development ages E13.5, E15.5, and E17.5. We then used a novel approach, implementing multiple computational methods followed by biological validation to further our understanding of Nr2e1 in neocortex development. In this work, we have generated a list of 1279 genes that are differentially expressed in response to altered Nr2e1 expression during in vivo neocortex development. We have refined this list to 64 candidate direct-targets of NR2E1. Our data suggested distinct roles for Nr2e1 during different neocortex developmental stages. Most importantly, our results suggest a possible novel pathway by which Nr2e1 regulates neurogenesis, which includes Lhx2 as one of the candidate direct-target genes, and SOX9 as a co-interactor. In conclusion, we have provided new candidate interacting partners and numerous well-developed testable hypotheses for understanding the pathways by which Nr2e1 functions to regulate neocortex development.
Rare Frequency of Mutations in Pituitary Transcription Factor Genes in Combined Pituitary Hormone or Isolated Growth Hormone Deficiencies in Korea.

PubMed

Choi, Jin Ho; Jung, Chang Woo; Kang, Eungu; Kim, Yoon Myung; Heo, Sun Hee; Lee, Beom Hee; Kim, Gu Hwan; Yoo, Han Wook

2017-05-01

Congenital hypopituitarism is caused by mutations in pituitary transcription factors involved in the development of the hypothalamic-pituitary axis. Mutation frequencies of genes involved in congenital hypopituitarism are extremely low and vary substantially between ethnicities. This study was undertaken to compare the clinical, endocrinological, and radiological features of patients with an isolated growth hormone deficiency (IGHD) or combined pituitary hormone deficiency (CPHD). This study included 27 patients with sporadic IGHD and CPHD. A mutation analysis of the POU1F1, PROP1, LHX3, LHX4, and HESX1 genes was performed using genomic DNA from peripheral blood leukocytes. IGHD and CPHD were observed in 4 and 23 patients, respectively. Mean age at diagnosis was 8.28±7.25 years for IGHD and 13.48±10.46 years for CPHD (p=0.37). Serum insulin-like growth factor-1 and peak growth hormone (GH) levels following GH stimulation tests were significantly lower in patients with CPHD than in those with IGHD (p<0.05). Sellar MRI findings revealed structural abnormalities in 3 patients with IGHD (75%) and 21 patients with CPHD (91.3%) (p=0.62). A mutation analysis identified homozygous p.R109Q mutations in HESX1 in a patient with CPHD. Patients with CPHD had more severe GHD than those with IGHD. The frequency of defects in the genes encoding pituitary transcription factors was extremely low in Korean patients with congenital hypopituitarism. Environmental factors and the impact of other causative genes may contribute to this clinical phenotype. © Copyright: Yonsei University College of Medicine 2017
Rare Frequency of Mutations in Pituitary Transcription Factor Genes in Combined Pituitary Hormone or Isolated Growth Hormone Deficiencies in Korea

PubMed Central

Choi, Jin-Ho; Jung, Chang-Woo; Kang, Eungu; Kim, Yoon-Myung; Heo, Sun Hee; Lee, Beom Hee; Kim, Gu-Hwan

2017-01-01

Purpose Congenital hypopituitarism is caused by mutations in pituitary transcription factors involved in the development of the hypothalamic-pituitary axis. Mutation frequencies of genes involved in congenital hypopituitarism are extremely low and vary substantially between ethnicities. This study was undertaken to compare the clinical, endocrinological, and radiological features of patients with an isolated growth hormone deficiency (IGHD) or combined pituitary hormone deficiency (CPHD). Materials and Methods This study included 27 patients with sporadic IGHD and CPHD. A mutation analysis of the POU1F1, PROP1, LHX3, LHX4, and HESX1 genes was performed using genomic DNA from peripheral blood leukocytes. Results IGHD and CPHD were observed in 4 and 23 patients, respectively. Mean age at diagnosis was 8.28±7.25 years for IGHD and 13.48±10.46 years for CPHD (p=0.37). Serum insulin-like growth factor-1 and peak growth hormone (GH) levels following GH stimulation tests were significantly lower in patients with CPHD than in those with IGHD (p<0.05). Sellar MRI findings revealed structural abnormalities in 3 patients with IGHD (75%) and 21 patients with CPHD (91.3%) (p=0.62). A mutation analysis identified homozygous p.R109Q mutations in HESX1 in a patient with CPHD. Patients with CPHD had more severe GHD than those with IGHD. Conclusion The frequency of defects in the genes encoding pituitary transcription factors was extremely low in Korean patients with congenital hypopituitarism. Environmental factors and the impact of other causative genes may contribute to this clinical phenotype. PMID:28332357
Gene expression analysis of a Helicobacter pylori-infected and high-salt diet-treated mouse gastric tumor model: identification of CD177 as a novel prognostic factor in patients with gastric cancer

PubMed Central

2013-01-01

Background Helicobacter pylori (H. pylori) infection and excessive salt intake are known as important risk factors for stomach cancer in humans. However, interactions of these two factors with gene expression profiles during gastric carcinogenesis remain unclear. In the present study, we investigated the global gene expression associated with stomach carcinogenesis and prognosis of human gastric cancer using a mouse model. Methods To find candidate genes involved in stomach carcinogenesis, we firstly constructed a carcinogen-induced mouse gastric tumor model combined with H. pylori infection and high-salt diet. C57BL/6J mice were given N-methyl-N-nitrosourea in their drinking water and sacrificed after 40 weeks. Animals of a combination group were inoculated with H. pylori and fed a high-salt diet. Gene expression profiles in glandular stomach of the mice were investigated by oligonucleotide microarray. Second, we examined an availability of the candidate gene as prognostic factor for human patients. Immunohistochemical analysis of CD177, one of the up-regulated genes, was performed in human advanced gastric cancer specimens to evaluate the association with prognosis. Results The multiplicity of gastric tumor in carcinogen-treated mice was significantly increased by combination of H. pylori infection and high-salt diet. In the microarray analysis, 35 and 31 more than two-fold up-regulated and down-regulated genes, respectively, were detected in the H. pylori-infection and high-salt diet combined group compared with the other groups. Quantitative RT-PCR confirmed significant over-expression of two candidate genes including Cd177 and Reg3g. On immunohistochemical analysis of CD177 in human advanced gastric cancer specimens, over-expression was evident in 33 (60.0%) of 55 cases, significantly correlating with a favorable prognosis (P = 0.0294). Multivariate analysis including clinicopathological factors as covariates revealed high expression of CD177 to be an

Novel liposomal combination treatments using dual genes knockdown in oral cancer treatment

NASA Astrophysics Data System (ADS)

Wu, Jyun-Sian; Yeh, Chia-Hsien; Huang, Leaf; Hsu, Yih-Chih

2018-02-01

Small interfering RNA (siRNA) can be used to treat tumor because it can effectively knockdown target oncoprotein expression and it leads to cancer cell death and apoptosis. Hypoxia-inducible factors-1 (HIF-1) is a transcription factor gene. Its high expression of tumor hypoxia cells, activation of transcription factor HIF-1α and angiogenesis found in most cancerous tissues. HIF-1α protein in cancer cells are critical to cell survival, tumor growth and proliferation. Epidermal growth factor receptor (EGFR) gene is another common head and neck oncogene. The dual self-designed siRNA sequences were encapsulated in the lipid-calcium-phosphate (LCP) and targeted to sigma receptors on the surface of cancer cells via binding to amino ethyl anisamide (AEAA). We used human oral cancer cells to establish the xenograft animal model to study the combination therapy for therapeutic results.
Novel insights into the lipidome of glioblastoma cells based on a combined PLSR and DD-HDS computational analysis

NASA Astrophysics Data System (ADS)

Lespinats, S.; Meyer-Bäse, Anke; He, Huan; Marshall, Alan G.; Conrad, Charles A.; Emmett, Mark R.

2009-05-01

Partial Least Square Regression (PLSR) and Data-Driven High Dimensional Scaling (DD-HDS) are employed for the prediction and the visualization of changes in polar lipid expression induced by different combinations of wild-type (wt) p53 gene therapy and SN38 chemotherapy of U87 MG glioblastoma cells. A very detailed analysis of the gangliosides reveals that certain gangliosides of GM3 or GD1-type have unique properties not shared by the others. In summary, this preliminary work shows that data mining techniques are able to determine the modulation of gangliosides by different treatment combinations.
Blood Transcriptomic Comparison of Individuals with and without Autism Spectrum Disorder: A Combined-Samples Mega-Analysis

PubMed Central

Tylee, Daniel S.; Hess, Jonathan L.; Quinn, Thomas P.; Barve, Rahul; Huang, Hailiang; Zhang-James, Yanli; Chang, Jeffrey; Stamova, Boryana S.; Sharp, Frank R.; Hertz-Picciotto, Irva; Faraone, Stephen V.; Kong, Sek Won; Glatt, Stephen J.

2017-01-01

Blood-based microarray studies comparing individuals affected with autism spectrum disorder (ASD) and typically developing individuals help characterize differences in circulating immune cell functions and offer potential biomarker signal. We sought to combine the subject-level data from previously published studies by mega-analysis to increase the statistical power. We identified studies that compared ex-vivo blood or lymphocytes from ASD-affected individuals and unrelated comparison subjects using Affymetrix or Illumina array platforms. Raw microarray data and clinical meta-data were obtained from seven studies, totaling 626 affected and 447 comparison subjects. Microarray data were processed using uniform methods. Covariate-controlled mixed-effect linear models were used to identify gene transcripts and co-expression network modules that were significantly associated with diagnostic status. Permutation-based gene-set analysis was used to identify functionally related sets of genes that were over- and under-expressed among ASD samples. Our results were consistent with diminished interferon-, EGF-, PDGF-, PI3K-AKT-mTOR-, and RAS-MAPK-signaling cascades, and increased ribosomal translation and NK-cell related activity in ASD. We explored evidence for sex-differences in the ASD-related transcriptomic signature. We also demonstrated that machine-learning classifiers using blood transcriptome data perform with moderate accuracy when data are combined across studies. Comparing our results with those from blood-based studies of protein biomarkers (e.g., cytokines and trophic factors), we propose that ASD may feature decoupling between certain circulating signaling proteins (higher in ASD samples) and the transcriptional cascades which they typically elicit within circulating immune cells (lower in ASD samples). These findings provide insight into ASD-related transcriptional differences in circulating immune cells. PMID:27862943
Differential gene expression profiles of peripheral blood mononuclear cells in childhood asthma.

PubMed

Kong, Qian; Li, Wen-Jing; Huang, Hua-Rong; Zhong, Ying-Qiang; Fang, Jian-Pei

2015-05-01

Asthma is a common childhood disease with strong genetic components. This study compared whole-genome expression differences between asthmatic young children and healthy controls to identify gene signatures of childhood asthma. Total RNA extracted from peripheral blood mononuclear cells (PBMC) was subjected to microarray analysis. QRT-PCR was performed to verify the microarray results. Classification and functional characterization of differential genes were illustrated by hierarchical clustering and gene ontology analysis. Multiple logistic regression (MLR) analysis, receiver operating characteristic (ROC) curve analysis, and discriminate power were used to scan asthma-specific diagnostic markers. For fold-change>2 and p < 0.05, there were 758 named differential genes. The results of QRT-PCR confirmed successfully the array data. Hierarchical clustering divided 29 highly possible genes into seven categories and the genes in the same cluster were likely to possess similar expression patterns or functions. Gene ontology analysis presented that differential genes primarily enriched in immune response, response to stress or stimulus, and regulation of apoptosis in biological process. MLR and ROC curve analysis revealed that the combination of ADAM33, Smad7, and LIGHT possessed excellent discriminating power. The combination of ADAM33, Smad7, and LIGHT would be a reliable and useful childhood asthma model for prediction and diagnosis.
FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data.

PubMed

Manijak, Mieszko P; Nielsen, Henrik B

2011-06-11

Although, systematic analysis of gene annotation is a powerful tool for interpreting gene expression data, it sometimes is blurred by incomplete gene annotation, missing expression response of key genes and secondary gene expression responses. These shortcomings may be partially circumvented by instead matching gene expression signatures to signatures of other experiments. To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700 Arabidopsis microarray experiments. Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/.
Identifying osteosarcoma metastasis associated genes by weighted gene co-expression network analysis (WGCNA).

PubMed

Tian, Honglai; Guan, Donghui; Li, Jianmin

2018-06-01

Osteosarcoma (OS), the most common malignant bone tumor, accounts for the heavy healthy threat in the period of children and adolescents. OS occurrence usually correlates with early metastasis and high death rate. This study aimed to better understand the mechanism of OS metastasis.Based on Gene Expression Omnibus (GEO) database, we downloaded 4 expression profile data sets associated with OS metastasis, and selected differential expressed genes. Weighted gene co-expression network analysis (WGCNA) approach allowed us to investigate the most OS metastasis-correlated module. Gene Ontology functional and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were used to give annotation of selected OS metastasis-associated genes.We select 897 differential expressed genes from OS metastasis and OS non-metastasis groups. Based on these selected genes, WGCNA further explored 142 genes included in the most OS metastasis-correlated module. Gene Ontology functional and KEGG pathway enrichment analyses showed that significantly OS metastasis-associated genes were involved in pathway correlated with insulin-like growth factor binding.Our research figured out several potential molecules participating in metastasis process and factors acting as biomarker. With this study, we could better explore the mechanism of OS metastasis and further discover more therapy targets.
Identification of Proteins Using iTRAQ and Virus-Induced Gene Silencing Reveals Three Bread Wheat Proteins Involved in the Response to Combined Osmotic-Cold Stress.

PubMed

Zhang, Ning; Zhang, Lingran; Shi, Chaonan; Zhao, Lei; Cui, Dangqun; Chen, Feng

2018-05-25

Crops are often subjected to a combination of stresses in the field. To date, studies on the physiological and molecular responses of common wheat to a combination of osmotic and cold stresses, however, remain unknown. In this study, wheat seedlings exposed to osmotic-cold stress for 24 h showed inhibited growth, as well as increased lipid peroxidation, relative electrolyte leakage, and soluble sugar contents. iTRAQ-based quantitative proteome method was employed to determine the proteomic profiles of the roots and leaves of wheat seedlings exposed to osmotic-cold stress conditions. A total of 250 and 258 proteins with significantly altered abundance in the roots and leaves were identified, respectively, and the majority of these proteins displayed differential abundance, thereby revealing organ-specific differences in adaptation to osmotic-cold stress. Yeast two hybrid assay examined five pairs of stress/defense-related protein-protein interactions in the predicted protein interaction network. Furthermore, quantitative real-time PCR analysis indicated that abiotic stresses increased the expression of three candidate protein genes, i.e., TaGRP2, CDCP, and Wcor410c in wheat leaves. Virus-induced gene silencing indicated that three genes TaGRP2, CDCP, and Wcor410c were involved in modulating osmotic-cold stress in common wheat. Our study provides useful information for the elucidation of molecular and genetics bases of osmotic-cold combined stress in bread wheat.
Ranking metrics in gene set enrichment analysis: do they matter?

PubMed

Zyla, Joanna; Marczyk, Michal; Weiner, January; Polanska, Joanna

2017-05-12

There exist many methods for describing the complex relation between changes of gene expression in molecular pathways or gene ontologies under different experimental conditions. Among them, Gene Set Enrichment Analysis seems to be one of the most commonly used (over 10,000 citations). An important parameter, which could affect the final result, is the choice of a metric for the ranking of genes. Applying a default ranking metric may lead to poor results. In this work 28 benchmark data sets were used to evaluate the sensitivity and false positive rate of gene set analysis for 16 different ranking metrics including new proposals. Furthermore, the robustness of the chosen methods to sample size was tested. Using k-means clustering algorithm a group of four metrics with the highest performance in terms of overall sensitivity, overall false positive rate and computational load was established i.e. absolute value of Moderated Welch Test statistic, Minimum Significant Difference, absolute value of Signal-To-Noise ratio and Baumgartner-Weiss-Schindler test statistic. In case of false positive rate estimation, all selected ranking metrics were robust with respect to sample size. In case of sensitivity, the absolute value of Moderated Welch Test statistic and absolute value of Signal-To-Noise ratio gave stable results, while Baumgartner-Weiss-Schindler and Minimum Significant Difference showed better results for larger sample size. Finally, the Gene Set Enrichment Analysis method with all tested ranking metrics was parallelised and implemented in MATLAB, and is available at https://github.com/ZAEDPolSl/MrGSEA . Choosing a ranking metric in Gene Set Enrichment Analysis has critical impact on results of pathway enrichment analysis. The absolute value of Moderated Welch Test has the best overall sensitivity and Minimum Significant Difference has the best overall specificity of gene set analysis. When the number of non-normally distributed genes is high, using Baumgartner
Joint mapping of genes and conditions via multidimensional unfolding analysis

PubMed Central

Van Deun, Katrijn; Marchal, Kathleen; Heiser, Willem J; Engelen, Kristof; Van Mechelen, Iven

2007-01-01

Background Microarray compendia profile the expression of genes in a number of experimental conditions. Such data compendia are useful not only to group genes and conditions based on their similarity in overall expression over profiles but also to gain information on more subtle relations between genes and conditions. Getting a clear visual overview of all these patterns in a single easy-to-grasp representation is a useful preliminary analysis step: We propose to use for this purpose an advanced exploratory method, called multidimensional unfolding. Results We present a novel algorithm for multidimensional unfolding that overcomes both general problems and problems that are specific for the analysis of gene expression data sets. Applying the algorithm to two publicly available microarray compendia illustrates its power as a tool for exploratory data analysis: The unfolding analysis of a first data set resulted in a two-dimensional representation which clearly reveals temporal regulation patterns for the genes and a meaningful structure for the time points, while the analysis of a second data set showed the algorithm's ability to go beyond a mere identification of those genes that discriminate between different patient or tissue types. Conclusion Multidimensional unfolding offers a useful tool for preliminary explorations of microarray data: By relying on an easy-to-grasp low-dimensional geometric framework, relations among genes, among conditions and between genes and conditions are simultaneously represented in an accessible way which may reveal interesting patterns in the data. An additional advantage of the method is that it can be applied to the raw data without necessitating the choice of suitable genewise transformations of the data. PMID:17550582
Efficacy of combining ING4 and TRAIL genes in cancer-targeting gene virotherapy strategy: first evidence in preclinical hepatocellular carcinoma.

PubMed

Galal El-Shemi, A; Mohammed Ashshi, A; Oh, E; Jung, B-K; Basalamah, M; Alsaegh, A; Yun, C-O

2018-01-01

Current treatments of hepatocellular carcinoma (HCC) are ineffective and unsatisfactory in many aspects. Cancer-targeting gene virotherapy using oncolytic adenoviruses (OAds) armed with anticancer genes has shown efficacy and safety in clinical trials. Nowadays, both inhibitor of growth 4 (ING4), as a multimodal tumor suppressor gene, and tumor necrosis factor-related apoptosis-inducing ligand (TRAIL), as a potent apoptosis-inducing gene, are experiencing a renaissance in cancer gene therapy. Herein we investigated the antitumor activity and safety of mono- and combined therapy with OAds armed with ING4 (Ad-ΔB/ING4) and TRAIL (Ad-ΔB/TRAIL) gene, respectively, on preclinical models of human HCC. OAd-mediated expression of ING4 or TRAIL transgene was confirmed. Ad-ΔB/TRAIL and/or Ad-ΔB/ING4 exhibited potent killing effect on human HCC cells (HuH7 and Hep3B) but not on normal liver cells. Most importantly, systemic therapy with Ad-ΔB/ING4 plus Ad-ΔB/TRAIL elicited more eradicative effect on an orthotopic mouse model of human HCC than their monotherapy, without causing obvious overlapping toxicity. Mechanistically, Ad-ΔB/ING4 and Ad-ΔB/TRAIL were remarkably cooperated to induce antitumor apoptosis and immune response, and to repress tumor angiogenesis. This is the first study showing that concomitant therapy with Ad-ΔB/ING4 and Ad-ΔB/TRAIL may provide a potential strategy for HCC therapy and merits further investigations to realize its possible clinical translation.
Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

PubMed

Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

2014-01-01

Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.
Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

PubMed Central

Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

2013-01-01

The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867
Global analysis of gene expression in maize leaves treated with low temperature. II. Combined effect of severe cold (8 °C) and circadian rhythm.

PubMed

Jończyk, M; Sobkowiak, A; Trzcinska-Danielewicz, J; Skoneczny, M; Solecka, D; Fronk, J; Sowiński, P

2017-10-01

In maize seedlings, severe cold results in dysregulation of circadian pattern of gene expression causing profound modulation of transcription of genes related to photosynthesis and other key biological processes. Plants live highly cyclic life and their response to environmental stresses must allow for underlying biological rhythms. To study the interplay of a stress and a rhythmic cue we investigated transcriptomic response of maize seedlings to low temperature in the context of diurnal gene expression. Severe cold stress had pronounced effect on the circadian rhythm of a substantial proportion of genes. Their response was strikingly dual, comprising either flattening (partial or complete) of the diel amplitude or delay of expression maximum/minimum by several hours. Genes encoding central oscillator components behaved in the same dual manner, unlike their Arabidopsis counterparts reported earlier to cease cycling altogether upon cold treatment. Also numerous genes lacking circadian rhythm responded to the cold by undergoing up- or down-regulation. Notably, the transcriptome changes preceded major physiological manifestations of cold stress. In silico analysis of metabolic processes likely affected by observed gene expression changes indicated major down-regulation of photosynthesis, profound and multifarious modulation of plant hormone levels, and of chromatin structure, transcription, and translation. A role of trehalose and stachyose in cold stress signaling was also suggested. Meta-analysis of published transcriptomic data allowed discrimination between general stress response of maize and that unique to severe cold. Several cis- and trans-factors likely involved in the latter were predicted, albeit none of them seemed to have a major role. These results underscore a key role of modulation of diel gene expression in maize response to severe cold and the unique character of the cold-response of the maize circadian clock.
Prognostic utility of gene therapy with herpes simplex virus thymidine kinase for patients with high-grade malignant gliomas: a systematic review and meta analysis.

PubMed

Zhao, Fei; Tian, Jinhui; An, Lifeng; Yang, Kehu

2014-06-01

The aim of this study was to assess the effectiveness of adding viral vector-mediated gene therapy with herpes simplex virus thymidine kinase (HSV-tk) to standard treatment, in comparison with standard treatment alone to treat patients with high-grade gliomas (HGGs). A literature search of the databases PubMed, Embase, the Cochrane Library, Web of Science, and Chinese biomedicine was performed to identify eligible studies. Three randomized controlled trials (involving a total of 532 patients) were included in this systematic review. A meta-analysis of included studies demonstrated a significant increase in median survival time (MST) in patients who were treated with HSV-tk gene therapy (mean deviation 0.59, 95% CI: 0.41-0.76, p < 0.0001). The results of pooled analysis for different patient groups show that overall survival (OS) for all HGG patients was improved by adding gene therapy [hazard ratio (HR) = 0.91, 95% CI: 0.74-1.13, p = 0.42], while a different result was seen for glioblastoma multiforme (GBM) patients (HR = 1.06, 95% CI: 0.80-1.41, p = 0.70). Furthermore, the combined results for tumor progression implied that standard therapy was superior to gene therapy [odds ratio (OR) = 1.31, p = 0.09]; yet differences in HR and OR between experimental groups and control groups had no statistical significance (p > 0.05). Based on the best available evidence, it appears that adding gene therapy with HSV-tk has some effect in treating HGG patients, especially with respect to MST. However, neither the pooled analysis of OS, nor the combined analysis of tumor progress indicates any significant advantage to adding gene therapy compared with standard treatment alone. More prospective studies are needed to draw solid conclusions about whether gene therapy has significant prognostic advantage.
Gene context analysis in the Integrated Microbial Genomes (IMG) data management system.

PubMed

Mavromatis, Konstantinos; Chu, Ken; Ivanova, Natalia; Hooper, Sean D; Markowitz, Victor M; Kyrpides, Nikos C

2009-11-24

Computational methods for determining the function of genes in newly sequenced genomes have been traditionally based on sequence similarity to genes whose function has been identified experimentally. Function prediction methods can be extended using gene context analysis approaches such as examining the conservation of chromosomal gene clusters, gene fusion events and co-occurrence profiles across genomes. Context analysis is based on the observation that functionally related genes are often having similar gene context and relies on the identification of such events across phylogenetically diverse collection of genomes. We have used the data management system of the Integrated Microbial Genomes (IMG) as the framework to implement and explore the power of gene context analysis methods because it provides one of the largest available genome integrations. Visualization and search tools to facilitate gene context analysis have been developed and applied across all publicly available archaeal and bacterial genomes in IMG. These computations are now maintained as part of IMG's regular genome content update cycle. IMG is available at: http://img.jgi.doe.gov.
Phylomemetics—Evolutionary Analysis beyond the Gene

PubMed Central

Howe, Christopher J.; Windram, Heather F.

2011-01-01

Genes are propagated by error-prone copying, and the resulting variation provides the basis for phylogenetic reconstruction of evolutionary relationships. Horizontal gene transfer may be superimposed on a tree-like evolutionary pattern, with some relationships better depicted as networks. The copying of manuscripts by scribes is very similar to the replication of genes, and phylogenetic inference programs can be used directly for reconstructing the copying history of different versions of a manuscript text. Phylogenetic methods have also been used for some time to analyse the evolution of languages and the development of physical cultural artefacts. These studies can help to answer a range of anthropological questions. We propose the adoption of the term “phylomemetics” for phylogenetic analysis of reproducing non-genetic elements. PMID:21655311
Weighted gene co-expression network analysis of gene modules for the prognosis of esophageal cancer.

PubMed

Zhang, Cong; Sun, Qian

2017-06-01

Esophageal cancer is a common malignant tumor, whose pathogenesis and prognosis factors are not fully understood. This study aimed to discover the gene clusters that have similar functions and can be used to predict the prognosis of esophageal cancer. The matched microarray and RNA sequencing data of 185 patients with esophageal cancer were downloaded from The Cancer Genome Atlas (TCGA), and gene co-expression networks were built without distinguishing between squamous carcinoma and adenocarcinoma. The result showed that 12 modules were associated with one or more survival data such as recurrence status, recurrence time, vital status or vital time. Furthermore, survival analysis showed that 5 out of the 12 modules were related to progression-free survival (PFS) or overall survival (OS). As the most important module, the midnight blue module with 82 genes was related to PFS, apart from the patient age, tumor grade, primary treatment success, and duration of smoking and tumor histological type. Gene ontology enrichment analysis revealed that "glycoprotein binding" was the top enriched function of midnight blue module genes. Additionally, the blue module was the exclusive gene clusters related to OS. Platelet activating factor receptor (PTAFR) and feline Gardner-Rasheed (FGR) were the top hub genes in both modeling datasets and the STRING protein interaction database. In conclusion, our study provides novel insights into the prognosis-associated genes and screens out candidate biomarkers for esophageal cancer.
RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes

PubMed Central

Ono, Hiromasa; Ogasawara, Osamu; Okubo, Kosaku; Bono, Hidemasa

2017-01-01

Gene expression data are exponentially accumulating; thus, the functional annotation of such sequence data from metadata is urgently required. However, life scientists have difficulty utilizing the available data due to its sheer magnitude and complicated access. We have developed a web tool for browsing reference gene expression pattern of mammalian tissues and cell lines measured using different methods, which should facilitate the reuse of the precious data archived in several public databases. The web tool is called Reference Expression dataset (RefEx), and RefEx allows users to search by the gene name, various types of IDs, chromosomal regions in genetic maps, gene family based on InterPro, gene expression patterns, or biological categories based on Gene Ontology. RefEx also provides information about genes with tissue-specific expression, and the relative gene expression values are shown as choropleth maps on 3D human body images from BodyParts3D. Combined with the newly incorporated Functional Annotation of Mammals (FANTOM) dataset, RefEx provides insight regarding the functional interpretation of unfamiliar genes. RefEx is publicly available at http://refex.dbcls.jp/. PMID:28850115
A mesh generation and machine learning framework for Drosophila gene expression pattern image analysis

PubMed Central

2013-01-01

Background Multicellular organisms consist of cells of many different types that are established during development. Each type of cell is characterized by the unique combination of expressed gene products as a result of spatiotemporal gene regulation. Currently, a fundamental challenge in regulatory biology is to elucidate the gene expression controls that generate the complex body plans during development. Recent advances in high-throughput biotechnologies have generated spatiotemporal expression patterns for thousands of genes in the model organism fruit fly Drosophila melanogaster. Existing qualitative methods enhanced by a quantitative analysis based on computational tools we present in this paper would provide promising ways for addressing key scientific questions. Results We develop a set of computational methods and open source tools for identifying co-expressed embryonic domains and the associated genes simultaneously. To map the expression patterns of many genes into the same coordinate space and account for the embryonic shape variations, we develop a mesh generation method to deform a meshed generic ellipse to each individual embryo. We then develop a co-clustering formulation to cluster the genes and the mesh elements, thereby identifying co-expressed embryonic domains and the associated genes simultaneously. Experimental results indicate that the gene and mesh co-clusters can be correlated to key developmental events during the stages of embryogenesis we study. The open source software tool has been made available at http://compbio.cs.odu.edu/fly/. Conclusions Our mesh generation and machine learning methods and tools improve upon the flexibility, ease-of-use and accuracy of existing methods. PMID:24373308
Hierarchical Parallelization of Gene Differential Association Analysis

PubMed Central

2011-01-01

Background Microarray gene differential expression analysis is a widely used technique that deals with high dimensional data and is computationally intensive for permutation-based procedures. Microarray gene differential association analysis is even more computationally demanding and must take advantage of multicore computing technology, which is the driving force behind increasing compute power in recent years. In this paper, we present a two-layer hierarchical parallel implementation of gene differential association analysis. It takes advantage of both fine- and coarse-grain (with granularity defined by the frequency of communication) parallelism in order to effectively leverage the non-uniform nature of parallel processing available in the cutting-edge systems of today. Results Our results show that this hierarchical strategy matches data sharing behavior to the properties of the underlying hardware, thereby reducing the memory and bandwidth needs of the application. The resulting improved efficiency reduces computation time and allows the gene differential association analysis code to scale its execution with the number of processors. The code and biological data used in this study are downloadable from http://www.urmc.rochester.edu/biostat/people/faculty/hu.cfm. Conclusions The performance sweet spot occurs when using a number of threads per MPI process that allows the working sets of the corresponding MPI processes running on the multicore to fit within the machine cache. Hence, we suggest that practitioners follow this principle in selecting the appropriate number of MPI processes and threads within each MPI process for their cluster configurations. We believe that the principles of this hierarchical approach to parallelization can be utilized in the parallelization of other computationally demanding kernels. PMID:21936916

Hierarchical parallelization of gene differential association analysis.

PubMed

Needham, Mark; Hu, Rui; Dwarkadas, Sandhya; Qiu, Xing

2011-09-21

Microarray gene differential expression analysis is a widely used technique that deals with high dimensional data and is computationally intensive for permutation-based procedures. Microarray gene differential association analysis is even more computationally demanding and must take advantage of multicore computing technology, which is the driving force behind increasing compute power in recent years. In this paper, we present a two-layer hierarchical parallel implementation of gene differential association analysis. It takes advantage of both fine- and coarse-grain (with granularity defined by the frequency of communication) parallelism in order to effectively leverage the non-uniform nature of parallel processing available in the cutting-edge systems of today. Our results show that this hierarchical strategy matches data sharing behavior to the properties of the underlying hardware, thereby reducing the memory and bandwidth needs of the application. The resulting improved efficiency reduces computation time and allows the gene differential association analysis code to scale its execution with the number of processors. The code and biological data used in this study are downloadable from http://www.urmc.rochester.edu/biostat/people/faculty/hu.cfm. The performance sweet spot occurs when using a number of threads per MPI process that allows the working sets of the corresponding MPI processes running on the multicore to fit within the machine cache. Hence, we suggest that practitioners follow this principle in selecting the appropriate number of MPI processes and threads within each MPI process for their cluster configurations. We believe that the principles of this hierarchical approach to parallelization can be utilized in the parallelization of other computationally demanding kernels.
Genome-Wide Analysis of Syntenic Gene Deletion in the Grasses

PubMed Central

Schnable, James C.; Freeling, Michael; Lyons, Eric

2012-01-01

The grasses, Poaceae, are one of the largest and most successful angiosperm families. Like many radiations of flowering plants, the divergence of the major grass lineages was preceded by a whole-genome duplication (WGD), although these events are not rare for flowering plants. By combining identification of syntenic gene blocks with measures of gene pair divergence and different frequencies of ancient gene loss, we have separated the two subgenomes present in modern grasses. Reciprocal loss of duplicated genes or genomic regions has been hypothesized to reproductively isolate populations and, thus, speciation. However, in contrast to previous studies in yeast and teleost fishes, we found very little evidence of reciprocal loss of homeologous genes between the grasses, suggesting that post-WGD gene loss may not be the cause of the grass radiation. The sets of homeologous and orthologous genes and predicted locations of deleted genes identified in this study, as well as links to the CoGe comparative genomics web platform for analyzing pan-grass syntenic regions, are provided along with this paper as a resource for the grass genetics community. PMID:22275519
Identification of genes associated with the long-gut-persistence phenotype of the probiotic Lactobacillus johnsonii strain NCC533 using a combination of genomics and transcriptome analysis.

PubMed

Denou, Emmanuel; Pridmore, Raymond David; Berger, Bernard; Panoff, Jean-Michel; Arigoni, Fabrizio; Brüssow, Harald

2008-05-01

Lactobacillus johnsonii strains NCC533 and ATCC 33200 (the type strain of this species) differed significantly in gut residence time (12 versus 5 days) after oral feeding to mice. Genes affecting the long gut residence time of the probiotic strain NCC533 were targeted for analysis. We hypothesized that genes specific for this strain, which are expressed during passage of the bacterium through the gut, affect the phenotype. When the DNA of the type strain was hybridized against a microarray of the sequenced NCC533 strain, we identified 233 genes that were specific for the long-gut-persistence isolate. Whole-genome transcription analysis of the NCC533 strain using the microarray format identified 174 genes that were strongly and consistently expressed in the jejunum of mice monocolonized with this strain. Fusion of the two microarray data sets identified three gene loci that were both expressed in vivo and specific to the long-gut-persistence isolate. The identified genes included LJ1027 and LJ1028, two glycosyltransferase genes in the exopolysaccharide synthesis operon; LJ1654 to LJ1656, encoding a sugar phosphotransferase system (PTS) transporter annotated as mannose PTS; and LJ1680, whose product shares 30% amino acid identity with immunoglobulin A proteases from pathogenic bacteria. Knockout mutants were tested in vivo. The experiments revealed that deletion of LJ1654 to LJ1656 and LJ1680 decreased the gut residence time, while a mutant with a deleted exopolysaccharide biosynthesis cluster had a slightly increased residence time.
Integrative Analysis of GWASs, Human Protein Interaction, and Gene Expression Identified Gene Modules Associated With BMDs

PubMed Central

He, Hao; Zhang, Lei; Li, Jian; Wang, Yu-Ping; Zhang, Ji-Gang; Shen, Jie; Guo, Yan-Fang

2014-01-01

Context: To date, few systems genetics studies in the bone field have been performed. We designed our study from a systems-level perspective by integrating genome-wide association studies (GWASs), human protein-protein interaction (PPI) network, and gene expression to identify gene modules contributing to osteoporosis risk. Methods: First we searched for modules significantly enriched with bone mineral density (BMD)-associated genes in human PPI network by using 2 large meta-analysis GWAS datasets through a dense module search algorithm. One included 7 individual GWAS samples (Meta7). The other was from the Genetic Factors for Osteoporosis Consortium (GEFOS2). One was assigned as a discovery dataset and the other as an evaluation dataset, and vice versa. Results: In total, 42 modules and 129 modules were identified significantly in both Meta7 and GEFOS2 datasets for femoral neck and spine BMD, respectively. There were 3340 modules identified for hip BMD only in Meta7. As candidate modules, they were assessed for the biological relevance to BMD by gene set enrichment analysis in 2 expression profiles generated from circulating monocytes in subjects with low versus high BMD values. Interestingly, there were 2 modules significantly enriched in monocytes from the low BMD group in both gene expression datasets (nominal P value <.05). Two modules had 16 nonredundant genes. Functional enrichment analysis revealed that both modules were enriched for genes involved in Wnt receptor signaling and osteoblast differentiation. Conclusion: We highlighted 2 modules and novel genes playing important roles in the regulation of bone mass, providing important clues for therapeutic approaches for osteoporosis. PMID:25119315
Mapping a New Spontaneous Preterm Birth Susceptibility Gene, IGF1R, Using Linkage, Haplotype Sharing, and Association Analysis

PubMed Central

Luukkonen, Aino; Teramo, Kari; Puttonen, Hilkka; Ojaniemi, Marja; Varilo, Teppo; Chaudhari, Bimal P.; Plunkett, Jevon; Murray, Jeffrey C.; McCarroll, Steven A.; Muglia, Louis J.; Palotie, Aarno; Hallman, Mikko

2011-01-01

Preterm birth is the major cause of neonatal death and serious morbidity. Most preterm births are due to spontaneous onset of labor without a known cause or effective prevention. Both maternal and fetal genomes influence the predisposition to spontaneous preterm birth (SPTB), but the susceptibility loci remain to be defined. We utilized a combination of unique population structures, family-based linkage analysis, and subsequent case-control association to identify a susceptibility haplotype for SPTB. Clinically well-characterized SPTB families from northern Finland, a subisolate founded by a relatively small founder population that has subsequently experienced a number of bottlenecks, were selected for the initial discovery sample. Genome-wide linkage analysis using a high-density single-nucleotide polymorphism (SNP) array in seven large northern Finnish non-consanginous families identified a locus on 15q26.3 (HLOD 4.68). This region contains the IGF1R gene, which encodes the type 1 insulin-like growth factor receptor IGF-1R. Haplotype segregation analysis revealed that a 55 kb 12-SNP core segment within the IGF1R gene was shared identical-by-state (IBS) in five families. A follow-up case-control study in an independent sample representing the more general Finnish population showed an association of a 6-SNP IGF1R haplotype with SPTB in the fetuses, providing further evidence for IGF1R as a SPTB predisposition gene (frequency in cases versus controls 0.11 versus 0.05, P = 0.001, odds ratio 2.3). This study demonstrates the identification of a predisposing, low-frequency haplotype in a multifactorial trait using a well-characterized population and a combination of family and case-control designs. Our findings support the identification of the novel susceptibility gene IGF1R for predisposition by the fetal genome to being born preterm. PMID:21304894
A multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors for functional gene analysis.

PubMed

Weber, Kristoffer; Bartsch, Udo; Stocking, Carol; Fehse, Boris

2008-04-01

Functional gene analysis requires the possibility of overexpression, as well as downregulation of one, or ideally several, potentially interacting genes. Lentiviral vectors are well suited for this purpose as they ensure stable expression of complementary DNAs (cDNAs), as well as short-hairpin RNAs (shRNAs), and can efficiently transduce a wide spectrum of cell targets when packaged within the coat proteins of other viruses. Here we introduce a multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors designed according to the "building blocks" principle. Using a wide spectrum of different fluorescent markers, including drug-selectable enhanced green fluorescent protein (eGFP)- and dTomato-blasticidin-S resistance fusion proteins, LeGO vectors allow simultaneous analysis of multiple genes and shRNAs of interest within single, easily identifiable cells. Furthermore, each functional module is flanked by unique cloning sites, ensuring flexibility and individual optimization. The efficacy of these vectors for analyzing multiple genes in a single cell was demonstrated in several different cell types, including hematopoietic, endothelial, and neural stem and progenitor cells, as well as hepatocytes. LeGO vectors thus represent a valuable tool for investigating gene networks using conditional ectopic expression and knock-down approaches simultaneously.
Comparative and Evolutionary Analysis of the HES/HEY Gene Family Reveal Exon/Intron Loss and Teleost Specific Duplication Events

PubMed Central

Ma, Zhaowu; Zhou, Yang; Abbood, Nibras Najm; Liu, Jianfeng; Su, Li; Jia, Haibo; Guo, An-Yuan

2012-01-01

Background HES/HEY genes encode a family of basic helix-loop-helix (bHLH) transcription factors with both bHLH and Orange domain. HES/HEY proteins are direct targets of the Notch signaling pathway and play an essential role in developmental decisions, such as the developments of nervous system, somitogenesis, blood vessel and heart. Despite their important functions, the origin and evolution of this HES/HEY gene family has yet to be elucidated. Methods and Findings In this study, we identified genes of the HES/HEY family in representative species and performed evolutionary analysis to elucidate their origin and evolutionary process. Our results showed that the HES/HEY genes only existed in metazoans and may originate from the common ancestor of metazoans. We identified HES/HEY genes in more than 10 species representing the main lineages. Combining the bHLH and Orange domain sequences, we constructed the phylogenetic trees by different methods (Bayesian, ML, NJ and ME) and classified the HES/HEY gene family into four groups. Our results indicated that this gene family had undergone three expansions, which were along with the origins of Eumetazoa, vertebrate, and teleost. Gene structure analysis revealed that the HES/HEY genes were involved in exon and/or intron loss in different species lineages. Genes of this family were duplicated in bony fishes and doubled than other vertebrates. Furthermore, we studied the teleost-specific duplications in zebrafish and investigated the expression pattern of duplicated genes in different tissues by RT-PCR. Finally, we proposed a model to show the evolution of this gene family with processes of expansion, exon/intron loss, and motif loss. Conclusions Our study revealed the evolution of HES/HEY gene family, the expression and function divergence of duplicated genes, which also provide clues for the research of Notch function in development. This study shows a model of gene family analysis with gene structure evolution and
Functional Module Analysis for Gene Coexpression Networks with Network Integration.

PubMed

Zhang, Shuqin; Zhao, Hongyu; Ng, Michael K

2015-01-01

Network has been a general tool for studying the complex interactions between different genes, proteins, and other small molecules. Module as a fundamental property of many biological networks has been widely studied and many computational methods have been proposed to identify the modules in an individual network. However, in many cases, a single network is insufficient for module analysis due to the noise in the data or the tuning of parameters when building the biological network. The availability of a large amount of biological networks makes network integration study possible. By integrating such networks, more informative modules for some specific disease can be derived from the networks constructed from different tissues, and consistent factors for different diseases can be inferred. In this paper, we have developed an effective method for module identification from multiple networks under different conditions. The problem is formulated as an optimization model, which combines the module identification in each individual network and alignment of the modules from different networks together. An approximation algorithm based on eigenvector computation is proposed. Our method outperforms the existing methods, especially when the underlying modules in multiple networks are different in simulation studies. We also applied our method to two groups of gene coexpression networks for humans, which include one for three different cancers, and one for three tissues from the morbidly obese patients. We identified 13 modules with three complete subgraphs, and 11 modules with two complete subgraphs, respectively. The modules were validated through Gene Ontology enrichment and KEGG pathway enrichment analysis. We also showed that the main functions of most modules for the corresponding disease have been addressed by other researchers, which may provide the theoretical basis for further studying the modules experimentally.
The association of multiple interacting genes with specific phenotypes in rice using gene coexpression networks.

PubMed

Ficklin, Stephen P; Luo, Feng; Feltus, F Alex

2010-09-01

Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes.
Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm.

PubMed

Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder; Murphy, Denis J

2018-01-01

Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops.
Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm

PubMed Central

Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E.; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder

2018-01-01

Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops. PMID:29672525
Gene-centric meta-analysis in 87,736 individuals of European ancestry identifies multiple blood-pressure-related loci.

PubMed

Tragante, Vinicius; Barnes, Michael R; Ganesh, Santhi K; Lanktree, Matthew B; Guo, Wei; Franceschini, Nora; Smith, Erin N; Johnson, Toby; Holmes, Michael V; Padmanabhan, Sandosh; Karczewski, Konrad J; Almoguera, Berta; Barnard, John; Baumert, Jens; Chang, Yen-Pei Christy; Elbers, Clara C; Farrall, Martin; Fischer, Mary E; Gaunt, Tom R; Gho, Johannes M I H; Gieger, Christian; Goel, Anuj; Gong, Yan; Isaacs, Aaron; Kleber, Marcus E; Mateo Leach, Irene; McDonough, Caitrin W; Meijs, Matthijs F L; Melander, Olle; Nelson, Christopher P; Nolte, Ilja M; Pankratz, Nathan; Price, Tom S; Shaffer, Jonathan; Shah, Sonia; Tomaszewski, Maciej; van der Most, Peter J; Van Iperen, Erik P A; Vonk, Judith M; Witkowska, Kate; Wong, Caroline O L; Zhang, Li; Beitelshees, Amber L; Berenson, Gerald S; Bhatt, Deepak L; Brown, Morris; Burt, Amber; Cooper-DeHoff, Rhonda M; Connell, John M; Cruickshanks, Karen J; Curtis, Sean P; Davey-Smith, George; Delles, Christian; Gansevoort, Ron T; Guo, Xiuqing; Haiqing, Shen; Hastie, Claire E; Hofker, Marten H; Hovingh, G Kees; Kim, Daniel S; Kirkland, Susan A; Klein, Barbara E; Klein, Ronald; Li, Yun R; Maiwald, Steffi; Newton-Cheh, Christopher; O'Brien, Eoin T; Onland-Moret, N Charlotte; Palmas, Walter; Parsa, Afshin; Penninx, Brenda W; Pettinger, Mary; Vasan, Ramachandran S; Ranchalis, Jane E; M Ridker, Paul; Rose, Lynda M; Sever, Peter; Shimbo, Daichi; Steele, Laura; Stolk, Ronald P; Thorand, Barbara; Trip, Mieke D; van Duijn, Cornelia M; Verschuren, W Monique; Wijmenga, Cisca; Wyatt, Sharon; Young, J Hunter; Zwinderman, Aeilko H; Bezzina, Connie R; Boerwinkle, Eric; Casas, Juan P; Caulfield, Mark J; Chakravarti, Aravinda; Chasman, Daniel I; Davidson, Karina W; Doevendans, Pieter A; Dominiczak, Anna F; FitzGerald, Garret A; Gums, John G; Fornage, Myriam; Hakonarson, Hakon; Halder, Indrani; Hillege, Hans L; Illig, Thomas; Jarvik, Gail P; Johnson, Julie A; Kastelein, John J P; Koenig, Wolfgang; Kumari, Meena; März, Winfried; Murray, Sarah S; O'Connell, Jeffery R; Oldehinkel, Albertine J; Pankow, James S; Rader, Daniel J; Redline, Susan; Reilly, Muredach P; Schadt, Eric E; Kottke-Marchant, Kandice; Snieder, Harold; Snyder, Michael; Stanton, Alice V; Tobin, Martin D; Uitterlinden, André G; van der Harst, Pim; van der Schouw, Yvonne T; Samani, Nilesh J; Watkins, Hugh; Johnson, Andrew D; Reiner, Alex P; Zhu, Xiaofeng; de Bakker, Paul I W; Levy, Daniel; Asselbergs, Folkert W; Munroe, Patricia B; Keating, Brendan J

2014-03-06

Blood pressure (BP) is a heritable risk factor for cardiovascular disease. To investigate genetic associations with systolic BP (SBP), diastolic BP (DBP), mean arterial pressure (MAP), and pulse pressure (PP), we genotyped ~50,000 SNPs in up to 87,736 individuals of European ancestry and combined these in a meta-analysis. We replicated findings in an independent set of 68,368 individuals of European ancestry. Our analyses identified 11 previously undescribed associations in independent loci containing 31 genes including PDE1A, HLA-DQB1, CDK6, PRKAG2, VCL, H19, NUCB2, RELA, HOXC@ complex, FBN1, and NFAT5 at the Bonferroni-corrected array-wide significance threshold (p < 6 × 10(-7)) and confirmed 27 previously reported associations. Bioinformatic analysis of the 11 loci provided support for a putative role in hypertension of several genes, such as CDK6 and NUCB2. Analysis of potential pharmacological targets in databases of small molecules showed that ten of the genes are predicted to be a target for small molecules. In summary, we identified previously unknown loci associated with BP. Our findings extend our understanding of genes involved in BP regulation, which may provide new targets for therapeutic intervention or drug response stratification. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Comparison of reverse transcription-quantitative polymerase chain reaction methods and platforms for single cell gene expression analysis.

PubMed

Fox, Bridget C; Devonshire, Alison S; Baradez, Marc-Olivier; Marshall, Damian; Foy, Carole A

2012-08-15

Single cell gene expression analysis can provide insights into development and disease progression by profiling individual cellular responses as opposed to reporting the global average of a population. Reverse transcription-quantitative polymerase chain reaction (RT-qPCR) is the "gold standard" for the quantification of gene expression levels; however, the technical performance of kits and platforms aimed at single cell analysis has not been fully defined in terms of sensitivity and assay comparability. We compared three kits using purification columns (PicoPure) or direct lysis (CellsDirect and Cells-to-CT) combined with a one- or two-step RT-qPCR approach using dilutions of cells and RNA standards to the single cell level. Single cell-level messenger RNA (mRNA) analysis was possible using all three methods, although the precision, linearity, and effect of lysis buffer and cell background differed depending on the approach used. The impact of using a microfluidic qPCR platform versus a standard instrument was investigated for potential variability introduced by preamplification of template or scaling down of the qPCR to nanoliter volumes using laser-dissected single cell samples. The two approaches were found to be comparable. These studies show that accurate gene expression analysis is achievable at the single cell level and highlight the importance of well-validated experimental procedures for low-level mRNA analysis. Copyright © 2012 Elsevier Inc. All rights reserved.
Bioinformatics Analysis of NBS-LRR Encoding Resistance Genes in Setaria italica.

PubMed

Zhao, Yan; Weng, Qiaoyun; Song, Jinhui; Ma, Hailian; Yuan, Jincheng; Dong, Zhiping; Liu, Yinghui

2016-06-01

In plants, resistance (R) genes are involved in pathogen recognition and subsequent activation of innate immune responses. The nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes family forms the largest R-gene family among plant genomes and play an important role in plant disease resistance. In this paper, comprehensive analysis of NBS-encoding genes is performed in the whole Setaria italica genome. A total of 96 NBS-LRR genes are identified, and comprehensive overview of the NBS-LRR genes is undertaken, including phylogenetic analysis, chromosome locations, conserved motifs of proteins, and gene expression. Based on the domain, these genes are divided into two groups and distributed in all Setaria italica chromosomes. Most NBS-LRR genes are located at the distal tip of the long arms of the chromosomes. Setaria italica NBS-LRR proteins share at least one nucleotide-biding domain and one leucine-rich repeat domain. Our results also show the duplication of NBS-LRR genes in Setaria italica is related to their gene structure.
Blood transcriptomic comparison of individuals with and without autism spectrum disorder: A combined-samples mega-analysis.

PubMed

Tylee, Daniel S; Hess, Jonathan L; Quinn, Thomas P; Barve, Rahul; Huang, Hailiang; Zhang-James, Yanli; Chang, Jeffrey; Stamova, Boryana S; Sharp, Frank R; Hertz-Picciotto, Irva; Faraone, Stephen V; Kong, Sek Won; Glatt, Stephen J

2017-04-01

Blood-based microarray studies comparing individuals affected with autism spectrum disorder (ASD) and typically developing individuals help characterize differences in circulating immune cell functions and offer potential biomarker signal. We sought to combine the subject-level data from previously published studies by mega-analysis to increase the statistical power. We identified studies that compared ex vivo blood or lymphocytes from ASD-affected individuals and unrelated comparison subjects using Affymetrix or Illumina array platforms. Raw microarray data and clinical meta-data were obtained from seven studies, totaling 626 affected and 447 comparison subjects. Microarray data were processed using uniform methods. Covariate-controlled mixed-effect linear models were used to identify gene transcripts and co-expression network modules that were significantly associated with diagnostic status. Permutation-based gene-set analysis was used to identify functionally related sets of genes that were over- and under-expressed among ASD samples. Our results were consistent with diminished interferon-, EGF-, PDGF-, PI3K-AKT-mTOR-, and RAS-MAPK-signaling cascades, and increased ribosomal translation and NK-cell related activity in ASD. We explored evidence for sex-differences in the ASD-related transcriptomic signature. We also demonstrated that machine-learning classifiers using blood transcriptome data perform with moderate accuracy when data are combined across studies. Comparing our results with those from blood-based studies of protein biomarkers (e.g., cytokines and trophic factors), we propose that ASD may feature decoupling between certain circulating signaling proteins (higher in ASD samples) and the transcriptional cascades which they typically elicit within circulating immune cells (lower in ASD samples). These findings provide insight into ASD-related transcriptional differences in circulating immune cells. © 2016 Wiley Periodicals, Inc. © 2016 Wiley
RaMP: A Comprehensive Relational Database of Metabolomics Pathways for Pathway Enrichment Analysis of Genes and Metabolites

PubMed Central

Zhang, Bofei; Hu, Senyang; Baskin, Elizabeth; Patt, Andrew; Siddiqui, Jalal K.

2018-01-01

The value of metabolomics in translational research is undeniable, and metabolomics data are increasingly generated in large cohorts. The functional interpretation of disease-associated metabolites though is difficult, and the biological mechanisms that underlie cell type or disease-specific metabolomics profiles are oftentimes unknown. To help fully exploit metabolomics data and to aid in its interpretation, analysis of metabolomics data with other complementary omics data, including transcriptomics, is helpful. To facilitate such analyses at a pathway level, we have developed RaMP (Relational database of Metabolomics Pathways), which combines biological pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG), Reactome, WikiPathways, and the Human Metabolome DataBase (HMDB). To the best of our knowledge, an off-the-shelf, public database that maps genes and metabolites to biochemical/disease pathways and can readily be integrated into other existing software is currently lacking. For consistent and comprehensive analysis, RaMP enables batch and complex queries (e.g., list all metabolites involved in glycolysis and lung cancer), can readily be integrated into pathway analysis tools, and supports pathway overrepresentation analysis given a list of genes and/or metabolites of interest. For usability, we have developed a RaMP R package (https://github.com/Mathelab/RaMP-DB), including a user-friendly RShiny web application, that supports basic simple and batch queries, pathway overrepresentation analysis given a list of genes or metabolites of interest, and network visualization of gene-metabolite relationships. The package also includes the raw database file (mysql dump), thereby providing a stand-alone downloadable framework for public use and integration with other tools. In addition, the Python code needed to recreate the database on another system is also publicly available (https://github.com/Mathelab/RaMP-BackEnd). Updates for databases in RaMP will be
RaMP: A Comprehensive Relational Database of Metabolomics Pathways for Pathway Enrichment Analysis of Genes and Metabolites.

PubMed

Zhang, Bofei; Hu, Senyang; Baskin, Elizabeth; Patt, Andrew; Siddiqui, Jalal K; Mathé, Ewy A

2018-02-22

The value of metabolomics in translational research is undeniable, and metabolomics data are increasingly generated in large cohorts. The functional interpretation of disease-associated metabolites though is difficult, and the biological mechanisms that underlie cell type or disease-specific metabolomics profiles are oftentimes unknown. To help fully exploit metabolomics data and to aid in its interpretation, analysis of metabolomics data with other complementary omics data, including transcriptomics, is helpful. To facilitate such analyses at a pathway level, we have developed RaMP (Relational database of Metabolomics Pathways), which combines biological pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG), Reactome, WikiPathways, and the Human Metabolome DataBase (HMDB). To the best of our knowledge, an off-the-shelf, public database that maps genes and metabolites to biochemical/disease pathways and can readily be integrated into other existing software is currently lacking. For consistent and comprehensive analysis, RaMP enables batch and complex queries (e.g., list all metabolites involved in glycolysis and lung cancer), can readily be integrated into pathway analysis tools, and supports pathway overrepresentation analysis given a list of genes and/or metabolites of interest. For usability, we have developed a RaMP R package (https://github.com/Mathelab/RaMP-DB), including a user-friendly RShiny web application, that supports basic simple and batch queries, pathway overrepresentation analysis given a list of genes or metabolites of interest, and network visualization of gene-metabolite relationships. The package also includes the raw database file (mysql dump), thereby providing a stand-alone downloadable framework for public use and integration with other tools. In addition, the Python code needed to recreate the database on another system is also publicly available (https://github.com/Mathelab/RaMP-BackEnd). Updates for databases in RaMP will be
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

NASA Astrophysics Data System (ADS)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
shinyGISPA: A web application for characterizing phenotype by gene sets using multiple omics data combinations.

PubMed

Dwivedi, Bhakti; Kowalski, Jeanne

2018-01-01

While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/.
shinyGISPA: A web application for characterizing phenotype by gene sets using multiple omics data combinations

PubMed Central

Dwivedi, Bhakti

2018-01-01

While many methods exist for integrating multi-omics data or defining gene sets, there is no one single tool that defines gene sets based on merging of multiple omics data sets. We present shinyGISPA, an open-source application with a user-friendly web-based interface to define genes according to their similarity in several molecular changes that are driving a disease phenotype. This tool was developed to help facilitate the usability of a previously published method, Gene Integrated Set Profile Analysis (GISPA), among researchers with limited computer-programming skills. The GISPA method allows the identification of multiple gene sets that may play a role in the characterization, clinical application, or functional relevance of a disease phenotype. The tool provides an automated workflow that is highly scalable and adaptable to applications that go beyond genomic data merging analysis. It is available at http://shinygispa.winship.emory.edu/shinyGISPA/. PMID:29415010

Combined antitumor gene therapy with herpes simplex virus-thymidine kinase and short hairpin RNA specific for mammalian target of rapamycin.

PubMed

Woo, Ha-Na; Lee, Won Il; Kim, Ji Hyun; Ahn, Jeonghyun; Han, Jeong Hee; Lim, Sue Yeon; Lee, Won Woo; Lee, Heuiran

2015-12-01

A proof-of-concept study is presented using dual gene therapy that employed a small hairpin RNA (shRNA) specific for mammalian target of rapamycin (mTOR) and a herpes simplex virus-thymidine kinase (HSV-TK) gene to inhibit the growth of tumors. Recombinant adeno-associated virus (rAAV) vectors containing a mutant TK gene (sc39TK) were transduced into HeLa cells, and the prodrug ganciclovir (GCV) was administered to establish a suicide gene-therapy strategy. Additionally, rAAV vectors expressing an mTOR-targeted shRNA were employed to suppress mTOR-dependent tumor growth. GCV selectively induced death in tumor cells expressing TK, and the mTOR-targeted shRNA altered the cell cycle to impair tumor growth. Combining the TK-GCV system with mTOR inhibition suppressed tumor growth to a greater extent than that achieved with either treatment alone. Furthermore, HSV-TK expression and mTOR inhibition did not mutually interfere with each other. In conclusion, gene therapy that combines the TK-GCV system and mTOR inhibition shows promise as a novel strategy for cancer therapy.
Transcriptomic analysis in the developing zebrafish embryo after compound exposure: Individual gene expression and pathway regulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl; Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht; Institute for Risk Assessment Sciences

2013-10-01

The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol andmore » saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.« less
Seeking unique and common biological themes in multiple gene lists or datasets: pathway pattern extraction pipeline for pathway-level comparative analysis.

PubMed

Yi, Ming; Mudunuri, Uma; Che, Anney; Stephens, Robert M

2009-06-29

One of the challenges in the analysis of microarray data is to integrate and compare the selected (e.g., differential) gene lists from multiple experiments for common or unique underlying biological themes. A common way to approach this problem is to extract common genes from these gene lists and then subject these genes to enrichment analysis to reveal the underlying biology. However, the capacity of this approach is largely restricted by the limited number of common genes shared by datasets from multiple experiments, which could be caused by the complexity of the biological system itself. We now introduce a new Pathway Pattern Extraction Pipeline (PPEP), which extends the existing WPS application by providing a new pathway-level comparative analysis scheme. To facilitate comparing and correlating results from different studies and sources, PPEP contains new interfaces that allow evaluation of the pathway-level enrichment patterns across multiple gene lists. As an exploratory tool, this analysis pipeline may help reveal the underlying biological themes at both the pathway and gene levels. The analysis scheme provided by PPEP begins with multiple gene lists, which may be derived from different studies in terms of the biological contexts, applied technologies, or methodologies. These lists are then subjected to pathway-level comparative analysis for extraction of pathway-level patterns. This analysis pipeline helps to explore the commonality or uniqueness of these lists at the level of pathways or biological processes from different but relevant biological systems using a combination of statistical enrichment measurements, pathway-level pattern extraction, and graphical display of the relationships of genes and their associated pathways as Gene-Term Association Networks (GTANs) within the WPS platform. As a proof of concept, we have used the new method to analyze many datasets from our collaborators as well as some public microarray datasets. This tool provides a
Bioinformatic Analysis of Strawberry GSTF12 Gene

NASA Astrophysics Data System (ADS)

Wang, Xiran; Jiang, Leiyu; Tang, Haoru

2018-01-01

GSTF12 has always been known as a key factor of proanthocyanins accumulate in plant testa. Through bioinformatics analysis of the nucleotide and encoded protein sequence of GSTF12, it is more advantageous to the study of genes related to anthocyanin biosynthesis accumulation pathway. Therefore, we chosen GSTF12 gene of 11 kinds species, downloaded their nucleotide and protein sequence from NCBI as the research object, found strawberry GSTF12 gene via bioinformation analyse, constructed phylogenetic tree. At the same time, we analysed the strawberry GSTF12 gene of physical and chemical properties and its protein structure and so on. The phylogenetic tree showed that Strawberry and petunia were closest relative. By the protein prediction, we found that the protein owed one proper signal peptide without obvious transmembrane regions.
Evaluation and Selection of Appropriate Reference Genes for Real-Time Quantitative PCR Analysis of Gene Expression in Nile Tilapia (Oreochromis niloticus) during Vaccination and Infection

PubMed Central

Wang, Erlong; Wang, Kaiyu; Chen, Defang; Wang, Jun; He, Yang; Long, Bo; Yang, Lei; Yang, Qian; Geng, Yi; Huang, Xiaoli; Ouyang, Ping; Lai, Weimin

2015-01-01

qPCR as a powerful and attractive methodology has been widely applied to aquaculture researches for gene expression analyses. However, the suitable reference selection is critical for normalizing target genes expression in qPCR. In the present study, six commonly used endogenous controls were selected as candidate reference genes to evaluate and analyze their expression levels, stabilities and normalization to immune-related gene IgM expression during vaccination and infection in spleen of tilapia with RefFinder and GeNorm programs. The results showed that all of these candidate reference genes exhibited transcriptional variations to some extent at different periods. Among them, EF1A was the most stable reference with RefFinder, followed by 18S rRNA, ACTB, UBCE, TUBA and GAPDH respectively and the optimal number of reference genes for IgM normalization under different experiment sets was two with GeNorm. Meanwhile, combination the Cq (quantification cycle) value and the recommended comprehensive ranking of reference genes, EF1A and ACTB, the two optimal reference genes, were used together as reference genes for accurate analysis of immune-related gene expression during vaccination and infection in Nile tilapia with qPCR. Moreover, the highest IgM expression level was at two weeks post-vaccination when normalized to EF1A, 18S rRNA, ACTB, and EF1A together with ACTB compared to one week post-vaccination before normalizing, which was also consistent with the IgM antibody titers detection by ELISA. PMID:25941937
GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

PubMed

Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

2016-03-01

Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics
Systemic bioinformatics analysis of skeletal muscle gene expression profiles of sepsis

PubMed Central

Yang, Fang; Wang, Yumei

2018-01-01

Sepsis is a type of systemic inflammatory response syndrome with high morbidity and mortality. Skeletal muscle dysfunction is one of the major complications of sepsis that may also influence the outcome of sepsis. The aim of the present study was to explore and identify potential mechanisms and therapeutic targets of sepsis. Systemic bioinformatics analysis of skeletal muscle gene expression profiles from the Gene Expression Omnibus was performed. Differentially expressed genes (DEGs) in samples from patients with sepsis and control samples were screened out using the limma package. Differential co-expression and coregulation (DCE and DCR, respectively) analysis was performed based on the Differential Co-expression Analysis package to identify differences in gene co-expression and coregulation patterns between the control and sepsis groups. Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways of DEGs were identified using the Database for Annotation, Visualization and Integrated Discovery, and inflammatory, cancer and skeletal muscle development-associated biological processes and pathways were identified. DCE and DCR analysis revealed several potential therapeutic targets for sepsis, including genes and transcription factors. The results of the present study may provide a basis for the development of novel therapeutic targets and treatment methods for sepsis. PMID:29805480
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks

DOE PAGES

Peng, Jiajie; Uygun, Sahra; Kim, Taehyong; ...

2015-02-14

Background: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstratemore » that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited.« less
The Role of Tomato WRKY Genes in Plant Responses to Combined Abiotic and Biotic Stresses

PubMed Central

Bai, Yuling; Sunarti, Sri; Kissoudis, Christos; Visser, Richard G. F.; van der Linden, C. G.

2018-01-01

In the field, plants constantly face a plethora of abiotic and biotic stresses that can impart detrimental effects on plants. In response to multiple stresses, plants can rapidly reprogram their transcriptome through a tightly regulated and highly dynamic regulatory network where WRKY transcription factors can act as activators or repressors. WRKY transcription factors have diverse biological functions in plants, but most notably are key players in plant responses to biotic and abiotic stresses. In tomato there are 83 WRKY genes identified. Here we review recent progress on functions of these tomato WRKY genes and their homologs in other plant species, such as Arabidopsis and rice, with a special focus on their involvement in responses to abiotic and biotic stresses. In particular, we highlight WRKY genes that play a role in plant responses to a combination of abiotic and biotic stresses.
Mosaic analysis of gene function in postnatal mouse brain development by using virus-based Cre recombination.

PubMed

Gibson, Daniel A; Ma, Le

2011-08-01

Normal brain function relies not only on embryonic development when major neuronal pathways are established, but also on postnatal development when neural circuits are matured and refined. Misregulation at this stage may lead to neurological and psychiatric disorders such as autism and schizophrenia. Many genes have been studied in the prenatal brain and found crucial to many developmental processes. However, their function in the postnatal brain is largely unknown, partly because their deletion in mice often leads to lethality during neonatal development, and partly because their requirement in early development hampers the postnatal analysis. To overcome these obstacles, floxed alleles of these genes are currently being generated in mice. When combined with transgenic alleles that express Cre recombinase in specific cell types, conditional deletion can be achieved to study gene function in the postnatal brain. However, this method requires additional alleles and extra time (3-6 months) to generate the mice with appropriate genotypes, thereby limiting the expansion of the genetic analysis to a large scale in the mouse brain. Here we demonstrate a complementary approach that uses virally-expressed Cre to study these floxed alleles rapidly and systematically in postnatal brain development. By injecting recombinant adeno-associated viruses (rAAVs) encoding Cre into the neonatal brain, we are able to delete the gene of interest in different regions of the brain. By controlling the viral titer and coexpressing a fluorescent protein marker, we can simultaneously achieve mosaic gene inactivation and sparse neuronal labeling. This method bypasses the requirement of many genes in early development, and allows us to study their cell autonomous function in many critical processes in postnatal brain development, including axonal and dendritic growth, branching, and tiling, as well as synapse formation and refinement. This method has been used successfully in our own lab
Phylogeny of flowering plants by the chloroplast genome sequences: in search of a "lucky gene".

PubMed

Logacheva, M D; Penin, A A; Samigullin, T H; Vallejo-Roman, C M; Antonov, A S

2007-12-01

One of the most complicated remaining problems of molecular-phylogenetic analysis is choosing an appropriate genome region. In an ideal case, such a region should have two specific properties: (i) results of analysis using this region should be similar to the results of multigene analysis using the maximal number of regions; (ii) this region should be arranged compactly and be significantly shorter than the multigene set. The second condition is necessary to facilitate sequencing and extension of taxons under analysis, the number of which is also crucial for molecular phylogenetic analysis. Such regions have been revealed for some groups of animals and have been designated as "lucky genes". We have carried out a computational experiment on analysis of 41 complete chloroplast genomes of flowering plants aimed at searching for a "lucky gene" for reconstruction of their phylogeny. It is shown that the phylogenetic tree inferred from a combination of translated nucleotide sequences of genes encoding subunits of plastid RNA polymerase is closest to the tree constructed using all protein coding sites of the chloroplast genome. The only node for which a contradiction is observed is unstable according to the different type analyses. For all the other genes or their combinations, the coincidence is significantly worse. The RNA polymerase genes are compactly arranged in the genome and are fourfold shorter than the total length of protein coding genes used for phylogenetic analysis. The combination of all necessary features makes this group of genes main candidates for the role of "lucky gene" in studying phylogeny of flowering plants.
Functional Analysis of the Arabidopsis TETRASPANIN Gene Family in Plant Growth and Development1[OPEN

PubMed Central

Wang, Feng; Muto, Antonella; Van de Velde, Jan; Neyt, Pia; Himanen, Kristiina; Vandepoele, Klaas; Van Lijsebettens, Mieke

2015-01-01

TETRASPANIN (TET) genes encode conserved integral membrane proteins that are known in animals to function in cellular communication during gamete fusion, immunity reaction, and pathogen recognition. In plants, functional information is limited to one of the 17 members of the Arabidopsis (Arabidopsis thaliana) TET gene family and to expression data in reproductive stages. Here, the promoter activity of all 17 Arabidopsis TET genes was investigated by pAtTET::NUCLEAR LOCALIZATION SIGNAL-GREEN FLUORESCENT PROTEIN/β-GLUCURONIDASE reporter lines throughout the life cycle, which predicted functional divergence in the paralogous genes per clade. However, partial overlap was observed for many TET genes across the clades, correlating with few phenotypes in single mutants and, therefore, requiring double mutant combinations for functional investigation. Mutational analysis showed a role for TET13 in primary root growth and lateral root development and redundant roles for TET5 and TET6 in leaf and root growth through negative regulation of cell proliferation. Strikingly, a number of TET genes were expressed in embryonic and seedling progenitor cells and remained expressed until the differentiation state in the mature plant, suggesting a dynamic function over developmental stages. The cis-regulatory elements together with transcription factor-binding data provided molecular insight into the sites, conditions, and perturbations that affect TET gene expression and positioned the TET genes in different molecular pathways; the data represent a hypothesis-generating resource for further functional analyses. PMID:26417009
Genome wide analysis reveals Zic3 interaction with distal regulatory elements of stage specific developmental genes in zebrafish.

PubMed

Winata, Cecilia L; Kondrychyn, Igor; Kumar, Vibhor; Srinivasan, Kandhadayar G; Orlov, Yuriy; Ravishankar, Ashwini; Prabhakar, Shyam; Stanton, Lawrence W; Korzh, Vladimir; Mathavan, Sinnakaruppan

2013-10-01

Zic3 regulates early embryonic patterning in vertebrates. Loss of Zic3 function is known to disrupt gastrulation, left-right patterning, and neurogenesis. However, molecular events downstream of this transcription factor are poorly characterized. Here we use the zebrafish as a model to study the developmental role of Zic3 in vivo, by applying a combination of two powerful genomics approaches--ChIP-seq and microarray. Besides confirming direct regulation of previously implicated Zic3 targets of the Nodal and canonical Wnt pathways, analysis of gastrula stage embryos uncovered a number of novel candidate target genes, among which were members of the non-canonical Wnt pathway and the neural pre-pattern genes. A similar analysis in zic3-expressing cells obtained by FACS at segmentation stage revealed a dramatic shift in Zic3 binding site locations and identified an entirely distinct set of target genes associated with later developmental functions such as neural development. We demonstrate cis-regulation of several of these target genes by Zic3 using in vivo enhancer assay. Analysis of Zic3 binding sites revealed a distribution biased towards distal intergenic regions, indicative of a long distance regulatory mechanism; some of these binding sites are highly conserved during evolution and act as functional enhancers. This demonstrated that Zic3 regulation of developmental genes is achieved predominantly through long distance regulatory mechanism and revealed that developmental transitions could be accompanied by dramatic changes in regulatory landscape.
Combined antitumor activity of the nitroreductase/CB1954 suicide gene system and γ-rays in HeLa cells in vitro

PubMed Central

Teng, Geling; Ju, Yuanrong; Yang, Yepeng; Hua, Hu; Chi, Jingyu; Mu, Xiuan

2016-01-01

Escherichia coli nitroreductase (NTR) may convert the prodrug CB1954 (5-(aziridin-1-yl)-2,4-dinitrobenzamide) into a bifunctional alkylating agent, which may lead to DNA crosslinks and the apoptosis of cancer cells. NTR/CB1954 has been demonstrated to be an effective gene therapy in cancer cells. The present study examined whether the NTR/CB1954 suicide gene system had cytotoxic effects on HeLa cells and may improve the radiosensitivity of HeLa cells to γ-rays. It was observed that the NTR/CB1954 suicide gene system exerted marked cytotoxic effects on HeLa cells. The combined therapeutic effects of NTR/CB1954 and γ-rays on HeLa cells demonstrated a synergistic effect. CB1954 at concentrations of 12.5 and 25 µmol/l increased the sensitization enhancement ratio of HeLa cells to 1.54 and 1.66, respectively. Therefore, when compared with monotherapy, the combined therapy of NTR/CB1954 and γ-rays may increase the apoptotic rate and enhance the radiosensitivity of HeLa cells. The combined therapy of γ-ray radiation and the NTR/CB1954 suicide gene system may be a novel and potent therapeutic method for the treatment of cervical carcinoma. PMID:27840931
Antitumor activity of combined endostatin and thymidine kinase gene therapy in C6 glioma models.

PubMed

Chen, Yan; Huang, Honglan; Yao, Chunshan; Su, Fengbo; Guan, Wenming; Yan, Shijun; Ni, Zhaohui

2016-09-01

The combination of Endostatin (ES) and Herpes Simplex Virus thymidine kinase (HSV-TK) gene therapy is known to have antitumor activity in bladder cancer. The potential effect of ES and TK therapy in glioma has not yet been investigated. In this study, pTK-internal ribosome entry site (IRES), pIRES-ES, and pTK-IRES-ES plasmids were constructed; pIRES empty vector served as the negative control. The recombinant constructs were transfected into human umbilical vein endothelial cells (HUVECs) ECV304 and C6 rat glioma cell line. Ganciclovir (GCV) was used to induce cell death in transfected C6 cells. We found that ECV304 cells expressing either ES or TK-ES showed reduced proliferation, decreased migration capacity, and increased apoptosis, as compared to untransfected cells or controls. pTK-IRES-ES/GCV or pTK-IRES/GCV significantly suppressed cell proliferation and induced cell apoptosis in C6 cells, as compared to the control. In addition, the administration of pIRES-ES, pTK-IRES/GCV, or pTK-IRES-ES/GCV therapy improved animal activity and behavior; was associated with prolonged animal survival, and a lower microvessel density (MVD) value in tumor tissues of C6 glioma rats. In comparison to others, dual gene therapy in form of pTK-IRES-ES/GCV had a significant antitumor activity against C6 glioma. These findings indicate combined TK and ES gene therapy was associated with a superior antitumor efficacy as compared to single gene therapy in C6 glioma. © 2016 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Genome-wide digital transcript analysis of putative fruitlet abscission related genes regulated by ethephon in litchi

PubMed Central

Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo

2015-01-01

The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better
Evaluation of Different Normalization and Analysis Procedures for Illumina Gene Expression Microarray Data Involving Small Changes

PubMed Central

Johnstone, Daniel M.; Riveros, Carlos; Heidari, Moones; Graham, Ross M.; Trinder, Debbie; Berretta, Regina; Olynyk, John K.; Scott, Rodney J.; Moscato, Pablo; Milward, Elizabeth A.

2013-01-01

While Illumina microarrays can be used successfully for detecting small gene expression changes due to their high degree of technical replicability, there is little information on how different normalization and differential expression analysis strategies affect outcomes. To evaluate this, we assessed concordance across gene lists generated by applying different combinations of normalization strategy and analytical approach to two Illumina datasets with modest expression changes. In addition to using traditional statistical approaches, we also tested an approach based on combinatorial optimization. We found that the choice of both normalization strategy and analytical approach considerably affected outcomes, in some cases leading to substantial differences in gene lists and subsequent pathway analysis results. Our findings suggest that important biological phenomena may be overlooked when there is a routine practice of using only one approach to investigate all microarray datasets. Analytical artefacts of this kind are likely to be especially relevant for datasets involving small fold changes, where inherent technical variation—if not adequately minimized by effective normalization—may overshadow true biological variation. This report provides some basic guidelines for optimizing outcomes when working with Illumina datasets involving small expression changes. PMID:27605185
Functional network analysis of genes differentially expressed during xylogenesis in soc1ful woody Arabidopsis plants.

PubMed

Davin, Nicolas; Edger, Patrick P; Hefer, Charles A; Mizrachi, Eshchar; Schuetz, Mathias; Smets, Erik; Myburg, Alexander A; Douglas, Carl J; Schranz, Michael E; Lens, Frederic

2016-06-01

Many plant genes are known to be involved in the development of cambium and wood, but how the expression and functional interaction of these genes determine the unique biology of wood remains largely unknown. We used the soc1ful loss of function mutant - the woodiest genotype known in the otherwise herbaceous model plant Arabidopsis - to investigate the expression and interactions of genes involved in secondary growth (wood formation). Detailed anatomical observations of the stem in combination with mRNA sequencing were used to assess transcriptome remodeling during xylogenesis in wild-type and woody soc1ful plants. To interpret the transcriptome changes, we constructed functional gene association networks of differentially expressed genes using the STRING database. This analysis revealed functionally enriched gene association hubs that are differentially expressed in herbaceous and woody tissues. In particular, we observed the differential expression of genes related to mechanical stress and jasmonate biosynthesis/signaling during wood formation in soc1ful plants that may be an effect of greater tension within woody tissues. Our results suggest that habit shifts from herbaceous to woody life forms observed in many angiosperm lineages could have evolved convergently by genetic changes that modulate the gene expression and interaction network, and thereby redeploy the conserved wood developmental program. © 2016 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Identification of novel risk genes associated with type 1 diabetes mellitus using a genome-wide gene-based association analysis.

PubMed

Qiu, Ying-Hua; Deng, Fei-Yan; Li, Min-Jing; Lei, Shu-Feng

2014-11-01

Type 1 diabetes mellitus is a serious disorder characterized by destruction of pancreatic β-cells, culminating in absolute insulin deficiency. Genetic factors contribute to the susceptibility of type 1 diabetes mellitus. The aim of the present study was to identify more susceptibility genes of type 1 diabetes mellitus. We carried out an initial gene-based genome-wide association study in a total of 4,075 type 1 diabetes mellitus cases and 2,604 controls by using the Gene-based Association Test using Extended Simes procedure. Furthermore, we carried out replication studies, differential expression analysis and functional annotation clustering analysis to support the significance of the identified susceptibility genes. We identified 452 genes associated with type 1 diabetes mellitus, even after adapting the genome-wide threshold for significance (P < 9.05E-04). Among these genes, 171 were newly identified for type 1 diabetes mellitus, which were ignored in single-nucleotide polymorphism-based association analysis and were not previously reported. We found that 53 genes have supportive evidence from replication studies and/or differential expression studies. In particular, seven genes including four non-human leukocyte antigen (HLA) genes (RASIP1, STRN4, BCAR1 and MYL2) are replicated in at least one independent population and also differentially expressed in peripheral blood mononuclear cells or monocytes. Furthermore, the associated genes tend to enrich in immune-related pathways or Gene Ontology project terms. The present results suggest the high power of gene-based association analysis in detecting disease-susceptibility genes. Our findings provide more insights into the genetic basis of type 1 diabetes mellitus.
Visual gene-network analysis reveals the cancer gene co-expression in human endometrial cancer

PubMed Central

2014-01-01

Background Endometrial cancers (ECs) are the most common form of gynecologic malignancy. Recent studies have reported that ECs reveal distinct markers for molecular pathogenesis, which in turn is linked to the various histological types of ECs. To understand further the molecular events contributing to ECs and endometrial tumorigenesis in general, a more precise identification of cancer-associated molecules and signaling networks would be useful for the detection and monitoring of malignancy, improving clinical cancer therapy, and personalization of treatments. Results ECs-specific gene co-expression networks were constructed by differential expression analysis and weighted gene co-expression network analysis (WGCNA). Important pathways and putative cancer hub genes contribution to tumorigenesis of ECs were identified. An elastic-net regularized classification model was built using the cancer hub gene signatures to predict the phenotypic characteristics of ECs. The 19 cancer hub gene signatures had high predictive power to distinguish among three key principal features of ECs: grade, type, and stage. Intriguingly, these hub gene networks seem to contribute to ECs progression and malignancy via cell-cycle regulation, antigen processing and the citric acid (TCA) cycle. Conclusions The results of this study provide a powerful biomarker discovery platform to better understand the progression of ECs and to uncover potential therapeutic targets in the treatment of ECs. This information might lead to improved monitoring of ECs and resulting improvement of treatment of ECs, the 4th most common of cancer in women. PMID:24758163

Microarray Data Analysis of Space Grown Arabidopsis Leaves for Genes Important in Vascular Patterning. Analysis of Space Grown Arabidopsis with Microarray Data from GeneLab: Identification of Genes Important in Vascular Patterning

NASA Technical Reports Server (NTRS)

Weitzel, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.

2016-01-01

Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photo-assimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be up-regulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS (Auxin-Regulated Gene Involved in Organ Size)-like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm up-regulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.
The Reconstruction and Analysis of Gene Regulatory Networks.

PubMed

Zheng, Guangyong; Huang, Tao

2018-01-01

In post-genomic era, an important task is to explore the function of individual biological molecules (i.e., gene, noncoding RNA, protein, metabolite) and their organization in living cells. For this end, gene regulatory networks (GRNs) are constructed to show relationship between biological molecules, in which the vertices of network denote biological molecules and the edges of network present connection between nodes (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). Biologists can understand not only the function of biological molecules but also the organization of components of living cells through interpreting the GRNs, since a gene regulatory network is a comprehensively physiological map of living cells and reflects influence of genetic and epigenetic factors (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). In this paper, we will review the inference methods of GRN reconstruction and analysis approaches of network structure. As a powerful tool for studying complex diseases and biological processes, the applications of the network method in pathway analysis and disease gene identification will be introduced.
Proteomic analysis of isolated chlamydomonas centrioles reveals orthologs of ciliary-disease genes.

PubMed

Keller, Lani C; Romijn, Edwin P; Zamora, Ivan; Yates, John R; Marshall, Wallace F

2005-06-21

The centriole is one of the most enigmatic organelles in the cell. Centrioles are cylindrical, microtubule-based barrels found in the core of the centrosome. Centrioles also act as basal bodies during interphase to nucleate the assembly of cilia and flagella. There are currently only a handful of known centriole proteins. We used mass-spectrometry-based MudPIT (multidimensional protein identification technology) to identify the protein composition of basal bodies (centrioles) isolated from the green alga Chlamydomonas reinhardtii. This analysis detected the majority of known centriole proteins, including centrin, epsilon tubulin, and the cartwheel protein BLD10p. By combining proteomic data with information about gene expression and comparative genomics, we identified 45 cross-validated centriole candidate proteins in two classes. Members of the first class of proteins (BUG1-BUG27) are encoded by genes whose expression correlates with flagellar assembly and which therefore may play a role in ciliogenesis-related functions of basal bodies. Members of the second class (POC1-POC18) are implicated by comparative-genomics and -proteomics studies to be conserved components of the centriole. We confirmed centriolar localization for the human homologs of four candidate proteins. Three of the cross-validated centriole candidate proteins are encoded by orthologs of genes (OFD1, NPHP-4, and PACRG) implicated in mammalian ciliary function and disease, suggesting that oral-facial-digital syndrome and nephronophthisis may involve a dysfunction of centrioles and/or basal bodies. By analyzing isolated Chlamydomonas basal bodies, we have been able to obtain the first reported proteomic analysis of the centriole.
Combining ability analysis for within-boll yield components in upland cotton (Gossypium hirsutum L.).

PubMed

Imran, M; Shakeel, A; Azhar, F M; Farooq, J; Saleem, M F; Saeed, A; Nazeer, W; Riaz, M; Naeem, M; Javaid, A

2012-08-24

Cotton is an important cash crop worldwide, accounting for a large percentage of world agricultural exports; however, yield per acre is still poor in many countries, including Pakistan. Diallel mating system was used to identify parents for improving within-boll yield and fiber quality parameters. Combining ability analysis was employed to obtain suitable parents for this purpose. The parental genotypes CP-15/2, NIAB Krishma, CIM-482, MS-39, and S-12 were crossed in complete diallel mating under green house conditions during 2009. The F₀ seed of 20 hybrids and five parents were planted in the field in randomized complete block design with three replications during 2010. There were highly significant differences among all F₁ hybrids and their parents. Specific combining ability (SCA) variance was greater than general combining ability (GCA) variance for bolls per plant (9.987), seeds per boll (0.635), seed density (5.672), lint per seed (4.174), boll size (3.69), seed cotton yield (0.315), and lint percentage (0.470), showing predominance of non-additive genes; while seed volume (3.84) was controlled by additive gene action based on maximum GCA variance. Cultivar MS-39 was found to be the best general combiner for seed volume (0.102), seeds per boll (0.448), and lint per seed (0.038) and its utilization produced valuable hybrids, including MS-39 x NIAB Krishma and MS-39 x S-12. The parental line CIM-482 had high GCA effects for boll size (0.33) and seeds per boll (0.90). It also showed good SCA with S-12 and NIAB Krishma for bolls per plant, with CP- 15/2 for boll size, and with MS-39 for seeds per boll. The hybrids, namely, CP-15/2 x NIAB Krishma, NIAB Krishma x S-12, NIAB Krishma x CIM-482, MS-39 x NIAB Krishma, MS-39 x CP-15/2, and S-12 x MS-39 showed promising results. Correlation analysis revealed that seed cotton yield showed significant positive correlation with bolls per plant, boll size and seeds per boll while it showed negative correlation with lint
A Systems Biology Approach To Identify the Combination Effects of Human Herpesvirus 8 Genes on NF-κB Activation▿

PubMed Central

Konrad, Andreas; Wies, Effi; Thurau, Mathias; Marquardt, Gaby; Naschberger, Elisabeth; Hentschel, Sonja; Jochmann, Ramona; Schulz, Thomas F.; Erfle, Holger; Brors, Benedikt; Lausen, Berthold; Neipel, Frank; Stürzl, Michael

2009-01-01

Human herpesvirus 8 (HHV-8) is the etiologic agent of Kaposi's sarcoma and primary effusion lymphoma. Activation of the cellular transcription factor nuclear factor-kappa B (NF-κB) is essential for latent persistence of HHV-8, survival of HHV-8-infected cells, and disease progression. We used reverse-transfected cell microarrays (RTCM) as an unbiased systems biology approach to systematically analyze the effects of HHV-8 genes on the NF-κB signaling pathway. All HHV-8 genes individually (n = 86) and, additionally, all K and latent genes in pairwise combinations (n = 231) were investigated. Statistical analyses of more than 14,000 transfections identified ORF75 as a novel and confirmed K13 as a known HHV-8 activator of NF-κB. K13 and ORF75 showed cooperative NF-κB activation. Small interfering RNA-mediated knockdown of ORF75 expression demonstrated that this gene contributes significantly to NF-κB activation in HHV-8-infected cells. Furthermore, our approach confirmed K10.5 as an NF-κB inhibitor and newly identified K1 as an inhibitor of both K13- and ORF75-mediated NF-κB activation. All results obtained with RTCM were confirmed with classical transfection experiments. Our work describes the first successful application of RTCM for the systematic analysis of pathofunctions of genes of an infectious agent. With this approach, ORF75 and K1 were identified as novel HHV-8 regulatory molecules on the NF-κB signal transduction pathway. The genes identified may be involved in fine-tuning of the balance between latency and lytic replication, since this depends critically on the state of NF-κB activity. PMID:19129458
A method for gene-based pathway analysis using genomewide association study summary statistics reveals nine new type 1 diabetes associations.

PubMed

Evangelou, Marina; Smyth, Deborah J; Fortune, Mary D; Burren, Oliver S; Walker, Neil M; Guo, Hui; Onengut-Gumuscu, Suna; Chen, Wei-Min; Concannon, Patrick; Rich, Stephen S; Todd, John A; Wallace, Chris

2014-12-01

Pathway analysis can complement point-wise single nucleotide polymorphism (SNP) analysis in exploring genomewide association study (GWAS) data to identify specific disease-associated genes that can be candidate causal genes. We propose a straightforward methodology that can be used for conducting a gene-based pathway analysis using summary GWAS statistics in combination with widely available reference genotype data. We used this method to perform a gene-based pathway analysis of a type 1 diabetes (T1D) meta-analysis GWAS (of 7,514 cases and 9,045 controls). An important feature of the conducted analysis is the removal of the major histocompatibility complex gene region, the major genetic risk factor for T1D. Thirty-one of the 1,583 (2%) tested pathways were identified to be enriched for association with T1D at a 5% false discovery rate. We analyzed these 31 pathways and their genes to identify SNPs in or near these pathway genes that showed potentially novel association with T1D and attempted to replicate the association of 22 SNPs in additional samples. Replication P-values were skewed (P=9.85×10-11) with 12 of the 22 SNPs showing P<0.05. Support, including replication evidence, was obtained for nine T1D associated variants in genes ITGB7 (rs11170466, P=7.86×10-9), NRP1 (rs722988, 4.88×10-8), BAD (rs694739, 2.37×10-7), CTSB (rs1296023, 2.79×10-7), FYN (rs11964650, P=5.60×10-7), UBE2G1 (rs9906760, 5.08×10-7), MAP3K14 (rs17759555, 9.67×10-7), ITGB1 (rs1557150, 1.93×10-6), and IL7R (rs1445898, 2.76×10-6). The proposed methodology can be applied to other GWAS datasets for which only summary level data are available. © 2014 The Authors. ** Genetic Epidemiology published by Wiley Periodicals, Inc.
An improved method for functional similarity analysis of genes based on Gene Ontology.

PubMed

Tian, Zhen; Wang, Chunyu; Guo, Maozu; Liu, Xiaoyan; Teng, Zhixia

2016-12-23

Measures of gene functional similarity are essential tools for gene clustering, gene function prediction, evaluation of protein-protein interaction, disease gene prioritization and other applications. In recent years, many gene functional similarity methods have been proposed based on the semantic similarity of GO terms. However, these leading approaches may make errorprone judgments especially when they measure the specificity of GO terms as well as the IC of a term set. Therefore, how to estimate the gene functional similarity reliably is still a challenging problem. We propose WIS, an effective method to measure the gene functional similarity. First of all, WIS computes the IC of a term by employing its depth, the number of its ancestors as well as the topology of its descendants in the GO graph. Secondly, WIS calculates the IC of a term set by means of considering the weighted inherited semantics of terms. Finally, WIS estimates the gene functional similarity based on the IC overlap ratio of term sets. WIS is superior to some other representative measures on the experiments of functional classification of genes in a biological pathway, collaborative evaluation of GO-based semantic similarity measures, protein-protein interaction prediction and correlation with gene expression. Further analysis suggests that WIS takes fully into account the specificity of terms and the weighted inherited semantics of terms between GO terms. The proposed WIS method is an effective and reliable way to compare gene function. The web service of WIS is freely available at http://nclab.hit.edu.cn/WIS/ .
Finding gene regulatory network candidates using the gene expression knowledge base.

PubMed

Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

2014-12-10

Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.
Individual and combined influence of ACE and ACTN3 genes on muscle phenotypes in Polish athletes.

PubMed

Orysiak, Joanna; Mazur-Różycka, Joanna; Busko, Krzysztof; Gajewski, Jan; Szczepanska, Beata; Malczewska-Lenczowska, Jadwiga

2017-02-08

The aim of this study was to examine the association between ACE and ACTN3 genes, independently or in combination, and muscle strength and power in male and female athletes. The study involved 398 young male (n=266) and female (n=132) athletes representing various sport disciplines (ice hockey, canoeing, swimming, volleyball). All were Caucasians. The following measurements were taken: height of jump and mechanical power in countermovement jump (CMJ) and spike jump (SPJ), and muscle strength of 10 muscle groups (flexors and extensors of the elbow, shoulder, hip, knee and trunk). The ID polymorphism of ACE and the R577X polymorphism of ACTN3 were typed using PCR (polymerase chain reaction) and PCR-RFLP (polymerase chain reaction - restriction fragment length polymorphism), respectively. The genotype distribution of the ACE and ACTN3 genes did not differ significantly between groups of athletes for either sex. There was no association between ACE and ACTN3 genotypes (alone or in combination) and sum of muscle strength, height of jump or mechanical power in both jump tests (CMJ and SPJ) for male and female athletes. These findings do not support an influential role of the ACE and ACTN3 genes in determining power/strength performance of elite athletes.
Microarray analysis of potential genes in the pathogenesis of recurrent oral ulcer.

PubMed

Han, Jingying; He, Zhiwei; Li, Kun; Hou, Lu

2015-01-01

Recurrent oral ulcer seriously threatens patients' daily life and health. This study investigated potential genes and pathways that participate in the pathogenesis of recurrent oral ulcer by high throughput bioinformatic analysis. RT-PCR and Western blot were applied to further verify screened interleukins effect. Recurrent oral ulcer related genes were collected from websites and papers, and further found out from Human Genome 280 6.0 microarray data. Each pathway of recurrent oral ulcer related genes were got through chip hybridization. RT-PCR was applied to test four recurrent oral ulcer related genes to verify the microarray data. Data transformation, scatter plot, clustering analysis, and expression pattern analysis were used to analyze recurrent oral ulcer related gene expression changes. Recurrent oral ulcer gene microarray was successfully established. Microarray showed that 551 genes involved in recurrent oral ulcer activity and 196 genes were recurrent oral ulcer related genes. Of them, 76 genes up-regulated, 62 genes down-regulated, and 58 genes up-/down-regulated. Total expression level up-regulated 752 times (60%) and down-regulated 485 times (40%). IL-2 plays an important role in the occurrence, development and recurrence of recurrent oral ulcer on the mRNA and protein levels. Gene microarray can be used to analyze potential genes and pathways in recurrent oral ulcer. IL-2 may be involved in the pathogenesis of recurrent oral ulcer.
Comprehensive analysis of the flowering genes in Chinese cabbage and examination of evolutionary pattern of CO-like genes in plant kingdom

NASA Astrophysics Data System (ADS)

Song, Xiaoming; Duan, Weike; Huang, Zhinan; Liu, Gaofeng; Wu, Peng; Liu, Tongkun; Li, Ying; Hou, Xilin

2015-09-01

In plants, flowering is the most important transition from vegetative to reproductive growth. The flowering patterns of monocots and eudicots are distinctly different, but few studies have described the evolutionary patterns of the flowering genes in them. In this study, we analysed the evolutionary pattern, duplication and expression level of these genes. The main results were as follows: (i) characterization of flowering genes in monocots and eudicots, including the identification of family-specific, orthologous and collinear genes; (ii) full characterization of CONSTANS-like genes in Brassica rapa (BraCOL genes), the key flowering genes; (iii) exploration of the evolution of COL genes in plant kingdom and construction of the evolutionary pattern of COL genes; (iv) comparative analysis of CO and FT genes between Brassicaceae and Grass, which identified several family-specific amino acids, and revealed that CO and FT protein structures were similar in B. rapa and Arabidopsis but different in rice; and (v) expression analysis of photoperiod pathway-related genes in B. rapa under different photoperiod treatments by RT-qPCR. This analysis will provide resources for understanding the flowering mechanisms and evolutionary pattern of COL genes. In addition, this genome-wide comparative study of COL genes may also provide clues for evolution of other flowering genes.
Combining Next Generation Sequencing with Bulked Segregant Analysis to Fine Map a Stem Moisture Locus in Sorghum (Sorghum bicolor L. Moench).

PubMed

Han, Yucui; Lv, Peng; Hou, Shenglin; Li, Suying; Ji, Guisu; Ma, Xue; Du, Ruiheng; Liu, Guoqing

2015-01-01

Sorghum is one of the most promising bioenergy crops. Stem juice yield, together with stem sugar concentration, determines sugar yield in sweet sorghum. Bulked segregant analysis (BSA) is a gene mapping technique for identifying genomic regions containing genetic loci affecting a trait of interest that when combined with deep sequencing could effectively accelerate the gene mapping process. In this study, a dry stem sorghum landrace was characterized and the stem water controlling locus, qSW6, was fine mapped using QTL analysis and the combined BSA and deep sequencing technologies. Results showed that: (i) In sorghum variety Jiliang 2, stem water content was around 80% before flowering stage. It dropped to 75% during grain filling with little difference between different internodes. In landrace G21, stem water content keeps dropping after the flag leaf stage. The drop from 71% at flowering time progressed to 60% at grain filling time. Large differences exist between different internodes with the lowest (51%) at the 7th and 8th internodes at dough stage. (ii) A quantitative trait locus (QTL) controlling stem water content mapped on chromosome 6 between SSR markers Ch6-2 and gpsb069 explained about 34.7-56.9% of the phenotypic variation for the 5th to 10th internodes, respectively. (iii) BSA and deep sequencing analysis narrowed the associated region to 339 kb containing 38 putative genes. The results could help reveal molecular mechanisms underlying juice yield of sorghum and thus to improve total sugar yield.
A Systematic Analysis of Candidate Genes Associated with Nicotine Addiction

PubMed Central

Liu, Meng; Li, Xia; Fan, Rui; Liu, Xinhua; Wang, Ju

2015-01-01

Nicotine, as the major psychoactive component of tobacco, has broad physiological effects within the central nervous system, but our understanding of the molecular mechanism underlying its neuronal effects remains incomplete. In this study, we performed a systematic analysis on a set of nicotine addiction-related genes to explore their characteristics at network levels. We found that NAGenes tended to have a more moderate degree and weaker clustering coefficient and to be less central in the network compared to alcohol addiction-related genes or cancer genes. Further, clustering of these genes resulted in six clusters with themes in synaptic transmission, signal transduction, metabolic process, and apoptosis, which provided an intuitional view on the major molecular functions of the genes. Moreover, functional enrichment analysis revealed that neurodevelopment, neurotransmission activity, and metabolism related biological processes were involved in nicotine addiction. In summary, by analyzing the overall characteristics of the nicotine addiction related genes, this study provided valuable information for understanding the molecular mechanisms underlying nicotine addiction. PMID:26097843
Combination of anginex gene therapy and radiation decelerates the growth and pulmonary metastasis of human osteosarcoma xenografts.

PubMed

Zhao, Kai; Yang, Shang-You; Geng, Jun; Gong, Xuan; Gong, Weiming; Shen, Lin; Ning, Bin

2018-06-01

Investigate whether rAAV-anginex gene therapy combined with radiotherapy could decrease growth and pulmonary metastasis of osteosarcoma in mice and examine the mechanisms involved in this therapeutic strategy. During in vitro experiment, multiple treatment regimes (rAAV-eGFP, radiotherapy, rAAV-anginex, combination therapy) were applied to determine effects on proliferation of endothelial cells (ECs) and G-292 osteosarcoma cells. During in vivo analysis, the same multiple treatment regimes were applied to osteosarcoma tumor-bearing mice. Use microcomputed tomography to evaluate tumor size. Eight weeks after tumor cell inoculation, immunohistochemistry was used to assess the therapeutic efficacy according to microvessel density (MVD), proliferating cell nuclear antigen (PCNA), and terminal-deoxynucleotidyl transferase-mediated nick-end labeling (TUNEL) assays. Metastasis of lungs was also evaluated by measuring number of metastatic nodules and wet weight of metastases. The proliferation of ECs and the tumor volumes in combination therapy group were inhibited more effectively than the other three groups at end point (P < 0.05). Cell clone assay showed anginex had radiosensitization effect on ECs. Immunohistochemistry showed tumors from mice treated with combination therapy exhibited the lowest MVD and proliferation rate, with highest apoptosis rate, as confirmed by IHC staining for CD34 and PCNA and TUNEL assays (P < 0.05). Combination therapy also induced the fewest metastatic nodules and lowest wet weights of the lungs (P < 0.05). rAAV-anginex combined with radiotherapy induced apoptosis of osteosarcoma cells and inhibited tumor growth and pulmonary metastasis on the experimental osteosarcoma models. We conclude that the primary mechanism of this process may be due to sensitizing effect of anginex to radiotherapy. © 2018 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
A Functional Genomic Meta-Analysis of Clinical Trials in Systemic Sclerosis: Toward Precision Medicine and Combination Therapy.

PubMed

Taroni, Jaclyn N; Martyanov, Viktor; Mahoney, J Matthew; Whitfield, Michael L

2017-05-01

Systemic sclerosis is an orphan, systemic autoimmune disease with no FDA-approved treatments. Its heterogeneity and rarity often result in underpowered clinical trials making the analysis and interpretation of associated molecular data challenging. We performed a meta-analysis of gene expression data from skin biopsies of patients with systemic sclerosis treated with five therapies: mycophenolate mofetil, rituximab, abatacept, nilotinib, and fresolimumab. A common clinical improvement criterion of -20% or -5 modified Rodnan skin score was applied to each study. We applied a machine learning approach that captured features beyond differential expression and was better at identifying targets of therapies than the differential expression alone. Regardless of treatment mechanism, abrogation of inflammatory pathways accompanied clinical improvement in multiple studies suggesting that high expression of immune-related genes indicates active and targetable disease. Our framework allowed us to compare different trials and ask if patients who failed one therapy would likely improve on a different therapy, based on changes in gene expression. Genes with high expression at baseline in fresolimumab nonimprovers were downregulated in mycophenolate mofetil improvers, suggesting that immunomodulatory or combination therapy may have benefitted these patients. This approach can be broadly applied to increase tissue specificity and sensitivity of differential expression results. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Avoiding pitfalls of internal controls: validation of reference genes for analysis by qRT-PCR and Western blot throughout rat retinal development.

PubMed

Rocha-Martins, Maurício; Njaine, Brian; Silveira, Mariana S

2012-01-01

Housekeeping genes have been commonly used as reference to normalize gene expression and protein content data because of its presumed constitutive expression. In this paper, we challenge the consensual idea that housekeeping genes are reliable controls for expression studies in the retina through the investigation of a panel of reference genes potentially suitable for analysis of different stages of retinal development. We applied statistical tools on combinations of retinal developmental stages to assess the most stable internal controls for quantitative RT-PCR (qRT-PCR). The stability of expression of seven putative reference genes (Actb, B2m, Gapdh, Hprt1, Mapk1, Ppia and Rn18s) was analyzed using geNorm, BestKeeper and Normfinder software. In addition, several housekeeping genes were tested as loading controls for Western blot in the same sample panel, using Image J. Overall, for qRT-PCR the combination of Gapdh and Mapk1 showed the highest stability for most experimental sets. Actb was downregulated in more mature stages, while Rn18s and Hprt1 showed the highest variability. We normalized the expression of cyclin D1 using various reference genes and demonstrated that spurious results may result from blind selection of internal controls. For Western blot significant variation could be seen among four putative internal controls (β-actin, cyclophilin b, α-tubulin and lamin A/C), while MAPK1 was stably expressed. Putative housekeeping genes exhibit significant variation in both mRNA and protein content during retinal development. Our results showed that distinct combinations of internal controls fit for each experimental set in the case of qRT-PCR and that MAPK1 is a reliable loading control for Western blot. The results indicate that biased study outcomes may follow the use of reference genes without prior validation for qRT-PCR and Western blot.
Identification of SLC20A1 and SLC15A4 among other genes as potential risk factors for combined pituitary hormone deficiency.

PubMed

Simm, Franziska; Griesbeck, Anne; Choukair, Daniela; Weiß, Birgit; Paramasivam, Nagarajan; Klammt, Jürgen; Schlesner, Matthias; Wiemann, Stefan; Martinez, Cristina; Hoffmann, Georg F; Pfäffle, Roland W; Bettendorf, Markus; Rappold, Gudrun A

2017-10-26

PurposeCombined pituitary hormone deficiency (CPHD) is characterized by a malformed or underdeveloped pituitary gland resulting in an impaired pituitary hormone secretion. Several transcription factors have been described in its etiology, but defects in known genes account for only a small proportion of cases.MethodsTo identify novel genetic causes for congenital hypopituitarism, we performed exome-sequencing studies on 10 patients with CPHD and their unaffected parents. Two candidate genes were sequenced in further 200 patients. Genotype data of known hypopituitary genes are reviewed.ResultsWe discovered 51 likely damaging variants in 38 genes; 12 of the 51 variants represent de novo events (24%); 11 of the 38 genes (29%) were present in the E12.5/E14.5 pituitary transcriptome. Targeted sequencing of two candidate genes, SLC20A1 and SLC15A4, of the solute carrier membrane transport protein family in 200 additional patients demonstrated two further variants predicted as damaging. We also found combinations of de novo (SLC20A1/SLC15A4) and transmitted variants (GLI2/LHX3) in the same individuals, leading to the full-blown CPHD phenotype.ConclusionThese data expand the pituitary target genes repertoire for diagnostics and further functional studies. Exome sequencing has identified a combination of rare variants in different genes that might explain incomplete penetrance in CPHD.Genetics in Medicine advance online publication, 26 October 2017; doi:10.1038/gim.2017.165.
NetGen: a novel network-based probabilistic generative model for gene set functional enrichment analysis.

PubMed

Sun, Duanchen; Liu, Yinliang; Zhang, Xiang-Sun; Wu, Ling-Yun

2017-09-21

High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes. In this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub ( http://github.com/wulingyun/CopTea/ ). Our procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.
The MB2 gene family of Plasmodium species has a unique combination of S1 and GTP-binding domains

PubMed Central

Romero, Lisa C; Nguyen, Thanh V; Deville, Benoit; Ogunjumo, Oluwasanmi; James, Anthony A

2004-01-01

Background Identification and characterization of novel Plasmodium gene families is necessary for developing new anti-malarial therapeutics. The products of the Plasmodium falciparum gene, MB2, were shown previously to have a stage-specific pattern of subcellular localization and proteolytic processing. Results Genes homologous to MB2 were identified in five additional parasite species, P. knowlesi, P. gallinaceum, P. berghei, P. yoelii, and P. chabaudi. Sequence comparisons among the MB2 gene products reveal amino acid conservation of structural features, including putative S1 and GTP-binding domains, and putative signal peptides and nuclear localization signals. Conclusions The combination of domains is unique to this gene family and indicates that MB2 genes comprise a novel family and therefore may be a good target for drug development. PMID:15222903
RNA-Seq workflow: gene-level exploratory analysis and differential expression

PubMed Central

Love, Michael I.; Anders, Simon; Kim, Vladislav; Huber, Wolfgang

2015-01-01

Here we walk through an end-to-end gene-level RNA-Seq differential expression workflow using Bioconductor packages. We will start from the FASTQ files, show how these were aligned to the reference genome, and prepare a count matrix which tallies the number of RNA-seq reads/fragments within each gene for each sample. We will perform exploratory data analysis (EDA) for quality assessment and to explore the relationship between samples, perform differential gene expression analysis, and visually explore the results. PMID:26674615

GECKO: a complete large-scale gene expression analysis platform.

PubMed

Theilhaber, Joachim; Ulyanov, Anatoly; Malanthara, Anish; Cole, Jack; Xu, Dapeng; Nahf, Robert; Heuer, Michael; Brockel, Christoph; Bushnell, Steven

2004-12-10

Gecko (Gene Expression: Computation and Knowledge Organization) is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing approximately 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph), in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (approximately 100 users) and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.
Identification of Development-Related Genes in the Ovaries of Adult Harmonia axyridis (Pallas) Lady Beetles Using a Time- Series Analysis by RNA-seq.

PubMed

Du, Wenxiao; Zeng, Fanrong

2016-12-14

Adults of the lady beetle species Harmonia axyridis (Pallas) are bred artificially en masse for classic biological control, which requires egg-laying by the H. axyridis ovary. Development-related genes may impact the growth of the H. axyridis adult ovary but have not been reported. Here, we used integrative time-series RNA-seq analysis of the ovary in H. axyridis adults to detect development-related genes. A total of 28,558 unigenes were functionally annotated using seven types of databases to obtain an annotated unigene database for ovaries in H. axyridis adults. We also analysed differentially expressed genes (DEGs) between samples. Based on a combination of the results of this bioinformatics analysis with literature reports and gene expression level changes in four different stages, we focused on the development of oocyte reproductive stem cell and yolk formation process and identified 26 genes with high similarity to development-related genes. 20 DEGs were randomly chosen for quantitative real-time PCR (qRT-PCR) to validate the accuracy of the RNA-seq results. This study establishes a robust pipeline for the discovery of key genes using high-throughput sequencing and the identification of a class of development-related genes for characterization.
Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

PubMed Central

Oduru, Sreedhar; Campbell, Janee L; Karri, SriTulasi; Hendry, William J; Khan, Shafiq A; Williams, Simon C

2003-01-01

Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells. PMID:12783626
Capturing the Alternative Cleavage and Polyadenylation Sites of 14 NAC Genes in Populus Using a Combination of 3'-RACE and High-Throughput Sequencing.

PubMed

Wang, Haoran; Wang, Mingxiu; Cheng, Qiang

2018-03-08

Detection of complex splice sites (SSs) and polyadenylation sites (PASs) of eukaryotic genes is essential for the elucidation of gene regulatory mechanisms. Transcriptome-wide studies using high-throughput sequencing (HTS) have revealed prevalent alternative splicing (AS) and alternative polyadenylation (APA) in plants. However, small-scale and high-depth HTS aimed at detecting genes or gene families are very few and limited. We explored a convenient and flexible method for profiling SSs and PASs, which combines rapid amplification of 3'-cDNA ends (3'-RACE) and HTS. Fourteen NAC (NAM, ATAF1/2, CUC2) transcription factor genes of Populus trichocarpa were analyzed by 3'-RACE-seq. Based on experimental reproducibility, boundary sequence analysis and reverse transcription PCR (RT-PCR) verification, only canonical SSs were considered to be authentic. Based on stringent criteria, candidate PASs without any internal priming features were chosen as authentic PASs and assumed to be PAS-rich markers. Thirty-four novel canonical SSs, six intronic/internal exons and thirty 3'-UTR PAS-rich markers were revealed by 3'-RACE-seq. Using 3'-RACE and real-time PCR, we confirmed that three APA transcripts ending in/around PAS-rich markers were differentially regulated in response to plant hormones. Our results indicate that 3'-RACE-seq is a robust and cost-effective method to discover SSs and label active regions subjected to APA for genes or gene families. The method is suitable for small-scale AS and APA research in the initial stage.
Gene expression analysis of pancreatic cell lines reveals genes overexpressed in pancreatic cancer.

PubMed

Alldinger, Ingo; Dittert, Dag; Peiper, Matthias; Fusco, Alberto; Chiappetta, Gennaro; Staub, Eike; Lohr, Matthias; Jesnowski, Ralf; Baretton, Gustavo; Ockert, Detlef; Saeger, Hans-Detlev; Grützmann, Robert; Pilarsky, Christian

2005-01-01

Pancreatic cancer is one of the leading causes of cancer-related death. Using DNA gene expression analysis based on a custom made Affymetrix cancer array, we investigated the expression pattern of both primary and established pancreatic carcinoma cell lines. We analyzed the gene expression of 5 established pancreatic cancer cell lines (AsPC-1, BxPC-3, Capan-1, Capan-2 and HPAF II) and 5 primary isolates, 1 of them derived from benign pancreatic duct cells. Out of 1,540 genes which were expressed in at least 3 experiments, we found 122 genes upregulated and 18 downregulated in tumor cell lines compared to benign cells with a fold change >3. Several of the upregulated genes (like Prefoldin 5, ADAM9 and E-cadherin) have been associated with pancreatic cancer before. The other differentially regulated genes, however, play a so far unknown role in the course of human pancreatic carcinoma. By means of immunohistochemistry we could show that thymosin beta-10 (TMSB10), upregulated in tumor cell lines, is expressed in human pancreatic carcinoma, but not in non-neoplastic pancreatic tissue, suggesting a role for TMSB10 in the carcinogenesis of pancreatic carcinoma. Using gene expression profiling of pancreatic cell lines we were able to identify genes differentially expressed in pancreatic adenocarcinoma, which might contribute to pancreatic cancer development. Copyright 2005 S. Karger AG, Basel.
Microarray-based bioinformatics analysis of the combined effects of SiNPs and PbAc on cardiovascular system in zebrafish.

PubMed

Hu, Hejing; Zhang, Yannan; Shi, Yanfeng; Feng, Lin; Duan, Junchao; Sun, Zhiwei

2017-10-01

With rapid development of nanotechnology and growing environmental pollution, the combined toxic effects of SiNPs and pollutants of heavy metals like lead have received global attentions. The aim of this study was to explore the cardiovascular effects of the co-exposure of SiNPs and lead acetate (PbAc) in zebrafish using microarray and bioinformatics analysis. Although there was no other obvious cardiovascular malformation except bleeding phenotype, bradycardia, angiogenesis inhibition and declined cardiac output in zebrafish co-exposed of SiNPs and PbAc at NOAEL level, significant changes were observed in mRNA and microRNA (miRNA) expression patterns. STC-GO analysis indicated that the co-exposure might have more toxic effects on cardiovascular system than that exposure alone. Key differentially expressed genes were discerned out based on the Dynamic-gene-network, including stxbp1a, ndfip2, celf4 and gsk3b. Furthermore, several miRNAs obtained from the miRNA-Gene-Network might play crucial roles in cardiovascular disease, such as dre-miR-93, dre-miR-34a, dre-miR-181c, dre-miR-7145, dre-miR-730, dre-miR-129-5p, dre-miR-19d, dre-miR-218b, dre-miR-221. Besides, the analysis of miRNA-pathway-network indicated that the zebrafish were stimulated by the co-exposure of SiNPs and PbAc, which might cause the disturbance of calcium homeostasis and endoplasmic reticulum stress. As a result, cardiac muscle contraction might be deteriorated. In general, our data provide abundant fundamental research clues to the combined toxicity of environmental pollutants and further in-depth verifications are needed. Copyright © 2017 Elsevier Ltd. All rights reserved.
Sample entropy analysis of cervical neoplasia gene-expression signatures

PubMed Central

Botting, Shaleen K; Trzeciakowski, Jerome P; Benoit, Michelle F; Salama, Salama A; Diaz-Arrastia, Concepcion R

2009-01-01

Background We introduce Approximate Entropy as a mathematical method of analysis for microarray data. Approximate entropy is applied here as a method to classify the complex gene expression patterns resultant of a clinical sample set. Since Entropy is a measure of disorder in a system, we believe that by choosing genes which display minimum entropy in normal controls and maximum entropy in the cancerous sample set we will be able to distinguish those genes which display the greatest variability in the cancerous set. Here we describe a method of utilizing Approximate Sample Entropy (ApSE) analysis to identify genes of interest with the highest probability of producing an accurate, predictive, classification model from our data set. Results In the development of a diagnostic gene-expression profile for cervical intraepithelial neoplasia (CIN) and squamous cell carcinoma of the cervix, we identified 208 genes which are unchanging in all normal tissue samples, yet exhibit a random pattern indicative of the genetic instability and heterogeneity of malignant cells. This may be measured in terms of the ApSE when compared to normal tissue. We have validated 10 of these genes on 10 Normal and 20 cancer and CIN3 samples. We report that the predictive value of the sample entropy calculation for these 10 genes of interest is promising (75% sensitivity, 80% specificity for prediction of cervical cancer over CIN3). Conclusion The success of the Approximate Sample Entropy approach in discerning alterations in complexity from biological system with such relatively small sample set, and extracting biologically relevant genes of interest hold great promise. PMID:19232110
Control of bacteriophage P2 gene expression: analysis of transcription of the ogr gene.

PubMed Central

Birkeland, N K; Lindqvist, B H; Christie, G E

1991-01-01

The bacteriophage P2 ogr gene encodes an 8.3-kDa protein that is a positive effector of P2 late gene transcription. The ogr gene is preceded by a promoter sequence (Pogr) resembling a normal Escherichia coli promoter and is located just downstream of a late transcription unit. We analyzed the kinetics and regulation of ogr gene transcription by using an ogr-specific antisense RNA probe in an S1 mapping assay. During a normal P2 infection, ogr gene transcription starts from Pogr at an intermediate time between the onset of early and late transcription. At late times after infection the ogr gene is cotranscribed with the late FETUD operon; the ogr gene product thus positively regulates its own synthesis from the P2 late promoter PF. Expression of the P2 late genes also requires P2 DNA replication. Complementation experiments and transcriptional analysis show that a nonreplicating P2 phage expresses the ogr gene from Pogr but is unable to transcribe the late genes. A P2 ogr-defective phage makes an increased level of ogr mRNA, consistent with autogenous control from Pogr. Transcription of the ogr gene in the prophage of a P2 heteroimmune lysogen is stimulated after infection with P2, suggesting that Pogr is under indirect immunity control and is activated by a yet-unidentified P2 early gene product during infection. Images FIG. 4 FIG. 5 FIG. 6 FIG. 7 FIG. 8 PMID:1938896
Functional Genomic Analysis of Cotton Genes with Agrobacterium-Mediated Virus-Induced Gene Silencing

PubMed Central

Gao, Xiquan; Shan, Libo

2015-01-01

Cotton (Gossypium spp.) is one of the most agronomically important crops worldwide for its unique textile fiber production and serving as food and feed stock. Molecular breeding and genetic engineering of useful genes into cotton have emerged as advanced approaches to improve cotton yield, fiber quality, and resistance to various stresses. However, the understanding of gene functions and regulations in cotton is largely hindered by the limited molecular and biochemical tools. Here, we describe the method of an Agrobacterium infiltration-based virus-induced gene silencing (VIGS) assay to transiently silence endogenous genes in cotton at 2-week-old seedling stage. The genes of interest could be readily silenced with a consistently high efficiency. To monitor gene silencing efficiency, we have cloned cotton GrCla1 from G. raimondii, a homolog gene of Arabidopsis Cloroplastos alterados 1 (AtCla1) involved in chloroplast development, and inserted into a tobacco rattle virus (TRV) binary vector pYL156. Silencing of GrCla1 results in albino phenotype on the newly emerging leaves, serving as a visual marker for silencing efficiency. To further explore the possibility of using VIGS assay to reveal the essential genes mediating disease resistance to Verticillium dahliae, a fungal pathogen causing severe Verticillium wilt in cotton, we developed a seedling infection assay to inoculate cotton seedlings when the genes of interest are silenced by VIGS. The method we describe here could be further explored for functional genomic analysis of cotton genes involved in development and various biotic and abiotic stresses. PMID:23386302
Functional genomic analysis of cotton genes with agrobacterium-mediated virus-induced gene silencing.

PubMed

Gao, Xiquan; Shan, Libo

2013-01-01

Cotton (Gossypium spp.) is one of the most agronomically important crops worldwide for its unique textile fiber production and serving as food and feed stock. Molecular breeding and genetic engineering of useful genes into cotton have emerged as advanced approaches to improve cotton yield, fiber quality, and resistance to various stresses. However, the understanding of gene functions and regulations in cotton is largely hindered by the limited molecular and biochemical tools. Here, we describe the method of an Agrobacterium infiltration-based virus-induced gene silencing (VIGS) assay to transiently silence endogenous genes in cotton at 2-week-old seedling stage. The genes of interest could be readily silenced with a consistently high efficiency. To monitor gene silencing efficiency, we have cloned cotton GrCla1 from G. raimondii, a homolog gene of Arabidopsis Cloroplastos alterados 1 (AtCla1) involved in chloroplast development, and inserted into a tobacco rattle virus (TRV) binary vector pYL156. Silencing of GrCla1 results in albino phenotype on the newly emerging leaves, serving as a visual marker for silencing efficiency. To further explore the possibility of using VIGS assay to reveal the essential genes mediating disease resistance to Verticillium dahliae, a fungal pathogen causing severe Verticillium wilt in cotton, we developed a seedling infection assay to inoculate cotton seedlings when the genes of interest are silenced by VIGS. The method we describe here could be further explored for functional genomic analysis of cotton genes involved in development and various biotic and abiotic stresses.
Analysis of gene expression profile microarray data in complex regional pain syndrome.

PubMed

Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

2017-09-01

The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.
Gene expression analysis of flax seed development

PubMed Central

2011-01-01

Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise
Data analysis using a combination of independent component analysis and empirical mode decomposition

NASA Astrophysics Data System (ADS)

Lin, Shih-Lin; Tung, Pi-Cheng; Huang, Norden E.

2009-06-01

A combination of independent component analysis and empirical mode decomposition (ICA-EMD) is proposed in this paper to analyze low signal-to-noise ratio data. The advantages of ICA-EMD combination are these: ICA needs few sensory clues to separate the original source from unwanted noise and EMD can effectively separate the data into its constituting parts. The case studies reported here involve original sources contaminated by white Gaussian noise. The simulation results show that the ICA-EMD combination is an effective data analysis tool.
Common disease signatures from gene expression analysis in Huntington's disease human blood and brain.

PubMed

Mina, Eleni; van Roon-Mom, Willeke; Hettne, Kristina; van Zwet, Erik; Goeman, Jelle; Neri, Christian; A C 't Hoen, Peter; Mons, Barend; Roos, Marco

2016-08-01

Huntington's disease (HD) is a devastating brain disorder with no effective treatment or cure available. The scarcity of brain tissue makes it hard to study changes in the brain and impossible to perform longitudinal studies. However, peripheral pathology in HD suggests that it is possible to study the disease using peripheral tissue as a monitoring tool for disease progression and/or efficacy of novel therapies. In this study, we investigated if blood can be used to monitor disease severity and progression in brain. Since previous attempts using only gene expression proved unsuccessful, we compared blood and brain Huntington's disease signatures in a functional context. Microarray HD gene expression profiles from three brain regions were compared to the transcriptome of HD blood generated by next generation sequencing. The comparison was performed with a combination of weighted gene co-expression network analysis and literature based functional analysis (Concept Profile Analysis). Uniquely, our comparison of blood and brain datasets was not based on (the very limited) gene overlap but on the similarity between the gene annotations in four different semantic categories: "biological process", "cellular component", "molecular function" and "disease or syndrome". We identified signatures in HD blood reflecting a broad pathophysiological spectrum, including alterations in the immune response, sphingolipid biosynthetic processes, lipid transport, cell signaling, protein modification, spliceosome, RNA splicing, vesicle transport, cell signaling and synaptic transmission. Part of this spectrum was reminiscent of the brain pathology. The HD signatures in caudate nucleus and BA4 exhibited the highest similarity with blood, irrespective of the category of semantic annotations used. BA9 exhibited an intermediate similarity, while cerebellum had the least similarity. We present two signatures that were shared between blood and brain: immune response and spinocerebellar ataxias
Gene Therapy for Bone Defects in Oral and Maxillofacial Surgery: A Systematic Review and Meta-Analysis of Animal Studies.

PubMed

Fliefel, Riham; Kühnisch, Jan; Ehrenfeld, Michael; Otto, Sven

2017-02-15

Craniofacial bone defects are challenging problems for maxillofacial surgeons over the years. With the development of cell and molecular biology, gene therapy is a breaking new technology with the aim of regenerating tissues by acting as a delivery system for therapeutic genes in the craniofacial region rather than treating genetic disorders. A systematic review was conducted summarizing the articles reporting gene therapy in maxillofacial surgery to answer the question: Was gene therapy successfully applied to regenerate bone in the maxillofacial region? Electronic searching of online databases was performed in addition to hand searching of the references of included articles. No language or time restrictions were enforced. Meta-analysis was done to assess significant bone formation after delivery of gene material in the surgically induced maxillofacial defects. The search identified 2081 articles, of which 57 were included with 1726 animals. Bone morphogenetic proteins were commonly used proteins for gene therapy. Viral vectors were the universally used vectors. Sprague-Dawley rats were the frequently used animal model in experimental studies. The quality of the articles ranged from excellent to average. Meta-analysis results performed on 21 articles showed that defects favored bone formation by gene therapy. Funnel plot showed symmetry with the absence of publication bias. Gene therapy is on the top list of innovative strategies that developed in the last 10 years with the hope of developing a simple chair-side protocol in the near future, combining improvement of gene delivery as well as knowledge of the molecular basis of oral and maxillofacial structures.
A genome-wide analysis of the expansin genes in Malus × Domestica.

PubMed

Zhang, Shizhong; Xu, Ruirui; Gao, Zheng; Chen, Changtian; Jiang, Zesheng; Shu, Huairui

2014-04-01

Expansins were first identified as cell wall-loosening proteins; they are involved in regulating cell expansion, fruits softening and many other physiological processes. However, our knowledge about the expansin family members and their evolutionary relationships in fruit trees, such as apple, is limited. In this study, we identified 41 members of the expansin gene family in the genome of apple (Malus × Domestica L. Borkh). Phylogenetic analysis revealed that expansin genes in apple could be divided into four subfamilies according to their gene structures and protein motifs. By phylogenetic analysis of the expansins in five plants (Arabidopsis, rice, poplar, grape and apple), the expansins were divided into 17 subgroups. Our gene duplication analysis revealed that whole-genome and chromosomal-segment duplications contributed to the expansion of Mdexpansins. The microarray and expressed sequence tag (EST) data showed that 34 Mdexpansin genes could be divided into five groups by the EST analysis; they may also play different roles during fruit development. An expression model for MdEXPA16 and MdEXPA20 showed their potential role in developing fruit. Overall, our study provides useful data and novel insights into the functions and regulatory mechanisms of the expansin genes in apple, as well as their evolution and divergence. As the first step towards genome-wide analysis of the expansin genes in apple, our results have established a solid foundation for future studies on the function of the expansin genes in fruit development.
Pharmacogenomic Characterization and Isobologram Analysis of the Combination of Ascorbic Acid and Curcumin-Two Main Metabolites of Curcuma longa-in Cancer Cells.

PubMed

Ooko, Edna; Kadioglu, Onat; Greten, Henry J; Efferth, Thomas

2017-01-01

Curcuma longa has long been used in China and India as anti-inflammatory agent to treat a wide variety of conditions and also as a spice for varied curry preparations. The chemoprofile of the Curcuma species exhibits the presence of varied phytochemicals with curcumin being present in all three species but AA only being shown in C. longa . This study explored the effect of a curcumin/AA combination on human cancer cell lines. The curcumin/AA combination was assessed by isobologram analysis using the Loewe additivity drug interaction model. The drug combination showed additive cytotoxicity toward CCRF-CEM and CEM/ADR5000 leukemia cell lines and HCT116p53 +/+ and HCT116p53 -/- colon cancer cell line, while the glioblastoma cell lines U87MG and U87MG.ΔEGFR showed additive to supra-additive cytotoxicity. Gene expression profiles predicting sensitivity and resistance of tumor cells to induction by curcumin and AA were determined by microarray-based mRNA expressions, COMPARE, and hierarchical cluster analyses. Numerous genes involved in transcription ( TFAM, TCERG1, RGS13, C11orf31 ), apoptosis-regulation ( CRADD, CDK7, CDK19, CD81, TOM1 ) signal transduction ( NR1D2, HMGN1, ABCA1, DE4ND4B, TRIM27 ) DNA repair ( TOPBP1, RPA2 ), mRNA metabolism ( RBBP4, HNRNPR, SRSF4, NR2F2, PDK1, TGM2 ), and transporter genes ( ABCA1 ) correlated with cellular responsiveness to curcumin and ascorbic acid. In conclusion, this study shows the effect of the curcumin/AA combination and identifies several candidate genes that may regulate the response of varied cancer cells to curcumin and AA.
Intrinsic biocontainment: Multiplex genome safeguards combine transcriptional and recombinational control of essential yeast genes

PubMed Central

Cai, Yizhi; Agmon, Neta; Choi, Woo Jin; Ubide, Alba; Stracquadanio, Giovanni; Caravelli, Katrina; Hao, Haiping; Bader, Joel S.; Boeke, Jef D.

2015-01-01

Biocontainment may be required in a wide variety of situations such as work with pathogens, field release applications of engineered organisms, and protection of intellectual properties. Here, we describe the control of growth of the brewer’s yeast, Saccharomyces cerevisiae, using both transcriptional and recombinational “safeguard” control of essential gene function. Practical biocontainment strategies dependent on the presence of small molecules require them to be active at very low concentrations, rendering them inexpensive and difficult to detect. Histone genes were controlled by an inducible promoter and controlled by 30 nM estradiol. The stability of the engineered genes was separately regulated by the expression of a site-specific recombinase. The combined frequency of generating viable derivatives when both systems were active was below detection (<10−10), consistent with their orthogonal nature and the individual escape frequencies of <10−6. Evaluation of escaper mutants suggests strategies for reducing their emergence. Transcript profiling and growth test suggest high fitness of safeguarded strains, an important characteristic for wide acceptance. PMID:25624482
Fast gene ontology based clustering for microarray experiments.

PubMed

Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

2008-11-21

Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
Changes in Physiological Parameters after Combined Exercise according to the I/D Polymorphism of hUCP2 Gene in Middle-Aged Obese Females

PubMed Central

DUK OH, Sang

2014-01-01

Abstract Background The purpose of this study was to determine whether a 45 bp insertion/deletion (I/D) polymorphism in human uncoupling protein 2 (hUCP2) gene was associated with changes in several cardiovascular risk and physical fitness factors in response to combined exercise during 12 weeks in Korean middle-aged women. The changes in physiological parameters after combined exercise during 12 weeks were compared between each genotype subgroups of hUCP2 gene to clarify the inter-individual differences in exercised-induced changes according to genetic predisposition. Methods A total of 185 women aged over 40 years living in Seoul, Korea were participated in this study, and analyzed before and after 12 weeks on combined exercise including aerobic exercise and strength training for body composition, hemodynamic parameters, physical fitness and metabolic variables. A 45 bp I/D polymorphism in hUCP2 gene was genotyped by polymerase chain reaction (PCR) amplification and agarose gel electrophoresis method. Results Combined exercise program during 12 weeks indicated the significant health-promoting effects for our participants on multiple body composition, hemodynamic parameters, physical fitness factors and metabolic parameters, respectively. With respect to a 45 bp I/D polymorphism in hUCP2 gene, this polymorphism was significantly associated with baseline %body fat of our participants (P <.05). Moreover, this polymorphism was significantly associated with the changes in %body fat and serum triglyceride(TG) level after combined exercise program during 12 weeks(P <.05). Conclusion Our data suggest that a 45 bp I/D polymorphism in hUCP2 gene may at least in part contribute to the inter-individual differences on the changes in some clinical and metabolic parameters following combined exercise in middle-aged women. PMID:25909061

Novel Mutations in HESX1 and PROP1 Genes in Combined Pituitary Hormone Deficiency.

PubMed

Avbelj Stefanija, Magdalena; Kotnik, Primož; Bratanič, Nina; Žerjav Tanšek, Mojca; Bertok, Sara; Bratina, Nataša; Battelino, Tadej; Trebušak Podkrajšek, Katarina

2015-01-01

The HESX1 gene is essential in forebrain development and pituitary organogenesis, and its mutations are the most commonly identified genetic cause of septo-optic dysplasia (SOD). The PROP1 gene is involved in anterior pituitary cell lineage specification and is commonly implicated in non-syndromic combined pituitary hormone deficiency (CPHD). We aimed to assess the involvement of HESX1 and PROP1 mutations in a cohort of patients with SOD and CPHD. Six patients with sporadic SOD and 16 patients with CPHD from 14 pedigrees were screened for mutations in HESX1 and PROP1 genes by exon sequencing. Half of the CPHD patients had variable associated clinical characteristics, such as hearing loss, orofacial cleft, kidney disorder or developmental delay. Novel variants were evaluated in silico and verified in SNP databases. A novel heterozygous p.Glu102Gly mutation in the HESX1 gene and a novel homozygous p.Arg121Thr mutation in the PROP1 gene were detected in 2 pedigrees with CPHD. A small previously reported deletion in PROP1 c.301_302delAG was detected in a separate patient with CPHD, in heterozygous state. No mutations were identified in patients with SOD. Our results expand the spectrum of mutations implicated in CPHD. The frequency of 15% of the PROP1 mutations in CPHD was low, likely due to the clinical heterogeneity of the cohort. © 2015 S. Karger AG, Basel.
Combined exposure to Maneb and Paraquat alters transcriptional regulation of neurogenesis-related genes in mice models of Parkinson's disease.

PubMed

Desplats, Paula; Patel, Pruthul; Kosberg, Kori; Mante, Michael; Patrick, Christina; Rockenstein, Edward; Fujita, Masayo; Hashimoto, Makoto; Masliah, Eliezer

2012-09-28

Parkinson's disease (PD) is a multifactorial disease where environmental factors act on genetically predisposed individuals. Although only 5% of PD manifestations are associated with specific mutations, majority of PD cases are of idiopathic origin, where environment plays a prominent role. Concurrent exposure to Paraquat (PQ) and Maneb (MB) in rural workers increases the risk for PD and exposure of adult mice to MB/PQ results in dopamine fiber loss and decreased locomotor activity. While PD is characterized by neuronal loss in the substantia nigra, we previously showed that accumulation of α-synuclein in the limbic system contributes to neurodegeneration by interfering with adult neurogenesis. We investigated the effect of pesticides on adult hippocampal neurogenesis in two transgenic models: Line 61, expressing the human wild type SNCA gene and Line LRRK2(G2019S), expressing the human LRRK2 gene with the mutation G2019S. Combined exposure to MB/PQ resulted in significant reduction of neuronal precursors and proliferating cells in non-transgenic animals, and this effect was increased in transgenic mice, in particular for Line 61, suggesting that α-synuclein accumulation and environmental toxins have a synergistic effect. We further investigated the transcription of 84 genes with direct function on neurogenesis. Overexpresion of α-synuclein resulted in the downregulation of 12% of target genes, most of which were functionally related to cell differentiation, while LRRK2 mutation had a minor impact on gene expression. MB/PQ also affected transcription in non-transgenic backgrounds, but when transgenic mice were exposed to the pesticides, profound alterations in gene expression affecting 27% of the studied targets were observed in both transgenic lines. Gene enrichment analysis showed that 1:3 of those genes were under the regulation of FoxF2 and FoxO3A, suggesting a primary role of these proteins in the response to genetic and environmental cues. We report that
Molecular cloning, structure, phylogeny and expression analysis of the invertase gene family in sugarcane.

PubMed

Wang, Liming; Zheng, Yuexia; Ding, Shihui; Zhang, Qing; Chen, Youqiang; Zhang, Jisen

2017-06-23

Invertases (INVs) are key enzymes regulating sucrose metabolism and are here revealed to be involved in responses to environmental stress in plants. To date, individual members of the invertase gene family and their expression patterns are unknown in sugarcane due to its complex genome despite their significance in sucrose metabolism. In this study, based on comparative genomics, eleven cDNA and twelve DNA sequences belonging to 14 non-redundant members of the invertase gene family were successfully cloned from sugarcane. A comprehensive analysis of the invertase gene family was carried out, including gene structures, phylogenetic relationships, functional domains, conserved motifs of proteins. The results revealed that the 14 invertase members from sugarcane could be clustered into three subfamilies, including 6 neutral/alkaline invertases (ShN/AINVs), and 8 acid invertases (ShAINVs). Faster divergence occurred in acid INVs than in neutral/alkaline INVs after the split of sugarcane and sorghum. At least a one-time gene duplication event was observed to have occurred in the four groups of acid INVs, whereas ShN/AINV1 and ShN/AINV2 in the β8 lineage were revealed to be the most recently duplicated genes among their paralogous genes in the β group of N/AINVs. Furthermore, comprehensive expression analysis of these genes was performed in sugarcane seedlings subjected to five abiotic stresses (drought, low temperature, glucose, fructose, and sucrose) using Quantitative Real-time PCR. The results suggested a functional divergence of INVs and their potential role in response to the five different treatments. Enzymatic activity in sugarcane seedlings was detected under five abiotic stresses treatments, and showed that the activities of all INVs were significantly inhibited in response to five different abiotic stresses, and that the neutral/alkaline INVs played a more prominent role in abiotic stresses than the acid INVs. In this study, we determined the INV gene family
Metabolic profiling of ob/ob mouse fatty liver using HR-MAS 1H-NMR combined with gene expression analysis reveals alterations in betaine metabolism and the transsulfuration pathway.

PubMed

Gogiashvili, Mikheil; Edlund, Karolina; Gianmoena, Kathrin; Marchan, Rosemarie; Brik, Alexander; Andersson, Jan T; Lambert, Jörg; Madjar, Katrin; Hellwig, Birte; Rahnenführer, Jörg; Hengstler, Jan G; Hergenröder, Roland; Cadenas, Cristina

2017-02-01

Metabolic perturbations resulting from excessive hepatic fat accumulation are poorly understood. Thus, in this study, leptin-deficient ob/ob mice, a mouse model of fatty liver disease, were used to investigate metabolic alterations in more detail. Metabolites were quantified in intact liver tissues of ob/ob (n = 8) and control (n = 8) mice using high-resolution magic angle spinning (HR-MAS) 1 H-NMR. In addition, after demonstrating that HR-MAS 1 H-NMR does not affect RNA integrity, transcriptional changes were measured by quantitative real-time PCR on RNA extracted from the same specimens after HR-MAS 1 H-NMR measurements. Importantly, the gene expression changes obtained agreed with those observed by Affymetrix microarray analysis performed on RNA isolated directly from fresh-frozen tissue. In total, 40 metabolites could be assigned in the spectra and subsequently quantified. Quantification of lactate was also possible after applying a lactate-editing pulse sequence that suppresses the lipid signal, which superimposes the lactate methyl resonance at 1.3 ppm. Significant differences were detected for creatinine, glutamate, glycine, glycolate, trimethylamine-N-oxide, dimethylglycine, ADP, AMP, betaine, phenylalanine, and uridine. Furthermore, alterations in one-carbon metabolism, supported by both metabolic and transcriptional changes, were observed. These included reduced demethylation of betaine to dimethylglycine and the reduced expression of genes coding for transsulfuration pathway enzymes, which appears to preserve methionine levels, but may limit glutathione synthesis. Overall, the combined approach is advantageous as it identifies changes not only at the single gene or metabolite level but also deregulated pathways, thus providing critical insight into changes accompanying fatty liver disease. Graphical abstract A Evaluation of RNA integrity before and after HR-MAS 1 H-NMR of intact mouse liver tissue. B Metabolite concentrations and gene expression
Analysis of blood-based gene expression in idiopathic Parkinson disease.

PubMed

Shamir, Ron; Klein, Christine; Amar, David; Vollstedt, Eva-Juliane; Bonin, Michael; Usenovic, Marija; Wong, Yvette C; Maver, Ales; Poths, Sven; Safer, Hershel; Corvol, Jean-Christophe; Lesage, Suzanne; Lavi, Ofer; Deuschl, Günther; Kuhlenbaeumer, Gregor; Pawlack, Heike; Ulitsky, Igor; Kasten, Meike; Riess, Olaf; Brice, Alexis; Peterlin, Borut; Krainc, Dimitri

2017-10-17

To examine whether gene expression analysis of a large-scale Parkinson disease (PD) patient cohort produces a robust blood-based PD gene signature compared to previous studies that have used relatively small cohorts (≤220 samples). Whole-blood gene expression profiles were collected from a total of 523 individuals. After preprocessing, the data contained 486 gene profiles (n = 205 PD, n = 233 controls, n = 48 other neurodegenerative diseases) that were partitioned into training, validation, and independent test cohorts to identify and validate a gene signature. Batch-effect reduction and cross-validation were performed to ensure signature reliability. Finally, functional and pathway enrichment analyses were applied to the signature to identify PD-associated gene networks. A gene signature of 100 probes that mapped to 87 genes, corresponding to 64 upregulated and 23 downregulated genes differentiating between patients with idiopathic PD and controls, was identified with the training cohort and successfully replicated in both an independent validation cohort (area under the curve [AUC] = 0.79, p = 7.13E-6) and a subsequent independent test cohort (AUC = 0.74, p = 4.2E-4). Network analysis of the signature revealed gene enrichment in pathways, including metabolism, oxidation, and ubiquitination/proteasomal activity, and misregulation of mitochondria-localized genes, including downregulation of COX4I1 , ATP5A1 , and VDAC3 . We present a large-scale study of PD gene expression profiling. This work identifies a reliable blood-based PD signature and highlights the importance of large-scale patient cohorts in developing potential PD biomarkers. © 2017 American Academy of Neurology.
htsint: a Python library for sequencing pipelines that combines data through gene set generation.

PubMed

Richards, Adam J; Herrel, Anthony; Bonneaud, Camille

2015-09-24

Sequencing technologies provide a wealth of details in terms of genes, expression, splice variants, polymorphisms, and other features. A standard for sequencing analysis pipelines is to put genomic or transcriptomic features into a context of known functional information, but the relationships between ontology terms are often ignored. For RNA-Seq, considering genes and their genetic variants at the group level enables a convenient way to both integrate annotation data and detect small coordinated changes between experimental conditions, a known caveat of gene level analyses. We introduce the high throughput data integration tool, htsint, as an extension to the commonly used gene set enrichment frameworks. The central aim of htsint is to compile annotation information from one or more taxa in order to calculate functional distances among all genes in a specified gene space. Spectral clustering is then used to partition the genes, thereby generating functional modules. The gene space can range from a targeted list of genes, like a specific pathway, all the way to an ensemble of genomes. Given a collection of gene sets and a count matrix of transcriptomic features (e.g. expression, polymorphisms), the gene sets produced by htsint can be tested for 'enrichment' or conditional differences using one of a number of commonly available packages. The database and bundled tools to generate functional modules were designed with sequencing pipelines in mind, but the toolkit nature of htsint allows it to also be used in other areas of genomics. The software is freely available as a Python library through GitHub at https://github.com/ajrichards/htsint.
Analysis of gene expression during parabolic flights reveals distinct early gravity responses in Arabidopsis roots.

PubMed

Aubry-Hivet, D; Nziengui, H; Rapp, K; Oliveira, O; Paponov, I A; Li, Y; Hauslage, J; Vagt, N; Braun, M; Ditengou, F A; Dovzhenko, A; Palme, K

2014-01-01

Plant roots are among most intensively studied biological systems in gravity research. Altered gravity induces asymmetric cell growth leading to root bending. Differential distribution of the phytohormone auxin underlies root responses to gravity, being coordinated by auxin efflux transporters from the PIN family. The objective of this study was to compare early transcriptomic changes in roots of Arabidopsis thaliana wild type, and pin2 and pin3 mutants under parabolic flight conditions and to correlate these changes to auxin distribution. Parabolic flights allow comparison of transient 1-g, hypergravity and microgravity effects in living organisms in parallel. We found common and mutation-related genes differentially expressed in response to transient microgravity phases. Gene ontology analysis of common genes revealed lipid metabolism, response to stress factors and light categories as primarily involved in response to transient microgravity phases, suggesting that fundamental reorganisation of metabolic pathways functions upstream of a further signal mediating hormonal network. Gene expression changes in roots lacking the columella-located PIN3 were stronger than in those deprived of the epidermis and cortex cell-specific PIN2. Moreover, repetitive exposure to microgravity/hypergravity and gravity/hypergravity flight phases induced an up-regulation of auxin responsive genes in wild type and pin2 roots, but not in pin3 roots, suggesting a critical function of PIN3 in mediating auxin fluxes in response to transient microgravity phases. Our study provides important insights towards understanding signal transduction processes in transient microgravity conditions by combining for the first time the parabolic flight platform with the transcriptome analysis of different genetic mutants in the model plant, Arabidopsis. © 2013 German Botanical Society and The Royal Botanical Society of the Netherlands.
Reference genes for normalization of gene expression studies in human osteoarthritic articular cartilage.

PubMed

Pombo-Suarez, Manuel; Calaza, Manuel; Gomez-Reino, Juan J; Gonzalez, Antonio

2008-01-29

Assessment of gene expression is an important component of osteoarthritis (OA) research, greatly improved by the development of quantitative real-time PCR (qPCR). This technique requires normalization for precise results, yet no suitable reference genes have been identified in human articular cartilage. We have examined ten well-known reference genes to determine the most adequate for this application. Analyses of expression stability in cartilage from 10 patients with hip OA, 8 patients with knee OA and 10 controls without OA were done with classical statistical tests and the software programs geNorm and NormFinder. Results from the three methods of analysis were broadly concordant. Some of the commonly used reference genes, GAPDH, ACTB and 18S RNA, performed poorly in our analysis. In contrast, the rarely used TBP, RPL13A and B2M genes were the best. It was necessary to use together several of these three genes to obtain the best results. The specific combination depended, to some extent, on the type of samples being compared. Our results provide a satisfactory set of previously unused reference genes for qPCR in hip and knee OA This confirms the need to evaluate the suitability of reference genes in every tissue and experimental situation before starting the quantitative assessment of gene expression by qPCR.
Identification of the key regulating genes of diminished ovarian reserve (DOR) by network and gene ontology analysis.

PubMed

Pashaiasl, Maryam; Ebrahimi, Mansour; Ebrahimie, Esmaeil

2016-09-01

Diminished ovarian reserve (DOR) is one of the reasons for infertility that not only affects both older and young women. Ovarian reserve assessment can be used as a new prognostic tool for infertility treatment decision making. Here, up- and down-regulated gene expression profiles of granulosa cells were analysed to generate a putative interaction map of the involved genes. In addition, gene ontology (GO) analysis was used to get insight intol the biological processes and molecular functions of involved proteins in DOR. Eleven up-regulated genes and nine down-regulated genes were identified and assessed by constructing interaction networks based on their biological processes. PTGS2, CTGF, LHCGR, CITED, SOCS2, STAR and FSTL3 were the key nodes in the up-regulated networks, while the IGF2, AMH, GREM, and FOXC1 proteins were key in the down-regulated networks. MIRN101-1, MIRN153-1 and MIRN194-1 inhibited the expression of SOCS2, while CSH1 and BMP2 positively regulated IGF1 and IGF2. Ossification, ovarian follicle development, vasculogenesis, sequence-specific DNA binding transcription factor activity, and golgi apparatus are the major differential groups between up-regulated and down-regulated genes in DOR. Meta-analysis of publicly available transcriptomic data highlighted the high coexpression of CTGF, connective tissue growth factor, with the other key regulators of DOR. CTGF is involved in organ senescence and focal adhesion pathway according to GO analysis. These findings provide a comprehensive system biology based insight into the aetiology of DOR through network and gene ontology analyses.
Gene expression analysis of bud and leaf color in tea.

PubMed

Wei, Kang; Zhang, Yazhen; Wu, Liyun; Li, Hailin; Ruan, Li; Bai, Peixian; Zhang, Chengcai; Zhang, Fen; Xu, Liyi; Wang, Liyuan; Cheng, Hao

2016-10-01

Purple shoot tea attributing to the high anthocyanin accumulation is of great interest for its wide health benefits. To better understand potential mechanisms involved in purple buds and leaves formation in tea plants, we performed transcriptome analysis of six green or purple shoot tea individuals from a F1 population using the Illumina sequencing method. Totally 292 million RNA-Seq reads were obtained and assembled into 112,233 unigenes, with an average length of 759 bp and an N50 of 1081 bp. Moreover, totally 2193 unigenes showed significant differences in expression levels between green and purple tea samples, with 1143 up- and 1050 down-regulated in the purple teas. Further real time PCR analysis confirmed RNA-Seq results. Our study identified 28 differentially expressed transcriptional factors and A CsMYB gene was found to be highly similar to AtPAP1 in Arabidopsis. Further analysis of differentially expressed genes involved in anthocyanin biosynthesis and transportation showed that the late biosynthetic genes and genes involved in anthocyanin transportation were largely affected but the early biosynthetic genes were less or none affected. Overall, the identification of a large number of differentially expressed genes offers a global view of the potential mechanisms associated with purple buds and leaves formation, which will facilitate molecular breeding in tea plants. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Microarray analysis and scale-free gene networks identify candidate regulators in drought-stressed roots of loblolly pine (P. taeda L.)

PubMed Central

2011-01-01

Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the
Combined antitumor activity of the nitroreductase/CB1954 suicide gene system and γ-rays in HeLa cells in vitro.

PubMed

Teng, Geling; Ju, Yuanrong; Yang, Yepeng; Hua, Hu; Chi, Jingyu; Mu, Xiuan

2016-12-01

Escherichia coli nitroreductase (NTR) may convert the prodrug CB1954 (5-(aziridin-1-yl)-2,4-dinitrobenzamide) into a bifunctional alkylating agent, which may lead to DNA crosslinks and the apoptosis of cancer cells. NTR/CB1954 has been demonstrated to be an effective gene therapy in cancer cells. The present study examined whether the NTR/CB1954 suicide gene system had cytotoxic effects on HeLa cells and may improve the radiosensitivity of HeLa cells to γ‑rays. It was observed that the NTR/CB1954 suicide gene system exerted marked cytotoxic effects on HeLa cells. The combined therapeutic effects of NTR/CB1954 and γ‑rays on HeLa cells demonstrated a synergistic effect. CB1954 at concentrations of 12.5 and 25 µmol/l increased the sensitization enhancement ratio of HeLa cells to 1.54 and 1.66, respectively. Therefore, when compared with monotherapy, the combined therapy of NTR/CB1954 and γ‑rays may increase the apoptotic rate and enhance the radiosensitivity of HeLa cells. The combined therapy of γ‑ray radiation and the NTR/CB1954 suicide gene system may be a novel and potent therapeutic method for the treatment of cervical carcinoma.
Prioritization of Epilepsy Associated Candidate Genes by Convergent Analysis

PubMed Central

Jia, Peilin; Ewers, Jeffrey M.; Zhao, Zhongming

2011-01-01

Background Epilepsy is a severe neurological disorder affecting a large number of individuals, yet the underlying genetic risk factors for epilepsy remain unclear. Recent studies have revealed several recurrent copy number variations (CNVs) that are more likely to be associated with epilepsy. The responsible gene(s) within these regions have yet to be definitively linked to the disorder, and the implications of their interactions are not fully understood. Identification of these genes may contribute to a better pathological understanding of epilepsy, and serve to implicate novel therapeutic targets for further research. Methodology/Principal Findings In this study, we examined genes within heterozygous deletion regions identified in a recent large-scale study, encompassing a diverse spectrum of epileptic syndromes. By integrating additional protein-protein interaction data, we constructed subnetworks for these CNV-region genes and also those previously studied for epilepsy. We observed 20 genes common to both networks, primarily concentrated within a small molecular network populated by GABA receptor, BDNF/MAPK signaling, and estrogen receptor genes. From among the hundreds of genes in the initial networks, these were designated by convergent evidence for their likely association with epilepsy. Importantly, the identified molecular network was found to contain complex interrelationships, providing further insight into epilepsy's underlying pathology. We further performed pathway enrichment and crosstalk analysis and revealed a functional map which indicates the significant enrichment of closely related neurological, immune, and kinase regulatory pathways. Conclusions/Significance The convergent framework we proposed here provides a unique and powerful approach to screening and identifying promising disease genes out of typically hundreds to thousands of genes in disease-related CNV-regions. Our network and pathway analysis provides important implications for the
Prioritization of epilepsy associated candidate genes by convergent analysis.

PubMed

Jia, Peilin; Ewers, Jeffrey M; Zhao, Zhongming

2011-02-24

Epilepsy is a severe neurological disorder affecting a large number of individuals, yet the underlying genetic risk factors for epilepsy remain unclear. Recent studies have revealed several recurrent copy number variations (CNVs) that are more likely to be associated with epilepsy. The responsible gene(s) within these regions have yet to be definitively linked to the disorder, and the implications of their interactions are not fully understood. Identification of these genes may contribute to a better pathological understanding of epilepsy, and serve to implicate novel therapeutic targets for further research. In this study, we examined genes within heterozygous deletion regions identified in a recent large-scale study, encompassing a diverse spectrum of epileptic syndromes. By integrating additional protein-protein interaction data, we constructed subnetworks for these CNV-region genes and also those previously studied for epilepsy. We observed 20 genes common to both networks, primarily concentrated within a small molecular network populated by GABA receptor, BDNF/MAPK signaling, and estrogen receptor genes. From among the hundreds of genes in the initial networks, these were designated by convergent evidence for their likely association with epilepsy. Importantly, the identified molecular network was found to contain complex interrelationships, providing further insight into epilepsy's underlying pathology. We further performed pathway enrichment and crosstalk analysis and revealed a functional map which indicates the significant enrichment of closely related neurological, immune, and kinase regulatory pathways. The convergent framework we proposed here provides a unique and powerful approach to screening and identifying promising disease genes out of typically hundreds to thousands of genes in disease-related CNV-regions. Our network and pathway analysis provides important implications for the underlying molecular mechanisms for epilepsy. The strategy can be
Topography of the Duchenne muscular dystrophy (DMD) gene: FIGE and cDNA analysis of 194 cases reveals 115 deletions and 13 duplications.

PubMed Central

Den Dunnen, J T; Grootscholten, P M; Bakker, E; Blonden, L A; Ginjaar, H B; Wapenaar, M C; van Paassen, H M; van Broeckhoven, C; Pearson, P L; van Ommen, G J

1989-01-01

We have studied 34 Becker and 160 Duchenne muscular dystrophy (DMD) patients with the dystrophin cDNA, using conventional blots and FIGE analysis. One hundred twenty-eight mutations (65%) were found, 115 deletions and 13 duplications, of which 106 deletions and 11 duplications could be precisely mapped in relation to both the mRNA and the major and minor mutation hot spots. Junction fragments, ideal markers for carrier detection, were found in 23 (17%) of the 128 cases. We identified eight new cDNA RFLPs within the DMD gene. With the use of cDNA probes we have completed the long-range map of the DMD gene, by the identification of a 680-kb SfiI fragment containing the gene's 3' end. The size of the DMD gene is now determined to be about 2.3 million basepairs. The combination of cDNA hybridizations with long-range analysis of deletion and duplication patients yields a global picture of the exon spacing within the dystrophin gene. The gene shows a large variability of intron size, ranging from only a few kilobases to 160-180 kb for the P20 intron. Images Figure 1 Figure 4 PMID:2573997
Screening key candidate genes and pathways involved in insulinoma by microarray analysis.

PubMed

Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin

2018-06-01

Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.
Buckling analysis for anisotropic laminated plates under combined inplane loads

NASA Technical Reports Server (NTRS)

Viswanathan, A. V.; Tamekuni, M.; Baker, L. L.

1974-01-01

The buckling analysis presented considers rectangular flat or curved general laminates subjected to combined inplane normal and shear loads. Linear theory is used in the analysis. All prebuckling deformations and any initial imperfections are ignored. The analysis method can be readily extended to longitudinally stiffened structures subjected to combined inplane normal and shear loads.
Molecular Insights on Post-chemotherapy Retinoblastoma by Microarray Gene Expression Analysis

PubMed Central

Nalini, Venkatesan; Segu, Ramya; Deepa, Perinkulam Ravi; Khetan, Vikas; Vasudevan, Madavan; Krishnakumar, Subramanian

2013-01-01

Purpose Management of Retinoblastoma (RB), a pediatric ocular cancer is limited by drug-resistance and drug-dosage related side effects during chemotherapy. Molecular de-regulation in post-chemotherapy RB tumors was investigated. Materials and Methods cDNA microarray analysis of two post-chemotherapy and one pre-chemotherapy RB tumor tissues was performed, followed by Principle Component Analysis, Gene ontology, Pathway Enrichment analysis and Biological Analysis Network (BAN) modeling. The drug modulation role of two significantly up-regulated genes (p≤0.05) − Ect2 (Epithelial-cell-transforming-sequence-2), and PRAME (preferentially-expressed-Antigen-in-Melanoma) was assessed by qRT-PCR, immunohistochemistry and cell viability assays. Results Differential up-regulation of 1672 genes and down-regulation of 2538 genes was observed in RB tissues (relative to normal adult retina), while 1419 genes were commonly de-regulated between pre-chemotherapy and post- chemotherapy RB. Twenty one key gene ontology categories, pathways, biomarkers and phenotype groups harboring 250 differentially expressed genes were dys-regulated (EZH2, NCoR1, MYBL2, RB1, STAMN1, SYK, JAK1/2, STAT1/2, PLK2/4, BIRC5, LAMN1, Ect2, PRAME and ABCC4). Differential molecular expressions of PRAME and Ect2 in RB tumors with and without chemotherapy were analyzed. There was neither up- regulation of MRP1, nor any significant shift in chemotherapeutic IC50, in PRAME over-expressed versus non-transfected RB cells. Conclusion Cell cycle regulatory genes were dys-regulated post-chemotherapy. Ect2 gene was expressed in response to chemotherapy-induced stress. PRAME does not contribute to drug resistance in RB, yet its nuclear localization and BAN information, points to its possible regulatory role in RB. PMID:24092970
Enrichr: a comprehensive gene set enrichment analysis web server 2016 update

PubMed Central

Kuleshov, Maxim V.; Jones, Matthew R.; Rouillard, Andrew D.; Fernandez, Nicolas F.; Duan, Qiaonan; Wang, Zichen; Koplev, Simon; Jenkins, Sherry L.; Jagodnik, Kathleen M.; Lachmann, Alexander; McDermott, Michael G.; Monteiro, Caroline D.; Gundersen, Gregory W.; Ma'ayan, Avi

2016-01-01

Enrichment analysis is a popular method for analyzing gene sets generated by genome-wide experiments. Here we present a significant update to one of the tools in this domain called Enrichr. Enrichr currently contains a large collection of diverse gene set libraries available for analysis and download. In total, Enrichr currently contains 180 184 annotated gene sets from 102 gene set libraries. New features have been added to Enrichr including the ability to submit fuzzy sets, upload BED files, improved application programming interface and visualization of the results as clustergrams. Overall, Enrichr is a comprehensive resource for curated gene sets and a search engine that accumulates biological knowledge for further biological discoveries. Enrichr is freely available at: http://amp.pharm.mssm.edu/Enrichr. PMID:27141961
Time-Course Analysis of Gene Expression During the Saccharomyces cerevisiae Hypoxic Response.

PubMed

Bendjilali, Nasrine; MacLeon, Samuel; Kalra, Gurmannat; Willis, Stephen D; Hossian, A K M Nawshad; Avery, Erica; Wojtowicz, Olivia; Hickman, Mark J

2017-01-05

Many cells experience hypoxia, or low oxygen, and respond by dramatically altering gene expression. In the yeast Saccharomyces cerevisiae, genes that respond are required for many oxygen-dependent cellular processes, such as respiration, biosynthesis, and redox regulation. To more fully characterize the global response to hypoxia, we exposed yeast to hypoxic conditions, extracted RNA at different times, and performed RNA sequencing (RNA-seq) analysis. Time-course statistical analysis revealed hundreds of genes that changed expression by up to 550-fold. The genes responded with varying kinetics suggesting that multiple regulatory pathways are involved. We identified most known oxygen-regulated genes and also uncovered new regulated genes. Reverse transcription-quantitative PCR (RT-qPCR) analysis confirmed that the lysine methyltransferase EFM6 and the recombinase DMC1, both conserved in humans, are indeed oxygen-responsive. Looking more broadly, oxygen-regulated genes participate in expected processes like respiration and lipid metabolism, but also in unexpected processes like amino acid and vitamin metabolism. Using principle component analysis, we discovered that the hypoxic response largely occurs during the first 2 hr and then a new steady-state expression state is achieved. Moreover, we show that the oxygen-dependent genes are not part of the previously described environmental stress response (ESR) consisting of genes that respond to diverse types of stress. While hypoxia appears to cause a transient stress, the hypoxic response is mostly characterized by a transition to a new state of gene expression. In summary, our results reveal that hypoxia causes widespread and complex changes in gene expression to prepare the cell to function with little or no oxygen. Copyright © 2017 Bendjilali et al.

[Preliminary analysis of retinal gene expression profile of diabetic rat].

PubMed

Mei, Yan; Zhou, Hong-ying; Xiang, Tao; Lu, You-guang; Li, Ai-dong; Tang, En-jie; Yang, Hui-jun

2005-10-01

Establishing the retinal gene expression profiles of non-diabetic rat and diabetic rat and comparing the profiles in order to analyze the possible genes related with diabetic retinopathy. The whole retinal transcriptional fragments of non-diabetic rat and 8-week diabetic rat were obtained by restriction fragments differential display-PCR (RFDD-PCR). Bioinformatic analysis of retinal gene expression was performed using soft wares, including Fragment Analysis. After comparison of the expression profiles, the related gene fragments of diabetic retinopathy were initially selected as the target gene of further approach. A total of 3639 significant fragments were obtained. By means of more than 3-fold contrast of fluorescent intensity as the differential expression standard, the authors got 840 differential fragments, accounting for 23.08% of the expressed numbers and including 5 visual related genes, 13 excitatory neruotransmitter genes and 3 inhibitory neurotransmitter genes. At the 8th week, the expression of Rhodopsin kinase, beta-arrestin, Phosducinìrod photoreceptor cGMP-gated channel and Rpe65 as well as iGlu R1-4 were down-regulated. mGluRs and GABA-Rs were all up-regulated, whereas the expression of GlyR was unchanged. These results prompt again that the changes in retinal nervous layer of rat have occurred at an early stage of diabetes. The genes expression pattern of visual related genes and excitatory and inhibitory neurotransmitters in rat diabetic retina have been involved in neuro-dysfunctions of diabetic retina.
Mechanisms of colitis-accelerated colon carcinogenesis and its prevention with the combination of aspirin and curcumin: Transcriptomic analysis using RNA-seq.

PubMed

Guo, Yue; Su, Zheng-Yuan; Zhang, Chengyue; Gaspar, John M; Wang, Rui; Hart, Ronald P; Verzi, Michael P; Kong, Ah-Ng Tony

2017-07-01

Colorectal cancer (CRC) remains the leading cause of cancer-related death in the world. Aspirin (ASA) and curcumin (CUR) are widely investigated chemopreventive candidates for CRC. However, the precise mechanisms of their action and their combinatorial effects have not been evaluated. The purpose of the present study was to determine the effect of ASA, CUR, and their combination in azoxymethane/dextran sulfate sodium (AOM/DSS)-induced colitis-accelerated colorectal cancer (CAC). We also aimed to characterize the differential gene expression profiles in AOM/DSS-induced tumors as well as in tumors modulated by ASA and CUR using RNA-seq. Diets supplemented with 0.02% ASA, 2% CUR or 0.01% ASA+1% CUR were given to mice from 1week prior to the AOM injection until the experiment was terminated 22weeks after AOM initiation. Our results showed that CUR had a superior inhibitory effect in colon tumorigenesis compared to that of ASA. The combination of ASA and CUR at a lower dose exhibited similar efficacy to that of a higher dose of CUR at 2%. RNA isolated from colonic tissue from the control group and from tumor samples from the experimental groups was subjected to RNA-seq. Transcriptomic analysis suggested that the low-dose combination of ASA and CUR modulated larger gene sets than the single treatment. These differentially expressed genes were situated in several canonical pathways important in the inflammatory network and liver metastasis in CAC. We identified a small subset of genes as potential molecular targets involved in the preventive action of the combination of ASA and CUR. Taken together, the current results provide the first evidence in support of the chemopreventive effect of a low-dose combination of ASA and CUR in CAC. Moreover, the transcriptional profile obtained in our study may provide a framework for identifying the mechanisms underlying the carcinogenesis process from normal colonic tissue to tumor development as well as the cancer inhibitory effects
Weighted gene co‑expression network analysis in identification of key genes and networks for ischemic‑reperfusion remodeling myocardium.

PubMed

Guo, Nan; Zhang, Nan; Yan, Liqiu; Lian, Zheng; Wang, Jiawang; Lv, Fengfeng; Wang, Yunfei; Cao, Xufen

2018-06-14

Acute myocardial infarction induces ventricular remodeling, which is implicated in dilated heart and heart failure. The pathogenical mechanism of myocardium remodeling remains to be elucidated. The aim of the present study was to identify key genes and networks for myocardium remodeling following ischemia‑reperfusion (IR). First, the mRNA expression data from the National Center for Biotechnology Information database were downloaded to identify differences in mRNA expression of the IR heart at days 2 and 7. Then, weighted gene co‑expression network analysis, hierarchical clustering, protein‑protein interaction (PPI) network, Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway were used to identify key genes and networks for the heart remodeling process following IR. A total of 3,321 differentially expressed genes were identified during the heart remodeling process. A total of 6 modules were identified through gene co‑expression network analysis. GO and KEGG analysis results suggested that each module represented a different biological function and was associated with different pathways. Finally, hub genes of each module were identified by PPI network construction. The present study revealed that heart remodeling following IR is a complicated process, involving extracellular matrix organization, neural development, apoptosis and energy metabolism. The dysregulated genes, including SRC proto‑oncogene, non‑receptor tyrosine kinase, discs large MAGUK scaffold protein 1, ATP citrate lyase, RAN, member RAS oncogene family, tumor protein p53, and polo like kinase 2, may be essential for heart remodeling following IR and may be used as potential targets for the inhibition of heart remodeling following acute myocardial infarction.
Genome-Wide Analysis of the NADK Gene Family in Plants

PubMed Central

Li, Wen-Yan; Wang, Xiang; Li, Ri; Li, Wen-Qiang; Chen, Kun-Ming

2014-01-01

Background NAD(H) kinase (NADK) is the key enzyme that catalyzes de novo synthesis of NADP(H) from NAD(H) for NADP(H)-based metabolic pathways. In plants, NADKs form functional subfamilies. Studies of these families in Arabidopsis thaliana indicate that they have undergone considerable evolutionary selection; however, the detailed evolutionary history and functions of the various NADKs in plants are not clearly understood. Principal Findings We performed a comparative genomic analysis that identified 74 NADK gene homologs from 24 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots and eudicots. Phylogenetic and structural analysis classified these NADK genes into four well-conserved subfamilies with considerable variety in the domain organization and gene structure among subfamily members. In addition to the typical NAD_kinase domain, additional domains, such as adenylate kinase, dual-specificity phosphatase, and protein tyrosine phosphatase catalytic domains, were found in subfamily II. Interestingly, NADKs in subfamily III exhibited low sequence similarity (∼30%) in the kinase domain within the subfamily and with the other subfamilies. These observations suggest that gene fusion and exon shuffling may have occurred after gene duplication, leading to specific domain organization seen in subfamilies II and III, respectively. Further analysis of the exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures, during the process of structural evolution of NADK family genes. Finally, both available global microarray data analysis and qRT-RCR experiments revealed that the NADK genes in Arabidopsis and Oryza sativa show different expression patterns in different developmental stages and under several different abiotic/biotic stresses and hormone treatments, underscoring the functional diversity
Identification of novel and known oocyte-specific genes using complementary DNA subtraction and microarray analysis in three different species.

PubMed

Vallée, Maud; Gravel, Catherine; Palin, Marie-France; Reghenas, Hélène; Stothard, Paul; Wishart, David S; Sirard, Marc-André

2005-07-01

The main objective of the present study was to identify novel oocyte-specific genes in three different species: bovine, mouse, and Xenopus laevis. To achieve this goal, two powerful technologies were combined: a polymerase chain reaction (PCR)-based cDNA subtraction, and cDNA microarrays. Three subtractive libraries consisting of 3456 clones were established and enriched for oocyte-specific transcripts. Sequencing analysis of the positive insert-containing clones resulted in the following classification: 53% of the clones corresponded to known cDNAs, 26% were classified as uncharacterized cDNAs, and a final 9% were classified as novel sequences. All these clones were used for cDNA microarray preparation. Results from these microarray analyses revealed that in addition to already known oocyte-specific genes, such as GDF9, BMP15, and ZP, known genes with unknown function in the oocyte were identified, such as a MLF1-interacting protein (MLF1IP), B-cell translocation gene 4 (BTG4), and phosphotyrosine-binding protein (xPTB). Furthermore, 15 novel oocyte-specific genes were validated by reverse transcription-PCR to confirm their preferential expression in the oocyte compared to somatic tissues. The results obtained in the present study confirmed that microarray analysis is a robust technique to identify true positives from the suppressive subtractive hybridization experiment. Furthermore, obtaining oocyte-specific genes from three species simultaneously allowed us to look at important genes that are conserved across species. Further characterization of these novel oocyte-specific genes will lead to a better understanding of the molecular mechanisms related to the unique functions found in the oocyte.
Validation of reference genes for quantitative RT-PCR studies of gene expression in perennial ryegrass (Lolium perenne L.)

PubMed Central

2010-01-01

Background Perennial ryegrass (Lolium perenne L.) is an important pasture and turf crop. Biotechniques such as gene expression studies are being employed to improve traits in this temperate grass. Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) is among the best methods available for determining changes in gene expression. Before analysis of target gene expression, it is essential to select an appropriate normalisation strategy to control for non-specific variation between samples. Reference genes that have stable expression at different biological and physiological states can be effectively used for normalisation; however, their expression stability must be validated before use. Results Existing Serial Analysis of Gene Expression data were queried to identify six moderately expressed genes that had relatively stable gene expression throughout the year. These six candidate reference genes (eukaryotic elongation factor 1 alpha, eEF1A; TAT-binding protein homolog 1, TBP-1; eukaryotic translation initiation factor 4 alpha, eIF4A; YT521-B-like protein family protein, YT521-B; histone 3, H3; ubiquitin-conjugating enzyme, E2) were validated for qRT-PCR normalisation in 442 diverse perennial ryegrass (Lolium perenne L.) samples sourced from field- and laboratory-grown plants under a wide range of experimental conditions. Eukaryotic EF1A is encoded by members of a multigene family exhibiting differential expression and necessitated the expression analysis of different eEF1A encoding genes; a highly expressed eEF1A (h), a moderately, but stably expressed eEF1A (s), and combined expression of multigene eEF1A (m). NormFinder identified eEF1A (s) and YT521-B as the best combination of two genes for normalisation of gene expression data in perennial ryegrass following different defoliation management in the field. Conclusions This study is unique in the magnitude of samples tested with the inclusion of numerous field-grown samples, helping pave the way to
Bi-directional gene set enrichment and canonical correlation analysis identify key diet-sensitive pathways and biomarkers of metabolic syndrome.

PubMed

Morine, Melissa J; McMonagle, Jolene; Toomey, Sinead; Reynolds, Clare M; Moloney, Aidan P; Gormley, Isobel C; Gaora, Peadar O; Roche, Helen M

2010-10-07

Currently, a number of bioinformatics methods are available to generate appropriate lists of genes from a microarray experiment. While these lists represent an accurate primary analysis of the data, fewer options exist to contextualise those lists. The development and validation of such methods is crucial to the wider application of microarray technology in the clinical setting. Two key challenges in clinical bioinformatics involve appropriate statistical modelling of dynamic transcriptomic changes, and extraction of clinically relevant meaning from very large datasets. Here, we apply an approach to gene set enrichment analysis that allows for detection of bi-directional enrichment within a gene set. Furthermore, we apply canonical correlation analysis and Fisher's exact test, using plasma marker data with known clinical relevance to aid identification of the most important gene and pathway changes in our transcriptomic dataset. After a 28-day dietary intervention with high-CLA beef, a range of plasma markers indicated a marked improvement in the metabolic health of genetically obese mice. Tissue transcriptomic profiles indicated that the effects were most dramatic in liver (1270 genes significantly changed; p < 0.05), followed by muscle (601 genes) and adipose (16 genes). Results from modified GSEA showed that the high-CLA beef diet affected diverse biological processes across the three tissues, and that the majority of pathway changes reached significance only with the bi-directional test. Combining the liver tissue microarray results with plasma marker data revealed 110 CLA-sensitive genes showing strong canonical correlation with one or more plasma markers of metabolic health, and 9 significantly overrepresented pathways among this set; each of these pathways was also significantly changed by the high-CLA diet. Closer inspection of two of these pathways--selenoamino acid metabolism and steroid biosynthesis--illustrated clear diet-sensitive changes in
Bi-directional gene set enrichment and canonical correlation analysis identify key diet-sensitive pathways and biomarkers of metabolic syndrome

PubMed Central

2010-01-01

Background Currently, a number of bioinformatics methods are available to generate appropriate lists of genes from a microarray experiment. While these lists represent an accurate primary analysis of the data, fewer options exist to contextualise those lists. The development and validation of such methods is crucial to the wider application of microarray technology in the clinical setting. Two key challenges in clinical bioinformatics involve appropriate statistical modelling of dynamic transcriptomic changes, and extraction of clinically relevant meaning from very large datasets. Results Here, we apply an approach to gene set enrichment analysis that allows for detection of bi-directional enrichment within a gene set. Furthermore, we apply canonical correlation analysis and Fisher's exact test, using plasma marker data with known clinical relevance to aid identification of the most important gene and pathway changes in our transcriptomic dataset. After a 28-day dietary intervention with high-CLA beef, a range of plasma markers indicated a marked improvement in the metabolic health of genetically obese mice. Tissue transcriptomic profiles indicated that the effects were most dramatic in liver (1270 genes significantly changed; p < 0.05), followed by muscle (601 genes) and adipose (16 genes). Results from modified GSEA showed that the high-CLA beef diet affected diverse biological processes across the three tissues, and that the majority of pathway changes reached significance only with the bi-directional test. Combining the liver tissue microarray results with plasma marker data revealed 110 CLA-sensitive genes showing strong canonical correlation with one or more plasma markers of metabolic health, and 9 significantly overrepresented pathways among this set; each of these pathways was also significantly changed by the high-CLA diet. Closer inspection of two of these pathways - selenoamino acid metabolism and steroid biosynthesis - illustrated clear diet
DGCA: A comprehensive R package for Differential Gene Correlation Analysis.

PubMed

McKenzie, Andrew T; Katsyv, Igor; Song, Won-Min; Wang, Minghui; Zhang, Bin

2016-11-15

Dissecting the regulatory relationships between genes is a critical step towards building accurate predictive models of biological systems. A powerful approach towards this end is to systematically study the differences in correlation between gene pairs in more than one distinct condition. In this study we develop an R package, DGCA (for Differential Gene Correlation Analysis), which offers a suite of tools for computing and analyzing differential correlations between gene pairs across multiple conditions. To minimize parametric assumptions, DGCA computes empirical p-values via permutation testing. To understand differential correlations at a systems level, DGCA performs higher-order analyses such as measuring the average difference in correlation and multiscale clustering analysis of differential correlation networks. Through a simulation study, we show that the straightforward z-score based method that DGCA employs significantly outperforms the existing alternative methods for calculating differential correlation. Application of DGCA to the TCGA RNA-seq data in breast cancer not only identifies key changes in the regulatory relationships between TP53 and PTEN and their target genes in the presence of inactivating mutations, but also reveals an immune-related differential correlation module that is specific to triple negative breast cancer (TNBC). DGCA is an R package for systematically assessing the difference in gene-gene regulatory relationships under different conditions. This user-friendly, effective, and comprehensive software tool will greatly facilitate the application of differential correlation analysis in many biological studies and thus will help identification of novel signaling pathways, biomarkers, and targets in complex biological systems and diseases.
Analysis of functional importance of binding sites in the Drosophila gap gene network model.

PubMed

Kozlov, Konstantin; Gursky, Vitaly V; Kulakovskiy, Ivan V; Dymova, Arina; Samsonova, Maria

2015-01-01

The statistical thermodynamics based approach provides a promising framework for construction of the genotype-phenotype map in many biological systems. Among important aspects of a good model connecting the DNA sequence information with that of a molecular phenotype (gene expression) is the selection of regulatory interactions and relevant transcription factor bindings sites. As the model may predict different levels of the functional importance of specific binding sites in different genomic and regulatory contexts, it is essential to formulate and study such models under different modeling assumptions. We elaborate a two-layer model for the Drosophila gap gene network and include in the model a combined set of transcription factor binding sites and concentration dependent regulatory interaction between gap genes hunchback and Kruppel. We show that the new variants of the model are more consistent in terms of gene expression predictions for various genetic constructs in comparison to previous work. We quantify the functional importance of binding sites by calculating their impact on gene expression in the model and calculate how these impacts correlate across all sites under different modeling assumptions. The assumption about the dual interaction between hb and Kr leads to the most consistent modeling results, but, on the other hand, may obscure existence of indirect interactions between binding sites in regulatory regions of distinct genes. The analysis confirms the previously formulated regulation concept of many weak binding sites working in concert. The model predicts a more or less uniform distribution of functionally important binding sites over the sets of experimentally characterized regulatory modules and other open chromatin domains.
Replication of 6 obesity genes in a meta-analysis of genome-wide association studies from diverse ancestries.

PubMed

Tan, Li-Jun; Zhu, Hu; He, Hao; Wu, Ke-Hao; Li, Jian; Chen, Xiang-Ding; Zhang, Ji-Gang; Shen, Hui; Tian, Qing; Krousel-Wood, Marie; Papasian, Christopher J; Bouchard, Claude; Pérusse, Louis; Deng, Hong-Wen

2014-01-01

Obesity is a major public health problem with a significant genetic component. Multiple DNA polymorphisms/genes have been shown to be strongly associated with obesity, typically in populations of European descent. The aim of this study was to verify the extent to which 6 confirmed obesity genes (FTO, CTNNBL1, ADRB2, LEPR, PPARG and UCP2 genes) could be replicated in 8 different samples (n = 11,161) and to explore whether the same genes contribute to obesity-susceptibility in populations of different ancestries (five Caucasian, one Chinese, one African-American and one Hispanic population). GWAS-based data sets with 1000 G imputed variants were tested for association with obesity phenotypes individually in each population, and subsequently combined in a meta-analysis. Multiple variants at the FTO locus showed significant associations with BMI, fat mass (FM) and percentage of body fat (PBF) in meta-analysis. The strongest association was detected at rs7185735 (P-value = 1.01×10(-7) for BMI, 1.80×10(-6) for FM, and 5.29×10(-4) for PBF). Variants at the CTNNBL1, LEPR and PPARG loci demonstrated nominal association with obesity phenotypes (meta-analysis P-values ranging from 1.15×10(-3) to 4.94×10(-2)). There was no evidence of association with variants at ADRB2 and UCP2 genes. When stratified by sex and ethnicity, FTO variants showed sex-specific and ethnic-specific effects on obesity traits. Thus, it is likely that FTO has an important role in the sex- and ethnic-specific risk of obesity. Our data confirmed the role of FTO, CTNNBL1, LEPR and PPARG in obesity predisposition. These findings enhanced our knowledge of genetic associations between these genes and obesity-related phenotypes, and provided further justification for pursuing functional studies of these genes in the pathophysiology of obesity. Sex and ethnic differences in genetic susceptibility across populations of diverse ancestries may contribute to a more targeted prevention and customized
System Biology Approach: Gene Network Analysis for Muscular Dystrophy.

PubMed

Censi, Federica; Calcagnini, Giovanni; Mattei, Eugenio; Giuliani, Alessandro

2018-01-01

Phenotypic changes at different organization levels from cell to entire organism are associated to changes in the pattern of gene expression. These changes involve the entire genome expression pattern and heavily rely upon correlation patterns among genes. The classical approach used to analyze gene expression data builds upon the application of supervised statistical techniques to detect genes differentially expressed among two or more phenotypes (e.g., normal vs. disease). The use of an a posteriori, unsupervised approach based on principal component analysis (PCA) and the subsequent construction of gene correlation networks can shed a light on unexpected behaviour of gene regulation system while maintaining a more naturalistic view on the studied system.In this chapter we applied an unsupervised method to discriminate DMD patient and controls. The genes having the highest absolute scores in the discrimination between the groups were then analyzed in terms of gene expression networks, on the basis of their mutual correlation in the two groups. The correlation network structures suggest two different modes of gene regulation in the two groups, reminiscent of important aspects of DMD pathogenesis.
Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

PubMed Central

Horton, Roger; Gibson, Richard; Coggill, Penny; Miretti, Marcos; Allcock, Richard J.; Almeida, Jeff; Forbes, Simon; Gilbert, James G. R.; Halls, Karen; Harrow, Jennifer L.; Hart, Elizabeth; Howe, Kevin; Jackson, David K.; Palmer, Sophie; Roberts, Anne N.; Sims, Sarah; Stewart, C. Andrew; Traherne, James A.; Trevanion, Steve; Wilming, Laurens; Rogers, Jane; de Jong, Pieter J.; Elliott, John F.; Sawcer, Stephen; Todd, John A.; Trowsdale, John

2008-01-01

The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine. PMID:18193213
Bacterial responses to antibiotics and their combinations.

PubMed

Mitosch, Karin; Bollenbach, Tobias

2014-12-01

Antibiotics affect bacterial cell physiology at many levels. Rather than just compensating for the direct cellular defects caused by the drug, bacteria respond to antibiotics by changing their morphology, macromolecular composition, metabolism, gene expression and possibly even their mutation rate. Inevitably, these processes affect each other, resulting in a complex response with changes in the expression of numerous genes. Genome-wide approaches can thus help in gaining a comprehensive understanding of bacterial responses to antibiotics. In addition, a combination of experimental and theoretical approaches is needed for identifying general principles that underlie these responses. Here, we review recent progress in our understanding of bacterial responses to antibiotics and their combinations, focusing on effects at the levels of growth rate and gene expression. We concentrate on studies performed in controlled laboratory conditions, which combine promising experimental techniques with quantitative data analysis and mathematical modeling. While these basic research approaches are not immediately applicable in the clinic, uncovering the principles and mechanisms underlying bacterial responses to antibiotics may, in the long term, contribute to the development of new treatment strategies to cope with and prevent the rise of resistant pathogenic bacteria.
SemanticSCo: A platform to support the semantic composition of services for gene expression analysis.

PubMed

Guardia, Gabriela D A; Ferreira Pires, Luís; da Silva, Eduardo G; de Farias, Cléver R G

2017-02-01

Gene expression studies often require the combined use of a number of analysis tools. However, manual integration of analysis tools can be cumbersome and error prone. To support a higher level of automation in the integration process, efforts have been made in the biomedical domain towards the development of semantic web services and supporting composition environments. Yet, most environments consider only the execution of simple service behaviours and requires users to focus on technical details of the composition process. We propose a novel approach to the semantic composition of gene expression analysis services that addresses the shortcomings of the existing solutions. Our approach includes an architecture designed to support the service composition process for gene expression analysis, and a flexible strategy for the (semi) automatic composition of semantic web services. Finally, we implement a supporting platform called SemanticSCo to realize the proposed composition approach and demonstrate its functionality by successfully reproducing a microarray study documented in the literature. The SemanticSCo platform provides support for the composition of RESTful web services semantically annotated using SAWSDL. Our platform also supports the definition of constraints/conditions regarding the order in which service operations should be invoked, thus enabling the definition of complex service behaviours. Our proposed solution for semantic web service composition takes into account the requirements of different stakeholders and addresses all phases of the service composition process. It also provides support for the definition of analysis workflows at a high-level of abstraction, thus enabling users to focus on biological research issues rather than on the technical details of the composition process. The SemanticSCo source code is available at https://github.com/usplssb/SemanticSCo. Copyright Â© 2017 Elsevier Inc. All rights reserved.
Mumps virus F gene and HN gene sequencing as a molecular tool to study mumps virus transmission.

PubMed

Gouma, Sigrid; Cremer, Jeroen; Parkkali, Saara; Veldhuijzen, Irene; van Binnendijk, Rob S; Koopmans, Marion P G

2016-11-01

Various mumps outbreaks have occurred in the Netherlands since 2004, particularly among persons who had received 2 doses of measles, mumps, and rubella (MMR) vaccination. Genomic typing of pathogens can be used to track outbreaks, but the established genotyping of mumps virus based on the small hydrophobic (SH) gene sequences did not provide sufficient resolution. Therefore, we expanded the sequencing to include fusion (F) gene and haemagglutinin-neuraminidase (HN) gene sequences in addition to the SH gene sequences from 109 mumps virus genotype G strains obtained between 2004 and mid 2015 in the Netherlands. When the molecular information from these 3 genes was combined, we were able to identify separate mumps virus clusters and track mumps virus transmission. The analyses suggested that multiple mumps virus introductions occurred in the Netherlands between 2004 and 2015 resulting in several mumps outbreaks throughout this period, whereas during some local outbreaks the molecular data pointed towards endemic circulation. Combined analysis of epidemiological data and sequence data collected in 2015 showed good support for the phylogenetic clustering. Copyright Â© 2016 Elsevier B.V. All rights reserved.
An Estimation of Erinaceidae Phylogeny: A Combined Analysis Approach

PubMed Central

Yamaguchi, Nobuyuki; Ai, Huai-Sen; Wang, Ying-Xiang; Zhang, Ya-Ping; Jiang, Xue-Long

2012-01-01

Background Erinaceidae is a family of small mammals that include the spiny hedgehogs (Erinaceinae) and the silky-furred moonrats and gymnures (Galericinae). These animals are widely distributed across Eurasia and Africa, from the tundra to the tropics and the deserts to damp forests. The importance of these animals lies in the fact that they are the oldest known living placental mammals, which are well represented in the fossil record, a rarity fact given their size and vulnerability to destruction during fossilization. Although the Family has been well studied, their phylogenetic relationships remain controversial. To test previous phylogenetic hypotheses, we combined molecular and morphological data sets, including representatives of all the genera. Methodology and Principal Findings We included in the analyses 3,218 bp mitochondrial genes, one hundred and thirty-five morphological characters, twenty-two extant erinaceid taxa, and five outgroup taxa. Phylogenetic relationships were reconstructed using both partitioned and combined data sets. As in previous analyses, our results strongly support the monophyly of both subfamilies (Galericinae and Erinaceinae), the Hylomys group (to include Neotetracus and Neohylomys), and a sister-relationship of Atelerix and Erinaceus. As well, we verified that the extremely long branch lengths within the Galericinae are consistent with their fossil records. Not surprisingly, we found significant incongruence between the phylogenetic signals of the genes and the morphological characters, specifically in the case of Hylomys parvus, Mesechinus, and relationships between Hemiechinus and Paraechinus. Conclusions Although we discovered new clues to understanding the evolutionary relationships within the Erinaceidae, our results nonetheless, strongly suggest that more robust analyses employing more complete taxon sampling (to include fossils) and multiple unlinked genes would greatly enhance our understanding of the Erinaceidae. Until
Meta-analysis of cancer gene expression signatures reveals new cancer genes, SAGE tags and tumor associated regions of co-regulation

PubMed Central

Kavak, Erşen; Ünlü, Mustafa; Nistér, Monica; Koman, Ahmet

2010-01-01

Cancer is among the major causes of human death and its mechanism(s) are not fully understood. We applied a novel meta-analysis approach to multiple sets of merged serial analysis of gene expression and microarray cancer data in order to analyze transcriptome alterations in human cancer. Our methodology, which we denote ‘COgnate Gene Expression patterNing in tumours’ (COGENT), unmasked numerous genes that were differentially expressed in multiple cancers. COGENT detected well-known tumor-associated (TA) genes such as TP53, EGFR and VEGF, as well as many multi-cancer, but not-yet-tumor-associated genes. In addition, we identified 81 co-regulated regions on the human genome (RIDGEs) by using expression data from all cancers. Some RIDGEs (28%) consist of paralog genes while another subset (30%) are specifically dysregulated in tumors but not in normal tissues. Furthermore, a significant number of RIDGEs are associated with GC-rich regions on the genome. All assembled data is freely available online (www.oncoreveal.org) as a tool implementing COGENT analysis of multi-cancer genes and RIDGEs. These findings engender a deeper understanding of cancer biology by demonstrating the existence of a pool of under-studied multi-cancer genes and by highlighting the cancer-specificity of some TA-RIDGEs. PMID:20621981
Combining Gene and Stem Cell Therapy for Peripheral Nerve Tissue Engineering.

PubMed

Busuttil, Francesca; Rahim, Ahad A; Phillips, James B

2017-02-15

Despite a substantially increased understanding of neuropathophysiology, insufficient functional recovery after peripheral nerve injury remains a significant clinical challenge. Nerve regeneration following injury is dependent on Schwann cells, the supporting cells in the peripheral nervous system. Following nerve injury, Schwann cells adopt a proregenerative phenotype, which supports and guides regenerating nerves. However, this phenotype may not persist long enough to ensure functional recovery. Tissue-engineered nerve repair devices containing therapeutic cells that maintain the appropriate phenotype may help enhance nerve regeneration. The combination of gene and cell therapy is an emerging experimental strategy that seeks to provide the optimal environment for axonal regeneration and reestablishment of functional circuits. This review aims to summarize current preclinical evidence with potential for future translation from bench to bedside.
Genome-scale analysis of positionally relocated genes

PubMed Central

Bhutkar, Arjun; Russo, Susan M.; Smith, Temple F.; Gelbart, William M.

2007-01-01

During evolution, genome reorganization includes large-scale events such as inversions, translocations, and segmental or even whole-genome duplications, as well as fine-scale events such as the relocation of individual genes. This latter category, which we will refer to as positionally relocated genes (PRGs), is the subject of this report. Assessment of the magnitude of such PRGs and of possible contributing mechanisms is aided by a comparative analysis of related genomes, where conserved chromosomal organization can aid in identifying genes that have acquired a new location in a lineage of these genomes. Here we utilize two methods to comprehensively identify relocated protein-coding genes in the recently sequenced genomes of 12 species of genus Drosophila. We use exceptions to the general rule of maintenance of chromosome arm (Muller element) association for most Drosophila genes to identify one major class of PRGs. We also identify a partially overlapping set of PRGs among “embedded genes,” located within the extents of other surrounding genes. We provide evidence that PRG movements have at least two different origins: Some events occur via retrotransposition of processed RNAs and others via a DNA-based transposition mechanism. Overall, we identify several hundred PRGs that arose within a lineage of the genus Drosophila phylogeny and provide suggestive evidence that a few thousand such events have occurred within the radiation of the insect order Diptera, thereby illustrating the magnitude of the contribution of PRG movement to chromosomal reorganization during evolution. PMID:17989252

Xylella fastidiosa gene expression analysis by DNA microarrays.

PubMed

Travensolo, Regiane F; Carareto-Alves, Lucia M; Costa, Maria V C G; Lopes, Tiago J S; Carrilho, Emanuel; Lemos, Eliana G M

2009-04-01

Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM(2) and liquid BCYE). All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others). The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.
Development of Molecular Markers Linked to Powdery Mildew Resistance Gene Pm4b by Combining SNP Discovery from Transcriptome Sequencing Data with Bulked Segregant Analysis (BSR-Seq) in Wheat.

PubMed

Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie

2018-01-01

Powdery mildew resistance gene Pm4b , originating from Triticum persicum , is effective against the prevalent Blumeria graminis f. sp. tritici ( Bgt ) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F 2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7 ∗ Bainong 3217 F 4 (carrying Pm4b ) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F 2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b -linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13 , Xics43 , and Xics76 , were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with
Development of Molecular Markers Linked to Powdery Mildew Resistance Gene Pm4b by Combining SNP Discovery from Transcriptome Sequencing Data with Bulked Segregant Analysis (BSR-Seq) in Wheat

PubMed Central

Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie

2018-01-01

Powdery mildew resistance gene Pm4b, originating from Triticum persicum, is effective against the prevalent Blumeria graminis f. sp. tritici (Bgt) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7∗Bainong 3217 F4 (carrying Pm4b) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13, Xics43, and Xics76, were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease
The human MCP-2 gene (SCYA8): Cloning, sequence analysis, tissue expression, and assignment to the CC chemokine gene contig on chromosome 17q11.2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Van Coillie, E.; Fiten, P.; Van Damme, J.

1997-03-01

Monocyte chemotactic proteins (MCPs) form a subfamily of chemokines that recruit leukocytes to sites of inflammation and that may contribute to tumor-associated leukocyte infiltration and to the antiviral state against HIV infection. With the use of degenerate primers that were based on CC chemokine consensus sequences, the known MIP-1{alpha}/LD78{alpha}, MCP-1, and MCP-3 genes and the previously unidentified eotaxin and MCP-2 genes were isolated from a YAC contig from human chromosome 17q11.2. The amplified genomic MCP-2 fragment was used to isolate an MCP-2 cosmid from which the gene sequence was determined. The MCP-2 gene shares with the MCP-1 and MCP-3 genesmore » a conserved intron-exon structure and a coding nucleotide sequence homology of 77%. By Northern blot analysis the 1.0-kb MCP-2 mRNA was predominantly detectable in the small intestine, peripheral blood, heart, placenta, lung, skeletal muscle, ovary, colon, spinal cord, pancreas, and thymus. Transcripts of 1.5 and 2.4 kb were found in the testis, the small intestine, and the colon. The isolation of the MCP-2 gene from the chemokine contig localized it on YAC clones of chromosome 17q11.2, which also contain the eotaxin, MCP-1, MCP-3, and NCC-1/MCP-4 genes. The combination of using degenerate primer PCR and YACs illustrates that novel genes can efficiently be isolated from gene cluster contigs with less redundancy and effort than the isolation of novel ESTs. 42 refs., 5 figs., 2 tabs.« less
Combining Static Analysis and Model Checking for Software Analysis

NASA Technical Reports Server (NTRS)

Brat, Guillaume; Visser, Willem; Clancy, Daniel (Technical Monitor)

2003-01-01

We present an iterative technique in which model checking and static analysis are combined to verify large software systems. The role of the static analysis is to compute partial order information which the model checker uses to reduce the state space. During exploration, the model checker also computes aliasing information that it gives to the static analyzer which can then refine its analysis. The result of this refined analysis is then fed back to the model checker which updates its partial order reduction. At each step of this iterative process, the static analysis computes optimistic information which results in an unsafe reduction of the state space. However we show that the process converges to a fired point at which time the partial order information is safe and the whole state space is explored.
Antitumor Effects of Epidrug/IFNα Combination Driven by Modulated Gene Signatures in Both Colorectal Cancer and Dendritic Cells.

PubMed

Fragale, Alessandra; Romagnoli, Giulia; Licursi, Valerio; Buoncervello, Maria; Del Vecchio, Giorgia; Giuliani, Caterina; Parlato, Stefania; Leone, Celeste; De Angelis, Marta; Canini, Irene; Toschi, Elena; Belardelli, Filippo; Negri, Rodolfo; Capone, Imerio; Presutti, Carlo; Gabriele, Lucia

2017-07-01

Colorectal cancer results from the progressive accumulation of genetic and epigenetic alterations. IFN signaling defects play an important role in the carcinogenesis process, in which the inability of IFN transcription regulatory factors (IRF) to access regulatory sequences in IFN-stimulated genes (ISG) in tumors and in immune cells may be pivotal. We reported that low-dose combination of two FDA-approved epidrugs, azacytidine (A) and romidepsin (R), with IFNα2 (ARI) hampers the aggressiveness of both colorectal cancer metastatic and stem cells in vivo and triggers immunogenic cell death signals that stimulate dendritic cell (DC) function. Here, we investigated the molecular signals induced by ARI treatment and found that this drug combination increased the accessibility to regulatory sequences of ISGs and IRFs that were epigenetically silenced in both colorectal cancer cells and DCs. Likewise, specific ARI-induced histone methylation and acetylation changes marked epigenetically affected ISG promoters in both metastatic cancer cells and DCs. Analysis by ChIP-seq confirmed such ARI-induced epigenetically regulated IFN signature. The activation of this signal endowed DCs with a marked migratory capability. Our results establish a direct correlation between reexpression of silenced ISGs by epigenetic control and ARI anticancer activity and provide new knowledge for the development of innovative combined therapeutic strategies for colorectal cancer. Cancer Immunol Res; 5(7); 604-16. ©2017 AACR . ©2017 American Association for Cancer Research.
Integrated Computational Analysis of Genes Associated with Human Hereditary Insensitivity to Pain. A Drug Repurposing Perspective

PubMed Central

Lötsch, Jörn; Lippmann, Catharina; Kringel, Dario; Ultsch, Alfred

2017-01-01

Genes causally involved in human insensitivity to pain provide a unique molecular source of studying the pathophysiology of pain and the development of novel analgesic drugs. The increasing availability of “big data” enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 20 genes causally involved in human hereditary insensitivity to pain with the knowledge about the functions of thousands of genes. An integrated computational analysis proposed that among the functions of this set of genes, the processes related to nervous system development and to ceramide and sphingosine signaling pathways are particularly important. This is in line with earlier suggestions to use these pathways as therapeutic target in pain. Following identification of the biological processes characterizing hereditary insensitivity to pain, the biological processes were used for a similarity analysis with the functions of n = 4,834 database-queried drugs. Using emergent self-organizing maps, a cluster of n = 22 drugs was identified sharing important functional features with hereditary insensitivity to pain. Several members of this cluster had been implicated in pain in preclinical experiments. Thus, the present concept of machine-learned knowledge discovery for pain research provides biologically plausible results and seems to be suitable for drug discovery by identifying a narrow choice of repurposing candidates, demonstrating that contemporary machine-learned methods offer innovative approaches to knowledge discovery from available evidence. PMID:28848388
A new method for detection and discrimination of Pepino mosaic virus isolates using high resolution melting analysis of the triple gene block 3.

PubMed

Hasiów-Jaroszewska, Beata; Komorowska, Beata

2013-10-01

Diagnostic methods distinguished different Pepino mosaic virus (PepMV) genotypes but the methods do not detect sequence variation in particular gene segments. The necrotic and non-necrotic isolates (pathotypes) of PepMV share a 99% sequence similarity. These isolates differ from each other at one nucleotide site in the triple gene block 3. In this study, a combination of real-time reverse transcription polymerase chain reaction and high resolution melting curve analysis of triple gene block 3 was developed for simultaneous detection and differentiation of PepMV pathotypes. The triple gene block 3 region carrying a transition A → G was amplified using two primer pairs from twelve virus isolates, and was subjected to high resolution melting curve analysis. The results showed two distinct melting curve profiles related to each pathotype. The results also indicated that the high resolution melting method could readily differentiate between necrotic and non-necrotic PepMV pathotypes. Copyright © 2013 Elsevier B.V. All rights reserved.
Separate and combined effects of genetic variants and pre-treatment whole blood gene expression on response to exposure-based cognitive behavioural therapy for anxiety disorders.

PubMed

Coleman, Jonathan R I; Lester, Kathryn J; Roberts, Susanna; Keers, Robert; Lee, Sang Hyuck; De Jong, Simone; Gaspar, Héléna; Teismann, Tobias; Wannemüller, André; Schneider, Silvia; Jöhren, Peter; Margraf, Jürgen; Breen, Gerome; Eley, Thalia C

2017-04-01

Exposure-based cognitive behavioural therapy (eCBT) is an effective treatment for anxiety disorders. Response varies between individuals. Gene expression integrates genetic and environmental influences. We analysed the effect of gene expression and genetic markers separately and together on treatment response. Adult participants (n ≤ 181) diagnosed with panic disorder or a specific phobia underwent eCBT as part of standard care. Percentage decrease in the Clinical Global Impression severity rating was assessed across treatment, and between baseline and a 6-month follow-up. Associations with treatment response were assessed using expression data from 3,233 probes, and expression profiles clustered in a data- and literature-driven manner. A total of 3,343,497 genetic variants were used to predict treatment response alone and combined in polygenic risk scores. Genotype and expression data were combined in expression quantitative trait loci (eQTL) analyses. Expression levels were not associated with either treatment phenotype in any analysis. A total of 1,492 eQTLs were identified with q < 0.05, but interactions between genetic variants and treatment response did not affect expression levels significantly. Genetic variants did not significantly predict treatment response alone or in polygenic risk scores. We assessed gene expression alone and alongside genetic variants. No associations with treatment outcome were identified. Future studies require larger sample sizes to discover associations.
The banana E2 gene family: Genomic identification, characterization, expression profiling analysis.

PubMed

Dong, Chen; Hu, Huigang; Jue, Dengwei; Zhao, Qiufang; Chen, Hongliang; Xie, Jianghui; Jia, Liqiang

2016-04-01

The E2 is at the center of a cascade of Ub1 transfers, and it links activation of the Ub1 by E1 to its eventual E3-catalyzed attachment to substrate. Although the genome-wide analysis of this family has been performed in some species, little is known about analysis of E2 genes in banana. In this study, 74 E2 genes of banana were identified and phylogenetically clustered into thirteen subgroups. The predicted banana E2 genes were distributed across all 11 chromosomes at different densities. Additionally, the E2 domain, gene structure and motif compositions were analyzed. The expression of all of the banana E2 genes was analyzed in the root, stem, leaf, flower organs, five stages of fruit development and under abiotic stresses. All of the banana E2 genes, with the exception of few genes in each group, were expressed in at least one of the organs and fruit developments, which indicated that the E2 genes might involve in various aspects of the physiological and developmental processes of the banana. Quantitative RT-PCR (qRT-PCR) analysis identified that 45 E2s under drought and 33 E2s under salt were induced. To the best of our knowledge, this report describes the first genome-wide analysis of the banana E2 gene family, and the results should provide valuable information for understanding the classification, cloning and putative functions of this family. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
GWATCH: a web platform for automated gene association discovery analysis.

PubMed

Svitin, Anton; Malov, Sergey; Cherkasov, Nikolay; Geerts, Paul; Rotkevich, Mikhail; Dobrynin, Pavel; Shevchenko, Andrey; Guan, Li; Troyer, Jennifer; Hendrickson, Sher; Dilks, Holli Hutcheson; Oleksyk, Taras K; Donfield, Sharyne; Gomperts, Edward; Jabs, Douglas A; Sezgin, Efe; Van Natta, Mark; Harrigan, P Richard; Brumme, Zabrina L; O'Brien, Stephen J

2014-01-01

As genome-wide sequence analyses for complex human disease determinants are expanding, it is increasingly necessary to develop strategies to promote discovery and validation of potential disease-gene associations. Here we present a dynamic web-based platform - GWATCH - that automates and facilitates four steps in genetic epidemiological discovery: 1) Rapid gene association search and discovery analysis of large genome-wide datasets; 2) Expanded visual display of gene associations for genome-wide variants (SNPs, indels, CNVs), including Manhattan plots, 2D and 3D snapshots of any gene region, and a dynamic genome browser illustrating gene association chromosomal regions; 3) Real-time validation/replication of candidate or putative genes suggested from other sources, limiting Bonferroni genome-wide association study (GWAS) penalties; 4) Open data release and sharing by eliminating privacy constraints (The National Human Genome Research Institute (NHGRI) Institutional Review Board (IRB), informed consent, The Health Insurance Portability and Accountability Act (HIPAA) of 1996 etc.) on unabridged results, which allows for open access comparative and meta-analysis. GWATCH is suitable for both GWAS and whole genome sequence association datasets. We illustrate the utility of GWATCH with three large genome-wide association studies for HIV-AIDS resistance genes screened in large multicenter cohorts; however, association datasets from any study can be uploaded and analyzed by GWATCH.
Overexpression of the Squalene Epoxidase Gene Alone and in Combination with the 3-Hydroxy-3-methylglutaryl Coenzyme A Gene Increases Ganoderic Acid Production in Ganoderma lingzhi.

PubMed

Zhang, De-Huai; Jiang, Lu-Xi; Li, Na; Yu, Xuya; Zhao, Peng; Li, Tao; Xu, Jun-Wei

2017-06-14

The squalene epoxidase (SE) gene from the biosynthetic pathway of ganoderic acid (GA) was cloned and overexpressed in Ganoderma lingzhi. The strain that overexpressed the SE produced approximately 2 times more GA molecules than the wild-type (WT) strain. Moreover, SE overexpression upregulated lanosterol synthase gene expression in the biosynthetic pathway. These results indicated that SE stimulates GA accumulation. Then, the SE and 3-hydroxy-3-methylglutaryl coenzyme A (HMGR) genes were simultaneously overexpressed in G. lingzhi. Compared with the individual overexpression of SE or HMGR, the combined overexpression of the two genes further enhanced individual GA production. The overexpressing strain produced maximum GA-T, GA-S, GA-Mk, and GA-Me contents of 90.4 ± 7.5, 35.9 ± 5.4, 6.2 ± 0.5, and 61.8 ± 5.8 μg/100 mg dry weight, respectively. These values were 5.9, 4.5, 2.4, and 5.8 times higher than those produced by the WT strain. This is the first example of the successful manipulation of multiple biosynthetic genes to improve GA content in G. lingzhi.
Pharmacogenomic Characterization and Isobologram Analysis of the Combination of Ascorbic Acid and Curcumin—Two Main Metabolites of Curcuma longa—in Cancer Cells

PubMed Central

Ooko, Edna; Kadioglu, Onat; Greten, Henry J.; Efferth, Thomas

2017-01-01

Curcuma longa has long been used in China and India as anti-inflammatory agent to treat a wide variety of conditions and also as a spice for varied curry preparations. The chemoprofile of the Curcuma species exhibits the presence of varied phytochemicals with curcumin being present in all three species but AA only being shown in C. longa. This study explored the effect of a curcumin/AA combination on human cancer cell lines. The curcumin/AA combination was assessed by isobologram analysis using the Loewe additivity drug interaction model. The drug combination showed additive cytotoxicity toward CCRF-CEM and CEM/ADR5000 leukemia cell lines and HCT116p53+/+ and HCT116p53−/− colon cancer cell line, while the glioblastoma cell lines U87MG and U87MG.ΔEGFR showed additive to supra-additive cytotoxicity. Gene expression profiles predicting sensitivity and resistance of tumor cells to induction by curcumin and AA were determined by microarray-based mRNA expressions, COMPARE, and hierarchical cluster analyses. Numerous genes involved in transcription (TFAM, TCERG1, RGS13, C11orf31), apoptosis-regulation (CRADD, CDK7, CDK19, CD81, TOM1) signal transduction (NR1D2, HMGN1, ABCA1, DE4ND4B, TRIM27) DNA repair (TOPBP1, RPA2), mRNA metabolism (RBBP4, HNRNPR, SRSF4, NR2F2, PDK1, TGM2), and transporter genes (ABCA1) correlated with cellular responsiveness to curcumin and ascorbic acid. In conclusion, this study shows the effect of the curcumin/AA combination and identifies several candidate genes that may regulate the response of varied cancer cells to curcumin and AA. PMID:28210221
Differential analysis between somatic mutation and germline variation profiles reveals cancer-related genes.

PubMed

Przytycki, Pawel F; Singh, Mona

2017-08-25

A major aim of cancer genomics is to pinpoint which somatically mutated genes are involved in tumor initiation and progression. We introduce a new framework for uncovering cancer genes, differential mutation analysis, which compares the mutational profiles of genes across cancer genomes with their natural germline variation across healthy individuals. We present DiffMut, a fast and simple approach for differential mutational analysis, and demonstrate that it is more effective in discovering cancer genes than considerably more sophisticated approaches. We conclude that germline variation across healthy human genomes provides a powerful means for characterizing somatic mutation frequency and identifying cancer driver genes. DiffMut is available at https://github.com/Singh-Lab/Differential-Mutation-Analysis .
The Association of Multiple Interacting Genes with Specific Phenotypes in Rice Using Gene Coexpression Networks1[C][W][OA

PubMed Central

Ficklin, Stephen P.; Luo, Feng; Feltus, F. Alex

2010-01-01

Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes. PMID:20668062
Characterization of an Artificial Swine-Origin Influenza Virus with the Same Gene Combination as H1N1/2009 Virus: A Genesis Clue of Pandemic Strain

PubMed Central

Pu, Juan; Fan, Lihong; Shi, Weimin; Hu, Yanxin; Yang, Jun; Xu, Qi; Wang, Jingjing; Hou, Dongjun; Ma, Guangpeng; Liu, Jinhua

2011-01-01

Pandemic H1N1/2009 influenza virus, derived from a reassortment of avian, human, and swine influenza viruses, possesses a unique gene segment combination that had not been detected previously in animal and human populations. Whether such a gene combination could result in the pathogenicity and transmission as H1N1/2009 virus remains unclear. In the present study, we used reverse genetics to construct a reassortant virus (rH1N1) with the same gene combination as H1N1/2009 virus (NA and M genes from a Eurasian avian-like H1N1 swine virus and another six genes from a North American triple-reassortant H1N2 swine virus). Characterization of rH1N1 in mice showed that this virus had higher replicability and pathogenicity than those of the seasonal human H1N1 and Eurasian avian-like swine H1N1 viruses, but was similar to the H1N1/2009 and triple-reassortant H1N2 viruses. Experiments performed on guinea pigs showed that rH1N1 was not transmissible, whereas pandemic H1N1/2009 displayed efficient transmissibility. To further determine which gene segment played a key role in transmissibility, we constructed a series of reassortants derived from rH1N1 and H1N1/2009 viruses. Direct contact transmission studies demonstrated that the HA and NS genes contributed to the transmission of H1N1/2009 virus. Second, the HA gene of H1N1/2009 virus, when combined with the H1N1/2009 NA gene, conferred efficient contact transmission among guinea pigs. The present results reveal that not only gene segment reassortment but also amino acid mutation were needed for the generation of the pandemic influenza virus. PMID:21799774
Characterization of an artificial swine-origin influenza virus with the same gene combination as H1N1/2009 virus: a genesis clue of pandemic strain.

PubMed

Zhao, Xueli; Sun, Yipeng; Pu, Juan; Fan, Lihong; Shi, Weimin; Hu, Yanxin; Yang, Jun; Xu, Qi; Wang, Jingjing; Hou, Dongjun; Ma, Guangpeng; Liu, Jinhua

2011-01-01

Pandemic H1N1/2009 influenza virus, derived from a reassortment of avian, human, and swine influenza viruses, possesses a unique gene segment combination that had not been detected previously in animal and human populations. Whether such a gene combination could result in the pathogenicity and transmission as H1N1/2009 virus remains unclear. In the present study, we used reverse genetics to construct a reassortant virus (rH1N1) with the same gene combination as H1N1/2009 virus (NA and M genes from a Eurasian avian-like H1N1 swine virus and another six genes from a North American triple-reassortant H1N2 swine virus). Characterization of rH1N1 in mice showed that this virus had higher replicability and pathogenicity than those of the seasonal human H1N1 and Eurasian avian-like swine H1N1 viruses, but was similar to the H1N1/2009 and triple-reassortant H1N2 viruses. Experiments performed on guinea pigs showed that rH1N1 was not transmissible, whereas pandemic H1N1/2009 displayed efficient transmissibility. To further determine which gene segment played a key role in transmissibility, we constructed a series of reassortants derived from rH1N1 and H1N1/2009 viruses. Direct contact transmission studies demonstrated that the HA and NS genes contributed to the transmission of H1N1/2009 virus. Second, the HA gene of H1N1/2009 virus, when combined with the H1N1/2009 NA gene, conferred efficient contact transmission among guinea pigs. The present results reveal that not only gene segment reassortment but also amino acid mutation were needed for the generation of the pandemic influenza virus.
A global analysis of protein expression profiles in Sinorhizobium meliloti: discovery of new genes for nodule occupancy and stress adaptation.

PubMed

Djordjevic, Michael A; Chen, Han Cai; Natera, Siria; Van Noorden, Giel; Menzel, Christian; Taylor, Scott; Renard, Clotilde; Geiger, Otto; Weiller, Georg F

2003-06-01

A proteomic examination of Sinorhizobium meliloti strain 1021 was undertaken using a combination of 2-D gel electrophoresis, peptide mass fingerprinting, and bioinformatics. Our goal was to identify (i) putative symbiosis- or nutrient-stress-specific proteins, (ii) the biochemical pathways active under different conditions, (iii) potential new genes, and (iv) the extent of posttranslational modifications of S. meliloti proteins. In total, we identified the protein products of 810 genes (13.1% of the genome's coding capacity). The 810 genes generated 1,180 gene products, with chromosomal genes accounting for 78% of the gene products identified (18.8% of the chromosome's coding capacity). The activity of 53 metabolic pathways was inferred from bioinformatic analysis of proteins with assigned Enzyme Commission numbers. Of the remaining proteins that did not encode enzymes, ABC-type transporters composed 12.7% and regulatory proteins 3.4% of the total. Proteins with up to seven transmembrane domains were identified in membrane preparations. A total of 27 putative nodule-specific proteins and 35 nutrient-stress-specific proteins were identified and used as a basis to define genes and describe processes occurring in S. meliloti cells in nodules and under stress. Several nodule proteins from the plant host were present in the nodule bacteria preparations. We also identified seven potentially novel proteins not predicted from the DNA sequence. Post-translational modifications such as N-terminal processing could be inferred from the data. The posttranslational addition of UMP to the key regulator of nitrogen metabolism, PII, was demonstrated. This work demonstrates the utility of combining mass spectrometry with protein arraying or separation techniques to identify candidate genes involved in important biological processes and niche occupations that may be intransigent to other methods of gene expression profiling.
[Detection of novel genetic markers of susceptibility to preeclampsia based on an analysis of the regulatory genes in the placental tissue].

PubMed

Serebrova, V N; Trifonova, E A; Gabidulina, T V; Bukharina, I Yu; Agarkova, T A; Evtushenko, I D; Maksimova, N R; Stepanov, V A

2016-01-01

Regulatory single nucleotide polymorphisms (rSNPs) are the least-studied group of SNP; however, they play an essential role in the development of human pathology by altering the level of candidate genes expression. In this work, we analyzed 29 rSNPs in 17 new candidate genes associated with preeclampsia (PE) according to the analysis of the transcriptome in placental tissue. Three ethnic groups have been studied (yakut, russian, and buryat). We have detected significant associations of PE with eight rSNPs in six differentially expressed genes, i.e., rs10423795 in the LHB gene; rs3771787 in the HK2 gene; rs72959687 in the INHA gene; rs12678229, rs2227262, and rs3802252 in the NDRG1 gene; rs34845949 in the SASH1 gene; and rs66707428 in the PPP1R12C gene. We used a new approach to detecting genetic markers of multifactorial diseases in the case of PE based on a combination of genomic, transcriptomic, and bioinformatic approaches. This approach proved its efficiency and may be applied to detecting new potential genetic markers in genes involved in disease pathogenesis, which reduces missing heritability in multifactorial diseases.
Combining Human Epigenetics and Sleep Studies in Caenorhabditis elegans: A Cross-Species Approach for Finding Conserved Genes Regulating Sleep.

PubMed

Huang, Huiyan; Zhu, Yong; Eliot, Melissa N; Knopik, Valerie S; McGeary, John E; Carskadon, Mary A; Hart, Anne C

2017-06-01

We aimed to test a combined approach to identify conserved genes regulating sleep and to explore the association between DNA methylation and sleep length. We identified candidate genes associated with shorter versus longer sleep duration in college students based on DNA methylation using Illumina Infinium HumanMethylation450 BeadChip arrays. Orthologous genes in Caenorhabditis elegans were identified, and we examined whether their loss of function affected C. elegans sleep. For genes whose perturbation affected C. elegans sleep, we subsequently undertook a small pilot study to re-examine DNA methylation in an independent set of human participants with shorter versus longer sleep durations. Eighty-seven out of 485,577 CpG sites had significant differential methylation in young adults with shorter versus longer sleep duration, corresponding to 52 candidate genes. We identified 34 C. elegans orthologs, including NPY/flp-18 and flp-21, which are known to affect sleep. Loss of five additional genes alters developmentally timed C. elegans sleep (B4GALT6/bre-4, DOCK180/ced-5, GNB2L1/rack-1, PTPRN2/ida-1, ZFYVE28/lst-2). For one of these genes, ZFYVE28 (also known as hLst2), the pilot replication study again found decreased DNA methylation associated with shorter sleep duration at the same two CpG sites in the first intron of ZFYVE28. Using an approach that combines human epigenetics and C. elegans sleep studies, we identified five genes that play previously unidentified roles in C. elegans sleep. We suggest sleep duration in humans may be associated with differential DNA methylation at specific sites and that the conserved genes identified here likely play roles in C. elegans sleep and in other species. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.

[FANCA gene mutation analysis in Fanconi anemia patients].

PubMed

Chen, Fei; Peng, Guang-Jie; Zhang, Kejian; Hu, Qun; Zhang, Liu-Qing; Liu, Ai-Guo

2005-10-01

To screen the FANCA gene mutation and explore the FANCA protein function in Fanconi anemia (FA) patients. FANCA protein expression and its interaction with FANCF were analyzed using Western blot and immunoprecipitation in 3 cases of FA-A. Genomic DNA was used for MLPA analysis followed by sequencing. FANCA protein was undetectable and FANCA and FANCF protein interaction was impaired in these 3 cases of FA-A. Each case of FA-A contained biallelic pathogenic mutations in FANCA gene. No functional FANCA protein was found in these 3 cases of FA-A, and intragenic deletion, frame shift and splice site mutation were the major pathogenic mutations found in FANCA gene.
Molecular, phylogenetic and comparative genomic analysis of the cytokinin oxidase/dehydrogenase gene family in the Poaceae.

PubMed

Mameaux, Sabine; Cockram, James; Thiel, Thomas; Steuernagel, Burkhard; Stein, Nils; Taudien, Stefan; Jack, Peter; Werner, Peter; Gray, John C; Greenland, Andy J; Powell, Wayne

2012-01-01

The genomes of cereals such as wheat (Triticum aestivum) and barley (Hordeum vulgare) are large and therefore problematic for the map-based cloning of agronomicaly important traits. However, comparative approaches within the Poaceae permit transfer of molecular knowledge between species, despite their divergence from a common ancestor sixty million years ago. The finding that null variants of the rice gene cytokinin oxidase/dehydrogenase 2 (OsCKX2) result in large yield increases provides an opportunity to explore whether similar gains could be achieved in other Poaceae members. Here, phylogenetic, molecular and comparative analyses of CKX families in the sequenced grass species rice, brachypodium, sorghum, maize and foxtail millet, as well as members identified from the transcriptomes/genomes of wheat and barley, are presented. Phylogenetic analyses define four Poaceae CKX clades. Comparative analyses showed that CKX phylogenetic groupings can largely be explained by a combination of local gene duplication, and the whole-genome duplication event that predates their speciation. Full-length OsCKX2 homologues in barley (HvCKX2.1, HvCKX2.2) and wheat (TaCKX2.3, TaCKX2.4, TaCKX2.5) are characterized, with comparative analysis at the DNA, protein and genetic/physical map levels suggesting that true CKX2 orthologs have been identified. Furthermore, our analysis shows CKX2 genes in barley and wheat have undergone a Triticeae-specific gene-duplication event. Finally, by identifying ten of the eleven CKX genes predicted to be present in barley by comparative analyses, we show that next-generation sequencing approaches can efficiently determine the gene space of large-genome crops. Together, this work provides the foundation for future functional investigation of CKX family members within the Poaceae. © 2011 National Institute of Agricultural Botany (NIAB). Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell
Suitable Reference Genes for Accurate Gene Expression Analysis in Parsley (Petroselinum crispum) for Abiotic Stresses and Hormone Stimuli

PubMed Central

Li, Meng-Yao; Song, Xiong; Wang, Feng; Xiong, Ai-Sheng

2016-01-01

Parsley, one of the most important vegetables in the Apiaceae family, is widely used in the food, medicinal, and cosmetic industries. Recent studies on parsley mainly focus on its chemical composition, and further research involving the analysis of the plant's gene functions and expressions is required. qPCR is a powerful method for detecting very low quantities of target transcript levels and is widely used to study gene expression. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, four software, namely geNorm, NormFinder, BestKeeper, and RefFinder were used to evaluate the expression stabilities of eight candidate reference genes of parsley (GAPDH, ACTIN, eIF-4α, SAND, UBC, TIP41, EF-1α, and TUB) under various conditions, including abiotic stresses (heat, cold, salt, and drought) and hormone stimuli treatments (GA, SA, MeJA, and ABA). Results showed that EF-1α and TUB were the most stable genes for abiotic stresses, whereas EF-1α, GAPDH, and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1α and TUB were the most stable reference genes among all tested samples, and UBC was the least stable one. Expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study can guide the selection of suitable reference genes in gene expression in parsley. PMID:27746803
Suitable Reference Genes for Accurate Gene Expression Analysis in Parsley (Petroselinum crispum) for Abiotic Stresses and Hormone Stimuli.

PubMed

Li, Meng-Yao; Song, Xiong; Wang, Feng; Xiong, Ai-Sheng

2016-01-01

Parsley, one of the most important vegetables in the Apiaceae family, is widely used in the food, medicinal, and cosmetic industries. Recent studies on parsley mainly focus on its chemical composition, and further research involving the analysis of the plant's gene functions and expressions is required. qPCR is a powerful method for detecting very low quantities of target transcript levels and is widely used to study gene expression. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, four software, namely geNorm, NormFinder, BestKeeper, and RefFinder were used to evaluate the expression stabilities of eight candidate reference genes of parsley ( GAPDH, ACTIN, eIF-4 α, SAND, UBC, TIP41, EF-1 α, and TUB ) under various conditions, including abiotic stresses (heat, cold, salt, and drought) and hormone stimuli treatments (GA, SA, MeJA, and ABA). Results showed that EF-1 α and TUB were the most stable genes for abiotic stresses, whereas EF-1 α, GAPDH , and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1 α and TUB were the most stable reference genes among all tested samples, and UBC was the least stable one. Expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study can guide the selection of suitable reference genes in gene expression in parsley.
Mean field analysis of a spatial stochastic model of a gene regulatory network.

PubMed

Sturrock, M; Murray, P J; Matzavinos, A; Chaplain, M A J

2015-10-01

A gene regulatory network may be defined as a collection of DNA segments which interact with each other indirectly through their RNA and protein products. Such a network is said to contain a negative feedback loop if its products inhibit gene transcription, and a positive feedback loop if a gene product promotes its own production. Negative feedback loops can create oscillations in mRNA and protein levels while positive feedback loops are primarily responsible for signal amplification. It is often the case in real biological systems that both negative and positive feedback loops operate in parameter regimes that result in low copy numbers of gene products. In this paper we investigate the spatio-temporal dynamics of a single feedback loop in a eukaryotic cell. We first develop a simplified spatial stochastic model of a canonical feedback system (either positive or negative). Using a Gillespie's algorithm, we compute sample trajectories and analyse their corresponding statistics. We then derive a system of equations that describe the spatio-temporal evolution of the stochastic means. Subsequently, we examine the spatially homogeneous case and compare the results of numerical simulations with the spatially explicit case. Finally, using a combination of steady-state analysis and data clustering techniques, we explore model behaviour across a subregion of the parameter space that is difficult to access experimentally and compare the parameter landscape of our spatio-temporal and spatially-homogeneous models.
A PCR primer bank for quantitative gene expression analysis.

PubMed

Wang, Xiaowei; Seed, Brian

2003-12-15

Although gene expression profiling by microarray analysis is a useful tool for assessing global levels of transcriptional activity, variability associated with the data sets usually requires that observed differences be validated by some other method, such as real-time quantitative polymerase chain reaction (real-time PCR). However, non-specific amplification of non-target genes is frequently observed in the latter, confounding the analysis in approximately 40% of real-time PCR attempts when primer-specific labels are not used. Here we present an experimentally validated algorithm for the identification of transcript-specific PCR primers on a genomic scale that can be applied to real-time PCR with sequence-independent detection methods. An online database, PrimerBank, has been created for researchers to retrieve primer information for their genes of interest. PrimerBank currently contains 147 404 primers encompassing most known human and mouse genes. The primer design algorithm has been tested by conventional and real-time PCR for a subset of 112 primer pairs with a success rate of 98.2%.
Early Evolution of Vertebrate Mybs: An Integrative Perspective Combining Synteny, Phylogenetic, and Gene Expression Analyses

PubMed Central

Campanini, Emeline B.; Vandewege, Michael W.; Pillai, Nisha E.; Tay, Boon-Hui; Jones, Justin L.; Venkatesh, Byrappa; Hoffmann, Federico G.

2015-01-01

Abstract The genes in the Myb superfamily encode for three related transcription factors in most vertebrates, A-, B-, and c-Myb, with functionally distinct roles, whereas most invertebrates have a single Myb. B-Myb plays an essential role in cell division and cell cycle progression, c-Myb is involved in hematopoiesis, and A-Myb is involved in spermatogenesis and regulating expression of pachytene PIWI interacting RNAs, a class of small RNAs involved in posttranscriptional gene regulation and the maintenance of reproductive tissues. Comparisons between teleost fish and tetrapods suggest that the emergence and functional divergence of the Myb genes were linked to the two rounds of whole-genome duplication early in vertebrate evolution. We combined phylogenetic, synteny, structural, and gene expression analyses of the Myb paralogs from elephant shark and lampreys with data from 12 bony vertebrates to reconstruct the early evolution of vertebrate Mybs. Phylogenetic and synteny analyses suggest that the elephant shark and Japanese lamprey have copies of the A-, B-, and c-Myb genes, implying their origin could be traced back to the common ancestor of lampreys and gnathostomes. However, structural and gene expression analyses suggest that their functional roles diverged between gnathostomes and cyclostomes. In particular, we did not detect A-Myb expression in testis suggesting that the involvement of A-Myb in the pachytene PIWI interacting RNA pathway is probably a gnathostome-specific innovation. We speculate that the secondary loss of a central domain in lamprey A-Myb underlies the functional differences between the cyclostome and gnathostome A-Myb proteins. PMID:26475318
GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis.

PubMed

Zheng, Qi; Wang, Xiu-Jie

2008-07-01

Gene Ontology (GO) analysis has become a commonly used approach for functional studies of large-scale genomic or transcriptomic data. Although there have been a lot of software with GO-related analysis functions, new tools are still needed to meet the requirements for data generated by newly developed technologies or for advanced analysis purpose. Here, we present a Gene Ontology Enrichment Analysis Software Toolkit (GOEAST), an easy-to-use web-based toolkit that identifies statistically overrepresented GO terms within given gene sets. Compared with available GO analysis tools, GOEAST has the following improved features: (i) GOEAST displays enriched GO terms in graphical format according to their relationships in the hierarchical tree of each GO category (biological process, molecular function and cellular component), therefore, provides better understanding of the correlations among enriched GO terms; (ii) GOEAST supports analysis for data from various sources (probe or probe set IDs of Affymetrix, Illumina, Agilent or customized microarrays, as well as different gene identifiers) and multiple species (about 60 prokaryote and eukaryote species); (iii) One unique feature of GOEAST is to allow cross comparison of the GO enrichment status of multiple experiments to identify functional correlations among them. GOEAST also provides rigorous statistical tests to enhance the reliability of analysis results. GOEAST is freely accessible at http://omicslab.genetics.ac.cn/GOEAST/
Reference gene selection for quantitative gene expression studies during biological invasions: A test on multiple genes and tissues in a model ascidian Ciona savignyi.

PubMed

Huang, Xuena; Gao, Yangchun; Jiang, Bei; Zhou, Zunchun; Zhan, Aibin

2016-01-15

As invasive species have successfully colonized a wide range of dramatically different local environments, they offer a good opportunity to study interactions between species and rapidly changing environments. Gene expression represents one of the primary and crucial mechanisms for rapid adaptation to local environments. Here, we aim to select reference genes for quantitative gene expression analysis based on quantitative Real-Time PCR (qRT-PCR) for a model invasive ascidian, Ciona savignyi. We analyzed the stability of ten candidate reference genes in three tissues (siphon, pharynx and intestine) under two key environmental stresses (temperature and salinity) in the marine realm based on three programs (geNorm, NormFinder and delta Ct method). Our results demonstrated only minor difference for stability rankings among the three methods. The use of different single reference gene might influence the data interpretation, while multiple reference genes could minimize possible errors. Therefore, reference gene combinations were recommended for different tissues - the optimal reference gene combination for siphon was RPS15 and RPL17 under temperature stress, and RPL17, UBQ and TubA under salinity treatment; for pharynx, TubB, TubA and RPL17 were the most stable genes under temperature stress, while TubB, TubA and UBQ were the best under salinity stress; for intestine, UBQ, RPS15 and RPL17 were the most reliable reference genes under both treatments. Our results suggest that the necessity of selection and test of reference genes for different tissues under varying environmental stresses. The results obtained here are expected to reveal mechanisms of gene expression-mediated invasion success using C. savignyi as a model species. Copyright © 2015 Elsevier B.V. All rights reserved.
Mapping of Brain Activity by Automated Volume Analysis of Immediate Early Genes.

PubMed

Renier, Nicolas; Adams, Eliza L; Kirst, Christoph; Wu, Zhuhao; Azevedo, Ricardo; Kohl, Johannes; Autry, Anita E; Kadiri, Lolahon; Umadevi Venkataraju, Kannan; Zhou, Yu; Wang, Victoria X; Tang, Cheuk Y; Olsen, Olav; Dulac, Catherine; Osten, Pavel; Tessier-Lavigne, Marc

2016-06-16

Understanding how neural information is processed in physiological and pathological states would benefit from precise detection, localization, and quantification of the activity of all neurons across the entire brain, which has not, to date, been achieved in the mammalian brain. We introduce a pipeline for high-speed acquisition of brain activity at cellular resolution through profiling immediate early gene expression using immunostaining and light-sheet fluorescence imaging, followed by automated mapping and analysis of activity by an open-source software program we term ClearMap. We validate the pipeline first by analysis of brain regions activated in response to haloperidol. Next, we report new cortical regions downstream of whisker-evoked sensory processing during active exploration. Last, we combine activity mapping with axon tracing to uncover new brain regions differentially activated during parenting behavior. This pipeline is widely applicable to different experimental paradigms, including animal species for which transgenic activity reporters are not readily available. Copyright © 2016 Elsevier Inc. All rights reserved.
Mapping of brain activity by automated volume analysis of immediate early genes

PubMed Central

Renier, Nicolas; Adams, Eliza L.; Kirst, Christoph; Wu, Zhuhao; Azevedo, Ricardo; Kohl, Johannes; Autry, Anita E.; Kadiri, Lolahon; Venkataraju, Kannan Umadevi; Zhou, Yu; Wang, Victoria X.; Tang, Cheuk Y.; Olsen, Olav; Dulac, Catherine; Osten, Pavel; Tessier-Lavigne, Marc

2016-01-01

Summary Understanding how neural information is processed in physiological and pathological states would benefit from precise detection, localization and quantification of the activity of all neurons across the entire brain, which has not to date been achieved in the mammalian brain. We introduce a pipeline for high speed acquisition of brain activity at cellular resolution through profiling immediate early gene expression using immunostaining and light-sheet fluorescence imaging, followed by automated mapping and analysis of activity by an open-source software program we term ClearMap. We validate the pipeline first by analysis of brain regions activated in response to Haloperidol. Next, we report new cortical regions downstream of whisker-evoked sensory processing during active exploration. Lastly, we combine activity mapping with axon tracing to uncover new brain regions differentially activated during parenting behavior. This pipeline is widely applicable to different experimental paradigms, including animal species for which transgenic activity reporters are not readily available. PMID:27238021
Genomewide analysis of TCP transcription factor gene family in Malus domestica.

PubMed

Xu, Ruirui; Sun, Peng; Jia, Fengjuan; Lu, Longtao; Li, Yuanyuan; Zhang, Shizhong; Huang, Jinguang

2014-12-01

Teosinte branched 1/cycloidea/proliferating cell factor 1 (TCP) proteins are a large family of transcriptional regulators in angiosperms. They are involved in various biological processes, including development and plant metabolism pathways. In this study, a total of 52 TCP genes were identified in apple (Malus domestica) genome. Bioinformatic methods were employed to predicate and analyse their relevant gene classification, gene structure, chromosome location, sequence alignment and conserved domains of MdTCP proteins. Expression analysis from microarray data showed that the expression levels of 28 and 51 MdTCP genes changed during the ripening and rootstock-scion interaction processes, respectively. The expression patterns of 12 selected MdTCP genes were analysed in different tissues and in response to abiotic stresses. All of the selected genes were detected in at least one of the tissues tested, and most of them were modulated by adverse treatments indicating that the MdTCPs were involved in various developmental and physiological processes. To the best of our knowledge, this is the first study of a genomewide analysis of apple TCP gene family. These results provide valuable information for studies on functions of the TCP transcription factor genes in apple.
Length bias correction in gene ontology enrichment analysis using logistic regression.

PubMed

Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

2012-01-01

When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.
Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).

PubMed

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.
Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)

PubMed Central

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863
Epigenetic modulation of AR gene expression in prostate cancer DU145 cells with the combination of sodium butyrate and 5'-Aza-2'-deoxycytidine.

PubMed

Fialova, Barbora; Luzna, Petra; Gursky, Jan; Langova, Katerina; Kolar, Zdenek; Trtkova, Katerina Smesny

2016-10-01

The androgen receptor (AR) plays an essential role in the development and progression of prostate cancer. Castration-resistant prostate cancer (CRPC) is a consequence of androgen deprivation therapy. Unchecked CRPC followed by metastasis is lethal. Some CRPCs show decreased AR gene expression due to epigenetic mechanisms such as DNA methylation and histone deacetylation. The aim of this study was to epigenetically modulate the methylated state of the AR gene leading to targeted demethylation and AR gene expression in androgen-independent human prostate cancer DU145 cell line, representing the CRPC model with very low or undetectable AR levels. The cell treatment was based on single and combined applications of two epigenetic inhibitors, sodium butyrate (NaB) as histone deacetylases inhibitor and 5'-Aza-2'-deoxycytidine (Aza-dC) as DNA methyltransferases inhibitor. We found that the Aza-dC in combination with NaB may help reduce the toxicity of higher NaB concentrations in cancer cells. In normal RWPE-1 cells and even stronger in cancer DU145 cells, the combined treatment induced both AR gene expression on the mRNA level and increased histone H4 acetylation in AR gene promoter. Also activation and maintenance of G2/M cell cycle arrest and better survival in normal RWPE-1 cells compared to cancer DU145 cells were observed after the treatments. These results imply the selective toxicity effect of both inhibitors used and their potentially more effective combined use in the epigenetic therapy of prostate cancer patients.
Bioinformatics Identification of Modules of Transcription Factor Binding Sites in Alzheimer's Disease-Related Genes by In Silico Promoter Analysis and Microarrays

PubMed Central

Augustin, Regina; Lichtenthaler, Stefan F.; Greeff, Michael; Hansen, Jens; Wurst, Wolfgang; Trümbach, Dietrich

2011-01-01

The molecular mechanisms and genetic risk factors underlying Alzheimer's disease (AD) pathogenesis are only partly understood. To identify new factors, which may contribute to AD, different approaches are taken including proteomics, genetics, and functional genomics. Here, we used a bioinformatics approach and found that distinct AD-related genes share modules of transcription factor binding sites, suggesting a transcriptional coregulation. To detect additional coregulated genes, which may potentially contribute to AD, we established a new bioinformatics workflow with known multivariate methods like support vector machines, biclustering, and predicted transcription factor binding site modules by using in silico analysis and over 400 expression arrays from human and mouse. Two significant modules are composed of three transcription factor families: CTCF, SP1F, and EGRF/ZBPF, which are conserved between human and mouse APP promoter sequences. The specific combination of in silico promoter and multivariate analysis can identify regulation mechanisms of genes involved in multifactorial diseases. PMID:21559189
gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.

PubMed

Larson, Nicholas B; McDonnell, Shannon; Cannon Albright, Lisa; Teerlink, Craig; Stanford, Janet; Ostrander, Elaine A; Isaacs, William B; Xu, Jianfeng; Cooney, Kathleen A; Lange, Ethan; Schleutker, Johanna; Carpten, John D; Powell, Isaac; Bailey-Wilson, Joan E; Cussenot, Olivier; Cancel-Tassin, Geraldine; Giles, Graham G; MacInnis, Robert J; Maier, Christiane; Whittemore, Alice S; Hsieh, Chih-Lin; Wiklund, Fredrik; Catalona, William J; Foulkes, William; Mandal, Diptasri; Eeles, Rosalind; Kote-Jarai, Zsofia; Ackerman, Michael J; Olson, Timothy M; Klein, Christopher J; Thibodeau, Stephen N; Schaid, Daniel J

2017-05-01

Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results. © 2017 WILEY PERIODICALS, INC.
Stability evaluation of reference genes for gene expression analysis by RT-qPCR in soybean under different conditions.

PubMed

Wan, Qiao; Chen, Shuilian; Shan, Zhihui; Yang, Zhonglu; Chen, Limiao; Zhang, Chanjuan; Yuan, Songli; Hao, Qinnan; Zhang, Xiaojuan; Qiu, Dezhen; Chen, Haifeng; Zhou, Xinan

2017-01-01

Real-time quantitative reverse transcription PCR is a sensitive and widely used technique to quantify gene expression. To achieve a reliable result, appropriate reference genes are highly required for normalization of transcripts in different samples. In this study, 9 previously published reference genes (60S, Fbox, ELF1A, ELF1B, ACT11, TUA5, UBC4, G6PD, CYP2) of soybean [Glycine max (L.) Merr.] were selected. The expression stability of the 9 genes was evaluated under conditions of biotic stress caused by infection with soybean mosaic virus, nitrogen stress, across different cultivars and developmental stages. ΔCt and geNorm algorithms were used to evaluate and rank the expression stability of the 9 reference genes. Results obtained from two algorithms showed high consistency. Moreover, results of pairwise variation showed that two reference genes were sufficient to normalize the expression levels of target genes under each experimental setting. For virus infection, ELF1A and ELF1B were the most stable reference genes for accurate normalization. For different developmental stages, Fbox and G6PD had the highest expression stability between two soybean cultivars (Tanlong No. 1 and Tanlong No. 2). ELF1B and ACT11 were identified as the most stably expressed reference genes both under nitrogen stress and among different cultivars. The results showed that none of the candidate reference genes were uniformly expressed at different conditions, and selecting appropriate reference genes was pivotal for gene expression studies with particular condition and tissue. The most stable combination of genes identified in this study will help to achieve more accurate and reliable results in a wide variety of samples in soybean.
Full-length Transcriptome Sequencing and Modular Organization Analysis of Naringin/Neoeriocitrin Related Gene Expression Pattern in Drynaria roosii.

PubMed

Sun, Mei-Yu; Li, Jing-Yi; Li, Dong; Huang, Feng-Jie; Wang, Di; Li, Hui; Xing, Quan; Zhu, Hui-Bin; Shi, Lei

2018-04-12

Drynaria roosii (Nakaike) is a traditional Chinese medicinal fern, known as 'GuSuiBu'. The corresponding effective components of naringin/neoeriocitrin share highly similar chemical structure and medicinal function. Our HPLC-MS/MS results showed that the accumulation of naringin/neoeriocitrin depended on specific tissues or ages. However, little was known about the expression patterns of naringin/neoeriocitrin related genes involved in their regulatory pathways. For lack of the basic genetic information, we applied a combination of SMRT sequencing and SGS to generate the complete and full-length transcriptome of D. roosii. According to the SGS data, the DEG-based heat map analysis revealed the naringin/neoeriocitrin related gene expression exhibited obvious tissue- and time-specific transcriptomic differences. Using the systems biology method of modular organization analysis, we clustered 16,472 DEGs into 17 gene modules and studied the relationships between modules and tissue/time point samples, as well as modules and naringin/neoeriocitrin contents. Hereinto, naringin/neoeriocitrin related DEGs distributed in nine distinct modules, and DEGs in these modules showed significant different patterns of transcript abundance to be linked with specific tissues or ages. Moreover, WGCNA results further identified that PAL, 4CL, C4H and C3H, HCT acted as the major hub genes involved in naringin and neoeriocitrin synthesis respectively and exhibited high co-expression with MYB- and bHLH-regulated genes. In this work, modular organization and co-expression networks elucidated the tissue- and time-specificity of gene expression pattern, as well as hub genes associated with naringin/neoeriocitrin synthesis in D. roosii. Simultaneously, the comprehensive transcriptome dataset provided the important genetic information for further research on D. roosii.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.