Jackson, Belinda M; Abete-Luzi, Patricia; Krause, Michael W; Eisenmann, David M
2014-04-16
The Wnt signaling pathway plays a fundamental role during metazoan development, where it regulates diverse processes, including cell fate specification, cell migration, and stem cell renewal. Activation of the beta-catenin-dependent/canonical Wnt pathway up-regulates expression of Wnt target genes to mediate a cellular response. In the nematode Caenorhabditis elegans, a canonical Wnt signaling pathway regulates several processes during larval development; however, few target genes of this pathway have been identified. To address this deficit, we used a novel approach of conditionally activated Wnt signaling during a defined stage of larval life by overexpressing an activated beta-catenin protein, then used microarray analysis to identify genes showing altered expression compared with control animals. We identified 166 differentially expressed genes, of which 104 were up-regulated. A subset of the up-regulated genes was shown to have altered expression in mutants with decreased or increased Wnt signaling; we consider these genes to be bona fide C. elegans Wnt pathway targets. Among these was a group of six genes, including the cuticular collagen genes, bli-1 col-38, col-49, and col-71. These genes show a peak of expression in the mid L4 stage during normal development, suggesting a role in adult cuticle formation. Consistent with this finding, reduction of function for several of the genes causes phenotypes suggestive of defects in cuticle function or integrity. Therefore, this work has identified a large number of putative Wnt pathway target genes during larval life, including a small subset of Wnt-regulated collagen genes that may function in synthesis of the adult cuticle.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Klopp, Ann H.; Jhingran, Anuja; Ramdas, Latha
2008-05-01
Purpose: The purpose of this study was to investigate early gene expression changes after chemoradiation in a human solid tumor, allowing identification of chemoradiation-induced gene expression changes in the tumor as well as the tumor microenvironment. In addition we aimed to identify a gene expression profile that was associated with clinical outcome. Methods and Materials: Microarray experiments were performed on cervical cancer specimens obtained before and 48 h after chemoradiation from 12 patients with Stage IB2 to IIIB squamous cell carcinoma of the cervix treated between April 2001 and August 2002. Results: A total of 262 genes were identified thatmore » were significantly changed after chemoradiation. Genes involved in DNA repair were identified including DDB2, ERCC4, GADD45A, and XPC. In addition, significantly regulated cell-to-cell signaling pathways included insulin-like growth factor-1 (IGF-1), interferon, and vascular endothelial growth factor signaling. At a median follow-up of 41 months, 5 of 12 patients had experienced either local or distant failure. Supervised clustering analysis identified a 58-gene set from the pretreatment samples that were differentially expressed between patients with and without recurrence. Genes involved in integrin signaling and apoptosis pathways were identified in this gene set. Immortalization-upregulated protein (IMUP), IGF-2, and ARHD had particularly marked differences in expression between patients with and without recurrence. Conclusions: Genetic profiling identified genes regulated by chemoradiation including DNA damage and cell-to-cell signaling pathways. Genes associated with recurrence were identified that will require validation in an independent patient data set to determine whether the 58-gene set associated with clinical outcome could be useful as a prognostic assay.« less
Recurrent R-spondin fusions in colon cancer.
Seshagiri, Somasekar; Stawiski, Eric W; Durinck, Steffen; Modrusan, Zora; Storm, Elaine E; Conboy, Caitlin B; Chaudhuri, Subhra; Guan, Yinghui; Janakiraman, Vasantharajan; Jaiswal, Bijay S; Guillory, Joseph; Ha, Connie; Dijkgraaf, Gerrit J P; Stinson, Jeremy; Gnad, Florian; Huntley, Melanie A; Degenhardt, Jeremiah D; Haverty, Peter M; Bourgon, Richard; Wang, Weiru; Koeppen, Hartmut; Gentleman, Robert; Starr, Timothy K; Zhang, Zemin; Largaespada, David A; Wu, Thomas D; de Sauvage, Frederic J
2012-08-30
Identifying and understanding changes in cancer genomes is essential for the development of targeted therapeutics. Here we analyse systematically more than 70 pairs of primary human colon tumours by applying next-generation sequencing to characterize their exomes, transcriptomes and copy-number alterations. We have identified 36,303 protein-altering somatic changes that include several new recurrent mutations in the Wnt pathway gene TCF7L2, chromatin-remodelling genes such as TET2 and TET3 and receptor tyrosine kinases including ERBB3. Our analysis for significantly mutated cancer genes identified 23 candidates, including the cell cycle checkpoint kinase ATM. Copy-number and RNA-seq data analysis identified amplifications and corresponding overexpression of IGF2 in a subset of colon tumours. Furthermore, using RNA-seq data we identified multiple fusion transcripts including recurrent gene fusions involving R-spondin family members RSPO2 and RSPO3 that together occur in 10% of colon tumours. The RSPO fusions were mutually exclusive with APC mutations, indicating that they probably have a role in the activation of Wnt signalling and tumorigenesis. Consistent with this we show that the RSPO fusion proteins were capable of potentiating Wnt signalling. The R-spondin gene fusions and several other gene mutations identified in this study provide new potential opportunities for therapeutic intervention in colon cancer.
Recurrent R-spondin fusions in colon cancer
Seshagiri, Somasekar; Stawiski, Eric W.; Durinck, Steffen; Modrusan, Zora; Storm, Elaine E.; Conboy, Caitlin B.; Chaudhuri, Subhra; Guan, Yinghui; Janakiraman, Vasantharajan; Jaiswal, Bijay S.; Guillory, Joseph; Ha, Connie; Dijkgraaf, Gerrit J. P.; Stinson, Jeremy; Gnad, Florian; Huntley, Melanie A.; Degenhardt, Jeremiah D.; Haverty, Peter M.; Bourgon, Richard; Wang, Weiru; Koeppen, Hartmut; Gentleman, Robert; Starr, Timothy K.; Zhang, Zemin; Largaespada, David A.; Wu, Thomas D.; de Sauvage, Frederic J
2013-01-01
Identifying and understanding changes in cancer genomes is essential for the development of targeted therapeutics1. Here we analyse systematically more than 70 pairs of primary human colon tumours by applying next-generation sequencing to characterize their exomes, transcriptomes and copy-number alterations. We have identified 36,303 protein-altering somatic changes that include several new recurrent mutations in the Wnt pathway gene TCF7L2, chromatin-remodelling genes such as TET2 and TET3 and receptor tyrosine kinases including ERBB3. Our analysis for significantly mutated cancer genes identified 23 candidates, including the cell cycle checkpoint kinase ATM. Copy-number and RNA-seq data analysis identified amplifications and corresponding overexpression of IGF2 in a subset of colon tumours. Furthermore, using RNA-seq data we identified multiple fusion transcripts including recurrent gene fusions involving R-spondin family members RSPO2 and RSPO3 that together occur in 10% of colon tumours. The RSPO fusions were mutually exclusive with APC mutations, indicating that they probably have a role in the activation of Wnt signalling and tumorigenesis. Consistent with this we show that the RSPO fusion proteins were capable of potentiating Wnt signalling. The R-spondin gene fusions and several other gene mutations identified in this study provide new potential opportunities for therapeutic intervention in colon cancer. PMID:22895193
A transposon-based genetic screen in mice identifies genes altered in colorectal cancer.
Starr, Timothy K; Allaei, Raha; Silverstein, Kevin A T; Staggs, Rodney A; Sarver, Aaron L; Bergemann, Tracy L; Gupta, Mihir; O'Sullivan, M Gerard; Matise, Ilze; Dupuy, Adam J; Collier, Lara S; Powers, Scott; Oberg, Ann L; Asmann, Yan W; Thibodeau, Stephen N; Tessarollo, Lino; Copeland, Neal G; Jenkins, Nancy A; Cormier, Robert T; Largaespada, David A
2009-03-27
Human colorectal cancers (CRCs) display a large number of genetic and epigenetic alterations, some of which are causally involved in tumorigenesis (drivers) and others that have little functional impact (passengers). To help distinguish between these two classes of alterations, we used a transposon-based genetic screen in mice to identify candidate genes for CRC. Mice harboring mutagenic Sleeping Beauty (SB) transposons were crossed with mice expressing SB transposase in gastrointestinal tract epithelium. Most of the offspring developed intestinal lesions, including intraepithelial neoplasia, adenomas, and adenocarcinomas. Analysis of over 16,000 transposon insertions identified 77 candidate CRC genes, 60 of which are mutated and/or dysregulated in human CRC and thus are most likely to drive tumorigenesis. These genes include APC, PTEN, and SMAD4. The screen also identified 17 candidate genes that had not previously been implicated in CRC, including POLI, PTPRK, and RSPO2.
Long Term Follow up of the Delayed Effects of Acute Radiation Exposure in Primates
2017-10-01
66 of 94 We will then use shRNAs and/or CRISPR constructs targeting the gene of interest to knock down its expression in stem cells prior to...DLBCLs Mutational profiling identifies 150 driver genes Gene expression identifies sub- groups including cell of origin Unbiased CRISPR screen...Exome sequencing in 1,001 DLBCL patients comprehensively identifies 150 driver genes d Unbiased CRISPR screen in DLBCL cell lines identifies essential
Zhou, Shiyong; Liu, Pengfei; Zhang, Huilai
2017-01-01
Acute myeloid leukemia (AML) is a frequently occurring malignant disease of the blood and may result from a variety of genetic disorders. The present study aimed to identify the underlying mechanisms associated with the therapeutic effects of decitabine and cytarabine on AML, using microarray analysis. The microarray datasets GSE40442 and GSE40870 were downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) and differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine via the Linear Models for Microarray Data package, following data pre-processing. Gene Ontology (GO) analysis of DEGs was performed using the Database for Annotation, Visualization and Integrated Analysis Discovery. Genes corresponding to the differentially methylated sites were obtained using the annotation package of the methylation microarray platform. The overlapping genes were identified, which exhibited the opposite variation trend between gene expression and DNA methylation. Important transcription factor (TF)-gene pairs were screened out, and a regulated network subsequently constructed. A total of 190 DEGs and 540 differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine. A total of 36 GO terms of DEGs were enriched, including nucleosomes, protein-DNA complexes and the nucleosome assembly. The 540 differentially methylated sites were located on 240 genes, including the acid-repeat containing protein (ACRC) gene that was additionally differentially expressed. In addition, 60 TF pairs and overlapped methylated sites, and 140 TF-pairs and DEGs were screened out. The regulated network included 68 nodes and 140 TF-gene pairs. The present study identified various genes including ACRC and proliferating cell nuclear antigen, in addition to various TFs, including TATA-box binding protein associated factor 1 and CCCTC-binding factor, which may be potential therapeutic targets of AML. PMID:28498449
Zhou, Shiyong; Liu, Pengfei; Zhang, Huilai
2017-07-01
Acute myeloid leukemia (AML) is a frequently occurring malignant disease of the blood and may result from a variety of genetic disorders. The present study aimed to identify the underlying mechanisms associated with the therapeutic effects of decitabine and cytarabine on AML, using microarray analysis. The microarray datasets GSE40442 and GSE40870 were downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) and differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine via the Linear Models for Microarray Data package, following data pre‑processing. Gene Ontology (GO) analysis of DEGs was performed using the Database for Annotation, Visualization and Integrated Analysis Discovery. Genes corresponding to the differentially methylated sites were obtained using the annotation package of the methylation microarray platform. The overlapping genes were identified, which exhibited the opposite variation trend between gene expression and DNA methylation. Important transcription factor (TF)‑gene pairs were screened out, and a regulated network subsequently constructed. A total of 190 DEGs and 540 differentially methylated sites were identified in AML cells treated with decitabine compared with those treated with cytarabine. A total of 36 GO terms of DEGs were enriched, including nucleosomes, protein‑DNA complexes and the nucleosome assembly. The 540 differentially methylated sites were located on 240 genes, including the acid‑repeat containing protein (ACRC) gene that was additionally differentially expressed. In addition, 60 TF pairs and overlapped methylated sites, and 140 TF‑pairs and DEGs were screened out. The regulated network included 68 nodes and 140 TF‑gene pairs. The present study identified various genes including ACRC and proliferating cell nuclear antigen, in addition to various TFs, including TATA‑box binding protein associated factor 1 and CCCTC‑binding factor, which may be potential therapeutic targets of AML.
Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE
Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.
2009-01-01
Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438
Defining the human macula transcriptome and candidate retinal disease genes using EyeSAGE.
Bowes Rickman, Catherine; Ebright, Jessica N; Zavodni, Zachary J; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P; Wistow, Graeme; Boon, Kathy; Hauser, Michael A
2006-06-01
To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. The EyeSAGE database, combining three different gene-profiling platforms including the authors' multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions.
Tiffin, Nicki; Meintjes, Ayton; Ramesar, Rajkumar; Bajic, Vladimir B.; Rayner, Brian
2010-01-01
Multiple factors underlie susceptibility to essential hypertension, including a significant genetic and ethnic component, and environmental effects. Blood pressure response of hypertensive individuals to salt is heterogeneous, but salt sensitivity appears more prevalent in people of indigenous African origin. The underlying genetics of salt-sensitive hypertension, however, are poorly understood. In this study, computational methods including text- and data-mining have been used to select and prioritize candidate aetiological genes for salt-sensitive hypertension. Additionally, we have compared allele frequencies and copy number variation for single nucleotide polymorphisms in candidate genes between indigenous Southern African and Caucasian populations, with the aim of identifying candidate genes with significant variability between the population groups: identifying genetic variability between population groups can exploit ethnic differences in disease prevalence to aid with prioritisation of good candidate genes. Our top-ranking candidate genes include parathyroid hormone precursor (PTH) and type-1angiotensin II receptor (AGTR1). We propose that the candidate genes identified in this study warrant further investigation as potential aetiological genes for salt-sensitive hypertension. PMID:20886000
Identifying key genes in glaucoma based on a benchmarked dataset and the gene regulatory network.
Chen, Xi; Wang, Qiao-Ling; Zhang, Meng-Hui
2017-10-01
The current study aimed to identify key genes in glaucoma based on a benchmarked dataset and gene regulatory network (GRN). Local and global noise was added to the gene expression dataset to produce a benchmarked dataset. Differentially-expressed genes (DEGs) between patients with glaucoma and normal controls were identified utilizing the Linear Models for Microarray Data (Limma) package based on benchmarked dataset. A total of 5 GRN inference methods, including Zscore, GeneNet, context likelihood of relatedness (CLR) algorithm, Partial Correlation coefficient with Information Theory (PCIT) and GEne Network Inference with Ensemble of Trees (Genie3) were evaluated using receiver operating characteristic (ROC) and precision and recall (PR) curves. The interference method with the best performance was selected to construct the GRN. Subsequently, topological centrality (degree, closeness and betweenness) was conducted to identify key genes in the GRN of glaucoma. Finally, the key genes were validated by performing reverse transcription-quantitative polymerase chain reaction (RT-qPCR). A total of 176 DEGs were detected from the benchmarked dataset. The ROC and PR curves of the 5 methods were analyzed and it was determined that Genie3 had a clear advantage over the other methods; thus, Genie3 was used to construct the GRN. Following topological centrality analysis, 14 key genes for glaucoma were identified, including IL6 , EPHA2 and GSTT1 and 5 of these 14 key genes were validated by RT-qPCR. Therefore, the current study identified 14 key genes in glaucoma, which may be potential biomarkers to use in the diagnosis of glaucoma and aid in identifying the molecular mechanism of this disease.
A Systematic Genetic Screen to Dissect the MicroRNA Pathway in Drosophila.
Pressman, Sigal; Reinke, Catherine A; Wang, Xiaohong; Carthew, Richard W
2012-04-01
A central goal of microRNA biology is to elucidate the genetic program of miRNA function and regulation. However, relatively few of the effectors that execute miRNA repression have been identified. Because such genes may function in many developmental processes, mutations in them are expected to be pleiotropic and thus are discarded in most standard genetic screens. Here, we describe a systematic screen designed to identify all Drosophila genes in ∼40% of the genome that function in the miRNA pathway. To identify potentially pleiotropic genes, the screen analyzed clones of homozygous mutant cells in heterozygous animals. We identified 45 mutations representing 24 genes, and we molecularly characterized 9 genes. These include 4 previously known genes that encode core components of the miRNA pathway, including Drosha, Pasha, Dicer-1, and Ago1. The rest are new genes that function through chromatin remodeling, signaling, and mRNA decapping. The results suggest genetic screens that use clonal analysis can elucidate the miRNA program and that ∼100 genes are required to execute the miRNA program.
Harripaul, R; Vasli, N; Mikhailov, A; Rafiq, M A; Mittal, K; Windpassinger, C; Sheikh, T I; Noor, A; Mahmood, H; Downey, S; Johnson, M; Vleuten, K; Bell, L; Ilyas, M; Khan, F S; Khan, V; Moradi, M; Ayaz, M; Naeem, F; Heidari, A; Ahmed, I; Ghadami, S; Agha, Z; Zeinali, S; Qamar, R; Mozhdehipanah, H; John, P; Mir, A; Ansar, M; French, L; Ayub, M; Vincent, J B
2018-04-01
Approximately 1% of the global population is affected by intellectual disability (ID), and the majority receive no molecular diagnosis. Previous studies have indicated high levels of genetic heterogeneity, with estimates of more than 2500 autosomal ID genes, the majority of which are autosomal recessive (AR). Here, we combined microarray genotyping, homozygosity-by-descent (HBD) mapping, copy number variation (CNV) analysis, and whole exome sequencing (WES) to identify disease genes/mutations in 192 multiplex Pakistani and Iranian consanguineous families with non-syndromic ID. We identified definite or candidate mutations (or CNVs) in 51% of families in 72 different genes, including 26 not previously reported for ARID. The new ARID genes include nine with loss-of-function mutations (ABI2, MAPK8, MPDZ, PIDD1, SLAIN1, TBC1D23, TRAPPC6B, UBA7 and USP44), and missense mutations include the first reports of variants in BDNF or TET1 associated with ID. The genes identified also showed overlap with de novo gene sets for other neuropsychiatric disorders. Transcriptional studies showed prominent expression in the prenatal brain. The high yield of AR mutations for ID indicated that this approach has excellent clinical potential and should inform clinical diagnostics, including clinical whole exome and genome sequencing, for populations in which consanguinity is common. As with other AR disorders, the relevance will also apply to outbred populations.
Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Foronjy, Robert F; Feronjy, Robert; Spira, Avrum; Schadt, Eric E; Powell, Charles A; Zhu, Jun
2015-01-01
Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a 'causal' role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology.
Identification of key target genes and pathways in laryngeal carcinoma
Liu, Feng; Du, Jintao; Liu, Jun; Wen, Bei
2016-01-01
The purpose of the present study was to screen the key genes associated with laryngeal carcinoma and to investigate the molecular mechanism of laryngeal carcinoma progression. The gene expression profile of GSE10935 [Gene Expression Omnibus (GEO) accession number], including 12 specimens from laryngeal papillomas and 12 specimens from normal laryngeal epithelia controls, was downloaded from the GEO database. Differentially expressed genes (DEGs) were screened in laryngeal papillomas compared with normal controls using Limma package in R language, followed by Gene Ontology (GO) enrichment analysis and pathway enrichment analysis. Furthermore, the protein-protein interaction (PPI) network of DEGs was constructed using Cytoscape software and modules were analyzed using MCODE plugin from the PPI network. Furthermore, significant biological pathway regions (sub-pathway) were identified by using iSubpathwayMiner analysis. A total of 67 DEGs were identified, including 27 up-regulated genes and 40 down-regulated genes and they were involved in different GO terms and pathways. PPI network analysis revealed that Ras association (RalGDS/AF-6) domain family member 1 (RASSF1) was a hub protein. The sub-pathway analysis identified 9 significantly enriched sub-pathways, including glycolysis/gluconeogenesis and nitrogen metabolism. Genes such as phosphoglycerate kinase 1 (PGK1), carbonic anhydrase II (CA2), and carbonic anhydrase XII (CA12) whose node degrees were >10 were identified in the disease risk sub-pathway. Genes in the sub-pathway, such as RASSF1, PGK1, CA2 and CA12 were presumed to serve critical roles in laryngeal carcinoma. The present study identified DEGs and their sub-pathways in the disease, which may serve as potential targets for treatment of laryngeal carcinoma. PMID:27446427
Lipid metabolism in Rhodnius prolixus: Lessons from the genome.
Majerowicz, David; Calderón-Fernández, Gustavo M; Alves-Bezerra, Michele; De Paula, Iron F; Cardoso, Lívia S; Juárez, M Patricia; Atella, Georgia C; Gondim, Katia C
2017-01-05
The kissing bug Rhodnius prolixus is both an important vector of Chagas' disease and an interesting model for investigation into the field of physiology, including lipid metabolism. The publication of this insect genome will bring a huge amount of new molecular biology data to be used in future experiments. Although this work represents a promising scenario, a preliminary analysis of the sequence data is necessary to identify and annotate the genes involved in lipid metabolism. Here, we used bioinformatics tools and gene expression analysis to explore genes from different genes families and pathways, including genes for fat breakdown, as lipases and phospholipases, and enzymes from β-oxidation, fatty acid metabolism, and acyl-CoA and glycerolipid synthesis. The R. prolixus genome encodes 31 putative lipase genes, including 21 neutral lipases and 5 acid lipases. The expression profiles of some of these genes were analyzed. We were able to identify nine phospholipase A2 genes. A variety of gene families that participate in fatty acid synthesis and modification were studied, including fatty acid synthase, elongase, desaturase and reductase. Concerning the synthesis of glycerolipids, we found a second isoform of glycerol-3-phosphate acyltransferase that was ubiquitously expressed throughout the organs. Finally, all genes involved in fatty acid β-oxidation were identified, but not a long-chain acyl-CoA dehydrogenase. These results provide fundamental data to be used in future research on insect lipid metabolism and its possible relevance to Chagas' disease transmission. Copyright © 2016 Elsevier B.V. All rights reserved.
[Genome-wide identification and analysis of WRKY transcription factors in Medicago truncatula].
Song, Hui; Nan, Zhibiao
2014-02-01
WRKY gene family plays important roles in plant by involving in transcriptional regulations during various physiologically processes such as development, metabolism and responses to biotic and abiotic stresses. WRKY genes have been identified in various plants. However, only few WRKY genes in Medicago truncatula have been identified with systematic analysis and comparison. In this study, we identified 93 WRKY genes through analyses of M. truncatula genome. These genes include 19 type-I genes, 49 type II genes and 13 type-III genes, and 12 non-regular type genes. All of these genes were characterized through analyses of gene duplication, chromosomal locations, structural diversity, conserved protein motifs and phylogenetic relations. The results showed that 11 times of gene duplication event occurred in WRKY gene family involving 24 genes. WRKY genes, containing 6 gene clusters, are unevenly distributed into chromosome 1 to 6, and there is the purifying selection pressure in WRKY group III genes.
Identification of key microRNAs and genes in preeclampsia by bioinformatics analysis
Luo, Shouling; Cao, Nannan; Tang, Yao; Gu, Weirong
2017-01-01
Preeclampsia is a leading cause of perinatal maternal–foetal mortality and morbidity. The aim of this study is to identify the key microRNAs and genes in preeclampsia and uncover their potential functions. We downloaded the miRNA expression profile of GSE84260 and the gene expression profile of GSE73374 from the Gene Expression Omnibus database. Differentially expressed miRNAs and genes were identified and compared to miRNA-target information from MiRWalk 2.0, and a total of 65 differentially expressed miRNAs (DEMIs), including 32 up-regulated miRNAs and 33 down-regulated miRNAs, and 91 differentially expressed genes (DEGs), including 83 up-regulated genes and 8 down-regulated genes, were identified. The pathway enrichment analyses of the DEMIs showed that the up-regulated DEMIs were enriched in the Hippo signalling pathway and MAPK signalling pathway, and the down-regulated DEMIs were enriched in HTLV-I infection and miRNAs in cancers. The gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) enrichment analyses of the DEGs were performed using Multifaceted Analysis Tool for Human Transcriptome. The up-regulated DEGs were enriched in biological processes (BPs), including the response to cAMP, response to hydrogen peroxide and cell-cell adhesion mediated by integrin; no enrichment of down-regulated DEGs was identified. KEGG analysis showed that the up-regulated DEGs were enriched in the Hippo signalling pathway and pathways in cancer. A PPI network of the DEGs was constructed by using Cytoscape software, and FOS, STAT1, MMP14, ITGB1, VCAN, DUSP1, LDHA, MCL1, MET, and ZFP36 were identified as the hub genes. The current study illustrates a characteristic microRNA profile and gene profile in preeclampsia, which may contribute to the interpretation of the progression of preeclampsia and provide novel biomarkers and therapeutic targets for preeclampsia. PMID:28594854
Putnam, Christopher D.; Srivatsan, Anjana; Nene, Rahul V.; Martinez, Sandra L.; Clotfelter, Sarah P.; Bell, Sara N.; Somach, Steven B.; E.S. de Souza, Jorge; Fonseca, André F.; de Souza, Sandro J.; Kolodner, Richard D.
2016-01-01
Gross chromosomal rearrangements (GCRs) play an important role in human diseases, including cancer. The identity of all Genome Instability Suppressing (GIS) genes is not currently known. Here multiple Saccharomyces cerevisiae GCR assays and query mutations were crossed into arrays of mutants to identify progeny with increased GCR rates. One hundred eighty two GIS genes were identified that suppressed GCR formation. Another 438 cooperatively acting GIS genes were identified that were not GIS genes, but suppressed the increased genome instability caused by individual query mutations. Analysis of TCGA data using the human genes predicted to act in GIS pathways revealed that a minimum of 93% of ovarian and 66% of colorectal cancer cases had defects affecting one or more predicted GIS gene. These defects included loss-of-function mutations, copy-number changes associated with reduced expression, and silencing. In contrast, acute myeloid leukaemia cases did not appear to have defects affecting the predicted GIS genes. PMID:27071721
Go, Yoon Young; Park, Moo Kyun; Kwon, Jee Young; Seo, Young Rok; Chae, Sung-Won; Song, Jae-Jun
2015-12-01
The primary aim of this study is to evaluate the gene expression profile of Asian sand dust (ASD)-treated human middle ear epithelial cell (HMEEC) using microarray analysis. The HMEEC was treated with ASD (400 µg/mL) and total RNA was extracted for microarray analysis. Molecular pathways among differentially expressed genes were further analyzed. For selected genes, the changes in gene expression were confirmed by real-time polymerase chain reaction. A total of 1,274 genes were differentially expressed by ASD. Among them, 1,138 genes were 2 folds up-regulated, whereas 136 genes were 2 folds down-regulated. Up-regulated genes were mainly involved in cellular processes, including apoptosis, cell differentiation, and cell proliferation. Down-regulated genes affected cellular processes, including apoptosis, cell cycle, cell differentiation, and cell proliferation. The 10 genes including ADM, CCL5, EDN1, EGR1, FOS, GHRL, JUN, SOCS3, TNF, and TNFSF10 were identified as main modulators in up-regulated genes. A total of 11 genes including CSF3, DKK1, FOSL1, FST, TERT, MMP13, PTHLH, SPRY2, TGFBR2, THBS1, and TIMP1 acted as main components of pathway associated with 2-fold down regulated genes. We identified the differentially expressed genes in ASD-treated HMEEC. Our work indicates that air pollutant like ASD, may play an important role in the pathogenesis of otitis media.
Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Feronjy, Robert; Spira, Avrum; Schadt, Eric E.; Powell, Charles A.; Zhu, Jun
2015-01-01
Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a ‘causal’ role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology. PMID:25569234
Zeeberg, Barry R; Riss, Joseph; Kane, David W; Bussey, Kimberly J; Uchio, Edward; Linehan, W Marston; Barrett, J Carl; Weinstein, John N
2004-01-01
Background When processing microarray data sets, we recently noticed that some gene names were being changed inadvertently to non-gene names. Results A little detective work traced the problem to default date format conversions and floating-point format conversions in the very useful Excel program package. The date conversions affect at least 30 gene names; the floating-point conversions affect at least 2,000 if Riken identifiers are included. These conversions are irreversible; the original gene names cannot be recovered. Conclusions Users of Excel for analyses involving gene names should be aware of this problem, which can cause genes, including medically important ones, to be lost from view and which has contaminated even carefully curated public databases. We provide work-arounds and scripts for circumventing the problem. PMID:15214961
Li, Shicheng; Sun, Xiao; Miao, Shuncheng; Liu, Jia; Jiao, Wenjie
2017-11-01
Cigarette smoking is one of the greatest preventable risk factors for developing cancer, and most cases of lung squamous cell carcinoma (lung SCC) are associated with smoking. The pathogenesis mechanism of tumor progress is unclear. This study aimed to identify biomarkers in smoking-related lung cancer, including protein-coding gene, long noncoding RNA, and transcription factors. We selected and obtained messenger RNA microarray datasets and clinical data from the Gene Expression Omnibus database to identify gene expression altered by cigarette smoking. Integrated bioinformatic analysis was used to clarify biological functions of the identified genes, including Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, the construction of a protein-protein interaction network, transcription factor, and statistical analyses. Subsequent quantitative real-time PCR was utilized to verify these bioinformatic analyses. Five hundred and ninety-eight differentially expressed genes and 21 long noncoding RNA were identified in smoking-related lung SCC. GO and KEGG pathway analysis showed that identified genes were enriched in the cancer-related functions and pathways. The protein-protein interaction network revealed seven hub genes identified in lung SCC. Several transcription factors and their binding sites were predicted. The results of real-time quantitative PCR revealed that AURKA and BIRC5 were significantly upregulated and LINC00094 was downregulated in the tumor tissues of smoking patients. Further statistical analysis indicated that dysregulation of AURKA, BIRC5, and LINC00094 indicated poor prognosis in lung SCC. Protein-coding genes AURKA, BIRC5, and LINC00094 could be biomarkers or therapeutic targets for smoking-related lung SCC. © 2017 The Authors. Thoracic Cancer published by China Lung Oncology Group and John Wiley & Sons Australia, Ltd.
Thomassen, Mads; Tan, Qihua; Kruse, Torben A
2009-01-01
Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of "Gene Set Enrichment Analysis" we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.
Wolen, Aaron R; Miles, Michael F
2012-01-01
For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
Kim, Jae Yoon; Moon, Jun-Cheol; Kim, Hyo Chul; Shin, Seungho; Song, Kitae; Kim, Kyung-Hee; Lee, Byung-Moo
2017-01-01
Premise of the study: Positional cloning in combination with phenotyping is a general approach to identify disease-resistance gene candidates in plants; however, it requires several time-consuming steps including population or fine mapping. Therefore, in the present study, we suggest a new combined strategy to improve the identification of disease-resistance gene candidates. Methods and Results: Downy mildew (DM)–resistant maize was selected from five cultivars using a spreader row technique. Positional cloning and bioinformatics tools were used to identify the DM-resistance quantitative trait locus marker (bnlg1702) and 47 protein-coding gene annotations. Eventually, five DM-resistance gene candidates, including bZIP34, Bak1, and Ppr, were identified by quantitative reverse-transcription PCR (RT-PCR) without fine mapping of the bnlg1702 locus. Conclusions: The combined protocol with the spreader row technique, quantitative trait locus positional cloning, and quantitative RT-PCR was effective for identifying DM-resistance candidate genes. This cloning approach may be applied to other whole-genome-sequenced crops or resistance to other diseases. PMID:28224059
Data on the genome-wide identification of CNL R-genes in Setaria italica (L.) P. Beauv.
Andersen, Ethan J; Nepal, Madhav P
2017-08-01
We report data associated with the identification of 242 disease resistance genes (R-genes) in the genome of Setaria italica as presented in "Genetic diversity of disease resistance genes in foxtail millet ( Setaria italica L.)" (Andersen and Nepal, 2017) [1]. Our data describe the structure and evolution of the Coiled-coil, Nucleotide-binding site, Leucine-rich repeat (CNL) R-genes in foxtail millet. The CNL genes were identified through rigorous extraction and analysis of recently available plant genome sequences using cutting-edge analytical software. Data visualization includes gene structure diagrams, chromosomal syntenic maps, a chromosomal density plot, and a maximum-likelihood phylogenetic tree comparing Sorghum bicolor , Panicum virgatum , Setaria italica , and Arabidopsis thaliana . Compilation of InterProScan annotations, Gene Ontology (GO) annotations, and Basic Local Alignment Search Tool (BLAST) results for the 242 R-genes identified in the foxtail millet genome are also included in tabular format.
Edenberg, Howard J; Foroud, Tatiana
2014-01-01
Multiple lines of evidence strongly indicate that genetic factors contribute to the risk for alcohol use disorders (AUD). There is substantial heterogeneity in AUD, which complicates studies seeking to identify specific genetic factors. To identify these genetic effects, several different alcohol-related phenotypes have been analyzed, including diagnosis and quantitative measures related to AUDs. Study designs have used candidate gene analyses, genetic linkage studies, genomewide association studies (GWAS), and analyses of rare variants. Two genes that encode enzymes of alcohol metabolism have the strongest effect on AUD: aldehyde dehydrogenase 2 and alcohol dehydrogenase 1B each has strongly protective variants that reduce risk, with odds ratios approximately 0.2-0.4. A number of other genes important in AUD have been identified and replicated, including GABRA2 and alcohol dehydrogenases 1B and 4. GWAS have identified additional candidates. Rare variants are likely also to play a role; studies of these are just beginning. A multifaceted approach to gene identification, targeting both rare and common variations and assembling much larger datasets for meta-analyses, is critical for identifying the key genes and pathways important in AUD. © 2014 Elsevier B.V. All rights reserved.
2011-01-01
Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the genes identified are known to be up-regulated in response to osmotic stress in pine and other plant species and encode proteins involved in both signal transduction and stress tolerance. Gene expression levels returned to control values within a 48-hour recovery period in all but 76 transcripts. Correlation network analysis indicates a scale-free network topology for the pine root transcriptome and identifies central nodes that may serve as drivers of drought-responsive transcriptome dynamics in the roots of loblolly pine. PMID:21609476
Mutation spectrum of Chinese patients with Bartter syndrome.
Han, Yue; Lin, Yi; Sun, Qing; Wang, Shujuan; Gao, Yanxia; Shao, Leping
2017-11-24
Bartter syndrome (BS) has been rarely reported in Chinese population except for a few case reports. This investigation was aimed to analyze the mutations of the causal genes in sixteen Chinese patients with BS, and review their followup and treatment. Identify mutations by the next generation sequencing and the multiplex ligation-dependent probe amplification (MLPA). Clinical characteristics and biochemical findings at the first presentation as well as follow-up were reviewed. 15 different CLCNKB gene mutations were identified in fourteen patients with BS, including 11 novel ones. A novel missense mutation and a novel small deletion were found from SLC12A1 gene. A novel gross deletion was found in CLCNKA gene. A recurrent missense mutation was identified from BSND gene. We found that the whole gene deletion mutation of CLCNKB gene was the most frequent mutation (32%), and the rate of gross deletion was up to 50 percent in this group of Chinese patients. The present study has found 19 mutations, including 14 novel ones, which would enrich the human gene mutation database (HGMD) and provide valuable references to the genetic counseling and diagnosis of the Chinese population.
The Essential Genome of Escherichia coli K-12
2018-01-01
ABSTRACT Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. PMID:29463657
Gibbons, John G.; Beauvais, Anne; Beau, Remi; McGary, Kriston L.
2012-01-01
Aspergillus fumigatus is the most common and deadly pulmonary fungal infection worldwide. In the lung, the fungus usually forms a dense colony of filaments embedded in a polymeric extracellular matrix. To identify candidate genes involved in this biofilm (BF) growth, we used RNA-Seq to compare the transcriptomes of BF and liquid plankton (PL) growth. Sequencing and mapping of tens of millions sequence reads against the A. fumigatus transcriptome identified 3,728 differentially regulated genes in the two conditions. Although many of these genes, including the ones coding for transcription factors, stress response, the ribosome, and the translation machinery, likely reflect the different growth demands in the two conditions, our experiment also identified hundreds of candidate genes for the observed differences in morphology and pathobiology between BF and PL. We found an overrepresentation of upregulated genes in transport, secondary metabolism, and cell wall and surface functions. Furthermore, upregulated genes showed significant spatial structure across the A. fumigatus genome; they were more likely to occur in subtelomeric regions and colocalized in 27 genomic neighborhoods, many of which overlapped with known or candidate secondary metabolism gene clusters. We also identified 1,164 genes that were downregulated. This gene set was not spatially structured across the genome and was overrepresented in genes participating in primary metabolic functions, including carbon and amino acid metabolism. These results add valuable insight into the genetics of biofilm formation in A. fumigatus and other filamentous fungi and identify many relevant, in the context of biofilm biology, candidate genes for downstream functional experiments. PMID:21724936
Guo, Zhiqiang; Zhao, Chuncheng; Wang, Zheng
2014-09-26
To identify critical genes and biological pathways in acute lung injury (ALI), a comparative analysis of gene expression profiles of patients with ALI + sepsis compared with patients with sepsis alone were performed with bioinformatic tools. GSE10474 was downloaded from Gene Expression Omnibus, including a collective of 13 whole blood samples with ALI + sepsis and 21 whole blood samples with sepsis alone. After pre-treatment with robust multichip averaging (RMA) method, differential analysis was conducted using simpleaffy package based upon t-test and fold change. Hierarchical clustering was also performed using function hclust from package stats. Beisides, functional enrichment analysis was conducted using iGepros. Moreover, the gene regulatory network was constructed with information from Kyoto Encyclopedia of Genes and Genomes (KEGG) and then visualized by Cytoscape. A total of 128 differentially expressed genes (DEGs) were identified, including 47 up- and 81 down-regulated genes. The significantly enriched functions included negative regulation of cell proliferation, regulation of response to stimulus and cellular component morphogenesis. A total of 27 DEGs were significantly enriched in 16 KEGG pathways, such as protein digestion and absorption, fatty acid metabolism, amoebiasis, etc. Furthermore, the regulatory network of these 27 DEGs was constructed, which involved several key genes, including protein tyrosine kinase 2 (PTK2), v-src avian sarcoma (SRC) and Caveolin 2 (CAV2). PTK2, SRC and CAV2 may be potential markers for diagnosis and treatment of ALI. The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/5865162912987143.
Mattison, Christopher P; Rai, Ruhi; Settlage, Robert E; Hinchliffe, Doug J; Madison, Crista; Bland, John M; Brashear, Suzanne; Graham, Charles J; Tarver, Matthew R; Florane, Christopher; Bechtel, Peter J
2017-02-22
The pecan nut is a nutrient-rich part of a healthy diet full of beneficial fatty acids and antioxidants, but can also cause allergic reactions in people suffering from food allergy to the nuts. The transcriptome of a developing pecan nut was characterized to identify the gene expression occurring during the process of nut development and to highlight those genes involved in fatty acid metabolism and those that commonly act as food allergens. Pecan samples were collected at several time points during the embryo development process including the water, gel, dough, and mature nut stages. Library preparation and sequencing were performed using Illumina-based mRNA HiSeq with RNA from four time points during the growing season during August and September 2012. Sequence analysis with Trinotate software following the Trinity protocol identified 133,000 unigenes with 52,267 named transcripts and 45,882 annotated genes. A total of 27,312 genes were defined by GO annotation. Gene expression clustering analysis identified 12 different gene expression profiles, each containing a number of genes. Three pecan seed storage proteins that commonly act as allergens, Car i 1, Car i 2, and Car i 4, were significantly up-regulated during the time course. Up-regulated fatty acid metabolism genes that were identified included acyl-[ACP] desaturase and omega-6 desaturase genes involved in oleic and linoleic acid metabolism. Notably, a few of the up-regulated acyl-[ACP] desaturase and omega-6 desaturase genes that were identified have expression patterns similar to the allergen genes based upon gene expression clustering and qPCR analysis. These findings suggest the possibility of coordinated accumulation of lipids and allergens during pecan nut embryogenesis.
Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors
Bii, Victor M.; Trobridge, Grant D.
2016-01-01
Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types. PMID:27792127
Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.
2016-01-01
Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499
Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G
2016-04-05
Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.
NGS testing for cardiomyopathy: Utility of adding RASopathy-associated genes.
Ceyhan-Birsoy, Ozge; Miatkowski, Maya M; Hynes, Elizabeth; Funke, Birgit H; Mason-Suares, Heather
2018-04-25
RASopathies include a group of syndromes caused by pathogenic germline variants in RAS-MAPK pathway genes and typically present with facial dysmorphology, cardiovascular disease, and musculoskeletal anomalies. Recently, variants in RASopathy-associated genes have been reported in individuals with apparently nonsyndromic cardiomyopathy, suggesting that subtle features may be overlooked. To determine the utility and burden of adding RASopathy-associated genes to cardiomyopathy panels, we tested 11 RASopathy-associated genes by next-generation sequencing (NGS), including NGS-based copy number variant assessment, in 1,111 individuals referred for genetic testing for hypertrophic cardiomyopathy (HCM) or dilated cardiomyopathy (DCM). Disease-causing variants were identified in 0.6% (four of 692) of individuals with HCM, including three missense variants in the PTPN11, SOS1, and BRAF genes. Overall, 36 variants of uncertain significance (VUSs) were identified, averaging ∼3VUSs/100 cases. This study demonstrates that adding a subset of the RASopathy-associated genes to cardiomyopathy panels will increase clinical diagnoses without significantly increasing the number of VUSs/case. © 2018 Wiley Periodicals, Inc.
Wilkinson, J R; Yu, J; Abbas, H K; Scheffler, B E; Kim, H S; Nierman, W C; Bhatnagar, D; Cleveland, T E
2007-10-01
Aflatoxins are toxic and carcinogenic polyketide metabolites produced by fungal species, including Aspergillus flavus and A. parasiticus. The biosynthesis of aflatoxins is modulated by many environmental factors, including the availability of a carbon source. The gene expression profile of A. parasiticus was evaluated during a shift from a medium with low concentration of simple sugars, yeast extract (YE), to a similar medium with sucrose, yeast extract sucrose (YES). Gene expression and aflatoxins (B1, B2, G1, and G2) were quantified from fungal mycelia harvested pre- and post-shifting. When compared with YE media, YES caused temporary reduction of the aflatoxin levels detected at 3-h post-shifting and they remained low well past 12 h post-shift. Aflatoxin levels did not exceed the levels in YE until 24 h post-shift, at which time point a tenfold increase was observed over YE. Microarray analysis comparing the RNA samples from the 48-h YE culture to the YES samples identified a total of 2120 genes that were expressed across all experiments, including most of the aflatoxin biosynthesis genes. One-way analysis of variance (ANOVA) identified 56 genes that were expressed with significant variation across all time points. Three genes responsible for converting norsolorinic acid to averantin were identified among these significantly expressed genes. The potential involvement of these genes in the regulation of aflatoxin biosynthesis is discussed.
Saltatory Evolution of the Ectodermal Neural Cortex Gene Family at the Vertebrate Origin
Feiner, Nathalie; Murakami, Yasunori; Breithut, Lisa; Mazan, Sylvie; Meyer, Axel; Kuraku, Shigehiro
2013-01-01
The ectodermal neural cortex (ENC) gene family, whose members are implicated in neurogenesis, is part of the kelch repeat superfamily. To date, ENC genes have been identified only in osteichthyans, although other kelch repeat-containing genes are prevalent throughout bilaterians. The lack of elaborate molecular phylogenetic analysis with exhaustive taxon sampling has obscured the possible link of the establishment of this gene family with vertebrate novelties. In this study, we identified ENC homologs in diverse vertebrates by means of database mining and polymerase chain reaction screens. Our analysis revealed that the ENC3 ortholog was lost in the basal eutherian lineage through single-gene deletion and that the triplication between ENC1, -2, and -3 occurred early in vertebrate evolution. Including our original data on the catshark and the zebrafish, our comparison revealed high conservation of the pleiotropic expression pattern of ENC1 and shuffling of expression domains between ENC1, -2, and -3. Compared with many other gene families including developmental key regulators, the ENC gene family is unique in that conventional molecular phylogenetic inference could identify no obvious invertebrate ortholog. This suggests a composite nature of the vertebrate-specific gene repertoire, consisting not only of de novo genes introduced at the vertebrate origin but also of long-standing genes with no apparent invertebrate orthologs. Some of the latter, including the ENC gene family, may be too rapidly evolving to provide sufficient phylogenetic signals marking orthology to their invertebrate counterparts. Such gene families that experienced saltatory evolution likely remain to be explored and might also have contributed to phenotypic evolution of vertebrates. PMID:23843192
Li, Angsheng; Yin, Xianchen; Pan, Yicheng
2016-01-01
In this study, we propose a method for constructing cell sample networks from gene expression profiles, and a structural entropy minimisation principle for detecting natural structure of networks and for identifying cancer cell subtypes. Our method establishes a three-dimensional gene map of cancer cell types and subtypes. The identified subtypes are defined by a unique gene expression pattern, and a three-dimensional gene map is established by defining the unique gene expression pattern for each identified subtype for cancers, including acute leukaemia, lymphoma, multi-tissue, lung cancer and healthy tissue. Our three-dimensional gene map demonstrates that a true tumour type may be divided into subtypes, each defined by a unique gene expression pattern. Clinical data analyses demonstrate that most cell samples of an identified subtype share similar survival times, survival indicators and International Prognostic Index (IPI) scores and indicate that distinct subtypes identified by our algorithms exhibit different overall survival times, survival ratios and IPI scores. Our three-dimensional gene map establishes a high-definition, one-to-one map between the biologically and medically meaningful tumour subtypes and the gene expression patterns, and identifies remarkable cells that form singleton submodules. PMID:26842724
Gu, Xiao-Cui; Zhang, Ya-Nan; Kang, Ke; Dong, Shuang-Lin; Zhang, Long-Wa
2015-01-01
The red turpentine beetle (RTB), Dendroctonus valens LeConte (Coleoptera: Curculionidae, Scolytinae), is a destructive invasive pest of conifers which has become the second most important forest pest nationwide in China. Dendroctonus valens is known to use host odors and aggregation pheromones, as well as non-host volatiles, in host location and mass-attack modulation, and thus antennal olfaction is of the utmost importance for the beetles' survival and fitness. However, information on the genes underlying olfaction has been lacking in D. valens. Here, we report the antennal transcriptome of D. valens from next-generation sequencing, with the goal of identifying the olfaction gene repertoire that is involved in D. valens odor-processing. We obtained 51 million reads that were assembled into 61,889 genes, including 39,831 contigs and 22,058 unigenes. In total, we identified 68 novel putative odorant reception genes, including 21 transcripts encoding for putative odorant binding proteins (OBP), six chemosensory proteins (CSP), four sensory neuron membrane proteins (SNMP), 22 odorant receptors (OR), four gustatory receptors (GR), three ionotropic receptors (IR), and eight ionotropic glutamate receptors. We also identified 155 odorant/xenobiotic degradation enzymes from the antennal transcriptome, putatively identified to be involved in olfaction processes including cytochrome P450s, glutathione-S-transferases, and aldehyde dehydrogenase. Predicted protein sequences were compared with counterparts in Tribolium castaneum, Megacyllene caryae, Ips typographus, Dendroctonus ponderosae, and Agrilus planipennis. The antennal transcriptome described here represents the first study of the repertoire of odor processing genes in D. valens. The genes reported here provide a significant addition to the pool of identified olfactory genes in Coleoptera, which might represent novel targets for insect management. The results from our study also will assist with evolutionary analyses of coleopteran olfaction.
Dong, Shuang-Lin; Zhang, Long-Wa
2015-01-01
Background The red turpentine beetle (RTB), Dendroctonus valens LeConte (Coleoptera: Curculionidae, Scolytinae), is a destructive invasive pest of conifers which has become the second most important forest pest nationwide in China. Dendroctonus valens is known to use host odors and aggregation pheromones, as well as non-host volatiles, in host location and mass-attack modulation, and thus antennal olfaction is of the utmost importance for the beetles’ survival and fitness. However, information on the genes underlying olfaction has been lacking in D. valens. Here, we report the antennal transcriptome of D. valens from next-generation sequencing, with the goal of identifying the olfaction gene repertoire that is involved in D. valens odor-processing. Results We obtained 51 million reads that were assembled into 61,889 genes, including 39,831 contigs and 22,058 unigenes. In total, we identified 68 novel putative odorant reception genes, including 21 transcripts encoding for putative odorant binding proteins (OBP), six chemosensory proteins (CSP), four sensory neuron membrane proteins (SNMP), 22 odorant receptors (OR), four gustatory receptors (GR), three ionotropic receptors (IR), and eight ionotropic glutamate receptors. We also identified 155 odorant/xenobiotic degradation enzymes from the antennal transcriptome, putatively identified to be involved in olfaction processes including cytochrome P450s, glutathione-S-transferases, and aldehyde dehydrogenase. Predicted protein sequences were compared with counterparts in Tribolium castaneum, Megacyllene caryae, Ips typographus, Dendroctonus ponderosae, and Agrilus planipennis. Conclusion The antennal transcriptome described here represents the first study of the repertoire of odor processing genes in D. valens. The genes reported here provide a significant addition to the pool of identified olfactory genes in Coleoptera, which might represent novel targets for insect management. The results from our study also will assist with evolutionary analyses of coleopteran olfaction. PMID:25938508
Schwab, Stefan; Ramos, Humberto J; Souza, Emanuel M; Pedrosa, Fábio O; Yates, Marshall G; Chubatsu, Leda S; Rigo, Liu U
2007-05-01
Random mutagenesis using transposons with promoterless reporter genes has been widely used to examine differential gene expression patterns in bacteria. Using this approach, we have identified 26 genes of the endophytic nitrogen-fixing bacterium Herbaspirillum seropedicae regulated in response to ammonium content in the growth medium. These include nine genes involved in the transport of nitrogen compounds, such as the high-affinity ammonium transporter AmtB, and uptake systems for alternative nitrogen sources; nine genes coding for proteins responsible for restoring intracellular ammonium levels through enzymatic reactions, such as nitrogenase, amidase, and arginase; and a third group includes metabolic switch genes, coding for sensor kinases or transcription regulation factors, whose role in metabolism was previously unknown. Also, four genes identified were of unknown function. This paper describes their involvement in response to ammonium limitation. The results provide a preliminary profile of the metabolic response of Herbaspirillum seropedicae to ammonium stress.
Selection signatures in Shetland ponies.
Frischknecht, M; Flury, C; Leeb, T; Rieder, S; Neuditschko, M
2016-06-01
Shetland ponies were selected for numerous traits including small stature, strength, hardiness and longevity. Despite the different selection criteria, Shetland ponies are well known for their small stature. We performed a selection signature analysis including genome-wide SNPs of 75 Shetland ponies and 76 large-sized horses. Based upon this dataset, we identified a selection signature on equine chromosome (ECA) 1 between 103.8 Mb and 108.5 Mb. A total of 33 annotated genes are located within this interval including the IGF1R gene at 104.2 Mb and the ADAMTS17 gene at 105.4 Mb. These two genes are well known to have a major impact on body height in numerous species including humans. Homozygosity mapping in the Shetland ponies identified a region with increased homozygosity between 107.4 Mb and 108.5 Mb. None of the annotated genes in this region have so far been associated with height. Thus, we cannot exclude the possibility that the identified selection signature on ECA1 is associated with some trait other than height, for which Shetland ponies were selected. © 2016 Stichting International Foundation for Animal Genetics.
Ingram, Jennifer L; Antao-Menezes, Aurita; Turpin, Elizabeth A; Wallace, Duncan G; Mangum, James B; Pluta, Linda J; Thomas, Russell S; Bonner, James C
2007-01-01
Background Exposure to vanadium pentoxide (V2O5) is a cause of occupational bronchitis. We evaluated gene expression profiles in cultured human lung fibroblasts exposed to V2O5 in vitro in order to identify candidate genes that could play a role in inflammation, fibrosis, and repair during the pathogenesis of V2O5-induced bronchitis. Methods Normal human lung fibroblasts were exposed to V2O5 in a time course experiment. Gene expression was measured at various time points over a 24 hr period using the Affymetrix Human Genome U133A 2.0 Array. Selected genes that were significantly changed in the microarray experiment were validated by RT-PCR. Results V2O5 altered more than 1,400 genes, of which ~300 were induced while >1,100 genes were suppressed. Gene ontology categories (GO) categories unique to induced genes included inflammatory response and immune response, while GO catogories unique to suppressed genes included ubiquitin cycle and cell cycle. A dozen genes were validated by RT-PCR, including growth factors (HBEGF, VEGF, CTGF), chemokines (IL8, CXCL9, CXCL10), oxidative stress response genes (SOD2, PIPOX, OXR1), and DNA-binding proteins (GAS1, STAT1). Conclusion Our study identified a variety of genes that could play pivotal roles in inflammation, fibrosis and repair during V2O5-induced bronchitis. The induction of genes that mediate inflammation and immune responses, as well as suppression of genes involved in growth arrest appear to be important to the lung fibrotic reaction to V2O5. PMID:17459161
Genome complexity in the coelacanth is reflected in its adaptive immune system
Saha, Nil Ratan; Ota, Tatsuya; Litman, Gary W.; Hansen, John; Parra, Zuly; Hsu, Ellen; Buonocore, Francesco; Canapa, Adriana; Cheng, Jan-Fang; Amemiya, Chris T.
2014-01-01
We have analyzed the available genome and transcriptome resources from the coelacanth in order to characterize genes involved in adaptive immunity. Two highly distinctive IgW-encoding loci have been identified that exhibit a unique genomic organization, including a multiplicity of tandemly repeated constant region exons. The overall organization of the IgW loci precludes typical heavy chain class switching. A locus encoding IgM could not be identified either computationally or by using several different experimental strategies. Four distinct sets of genes encoding Ig light chains were identified. This includes a variant sigma-type Ig light chain previously identified only in cartilaginous fishes and which is now provisionally denoted sigma-2. Genes encoding α/β and γ/δ T-cell receptors, and CD3, CD4, and CD8 co-receptors also were characterized. Ig heavy chain variable region genes and TCR components are interspersed within the TCR α/δ locus; this organization previously was reported only in tetrapods and raises questions regarding evolution and functional cooption of genes encoding variable regions. The composition, organization and syntenic conservation of the major histocompatibility complex locus have been characterized. We also identified large numbers of genes encoding cytokines and their receptors, and other genes associated with adaptive immunity. In terms of sequence identity and organization, the adaptive immune genes of the coelacanth more closely resemble orthologous genes in tetrapods than those in teleost fishes, consistent with current phylogenomic interpretations. Overall, the work reported described herein highlights the complexity inherent in the coelacanth genome and provides a rich catalog of immune genes for future investigations.
Savige, Judy; Dagher, Hayat; Povey, Sue
2014-07-01
This study examined whether gene-specific DNA variant databases for inherited diseases of the kidney fulfilled the Human Variome Project recommendations of being complete, accurate, clinically relevant and freely available. A recent review identified 60 inherited renal diseases caused by mutations in 132 genes. The disease name, MIM number, gene name, together with "mutation" or "database," were used to identify web-based databases. Fifty-nine diseases (98%) due to mutations in 128 genes had a variant database. Altogether there were 349 databases (a median of 3 per gene, range 0-6), but no gene had two databases with the same number of variants, and 165 (50%) databases included fewer than 10 variants. About half the databases (180, 54%) had been updated in the previous year. Few (77, 23%) were curated by "experts" but these included nine of the 11 with the most variants. Even fewer databases (41, 12%) included clinical features apart from the name of the associated disease. Most (223, 67%) could be accessed without charge, including those for 50 genes (40%) with the maximum number of variants. Future efforts should focus on encouraging experts to collaborate on a single database for each gene affected in inherited renal disease, including both unpublished variants, and clinical phenotypes. © 2014 WILEY PERIODICALS, INC.
Microarray analysis reveals key genes and pathways in Tetralogy of Fallot
He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai
2017-01-01
The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF. PMID:28713939
Llera-Herrera, Raúl; García-Gasca, Alejandra; Abreu-Goodger, Cei; Huvet, Arnaud; Ibarra, Ana M.
2013-01-01
Despite the great advances in sequencing technologies, genomic and transcriptomic information for marine non-model species with ecological, evolutionary, and economical interest is still scarce. In this work we aimed to identify genes expressed during spermatogenesis in the functional hermaphrodite scallop Nodipecten subnodosus (Mollusca: Bivalvia: Pectinidae), with the purpose of obtaining a panel of genes that would allow for the study of differentially transcribed genes between diploid and triploid scallops in the context of meiotic arrest and reproductive sterility. Because our aim was to isolate genes involved in meiosis and other testis maturation-related processes, we generated suppressive subtractive hybridization libraries of testis vs. inactive gonad. We obtained 352 and 177 ESTs by clone sequencing, and using pyrosequencing (454-Roche) we maximized the identified ESTs to 34,276 reads. A total of 1,153 genes from the testis library had a blastx hit and GO annotation, including genes specific for meiosis, spermatogenesis, sex-differentiation, and transposable elements. Some of the identified meiosis genes function in chromosome pairing (scp2, scp3), recombination and DNA repair (dmc1, rad51, ccnb1ip1/hei10), and meiotic checkpoints (rad1, hormad1, dtl/cdt2). Gene expression analyses in different gametogenic stages in both sexual regions of the gonad of meiosis genes confirmed that the expression was specific or increased towards the maturing testis. Spermatogenesis genes included known testis-specific ones (kelch-10, shippo1, adad1), with some of these known to be associated to sterility. Sex differentiation genes included one of the most conserved genes at the bottom of the sex-determination cascade (dmrt1). Transcript from transposable elements, reverse transcriptase, and transposases in this library evidenced that transposition is an active process during spermatogenesis in N. subnodosus. In relation to the inactive library, we identified 833 transcripts with functional annotation related to activation of the transcription and translation machinery, as well as to germline control and maintenance. PMID:24066034
Uddin, Raihan; Singh, Shiva M.
2017-01-01
As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in “learning and memory” related functions and pathways. Subsequent differential network analysis of this “learning and memory” module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning. PMID:29066959
Uddin, Raihan; Singh, Shiva M
2017-01-01
As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they provide a new insight and generate new hypotheses into the molecular mechanisms responsible for age associated learning impairment, including spatial learning.
Analysis of blood-based gene expression in idiopathic Parkinson disease.
Shamir, Ron; Klein, Christine; Amar, David; Vollstedt, Eva-Juliane; Bonin, Michael; Usenovic, Marija; Wong, Yvette C; Maver, Ales; Poths, Sven; Safer, Hershel; Corvol, Jean-Christophe; Lesage, Suzanne; Lavi, Ofer; Deuschl, Günther; Kuhlenbaeumer, Gregor; Pawlack, Heike; Ulitsky, Igor; Kasten, Meike; Riess, Olaf; Brice, Alexis; Peterlin, Borut; Krainc, Dimitri
2017-10-17
To examine whether gene expression analysis of a large-scale Parkinson disease (PD) patient cohort produces a robust blood-based PD gene signature compared to previous studies that have used relatively small cohorts (≤220 samples). Whole-blood gene expression profiles were collected from a total of 523 individuals. After preprocessing, the data contained 486 gene profiles (n = 205 PD, n = 233 controls, n = 48 other neurodegenerative diseases) that were partitioned into training, validation, and independent test cohorts to identify and validate a gene signature. Batch-effect reduction and cross-validation were performed to ensure signature reliability. Finally, functional and pathway enrichment analyses were applied to the signature to identify PD-associated gene networks. A gene signature of 100 probes that mapped to 87 genes, corresponding to 64 upregulated and 23 downregulated genes differentiating between patients with idiopathic PD and controls, was identified with the training cohort and successfully replicated in both an independent validation cohort (area under the curve [AUC] = 0.79, p = 7.13E-6) and a subsequent independent test cohort (AUC = 0.74, p = 4.2E-4). Network analysis of the signature revealed gene enrichment in pathways, including metabolism, oxidation, and ubiquitination/proteasomal activity, and misregulation of mitochondria-localized genes, including downregulation of COX4I1 , ATP5A1 , and VDAC3 . We present a large-scale study of PD gene expression profiling. This work identifies a reliable blood-based PD signature and highlights the importance of large-scale patient cohorts in developing potential PD biomarkers. © 2017 American Academy of Neurology.
Ping, Yanyan; Deng, Yulan; Wang, Li; Zhang, Hongyi; Zhang, Yong; Xu, Chaohan; Zhao, Hongying; Fan, Huihui; Yu, Fulong; Xiao, Yun; Li, Xia
2015-01-01
The driver genetic aberrations collectively regulate core cellular processes underlying cancer development. However, identifying the modules of driver genetic alterations and characterizing their functional mechanisms are still major challenges for cancer studies. Here, we developed an integrative multi-omics method CMDD to identify the driver modules and their affecting dysregulated genes through characterizing genetic alteration-induced dysregulated networks. Applied to glioblastoma (GBM), the CMDD identified a core gene module of 17 genes, including seven known GBM drivers, and their dysregulated genes. The module showed significant association with shorter survival of GBM. When classifying driver genes in the module into two gene sets according to their genetic alteration patterns, we found that one gene set directly participated in the glioma pathway, while the other indirectly regulated the glioma pathway, mostly, via their dysregulated genes. Both of the two gene sets were significant contributors to survival and helpful for classifying GBM subtypes, suggesting their critical roles in GBM pathogenesis. Also, by applying the CMDD to other six cancers, we identified some novel core modules associated with overall survival of patients. Together, these results demonstrate integrative multi-omics data can identify driver modules and uncover their dysregulated genes, which is useful for interpreting cancer genome. PMID:25653168
Ming, De-Song; Chen, Qing-Qing; Chen, Xiao-Tin
2018-05-14
To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of β-lactamase [NPS β-lactamase (EC 3.5.2.6), β-lactamase class C, and a metal-dependent hydrolase of β-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.
Poole, William; Leinonen, Kalle; Shmulevich, Ilya
2017-01-01
Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390
Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady
2017-02-01
Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.
A Systems Toxicology Approach Reveals Biological Pathways Dysregulated by Prenatal Arsenic Exposure
Laine, Jessica E.; Fry, Rebecca C.
2016-01-01
BACKGROUND Prenatal exposure to inorganic arsenic (iAs) is associated with dysregulated gene and protein expression in the fetus, both evident at birth. Potential epigenetic mechanisms that underlie these changes include but are not limited to the methylation of cytosines (CpG). OBJECTIVE The aim of the present study was to compile datasets from studies on prenatal arsenic exposure to identify whether key genes, proteins, or both and their associated biological pathways are perturbed. METHODS We compiled datasets from 12 studies that analyzed the relationship between prenatal iAs exposure and fetal changes to the epigenome (5-methyl cytosine), transcriptome (mRNA expression), and/or proteome (protein expression changes). FINDINGS Across the 12 studies, a set of 845 unique genes was identified and found to enrich for their role in biological pathways, including those signaled by peroxisome proliferator-activated receptor, nuclear factor of kappa light polypeptide gene enhancer in B-cells inhibitor, and the glucocorticoid receptor. Tumor necrosis factor was identified as a putative cellular regulator underlying most (n = 277) of the identified iAs-associated genes or proteins. CONCLUSIONS Given their common identification across numerous human cohorts and their known toxicologic role in disease, the identified genes and pathways may underlie altered disease susceptibility associated with prenatal exposure to iAs. PMID:27325076
The WRKY transcription factor family and senescence in switchgrass.
Rinerson, Charles I; Scully, Erin D; Palmer, Nathan A; Donze-Reiner, Teresa; Rabara, Roel C; Tripathi, Prateek; Shen, Qingxi J; Sattler, Scott E; Rohila, Jai S; Sarath, Gautam; Rushton, Paul J
2015-11-09
Early aerial senescence in switchgrass (Panicum virgatum) can significantly limit biomass yields. WRKY transcription factors that can regulate senescence could be used to reprogram senescence and enhance biomass yields. All potential WRKY genes present in the version 1.0 of the switchgrass genome were identified and curated using manual and bioinformatic methods. Expression profiles of WRKY genes in switchgrass flag leaf RNA-Seq datasets were analyzed using clustering and network analyses tools to identify both WRKY and WRKY-associated gene co-expression networks during leaf development and senescence onset. We identified 240 switchgrass WRKY genes including members of the RW5 and RW6 families of resistance proteins. Weighted gene co-expression network analysis of the flag leaf transcriptomes across development readily separated clusters of co-expressed genes into thirteen modules. A visualization highlighted separation of modules associated with the early and senescence-onset phases of flag leaf growth. The senescence-associated module contained 3000 genes including 23 WRKYs. Putative promoter regions of senescence-associated WRKY genes contained several cis-element-like sequences suggestive of responsiveness to both senescence and stress signaling pathways. A phylogenetic comparison of senescence-associated WRKY genes from switchgrass flag leaf with senescence-associated WRKY genes from other plants revealed notable hotspots in Group I, IIb, and IIe of the phylogenetic tree. We have identified and named 240 WRKY genes in the switchgrass genome. Twenty three of these genes show elevated mRNA levels during the onset of flag leaf senescence. Eleven of the WRKY genes were found in hotspots of related senescence-associated genes from multiple species and thus represent promising targets for future switchgrass genetic improvement. Overall, individual WRKY gene expression profiles could be readily linked to developmental stages of flag leaves.
Gupta, Vikas; Estrada, April D; Blakley, Ivory; Reid, Rob; Patel, Ketan; Meyer, Mason D; Andersen, Stig Uggerhøj; Brown, Allan F; Lila, Mary Ann; Loraine, Ann E
2015-01-01
Blueberries are a rich source of antioxidants and other beneficial compounds that can protect against disease. Identifying genes involved in synthesis of bioactive compounds could enable the breeding of berry varieties with enhanced health benefits. Toward this end, we annotated a previously sequenced draft blueberry genome assembly using RNA-Seq data from five stages of berry fruit development and ripening. Genome-guided assembly of RNA-Seq read alignments combined with output from ab initio gene finders produced around 60,000 gene models, of which more than half were similar to proteins from other species, typically the grape Vitis vinifera. Comparison of gene models to the PlantCyc database of metabolic pathway enzymes identified candidate genes involved in synthesis of bioactive compounds, including bixin, an apocarotenoid with potential disease-fighting properties, and defense-related cyanogenic glycosides, which are toxic. Cyanogenic glycoside (CG) biosynthetic enzymes were highly expressed in green fruit, and a candidate CG detoxification enzyme was up-regulated during fruit ripening. Candidate genes for ethylene, anthocyanin, and 400 other biosynthetic pathways were also identified. Homology-based annotation using Blast2GO and InterPro assigned Gene Ontology terms to around 15,000 genes. RNA-Seq expression profiling showed that blueberry growth, maturation, and ripening involve dynamic gene expression changes, including coordinated up- and down-regulation of metabolic pathway enzymes and transcriptional regulators. Analysis of RNA-seq alignments identified developmentally regulated alternative splicing, promoter use, and 3' end formation. We report genome sequence, gene models, functional annotations, and RNA-Seq expression data that provide an important new resource enabling high throughput studies in blueberry.
Li, Li; Brunk, Brian P.; Kissinger, Jessica C.; Pape, Deana; Tang, Keliang; Cole, Robert H.; Martin, John; Wylie, Todd; Dante, Mike; Fogarty, Steven J.; Howe, Daniel K.; Liberator, Paul; Diaz, Carmen; Anderson, Jennifer; White, Michael; Jerome, Maria E.; Johnson, Emily A.; Radke, Jay A.; Stoeckert, Christian J.; Waterston, Robert H.; Clifton, Sandra W.; Roos, David S.; Sibley, L. David
2003-01-01
Large-scale EST sequencing projects for several important parasites within the phylum Apicomplexa were undertaken for the purpose of gene discovery. Included were several parasites of medical importance (Plasmodium falciparum, Toxoplasma gondii) and others of veterinary importance (Eimeria tenella, Sarcocystis neurona, and Neospora caninum). A total of 55,192 ESTs, deposited into dbEST/GenBank, were included in the analyses. The resulting sequences have been clustered into nonredundant gene assemblies and deposited into a relational database that supports a variety of sequence and text searches. This database has been used to compare the gene assemblies using BLAST similarity comparisons to the public protein databases to identify putative genes. Of these new entries, ∼15%–20% represent putative homologs with a conservative cutoff of p < 10−9, thus identifying many conserved genes that are likely to share common functions with other well-studied organisms. Gene assemblies were also used to identify strain polymorphisms, examine stage-specific expression, and identify gene families. An interesting class of genes that are confined to members of this phylum and not shared by plants, animals, or fungi, was identified. These genes likely mediate the novel biological features of members of the Apicomplexa and hence offer great potential for biological investigation and as possible therapeutic targets. [The sequence data from this study have been submitted to dbEST division of GenBank under accession nos.: Toxoplasma gondii: –, –, –, –, – , –, –, –, –. Plasmodium falciparum: –, –, –, –. Sarcocystis neurona: , , , , , , , , , , , , , –, –, –, –, –. Eimeria tenella: –, –, –, –, –, –, –, –, – , –, –, –, –, –, –, –, –, –, –, –. Neospora caninum: –, –, , – , –, –.] PMID:12618375
Bae, Nancy S.; Seberg, Andrew P.; Carroll, Leslie P.; Swanson, Mark J.
2017-01-01
The yeast Saccharomyces cerevisiae responds to amino acid deprivation by activating a pathway conserved in eukaryotes to overcome the starvation stress. We have screened the entire yeast heterozygous deletion collection to identify strains haploinsufficient for growth in the presence of sulfometuron methyl, which causes starvation for isoleucine and valine. We have discovered that cells devoid of MET15 are sensitive to sulfometuron methyl, and loss of heterozygosity at the MET15 locus can complicate screening the heterozygous deletion collection. We identified 138 cases of loss of heterozygosity in this screen. After eliminating the issues of the MET15 loss of heterozygosity, strains isolated from the collection were retested on sulfometuron methyl. To determine the general effect of the mutations for a starvation response, SMM-sensitive strains were tested for the ability to grow in the presence of canavanine, which induces arginine starvation, and strains that were MET15 were also tested for growth in the presence of ethionine, which causes methionine starvation. Many of the genes identified in our study were not previously identified as starvation-responsive genes, including a number of essential genes that are not easily screened in a systematic way. The genes identified span a broad range of biological functions, including many involved in some level of gene expression. Several unnamed proteins have also been identified, giving a clue as to possible functions of the encoded proteins. PMID:28209762
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey; ...
2017-01-21
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Neupane, Achal; Nepal, Madhav P.; Piya, Sarbottam; Subramanian, Senthil; Rohila, Jai S.; Reese, R. Neil; Benson, Benjamin V.
2013-01-01
Mitogen-activated protein kinase (MAPK) genes in eukaryotes regulate various developmental and physiological processes including those associated with biotic and abiotic stresses. Although MAPKs in some plant species including Arabidopsis have been identified, they are yet to be identified in soybean. Major objectives of this study were to identify GmMAPKs, assess their evolutionary relationships, and analyze their functional divergence. We identified a total of 38 MAPKs, eleven MAPKKs, and 150 MAPKKKs in soybean. Within the GmMAPK family, we also identified a new clade of six genes: four genes with TEY and two genes with TQY motifs requiring further investigation into possible legume-specific functions. The results indicated the expansion of the GmMAPK families attributable to the ancestral polyploidy events followed by chromosomal rearrangements. The GmMAPK and GmMAPKKK families were substantially larger than those in other plant species. The duplicated GmMAPK members presented complex evolutionary relationships and functional divergence when compared to their counterparts in Arabidopsis. We also highlighted existing nomenclatural issues, stressing the need for nomenclatural consistency. GmMAPK identification is vital to soybean crop improvement, and novel insights into the evolutionary relationships will enhance our understanding about plant genome evolution. PMID:24137047
Catto, James W F; Abbod, Maysam F; Wild, Peter J; Linkens, Derek A; Pilarsky, Christian; Rehman, Ishtiaq; Rosario, Derek J; Denzinger, Stefan; Burger, Maximilian; Stoehr, Robert; Knuechel, Ruth; Hartmann, Arndt; Hamdy, Freddie C
2010-03-01
New methods for identifying bladder cancer (BCa) progression are required. Gene expression microarrays can reveal insights into disease biology and identify novel biomarkers. However, these experiments produce large datasets that are difficult to interpret. To develop a novel method of microarray analysis combining two forms of artificial intelligence (AI): neurofuzzy modelling (NFM) and artificial neural networks (ANN) and validate it in a BCa cohort. We used AI and statistical analyses to identify progression-related genes in a microarray dataset (n=66 tumours, n=2800 genes). The AI-selected genes were then investigated in a second cohort (n=262 tumours) using immunohistochemistry. We compared the accuracy of AI and statistical approaches to identify tumour progression. AI identified 11 progression-associated genes (odds ratio [OR]: 0.70; 95% confidence interval [CI], 0.56-0.87; p=0.0004), and these were more discriminate than genes chosen using statistical analyses (OR: 1.24; 95% CI, 0.96-1.60; p=0.09). The expression of six AI-selected genes (LIG3, FAS, KRT18, ICAM1, DSG2, and BRCA2) was determined using commercial antibodies and successfully identified tumour progression (concordance index: 0.66; log-rank test: p=0.01). AI-selected genes were more discriminate than pathologic criteria at determining progression (Cox multivariate analysis: p=0.01). Limitations include the use of statistical correlation to identify 200 genes for AI analysis and that we did not compare regression identified genes with immunohistochemistry. AI and statistical analyses use different techniques of inference to determine gene-phenotype associations and identify distinct prognostic gene signatures that are equally valid. We have identified a prognostic gene signature whose members reflect a variety of carcinogenic pathways that could identify progression in non-muscle-invasive BCa. 2009 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Identification of transcriptional factors and key genes in primary osteoporosis by DNA microarray.
Xie, Wengui; Ji, Lixin; Zhao, Teng; Gao, Pengfei
2015-05-09
A number of genes have been identified to be related with primary osteoporosis while less is known about the comprehensive interactions between regulating genes and proteins. We aimed to identify the differentially expressed genes (DEGs) and regulatory effects of transcription factors (TFs) involved in primary osteoporosis. The gene expression profile GSE35958 was obtained from Gene Expression Omnibus database, including 5 primary osteoporosis and 4 normal bone tissues. The differentially expressed genes between primary osteoporosis and normal bone tissues were identified by the same package in R language. The TFs of these DEGs were predicted with the Essaghir A method. DAVID (The Database for Annotation, Visualization and Integrated Discovery) was applied to perform the GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment analysis of DEGs. After analyzing regulatory effects, a regulatory network was built between TFs and the related DEGs. A total of 579 DEGs was screened, including 310 up-regulated genes and 269 down-regulated genes in primary osteoporosis samples. In GO terms, more up-regulated genes were enriched in transcription regulator activity, and secondly in transcription factor activity. A total 10 significant pathways were enriched in KEGG analysis, including colorectal cancer, Wnt signaling pathway, Focal adhesion, and MAPK signaling pathway. Moreover, total 7 TFs were enriched, of which CTNNB1, SP1, and TP53 regulated most up-regulated DEGs. The discovery of the enriched TFs might contribute to the understanding of the mechanism of primary osteoporosis. Further research on genes and TFs related to the WNT signaling pathway and MAPK pathway is urgent for clinical diagnosis and directing treatment of primary osteoporosis.
Mutation spectrum of Chinese patients with Bartter syndrome
Han, Yue; Lin, Yi; Sun, Qing; Wang, Shujuan; Gao, Yanxia; Shao, Leping
2017-01-01
Objective Bartter syndrome (BS) has been rarely reported in Chinese population except for a few case reports. This investigation was aimed to analyze the mutations of the causal genes in sixteen Chinese patients with BS, and review their followup and treatment. Methods Identify mutations by the next generation sequencing and the multiplex ligation-dependent probe amplification (MLPA). Clinical characteristics and biochemical findings at the first presentation as well as follow-up were reviewed. Results 15 different CLCNKB gene mutations were identified in fourteen patients with BS, including 11 novel ones. A novel missense mutation and a novel small deletion were found from SLC12A1 gene. A novel gross deletion was found in CLCNKA gene. A recurrent missense mutation was identified from BSND gene. We found that the whole gene deletion mutation of CLCNKB gene was the most frequent mutation (32%), and the rate of gross deletion was up to 50 percent in this group of Chinese patients. Conclusion The present study has found 19 mutations, including 14 novel ones, which would enrich the human gene mutation database (HGMD) and provide valuable references to the genetic counseling and diagnosis of the Chinese population. PMID:29254190
Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma
Chambers, John C; Zhang, Weihua; Sehmi, Joban; Li, Xinzhong; Wass, Mark N; Van der Harst, Pim; Holm, Hilma; Sanna, Serena; Kavousi, Maryam; Baumeister, Sebastian E; Coin, Lachlan J; Deng, Guohong; Gieger, Christian; Heard-Costa, Nancy L; Hottenga, Jouke-Jan; Kühnel, Brigitte; Kumar, Vinod; Lagou, Vasiliki; Liang, Liming; Luan, Jian’an; Vidal, Pedro Marques; Leach, Irene Mateo; O’Reilly, Paul F; Peden, John F; Rahmioglu, Nilufer; Soininen, Pasi; Speliotes, Elizabeth K; Yuan, Xin; Thorleifsson, Gudmar; Alizadeh, Behrooz Z; Atwood, Larry D; Borecki, Ingrid B; Brown, Morris J; Charoen, Pimphen; Cucca, Francesco; Das, Debashish; de Geus, Eco J C; Dixon, Anna L; Döring, Angela; Ehret, Georg; Eyjolfsson, Gudmundur I; Farrall, Martin; Forouhi, Nita G; Friedrich, Nele; Goessling, Wolfram; Gudbjartsson, Daniel F; Harris, Tamara B; Hartikainen, Anna-Liisa; Heath, Simon; Hirschfield, Gideon M; Hofman, Albert; Homuth, Georg; Hyppönen, Elina; Janssen, Harry L A; Johnson, Toby; Kangas, Antti J; Kema, Ido P; Kühn, Jens P; Lai, Sandra; Lathrop, Mark; Lerch, Markus M; Li, Yun; Liang, T Jake; Lin, Jing-Ping; Loos, Ruth J F; Martin, Nicholas G; Moffatt, Miriam F; Montgomery, Grant W; Munroe, Patricia B; Musunuru, Kiran; Nakamura, Yusuke; O’Donnell, Christopher J; Olafsson, Isleifur; Penninx, Brenda W; Pouta, Anneli; Prins, Bram P; Prokopenko, Inga; Puls, Ralf; Ruokonen, Aimo; Savolainen, Markku J; Schlessinger, David; Schouten, Jeoffrey N L; Seedorf, Udo; Sen-Chowdhry, Srijita; Siminovitch, Katherine A; Smit, Johannes H; Spector, Timothy D; Tan, Wenting; Teslovich, Tanya M; Tukiainen, Taru; Uitterlinden, Andre G; Van der Klauw, Melanie M; Vasan, Ramachandran S; Wallace, Chris; Wallaschofski, Henri; Wichmann, H-Erich; Willemsen, Gonneke; Würtz, Peter; Xu, Chun; Yerges-Armstrong, Laura M; Abecasis, Goncalo R; Ahmadi, Kourosh R; Boomsma, Dorret I; Caulfield, Mark; Cookson, William O; van Duijn, Cornelia M; Froguel, Philippe; Matsuda, Koichi; McCarthy, Mark I; Meisinger, Christa; Mooser, Vincent; Pietiläinen, Kirsi H; Schumann, Gunter; Snieder, Harold; Sternberg, Michael J E; Stolk, Ronald P; Thomas, Howard C; Thorsteinsdottir, Unnur; Uda, Manuela; Waeber, Gérard; Wareham, Nicholas J; Waterworth, Dawn M; Watkins, Hugh; Whitfield, John B; Witteman, Jacqueline C M; Wolffenbuttel, Bruce H R; Fox, Caroline S; Ala-Korpela, Mika; Stefansson, Kari; Vollenweider, Peter; Völzke, Henry; Schadt, Eric E; Scott, James; Järvelin, Marjo-Riitta; Elliott, Paul; Kooner, Jaspal S
2012-01-01
Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10−8 to P = 10−190). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, including genes involved in biliary transport (ATP8B1 and ABCB11), glucose, carbohydrate and lipid metabolism (FADS1, FADS2, GCKR, JMJD1C, HNF1A, MLXIPL, PNPLA3, PPP1R3B, SLC2A2 and TRIB1), glycoprotein biosynthesis and cell surface glycobiology (ABO, ASGR1, FUT2, GPLD1 and ST3GAL4), inflammation and immunity (CD276, CDH6, GCKR, HNF1A, HPR, ITGA1, RORA and STAT4) and glutathione metabolism (GSTT1, GSTT2 and GGT), as well as several genes of uncertain or unknown function (including ABHD12, EFHD1, EFNA1, EPHA2, MICAL3 and ZNF827). Our results provide new insight into genetic mechanisms and pathways influencing markers of liver function. PMID:22001757
Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma.
Chambers, John C; Zhang, Weihua; Sehmi, Joban; Li, Xinzhong; Wass, Mark N; Van der Harst, Pim; Holm, Hilma; Sanna, Serena; Kavousi, Maryam; Baumeister, Sebastian E; Coin, Lachlan J; Deng, Guohong; Gieger, Christian; Heard-Costa, Nancy L; Hottenga, Jouke-Jan; Kühnel, Brigitte; Kumar, Vinod; Lagou, Vasiliki; Liang, Liming; Luan, Jian'an; Vidal, Pedro Marques; Mateo Leach, Irene; O'Reilly, Paul F; Peden, John F; Rahmioglu, Nilufer; Soininen, Pasi; Speliotes, Elizabeth K; Yuan, Xin; Thorleifsson, Gudmar; Alizadeh, Behrooz Z; Atwood, Larry D; Borecki, Ingrid B; Brown, Morris J; Charoen, Pimphen; Cucca, Francesco; Das, Debashish; de Geus, Eco J C; Dixon, Anna L; Döring, Angela; Ehret, Georg; Eyjolfsson, Gudmundur I; Farrall, Martin; Forouhi, Nita G; Friedrich, Nele; Goessling, Wolfram; Gudbjartsson, Daniel F; Harris, Tamara B; Hartikainen, Anna-Liisa; Heath, Simon; Hirschfield, Gideon M; Hofman, Albert; Homuth, Georg; Hyppönen, Elina; Janssen, Harry L A; Johnson, Toby; Kangas, Antti J; Kema, Ido P; Kühn, Jens P; Lai, Sandra; Lathrop, Mark; Lerch, Markus M; Li, Yun; Liang, T Jake; Lin, Jing-Ping; Loos, Ruth J F; Martin, Nicholas G; Moffatt, Miriam F; Montgomery, Grant W; Munroe, Patricia B; Musunuru, Kiran; Nakamura, Yusuke; O'Donnell, Christopher J; Olafsson, Isleifur; Penninx, Brenda W; Pouta, Anneli; Prins, Bram P; Prokopenko, Inga; Puls, Ralf; Ruokonen, Aimo; Savolainen, Markku J; Schlessinger, David; Schouten, Jeoffrey N L; Seedorf, Udo; Sen-Chowdhry, Srijita; Siminovitch, Katherine A; Smit, Johannes H; Spector, Timothy D; Tan, Wenting; Teslovich, Tanya M; Tukiainen, Taru; Uitterlinden, Andre G; Van der Klauw, Melanie M; Vasan, Ramachandran S; Wallace, Chris; Wallaschofski, Henri; Wichmann, H-Erich; Willemsen, Gonneke; Würtz, Peter; Xu, Chun; Yerges-Armstrong, Laura M; Abecasis, Goncalo R; Ahmadi, Kourosh R; Boomsma, Dorret I; Caulfield, Mark; Cookson, William O; van Duijn, Cornelia M; Froguel, Philippe; Matsuda, Koichi; McCarthy, Mark I; Meisinger, Christa; Mooser, Vincent; Pietiläinen, Kirsi H; Schumann, Gunter; Snieder, Harold; Sternberg, Michael J E; Stolk, Ronald P; Thomas, Howard C; Thorsteinsdottir, Unnur; Uda, Manuela; Waeber, Gérard; Wareham, Nicholas J; Waterworth, Dawn M; Watkins, Hugh; Whitfield, John B; Witteman, Jacqueline C M; Wolffenbuttel, Bruce H R; Fox, Caroline S; Ala-Korpela, Mika; Stefansson, Kari; Vollenweider, Peter; Völzke, Henry; Schadt, Eric E; Scott, James; Järvelin, Marjo-Riitta; Elliott, Paul; Kooner, Jaspal S
2011-10-16
Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10(-8) to P = 10(-190)). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, including genes involved in biliary transport (ATP8B1 and ABCB11), glucose, carbohydrate and lipid metabolism (FADS1, FADS2, GCKR, JMJD1C, HNF1A, MLXIPL, PNPLA3, PPP1R3B, SLC2A2 and TRIB1), glycoprotein biosynthesis and cell surface glycobiology (ABO, ASGR1, FUT2, GPLD1 and ST3GAL4), inflammation and immunity (CD276, CDH6, GCKR, HNF1A, HPR, ITGA1, RORA and STAT4) and glutathione metabolism (GSTT1, GSTT2 and GGT), as well as several genes of uncertain or unknown function (including ABHD12, EFHD1, EFNA1, EPHA2, MICAL3 and ZNF827). Our results provide new insight into genetic mechanisms and pathways influencing markers of liver function.
Genetic Factors of Autoimmune Thyroid Diseases in Japanese
Ban, Yoshiyuki
2012-01-01
Autoimmune thyroid diseases (AITDs), including Graves' disease (GD) and Hashimoto's thyroiditis (HT), are caused by immune response to self-thyroid antigens and affect approximately 2–5% of the general population. Genetic susceptibility in combination with external factors, such as smoking, viral/bacterial infection, and chemicals, is believed to initiate the autoimmune response against thyroid antigens. Abundant epidemiological data, including family and twin studies, point to a strong genetic influence on the development of AITDs. Various techniques have been employed to identify genes contributing to the etiology of AITDs, including candidate gene analysis and whole genome screening. These studies have enabled the identification of several loci (genetic regions) that are linked to AITDs, and, in some of these loci, putative AITD susceptibility genes have been identified. Some of these genes/loci are unique to GD and HT and some are common to both diseases, indicating that there is a shared genetic susceptibility to GD and HT. Known AITD-susceptibility genes are classified into three groups: HLA genes, non-HLA immune-regulatory genes (e.g., CTLA-4, PTPN22, and CD40), and thyroid-specific genes (e.g., TSHR and Tg). In this paper, we will summarize the latest findings on AITD susceptibility genes in Japanese. PMID:22242199
Multi-gene panel testing in Korean patients with common genetic generalized epilepsy syndromes.
Lee, Cha Gon; Lee, Jeehun; Lee, Munhyang
2018-01-01
Genetic heterogeneity of common genetic generalized epilepsy syndromes is frequently considered. The present study conducted a focused analysis of potential candidate or susceptibility genes for common genetic generalized epilepsy syndromes using multi-gene panel testing with next-generation sequencing. This study included patients with juvenile myoclonic epilepsy, juvenile absence epilepsy, and epilepsy with generalized tonic-clonic seizures alone. We identified pathogenic variants according to the American College of Medical Genetics and Genomics guidelines and identified susceptibility variants using case-control association analyses and family analyses for familial cases. A total of 57 patients were enrolled, including 51 sporadic cases and 6 familial cases. Twenty-two pathogenic and likely pathogenic variants of 16 different genes were identified. CACNA1H was the most frequently observed single gene. Variants of voltage-gated Ca2+ channel genes, including CACNA1A, CACNA1G, and CACNA1H were observed in 32% of variants (n = 7/22). Analyses to identify susceptibility variants using case-control association analysis indicated that KCNMA1 c.400G>C was associated with common genetic generalized epilepsy syndromes. Only 1 family (family A) exhibited a candidate pathogenic variant p.(Arg788His) on CACNA1H, as determined via family analyses. This study identified candidate genetic variants in about a quarter of patients (n = 16/57) and an average of 2.8 variants was identified in each patient. The results reinforced the polygenic disorder with very high locus and allelic heterogeneity of common GGE syndromes. Further, voltage-gated Ca2+ channels are suggested as important contributors to common genetic generalized epilepsy syndromes. This study extends our comprehensive understanding of common genetic generalized epilepsy syndromes.
Keel, Brittney N; Zarek, Christina M; Keele, John W; Kuehn, Larry A; Snelling, Warren M; Oliver, William T; Freetly, Harvey C; Lindholm-Perry, Amanda K
2018-06-04
Feed intake and body weight gain are economically important inputs and outputs of beef production systems. The purpose of this study was to discover differentially expressed genes that will be robust for feed intake and gain across a large segment of the cattle industry. Transcriptomic studies often suffer from issues with reproducibility and cross-validation. One way to improve reproducibility is by integrating multiple datasets via meta-analysis. RNA sequencing (RNA-Seq) was performed on longissimus dorsi muscle from 80 steers (5 cohorts, each with 16 animals) selected from the outside fringe of a bivariate gain and feed intake distribution to understand the genes and pathways involved in feed efficiency. In each cohort, 16 steers were selected from one of four gain and feed intake phenotypes (n = 4 per phenotype) in a 2 × 2 factorial arrangement with gain and feed intake as main effect variables. Each cohort was analyzed as a single experiment using a generalized linear model and results from the 5 cohort analyses were combined in a meta-analysis to identify differentially expressed genes (DEG) across the cohorts. A total of 51 genes were differentially expressed for the main effect of gain, 109 genes for the intake main effect, and 11 genes for the gain x intake interaction (P corrected < 0.05). A jackknife sensitivity analysis showed that, in general, the meta-analysis produced robust DEGs for the two main effects and their interaction. Pathways identified from over-represented genes included mitochondrial energy production and oxidative stress pathways for the main effect of gain due to DEG including GPD1, NDUFA6, UQCRQ, ACTC1, and MGST3. For intake, metabolic pathways including amino acid biosynthesis and degradation were identified, and for the interaction analysis the pathways identified included GADD45, pyridoxal 5'phosphate salvage, and caveolar mediated endocytosis signaling. Variation among DEG identified by cohort suggests that environment and breed may play large roles in the expression of genes associated with feed efficiency in the muscle of beef cattle. Meta-analyses of transcriptome data from groups of animals over multiple cohorts may be necessary to elucidate the genetics contributing these types of biological phenotypes.
Taguchi, Y-h
2015-01-01
Transgenerational epigenetics (TGE) are currently considered important in disease, but the mechanisms involved are not yet fully understood. TGE abnormalities expected to cause disease are likely to be initiated during development and to be mediated by aberrant gene expression associated with aberrant promoter methylation that is heritable between generations. However, because methylation is removed and then re-established during development, it is not easy to identify promoter methylation abnormalities by comparing normal lineages with those expected to exhibit TGE abnormalities. This study applied the recently proposed principal component analysis (PCA)-based unsupervised feature extraction to previously reported and publically available gene expression/promoter methylation profiles of rat primordial germ cells, between E13 and E16 of the F3 generation vinclozolin lineage that are expected to exhibit TGE abnormalities, to identify multiple genes that exhibited aberrant gene expression/promoter methylation during development. The biological feasibility of the identified genes were tested via enrichment analyses of various biological concepts including pathway analysis, gene ontology terms and protein-protein interactions. All validations suggested superiority of the proposed method over three conventional and popular supervised methods that employed t test, limma and significance analysis of microarrays, respectively. The identified genes were globally related to tumors, the prostate, kidney, testis and the immune system and were previously reported to be related to various diseases caused by TGE. Among the genes reported by PCA-based unsupervised feature extraction, we propose that chemokine signaling pathways and leucine rich repeat proteins are key factors that initiate transgenerational epigenetic-mediated diseases, because multiple genes included in these two categories were identified in this study.
2015-01-01
Background Transgenerational epigenetics (TGE) are currently considered important in disease, but the mechanisms involved are not yet fully understood. TGE abnormalities expected to cause disease are likely to be initiated during development and to be mediated by aberrant gene expression associated with aberrant promoter methylation that is heritable between generations. However, because methylation is removed and then re-established during development, it is not easy to identify promoter methylation abnormalities by comparing normal lineages with those expected to exhibit TGE abnormalities. Methods This study applied the recently proposed principal component analysis (PCA)-based unsupervised feature extraction to previously reported and publically available gene expression/promoter methylation profiles of rat primordial germ cells, between E13 and E16 of the F3 generation vinclozolin lineage that are expected to exhibit TGE abnormalities, to identify multiple genes that exhibited aberrant gene expression/promoter methylation during development. Results The biological feasibility of the identified genes were tested via enrichment analyses of various biological concepts including pathway analysis, gene ontology terms and protein-protein interactions. All validations suggested superiority of the proposed method over three conventional and popular supervised methods that employed t test, limma and significance analysis of microarrays, respectively. The identified genes were globally related to tumors, the prostate, kidney, testis and the immune system and were previously reported to be related to various diseases caused by TGE. Conclusions Among the genes reported by PCA-based unsupervised feature extraction, we propose that chemokine signaling pathways and leucine rich repeat proteins are key factors that initiate transgenerational epigenetic-mediated diseases, because multiple genes included in these two categories were identified in this study. PMID:26677731
The Essential Genome of Escherichia coli K-12.
Goodall, Emily C A; Robinson, Ashley; Johnston, Iain G; Jabbari, Sara; Turner, Keith A; Cunningham, Adam F; Lund, Peter A; Cole, Jeffrey A; Henderson, Ian R
2018-02-20
Transposon-directed insertion site sequencing (TraDIS) is a high-throughput method coupling transposon mutagenesis with short-fragment DNA sequencing. It is commonly used to identify essential genes. Single gene deletion libraries are considered the gold standard for identifying essential genes. Currently, the TraDIS method has not been benchmarked against such libraries, and therefore, it remains unclear whether the two methodologies are comparable. To address this, a high-density transposon library was constructed in Escherichia coli K-12. Essential genes predicted from sequencing of this library were compared to existing essential gene databases. To decrease false-positive identification of essential genes, statistical data analysis included corrections for both gene length and genome length. Through this analysis, new essential genes and genes previously incorrectly designated essential were identified. We show that manual analysis of TraDIS data reveals novel features that would not have been detected by statistical analysis alone. Examples include short essential regions within genes, orientation-dependent effects, and fine-resolution identification of genome and protein features. Recognition of these insertion profiles in transposon mutagenesis data sets will assist genome annotation of less well characterized genomes and provides new insights into bacterial physiology and biochemistry. IMPORTANCE Incentives to define lists of genes that are essential for bacterial survival include the identification of potential targets for antibacterial drug development, genes required for rapid growth for exploitation in biotechnology, and discovery of new biochemical pathways. To identify essential genes in Escherichia coli , we constructed a transposon mutant library of unprecedented density. Initial automated analysis of the resulting data revealed many discrepancies compared to the literature. We now report more extensive statistical analysis supported by both literature searches and detailed inspection of high-density TraDIS sequencing data for each putative essential gene for the E. coli model laboratory organism. This paper is important because it provides a better understanding of the essential genes of E. coli , reveals the limitations of relying on automated analysis alone, and provides a new standard for the analysis of TraDIS data. Copyright © 2018 Goodall et al.
Liu, Jian; Cheng, Yuhu; Wang, Xuesong; Zhang, Lin; Liu, Hui
2017-08-17
It is urgent to diagnose colorectal cancer in the early stage. Some feature genes which are important to colorectal cancer development have been identified. However, for the early stage of colorectal cancer, less is known about the identity of specific cancer genes that are associated with advanced clinical stage. In this paper, we conducted a feature extraction method named Optimal Mean based Block Robust Feature Extraction method (OMBRFE) to identify feature genes associated with advanced colorectal cancer in clinical stage by using the integrated colorectal cancer data. Firstly, based on the optimal mean and L 2,1 -norm, a novel feature extraction method called Optimal Mean based Robust Feature Extraction method (OMRFE) is proposed to identify feature genes. Then the OMBRFE method which introduces the block ideology into OMRFE method is put forward to process the colorectal cancer integrated data which includes multiple genomic data: copy number alterations, somatic mutations, methylation expression alteration, as well as gene expression changes. Experimental results demonstrate that the OMBRFE is more effective than previous methods in identifying the feature genes. Moreover, genes identified by OMBRFE are verified to be closely associated with advanced colorectal cancer in clinical stage.
Mutations in the Kinase Domain of the HER2/ERBB2 Gene Identified in a Wide Variety of Human Cancers.
Wen, Wenhsiang; Chen, Wangjuh Sting; Xiao, Nick; Bender, Ryan; Ghazalpour, Anatole; Tan, Zheng; Swensen, Jeffrey; Millis, Sherri Z; Basu, Gargi; Gatalica, Zoran; Press, Michael F
2015-09-01
The HER2 (official name ERBB2) gene encodes a membrane receptor in the epidermal growth factor receptor family amplified and overexpressed in adenocarcinoma. Activating mutations also occur in several cancers. We report mutation analyses of the HER2 kinase domain in 7497 histologically diverse cancers. Forty-five genes, including the kinase domain of HER2 with HER2 IHC and dual in situ hybridization, were analyzed in tumors from 7497 patients with cancer, including 850 breast, 770 colorectal, 910 non-small cell lung, 823 uterine or cervical, 1372 ovarian, and 297 pancreatic cancers, as well as 323 melanomas and 2152 other solid tumors. Sixty-nine HER2 kinase domain mutations were identified in tumors from 68 patients (approximately 1% of all cases, ranging from absent in sarcomas to 4% in urothelial cancers), which included previously published activating mutations and 13 novel mutations. Fourteen cases with coexisting HER2 mutation and amplification and/or overexpression were identified. Fifty-two of 68 patients had additional mutations in other analyzed genes, whereas 16 patients (23%) had HER2 mutations identified as the sole driver mutation. HER2 mutations coexisted with HER2 gene amplification and overexpression and with mutations in other functionally important genes. HER2 mutations were identified as the only driver mutation in a significant proportion of solid cancers. Evaluation of anti-HER2 therapies in nonamplified, HER2-mutated cancers is warranted. Copyright © 2015 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni
2013-01-01
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function. PMID:24278029
Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni
2013-11-01
Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune function.
Yang, Yan; Zhou, Yuan; Chi, Yingjun; Fan, Baofang; Chen, Zhixiang
2017-12-19
WRKY proteins are a superfamily of plant transcription factors with important roles in plants. WRKY proteins have been extensively analyzed in plant species including Arabidopsis and rice. Here we report characterization of soybean WRKY gene family and their functional analysis in resistance to soybean cyst nematode (SCN), the most important soybean pathogen. Through search of the soybean genome, we identified 174 genes encoding WRKY proteins that can be classified into seven groups as established in other plants. WRKY variants including a WRKY-related protein unique to legumes have also been identified. Expression analysis reveals both diverse expression patterns in different soybean tissues and preferential expression of specific WRKY groups in certain tissues. Furthermore, a large number of soybean WRKY genes were responsive to salicylic acid. To identify soybean WRKY genes that promote soybean resistance to SCN, we first screened soybean WRKY genes for enhancing SCN resistance when over-expressed in transgenic soybean hairy roots. To confirm the results, we transformed five WRKY genes into a SCN-susceptible soybean cultivar and generated transgenic soybean lines. Transgenic soybean lines overexpressing three WRKY transgenes displayed increased resistance to SCN. Thus, WRKY genes could be explored to develop new soybean cultivars with enhanced resistance to SCN.
McKay, James D.; Hung, Rayjean J.; Han, Younghun; Zong, Xuchen; Carreras-Torres, Robert; Christiani, David C.; Caporaso, Neil E.; Johansson, Mattias; Xiao, Xiangjun; Li, Yafang; Byun, Jinyoung; Dunning, Alison; Pooley, Karen A.; Qian, David C.; Ji, Xuemei; Liu, Geoffrey; Timofeeva, Maria N.; Bojesen, Stig E.; Wu, Xifeng; Le Marchand, Loic; Albanes, Demetrios; Bickeböller, Heike; Aldrich, Melinda C.; Bush, William S.; Tardon, Adonina; Rennert, Gad; Teare, M. Dawn; Field, John K.; Kiemeney, Lambertus A.; Lazarus, Philip; Haugen, Aage; Lam, Stephen; Schabath, Matthew B.; Andrew, Angeline S.; Shen, Hongbing; Hong, Yun-Chul; Yuan, Jian-Min; Bertazzi, Pier Alberto; Pesatori, Angela C.; Ye, Yuanqing; Diao, Nancy; Su, Li; Zhang, Ruyang; Brhane, Yonathan; Leighl, Natasha; Johansen, Jakob S.; Mellemgaard, Anders; Saliba, Walid; Haiman, Christopher A.; Wilkens, Lynne R.; Fernandez-Somoano, Ana; Fernandez-Tardon, Guillermo; van der Heijden, Henricus F.M.; Kim, Jin Hee; Dai, Juncheng; Hu, Zhibin; Davies, Michael PA; Marcus, Michael W.; Brunnström, Hans; Manjer, Jonas; Melander, Olle; Muller, David C.; Overvad, Kim; Trichopoulou, Antonia; Tumino, Rosario; Doherty, Jennifer A.; Barnett, Matt P.; Chen, Chu; Goodman, Gary E.; Cox, Angela; Taylor, Fiona; Woll, Penella; Brüske, Irene; Wichmann, H.-Erich; Manz, Judith; Muley, Thomas R.; Risch, Angela; Rosenberger, Albert; Grankvist, Kjell; Johansson, Mikael; Shepherd, Frances A.; Tsao, Ming-Sound; Arnold, Susanne M.; Haura, Eric B.; Bolca, Ciprian; Holcatova, Ivana; Janout, Vladimir; Kontic, Milica; Lissowska, Jolanta; Mukeria, Anush; Ognjanovic, Simona; Orlowski, Tadeusz M.; Scelo, Ghislaine; Swiatkowska, Beata; Zaridze, David; Bakke, Per; Skaug, Vidar; Zienolddiny, Shanbeh; Duell, Eric J.; Butler, Lesley M.; Koh, Woon-Puay; Gao, Yu-Tang; Houlston, Richard S.; McLaughlin, John; Stevens, Victoria L.; Joubert, Philippe; Lamontagne, Maxime; Nickle, David C.; Obeidat, Ma’en; Timens, Wim; Zhu, Bin; Song, Lei; Kachuri, Linda; Artigas, María Soler; Tobin, Martin D.; Wain, Louise V.; Rafnar, Thorunn; Thorgeirsson, Thorgeir E.; Reginsson, Gunnar W.; Stefansson, Kari; Hancock, Dana B.; Bierut, Laura J.; Spitz, Margaret R.; Gaddis, Nathan C.; Lutz, Sharon M.; Gu, Fangyi; Johnson, Eric O.; Kamal, Ahsan; Pikielny, Claudio; Zhu, Dakai; Lindströem, Sara; Jiang, Xia; Tyndale, Rachel F.; Chenevix-Trench, Georgia; Beesley, Jonathan; Bossé, Yohan; Chanock, Stephen; Brennan, Paul; Landi, Maria Teresa; Amos, Christopher I.
2017-01-01
Summary While several lung cancer susceptibility loci have been identified, much of lung cancer heritability remains unexplained. Here, 14,803 cases and 12,262 controls of European descent were genotyped on the OncoArray and combined with existing data for an aggregated GWAS analysis of lung cancer on 29,266 patients and 56,450 controls. We identified 18 susceptibility loci achieving genome wide significance, including 10 novel loci. The novel loci highlighted the striking heterogeneity in genetic susceptibility across lung cancer histological subtypes, with four loci associated with lung cancer overall and six with lung adenocarcinoma. Gene expression quantitative trait analysis (eQTL) in 1,425 normal lung tissues highlighted RNASET2, SECISBP2L and NRG1 as candidate genes. Other loci include genes such as a cholinergic nicotinic receptor, CHRNA2, and the telomere-related genes, OFBC1 and RTEL1. Further exploration of the target genes will continue to provide new insights into the etiology of lung cancer. PMID:28604730
2012-01-01
Background Thalidomide is an anti-inflammatory and anti-angiogenic drug currently used for the treatment of several diseases, including erythema nodosum leprosum, which occurs in patients with lepromatous leprosy. In this research, we use DNA microarray analysis to identify the impact of thalidomide on gene expression responses in human cells after lipopolysaccharide (LPS) stimulation. We employed a two-stage framework. Initially, we identified 1584 altered genes in response to LPS. Modulation of this set of genes was then analyzed in the LPS stimulated cells treated with thalidomide. Results We identified 64 genes with altered expression induced by thalidomide using the rank product method. In addition, the lists of up-regulated and down-regulated genes were investigated by means of bioinformatics functional analysis, which allowed for the identification of biological processes affected by thalidomide. Confirmatory analysis was done in five of the identified genes using real time PCR. Conclusions The results showed some genes that can further our understanding of the biological mechanisms in the action of thalidomide. Of the five genes evaluated with real time PCR, three were down regulated and two were up regulated confirming the initial results of the microarray analysis. PMID:22695124
Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili
2017-01-01
Abstract Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. PMID:28922794
Beck, David A. C.; Hendrickson, Erik L.; Vorobev, Alexey; Wang, Tiansong; Lim, Sujung; Kalyuzhnaya, Marina G.; Lidstrom, Mary E.; Hackett, Murray; Chistoserdova, Ludmila
2011-01-01
Methylotenera species, unlike their close relatives in the genera Methylophilus, Methylobacillus, and Methylovorus, neither exhibit the activity of methanol dehydrogenase nor possess mxaFI genes encoding this enzyme, yet they are able to grow on methanol. In this work, we integrated a genome-wide proteomics approach, shotgun proteomics, and a genome-wide transcriptomics approach, shotgun transcriptome sequencing (RNA-seq), of Methylotenera mobilis JLW8 to identify genes and enzymes potentially involved in methanol oxidation, with special attention to alternative nitrogen sources, to address the question of whether nitrate could play a role as an electron acceptor in place of oxygen. Both proteomics and transcriptomics identified a limited number of genes and enzymes specifically responding to methanol. This set includes genes involved in oxidative stress response systems, a number of oxidoreductases, including XoxF-type alcohol dehydrogenases, a type II secretion system, and proteins without a predicted function. Nitrate stimulated expression of some genes in assimilatory nitrate reduction and denitrification pathways, while ammonium downregulated some of the nitrogen metabolism genes. However, none of these genes appeared to respond to methanol, which suggests that oxygen may be the main electron sink during growth on methanol. This study identifies initial targets for future focused physiological studies, including mutant analysis, which will provide further details into this novel process. PMID:21764938
Identification of candidate genes for familial early-onset essential tremor.
Liu, Xinmin; Hernandez, Nora; Kisselev, Sergey; Floratos, Aris; Sawle, Ashley; Ionita-Laza, Iuliana; Ottman, Ruth; Louis, Elan D; Clark, Lorraine N
2016-07-01
Essential tremor (ET) is one of the most common causes of tremor in humans. Despite its high heritability and prevalence, few susceptibility genes for ET have been identified. To identify ET genes, whole-exome sequencing was performed in 37 early-onset ET families with an autosomal-dominant inheritance pattern. We identified candidate genes for follow-up functional studies in five ET families. In two independent families, we identified variants predicted to affect function in the nitric oxide (NO) synthase 3 gene (NOS3) that cosegregated with disease. NOS3 is highly expressed in the central nervous system (including cerebellum), neurons and endothelial cells, and is one of three enzymes that converts l-arginine to the neurotransmitter NO. In one family, a heterozygous variant, c.46G>A (p.(Gly16Ser)), in NOS3, was identified in three affected ET cases and was absent in an unaffected family member; and in a second family, a heterozygous variant, c.164C>T (p.(Pro55Leu)), was identified in three affected ET cases (dizygotic twins and their mother). Both variants result in amino-acid substitutions of highly conserved amino-acid residues that are predicted to be deleterious and damaging by in silico analysis. In three independent families, variants predicted to affect function were also identified in other genes, including KCNS2 (KV9.2), HAPLN4 (BRAL2) and USP46. These genes are highly expressed in the cerebellum and Purkinje cells, and influence function of the gamma-amino butyric acid (GABA)-ergic system. This is in concordance with recent evidence that the pathophysiological process in ET involves cerebellar dysfunction and possibly cerebellar degeneration with a reduction in Purkinje cells, and a decrease in GABA-ergic tone.
Transcriptional profiling of predator-induced phenotypic plasticity in Daphnia pulex.
Rozenberg, Andrey; Parida, Mrutyunjaya; Leese, Florian; Weiss, Linda C; Tollrian, Ralph; Manak, J Robert
2015-01-01
Predator-induced defences are a prominent example of phenotypic plasticity found from single-celled organisms to vertebrates. The water flea Daphnia pulex is a very convenient ecological genomic model for studying predator-induced defences as it exhibits substantial morphological changes under predation risk. Most importantly, however, genetically identical clones can be transcriptionally profiled under both control and predation risk conditions and be compared due to the availability of the sequenced reference genome. Earlier gene expression analyses of candidate genes as well as a tiled genomic microarray expression experiment have provided insights into some genes involved in predator-induced phenotypic plasticity. Here we performed the first RNA-Seq analysis to identify genes that were differentially expressed in defended vs. undefended D. pulex specimens in order to explore the genetic mechanisms underlying predator-induced defences at a qualitatively novel level. We report 230 differentially expressed genes (158 up- and 72 down-regulated) identified in at least two of three different assembly approaches. Several of the differentially regulated genes belong to families of paralogous genes. The most prominent classes amongst the up-regulated genes include cuticle genes, zinc-metalloproteinases and vitellogenin genes. Furthermore, several genes from this group code for proteins recruited in chromatin-reorganization or regulation of the cell cycle (cyclins). Down-regulated gene classes include C-type lectins, proteins involved in lipogenesis, and other families, some of which encode proteins with no known molecular function. The RNA-Seq transcriptome data presented in this study provide important insights into gene regulatory patterns underlying predator-induced defences. In particular, we characterized different effector genes and gene families found to be regulated in Daphnia in response to the presence of an invertebrate predator. These effector genes are mostly in agreement with expectations based on observed phenotypic changes including morphological alterations, i.e., expression of proteins involved in formation of protective structures and in cuticle strengthening, as well as proteins required for resource re-allocation. Our findings identify key genetic pathways associated with anti-predator defences.
An Integrated Systems Genetics and Omics Toolkit to Probe Gene Function.
Li, Hao; Wang, Xu; Rukina, Daria; Huang, Qingyao; Lin, Tao; Sorrentino, Vincenzo; Zhang, Hongbo; Bou Sleiman, Maroun; Arends, Danny; McDaid, Aaron; Luan, Peiling; Ziari, Naveed; Velázquez-Villegas, Laura A; Gariani, Karim; Kutalik, Zoltan; Schoonjans, Kristina; Radcliffe, Richard A; Prins, Pjotr; Morgenthaler, Stephan; Williams, Robert W; Auwerx, Johan
2018-01-24
Identifying genetic and environmental factors that impact complex traits and common diseases is a high biomedical priority. Here, we developed, validated, and implemented a series of multi-layered systems approaches, including (expression-based) phenome-wide association, transcriptome-/proteome-wide association, and (reverse-) mediation analysis, in an open-access web server (systems-genetics.org) to expedite the systems dissection of gene function. We applied these approaches to multi-omics datasets from the BXD mouse genetic reference population, and identified and validated associations between genes and clinical and molecular phenotypes, including previously unreported links between Rpl26 and body weight, and Cpt1a and lipid metabolism. Furthermore, through mediation and reverse-mediation analysis we established regulatory relations between genes, such as the co-regulation of BCKDHA and BCKDHB protein levels, and identified targets of transcription factors E2F6, ZFP277, and ZKSCAN1. Our multifaceted toolkit enabled the identification of gene-gene and gene-phenotype links that are robust and that translate well across populations and species, and can be universally applied to any populations with multi-omics datasets. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
de Abreu Neto, Joao B.; Frei, Michael
2016-01-01
Plants are exposed to a wide range of abiotic stresses (AS), which often occur in combination. Because physiological investigations typically focus on one stress, our understanding of unspecific stress responses remains limited. The plant redox homeostasis, i.e., the production and removal of reactive oxygen species (ROS), may be involved in many environmental stress conditions. Therefore, this study intended to identify genes, which are activated in diverse AS, focusing on ROS-related pathways. We conducted a meta-analysis (MA) of microarray experiments, focusing on rice. Transcriptome data were mined from public databases and fellow researchers, which represented 36 different experiments and investigated diverse AS, including ozone stress, drought, heat, cold, salinity, and mineral deficiencies/toxicities. To overcome the inherent artifacts of different MA methods, data were processed using Fisher, rOP, REM, and product of rank (GeneSelector), and genes identified by most approaches were considered as shared differentially expressed genes (DEGs). Two MA strategies were adopted: first, datasets were separated into shoot, root, and seedling experiments, and these tissues were analyzed separately to identify shared DEGs. Second, shoot and seedling experiments were classed into oxidative stress (OS), i.e., ozone and hydrogen peroxide treatments directly producing ROS in plant tissue, and other AS, in which ROS production is indirect. In all tissues and stress conditions, genes a priori considered as ROS-related were overrepresented among the DEGs, as they represented 4% of all expressed genes but 7–10% of the DEGs. The combined MA approach was substantially more conservative than individual MA methods and identified 1001 shared DEGs in shoots, 837 shared DEGs in root, and 1172 shared DEGs in seedlings. Within the OS and AS groups, 990 and 1727 shared DEGs were identified, respectively. In total, 311 genes were shared between OS and AS, including many regulatory genes. Combined co-expression analysis identified among those a cluster of 42 genes, many involved in the photosynthetic apparatus and responsive to drought, iron deficiency, arsenic toxicity, and ozone. Our data demonstrate the importance of redox homeostasis in plant stress responses and the power of MA to identify candidate genes underlying unspecific signaling pathways. PMID:26793229
Genome-wide differential gene expression in immortalized DF-1 chicken embryo fibroblast cell line
2011-01-01
Background When compared to primary chicken embryo fibroblast (CEF) cells, the immortal DF-1 CEF line exhibits enhanced growth rates and susceptibility to oxidative stress. Although genes responsible for cell cycle regulation and antioxidant functions have been identified, the genome-wide transcription profile of immortal DF-1 CEF cells has not been previously reported. Global gene expression in primary CEF and DF-1 cells was performed using a 4X44K chicken oligo microarray. Results A total of 3876 differentially expressed genes were identified with a 2 fold level cutoff that included 1706 up-regulated and 2170 down-regulated genes in DF-1 cells. Network and functional analyses using Ingenuity Pathways Analysis (IPA, Ingenuity® Systems, http://www.ingenuity.com) revealed that 902 of 3876 differentially expressed genes were classified into a number of functional groups including cellular growth and proliferation, cell cycle, cellular movement, cancer, genetic disorders, and cell death. Also, the top 5 gene networks with intermolecular connections were identified. Bioinformatic analyses suggested that DF-1 cells were characterized by enhanced molecular mechanisms for cell cycle progression and proliferation, suppressing cell death pathways, altered cellular morphogenesis, and accelerated capacity for molecule transport. Key molecules for these functions include E2F1, BRCA1, SRC, CASP3, and the peroxidases. Conclusions The global gene expression profiles provide insight into the cellular mechanisms that regulate the unique characteristics observed in immortal DF-1 CEF cells. PMID:22111699
Bradley, S P; Pahari, M; Uknis, M E; Rastellini, C; Cicalese, L
2006-01-01
The cellular and histological events that occur during the regeneration process in invertebrates have been studied in the field of visceral regeneration. We would like to explore the molecular aspects of the regeneration process in the small intestine. The aim of this study was to characterize the gene expression profiles of the intestinal graft to identify which genes may have a role in regeneration of graft tissue posttransplant. In a patient undergoing living related small bowel transplantation (LRSBTx) in our institution, mucosal biopsies were obtained from the recipient intestine and donor graft at the time of transplant and at weeks 1, 2, 3, and 6 posttransplant. Total RNA was isolated from sample biopsies followed by gene expression profiles determined from the replicate samples (n = 3) for each biopsy using the Affymetrix U133 Plus 2.0 Human GeneChip set. Two profiles were obtained from the data. One profile showed rapid increase of 45 genes immediately after transplant by week 1 with significant changes (P < .05) greater than threefold including the chemokine CXC9 and glutathione-related stress factors, GPX2 and GSTA4. The second profile identified 133 genes that were significantly decreased by threefold or greater immediately after transplant week 1, including UCC1, the human homolog of the Ependymin gene. We have identified two gene expression profiles representing early graft responses to small bowel transplantation. These profiles will serve to identify and study those genes whose products may play a role in accelerating tissue regeneration following segmental LRSBTx.
Clustered Xenopus keratin genes: A genomic, transcriptomic, and proteomic analysis.
Suzuki, Ken-Ichi T; Suzuki, Miyuki; Shigeta, Mitsuki; Fortriede, Joshua D; Takahashi, Shuji; Mawaribuchi, Shuuji; Yamamoto, Takashi; Taira, Masanori; Fukui, Akimasa
2017-06-15
Keratin genes belong to the intermediate filament superfamily and their expression is altered following morphological and physiological changes in vertebrate epithelial cells. Keratin genes are divided into two groups, type I and II, and are clustered on vertebrate genomes, including those of Xenopus species. Various keratin genes have been identified and characterized by their unique expression patterns throughout ontogeny in Xenopus laevis; however, compilation of previously reported and newly identified keratin genes in two Xenopus species is required for our further understanding of keratin gene evolution, not only in amphibians but also in all terrestrial vertebrates. In this study, 120 putative type I and II keratin genes in total were identified based on the genome data from two Xenopus species. We revealed that most of these genes are highly clustered on two homeologous chromosomes, XLA9_10 and XLA2 in X. laevis, and XTR10 and XTR2 in X. tropicalis, which are orthologous to those of human, showing conserved synteny among tetrapods. RNA-Seq data from various embryonic stages and adult tissues highlighted the unique expression profiles of orthologous and homeologous keratin genes in developmental stage- and tissue-specific manners. Moreover, we identified dozens of epidermal keratin proteins from the whole embryo, larval skin, tail, and adult skin using shotgun proteomics. In light of our results, we discuss the radiation, diversification, and unique expression of the clustered keratin genes, which are closely related to epidermal development and terrestrial adaptation during amphibian evolution, including Xenopus speciation. Copyright © 2016 Elsevier Inc. All rights reserved.
Zhang, Le-Le; Zhang, Zi-Ning; Wu, Xian; Jiang, Yong-Jun; Fu, Ya-Jing; Shang, Hong
2017-09-12
A small proportion of HIV-infected patients remain clinically and/or immunologically stable for years, including elite controllers (ECs) who have undetectable viremia (<50 copies/ml) and long-term nonprogressors (LTNPs) who maintain normal CD4 + T cell counts for prolonged periods (>10 years). However, the mechanism of nonprogression needs to be further resolved. In this study, a transcriptome meta-analysis was performed on nonprogressor and progressor microarray data to identify differential transcriptome pathways and potential biomarkers. Using the INMEX (integrative meta-analysis of expression data) program, we performed the meta-analysis to identify consistently differentially expressed genes (DEGs) in nonprogressors and further performed functional interpretation (gene ontology analysis and pathway analysis) of the DEGs identified in the meta-analysis. Five microarray datasets (81 cases and 98 controls in total), including whole blood, CD4 + and CD8 + T cells, were collected for meta-analysis. We determined that nonprogressors have reduced expression of important interferon-stimulated genes (ISGs), CD38, lymphocyte activation gene 3 (LAG-3) in whole blood, CD4 + and CD8 + T cells. Gene ontology (GO) analysis showed a significant enrichment in DEGs that function in the type I interferon signaling pathway. Upregulated pathways, including the PI3K-Akt signaling pathway in whole blood, cytokine-cytokine receptor interaction in CD4 + T cells and the MAPK signaling pathway in CD8 + T cells, were identified in nonprogressors compared with progressors. In each metabolic functional category, the number of downregulated DEGs was more than the upregulated DEGs, and almost all genes were downregulated DEGs in the oxidative phosphorylation (OXPHOS) and tricarboxylic acid (TCA) cycle in the three types of samples. Our transcriptomic meta-analysis provides a comprehensive evaluation of the gene expression profiles in major blood types of nonprogressors, providing new insights in the understanding of HIV pathogenesis and developing strategies to delay HIV disease progression.
Mapping eQTLs in the Norfolk Island Genetic Isolate Identifies Candidate Genes for CVD Risk Traits
Benton, Miles C.; Lea, Rod A.; Macartney-Coxson, Donia; Carless, Melanie A.; Göring, Harald H.; Bellis, Claire; Hanna, Michelle; Eccles, David; Chambers, Geoffrey K.; Curran, Joanne E.; Harper, Jacquie L.; Blangero, John; Griffiths, Lyn R.
2013-01-01
Cardiovascular disease (CVD) affects millions of people worldwide and is influenced by numerous factors, including lifestyle and genetics. Expression quantitative trait loci (eQTLs) influence gene expression and are good candidates for CVD risk. Founder-effect pedigrees can provide additional power to map genes associated with disease risk. Therefore, we identified eQTLs in the genetic isolate of Norfolk Island (NI) and tested for associations between these and CVD risk factors. We measured genome-wide transcript levels of blood lymphocytes in 330 individuals and used pedigree-based heritability analysis to identify heritable transcripts. eQTLs were identified by genome-wide association testing of these transcripts. Testing for association between CVD risk factors (i.e., blood lipids, blood pressure, and body fat indices) and eQTLs revealed 1,712 heritable transcripts (p < 0.05) with heritability values ranging from 0.18 to 0.84. From these, we identified 200 cis-acting and 70 trans-acting eQTLs (p < 1.84 × 10−7) An eQTL-centric analysis of CVD risk traits revealed multiple associations, including 12 previously associated with CVD-related traits. Trait versus eQTL regression modeling identified four CVD risk candidates (NAAA, PAPSS1, NME1, and PRDX1), all of which have known biological roles in disease. In addition, we implicated several genes previously associated with CVD risk traits, including MTHFR and FN3KRP. We have successfully identified a panel of eQTLs in the NI pedigree and used this to implicate several genes in CVD risk. Future studies are required for further assessing the functional importance of these eQTLs and whether the findings here also relate to outbred populations. PMID:24314549
An epigenetic state associated with areas of gene duplication
Gimelbrant, Alexander A.; Chess, Andrew
2006-01-01
Asynchronous DNA replication is an epigenetically determined feature found in all cases of monoallelic expression, including genomic imprinting, X-inactivation, and random monoallelic expression of autosomal genes such as immunoglobulins and olfactory receptor genes. Most genes of the latter class were identified in experiments focused on genes functioning in the chemosensory and immune systems. We performed an unbiased survey of asynchronous replication in the mouse genome, excluding known asynchronously replicated genes. Fully 10% (eight of 80) of the genes tested exhibited asynchronous replication. A common feature of the newly identified asynchronously replicated areas is their proximity to areas of tandem gene duplication. Testing of other clustered areas supported the idea that such regions are enriched with asynchronously replicated genes. PMID:16687731
Ontology-based literature mining of E. coli vaccine-associated gene interaction networks.
Hur, Junguk; Özgür, Arzucan; He, Yongqun
2017-03-14
Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of these gene interaction networks identified top ranked E. coli genes and 6 INO interaction types (e.g., regulation and gene expression). Vaccine-related E. coli gene-gene interaction network was constructed using ontology-based literature mining strategy, which identified important E. coli vaccine genes and their interactions with other genes through specific interaction types.
Zhu, Haisun; Casselman, Amy; Reppert, Steven M.
2008-01-01
North American monarch butterflies (Danaus plexippus) undergo a spectacular fall migration. In contrast to summer butterflies, migrants are juvenile hormone (JH) deficient, which leads to reproductive diapause and increased longevity. Migrants also utilize time-compensated sun compass orientation to help them navigate to their overwintering grounds. Here, we describe a brain expressed sequence tag (EST) resource to identify genes involved in migratory behaviors. A brain EST library was constructed from summer and migrating butterflies. Of 9,484 unique sequences, 6068 had positive hits with the non-redundant protein database; the EST database likely represents ∼52% of the gene-encoding potential of the monarch genome. The brain transcriptome was cataloged using Gene Ontology and compared to Drosophila. Monarch genes were well represented, including those implicated in behavior. Three genes involved in increased JH activity (allatotropin, juvenile hormone acid methyltransfersase, and takeout) were upregulated in summer butterflies, compared to migrants. The locomotion-relevant turtle gene was marginally upregulated in migrants, while the foraging and single-minded genes were not differentially regulated. Many of the genes important for the monarch circadian clock mechanism (involved in sun compass orientation) were in the EST resource, including the newly identified cryptochrome 2. The EST database also revealed a novel Na+/K+ ATPase allele predicted to be more resistant to the toxic effects of milkweed than that reported previously. Potential genetic markers were identified from 3,486 EST contigs and included 1599 double-hit single nucleotide polymorphisms (SNPs) and 98 microsatellite polymorphisms. These data provide a template of the brain transcriptome for the monarch butterfly. Our “snap-shot” analysis of the differential regulation of candidate genes between summer and migratory butterflies suggests that unbiased, comprehensive transcriptional profiling will inform the molecular basis of migration. The identified SNPs and microsatellite polymorphisms can be used as genetic markers to address questions of population and subspecies structure. PMID:18183285
Moriarity, Branden S; Otto, George M; Rahrmann, Eric P; Rathe, Susan K; Wolf, Natalie K; Weg, Madison T; Manlove, Luke A; LaRue, Rebecca S; Temiz, Nuri A; Molyneux, Sam D; Choi, Kwangmin; Holly, Kevin J; Sarver, Aaron L; Scott, Milcah C; Forster, Colleen L; Modiano, Jaime F; Khanna, Chand; Hewitt, Stephen M; Khokha, Rama; Yang, Yi; Gorlick, Richard; Dyer, Michael A; Largaespada, David A
2016-01-01
Osteosarcomas are sarcomas of the bone, derived from osteoblasts or their precursors, with a high propensity to metastasize. Osteosarcoma is associated with massive genomic instability, making it problematic to identify driver genes using human tumors or prototypical mouse models, many of which involve loss of Trp53 function. To identify the genes driving osteosarcoma development and metastasis, we performed a Sleeping Beauty (SB) transposon-based forward genetic screen in mice with and without somatic loss of Trp53. Common insertion site (CIS) analysis of 119 primary tumors and 134 metastatic nodules identified 232 sites associated with osteosarcoma development and 43 sites associated with metastasis, respectively. Analysis of CIS-associated genes identified numerous known and new osteosarcoma-associated genes enriched in the ErbB, PI3K-AKT-mTOR and MAPK signaling pathways. Lastly, we identified several oncogenes involved in axon guidance, including Sema4d and Sema6d, which we functionally validated as oncogenes in human osteosarcoma. PMID:25961939
Shiba, Norio
2015-12-01
A new class of gene mutations, identified in the pathogenesis of adult acute myeloid leukemia (AML), includes DNMT3A, IDH1/2, TET2 and EZH2. However, these mutations are rare in pediatric AML cases, indicating that pathogeneses differ between adult and pediatric forms of AML. Meanwhile, the recent development of massively parallel sequencing technologies has provided a new opportunity to discover genetic changes across entire genomes or proteincoding sequences. In order to reveal a complete registry of gene mutations, we performed whole exome resequencing of paired tumor-normal specimens from 19 pediatric AML cases using Illumina HiSeq 2000. In total, 80 somatic mutations or 4.2 mutations per sample were identified. Many of the recurrent mutations identified in this study involved previously reported targets in AML, such as FLT3, CEBPA, KIT, CBL, NRAS, WT1 and EZH2. On the other hand, several genes were newly identified in the current study, including BCORL1 and major cohesin components such as SMC3 and RAD21. Whole exome resequencing revealed a complex array of gene mutations in pediatric AML genomes. Our results indicate that a subset of pediatric AML represents a discrete entity that could be discriminated from its adult counterpart, in terms of the spectrum of gene mutations.
Oti, Martin; Dutilh, Bas E.; Alonso, M. Eva; de la Calle-Mustienes, Elisa; Smeenk, Leonie; Rinne, Tuula; Parsaulian, Lilian; Bolat, Emine; Jurgelenaite, Rasa; Huynen, Martijn A.; Hoischen, Alexander; Veltman, Joris A.; Brunner, Han G.; Roscioli, Tony; Oates, Emily; Wilson, Meredith; Manzanares, Miguel; Gómez-Skarmeta, José Luis; Stunnenberg, Hendrik G.; Lohrum, Marion; van Bokhoven, Hans; Zhou, Huiqing
2010-01-01
Heterozygous mutations in p63 are associated with split hand/foot malformations (SHFM), orofacial clefting, and ectodermal abnormalities. Elucidation of the p63 gene network that includes target genes and regulatory elements may reveal new genes for other malformation disorders. We performed genome-wide DNA–binding profiling by chromatin immunoprecipitation (ChIP), followed by deep sequencing (ChIP–seq) in primary human keratinocytes, and identified potential target genes and regulatory elements controlled by p63. We show that p63 binds to an enhancer element in the SHFM1 locus on chromosome 7q and that this element controls expression of DLX6 and possibly DLX5, both of which are important for limb development. A unique micro-deletion including this enhancer element, but not the DLX5/DLX6 genes, was identified in a patient with SHFM. Our study strongly indicates disruption of a non-coding cis-regulatory element located more than 250 kb from the DLX5/DLX6 genes as a novel disease mechanism in SHFM1. These data provide a proof-of-concept that the catalogue of p63 binding sites identified in this study may be of relevance to the studies of SHFM and other congenital malformations that resemble the p63-associated phenotypes. PMID:20808887
StemTextSearch: Stem cell gene database with evidence from abstracts.
Chen, Chou-Cheng; Ho, Chung-Liang
2017-05-01
Previous studies have used many methods to find biomarkers in stem cells, including text mining, experimental data and image storage. However, no text-mining methods have yet been developed which can identify whether a gene plays a positive or negative role in stem cells. StemTextSearch identifies the role of a gene in stem cells by using a text-mining method to find combinations of gene regulation, stem-cell regulation and cell processes in the same sentences of biomedical abstracts. The dataset includes 5797 genes, with 1534 genes having positive roles in stem cells, 1335 genes having negative roles, 1654 genes with both positive and negative roles, and 1274 with an uncertain role. The precision of gene role in StemTextSearch is 0.66, and the recall is 0.78. StemTextSearch is a web-based engine with queries that specify (i) gene, (ii) category of stem cell, (iii) gene role, (iv) gene regulation, (v) cell process, (vi) stem-cell regulation, and (vii) species. StemTextSearch is available through http://bio.yungyun.com.tw/StemTextSearch.aspx. Copyright © 2017. Published by Elsevier Inc.
Shakoor, Nadia; Nair, Ramesh; Crasta, Oswald; Morris, Geoffrey; Feltus, Alex; Kresovich, Stephen
2014-01-23
Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.
2014-01-01
Background Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specific probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e.g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community. PMID:24456189
Wang, Anping; Zhang, Guibin
2017-11-01
The differentially expressed genes between glioblastoma (GBM) cells and normal human brain cells were investigated to performed pathway analysis and protein interaction network analysis for the differentially expressed genes. GSE12657 and GSE42656 gene chips, which contain gene expression profile of GBM were obtained from Gene Expression Omniub (GEO) database of National Center for Biotechnology Information (NCBI). The 'limma' data packet in 'R' software was used to analyze the differentially expressed genes in the two gene chips, and gene integration was performed using 'RobustRankAggreg' package. Finally, pheatmap software was used for heatmap analysis and Cytoscape, DAVID, STRING and KOBAS were used for protein-protein interaction, Gene Ontology (GO) and KEGG analyses. As results: i) 702 differentially expressed genes were identified in GSE12657, among those genes, 548 were significantly upregulated and 154 were significantly downregulated (p<0.01, fold-change >1), and 1,854 differentially expressed genes were identified in GSE42656, among the genes, 1,068 were significantly upregulated and 786 were significantly downregulated (p<0.01, fold-change >1). A total of 167 differentially expressed genes including 100 upregulated genes and 67 downregulated genes were identified after gene integration, and the genes showed significantly different expression levels in GBM compared with normal human brain cells (p<0.05). ii) Interactions between the protein products of 101 differentially expressed genes were identified using STRING and expression network was established. A key gene, called CALM3, was identified by Cytoscape software. iii) GO enrichment analysis showed that differentially expressed genes were mainly enriched in 'neurotransmitter:sodium symporter activity' and 'neurotransmitter transporter activity', which can affect the activity of neurotransmitter transportation. KEGG pathway analysis showed that the differentially expressed genes were mainly enriched in 'protein processing in endoplasmic reticulum', which can affect protein processing in endoplasmic reticulum. The results showed that: i) 167 differentially expressed genes were identified from two gene chips after integration; and ii) protein interaction network was established, and GO and KEGG pathway analyses were successfully performed to identify and annotate the key gene, which provide new insights for the studies on GBN at gene level.
Integrated multi-cohort transcriptional meta-analysis of neurodegenerative diseases.
Li, Matthew D; Burns, Terry C; Morgan, Alexander A; Khatri, Purvesh
2014-09-04
Neurodegenerative diseases share common pathologic features including neuroinflammation, mitochondrial dysfunction and protein aggregation, suggesting common underlying mechanisms of neurodegeneration. We undertook a meta-analysis of public gene expression data for neurodegenerative diseases to identify a common transcriptional signature of neurodegeneration. Using 1,270 post-mortem central nervous system tissue samples from 13 patient cohorts covering four neurodegenerative diseases, we identified 243 differentially expressed genes, which were similarly dysregulated in 15 additional patient cohorts of 205 samples including seven neurodegenerative diseases. This gene signature correlated with histologic disease severity. Metallothioneins featured prominently among differentially expressed genes, and functional pathway analysis identified specific convergent themes of dysregulation. MetaCore network analyses revealed various novel candidate hub genes (e.g. STAU2). Genes associated with M1-polarized macrophages and reactive astrocytes were strongly enriched in the meta-analysis data. Evaluation of genes enriched in neurons revealed 70 down-regulated genes, over half not previously associated with neurodegeneration. Comparison with aging brain data (3 patient cohorts, 221 samples) revealed 53 of these to be unique to neurodegenerative disease, many of which are strong candidates to be important in neuropathogenesis (e.g. NDN, NAP1L2). ENCODE ChIP-seq analysis predicted common upstream transcriptional regulators not associated with normal aging (REST, RBBP5, SIN3A, SP2, YY1, ZNF143, IKZF1). Finally, we removed genes common to neurodegeneration from disease-specific gene signatures, revealing uniquely robust immune response and JAK-STAT signaling in amyotrophic lateral sclerosis. Our results implicate pervasive bioenergetic deficits, M1-type microglial activation and gliosis as unifying themes of neurodegeneration, and identify numerous novel genes associated with neurodegenerative processes.
Integrated Analyses of Gene Expression Profiles Digs out Common Markers for Rheumatic Diseases
Wang, Lan; Wu, Long-Fei; Lu, Xin; Mo, Xing-Bo; Tang, Zai-Xiang; Lei, Shu-Feng; Deng, Fei-Yan
2015-01-01
Objective Rheumatic diseases have some common symptoms. Extensive gene expression studies, accumulated thus far, have successfully identified signature molecules for each rheumatic disease, individually. However, whether there exist shared factors across rheumatic diseases has yet to be tested. Methods We collected and utilized 6 public microarray datasets covering 4 types of representative rheumatic diseases including rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis, and osteoarthritis. Then we detected overlaps of differentially expressed genes across datasets and performed a meta-analysis aiming at identifying common differentially expressed genes that discriminate between pathological cases and normal controls. To further gain insights into the functions of the identified common differentially expressed genes, we conducted gene ontology enrichment analysis and protein-protein interaction analysis. Results We identified a total of eight differentially expressed genes (TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, PRF1), each associated with at least 3 of the 4 studied rheumatic diseases. Meta-analysis warranted the significance of the eight genes and highlighted the general significance of four genes (CX3CR1, LY96, TLR5, and PRF1). Protein-protein interaction and gene ontology enrichment analyses indicated that the eight genes interact with each other to exert functions related to immune response and immune regulation. Conclusion The findings support that there exist common factors underlying rheumatic diseases. For rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis and osteoarthritis diseases, those common factors include TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, and PRF1. In-depth studies on these common factors may provide keys to understanding the pathogenesis and developing intervention strategies for rheumatic diseases. PMID:26352601
Lack of haplotype structuring for two candidate genes for trypanotolerance in cattle.
Álvarez, I; Pérez-Pardal, L; Traoré, A; Fernández, I; Goyache, F
2016-04-01
Bovine trypanotolerance is a heritable trait associated to the ability of the individuals to control parasitaemia and anaemia. The INHBA (BTA4) and TICAM1 (BTA7) genes are strong candidates for trypanotolerance-related traits. The coding sequence of both genes (3951 bp in total) were analysed in a panel including 79 Asian, African and European cattle (Bos taurus and B. indicus) to identify naturally occurring polymorphisms on both genes. In general, the genetic diversity was low. Nineteen of the 33 mutations identified were found just one time. Seventeen different haplotypes were defined for the TICAM1 gene, and 9 and 12 were defined for the exon 1 and the exon 2 of the INHBA gene, respectively. There was no clear separation between cattle groups. The most frequent haplotypes identified in West African taurine samples were also identified in other cattle groups including Asian zebu and European cattle. Phylogenetic trees and principal component analysis confirmed that divergence among the cattle groups analysed was poor, particularly for the INHBA sequences. The European cattle subset had the lowest values of haplotype diversity for both the exon1 (monomorphic) and the exon2 (0.077 ± 0.066) of the INHBA gene. Neutrality tests, in general, did not suggest that the analysed genes were under positive selection. The assessed scenario would be consistent with the identification of recent mutations in evolutionary terms. © 2015 Blackwell Verlag GmbH.
Zhang, Quan; Zhu, Feng; Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua
2015-01-01
Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as revealed by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus.
Eleven loci with new reproducible genetic associations with allergic disease risk.
Ferreira, Manuel A R; Vonk, Judith M; Baurecht, Hansjörg; Marenholz, Ingo; Tian, Chao; Hoffman, Joshua D; Helmer, Quinta; Tillander, Annika; Ullemar, Vilhelmina; Lu, Yi; Rüschendorf, Franz; Hinds, David A; Hübner, Norbert; Weidinger, Stephan; Magnusson, Patrik K E; Jorgenson, Eric; Lee, Young-Ae; Boomsma, Dorret I; Karlsson, Robert; Almqvist, Catarina; Koppelman, Gerard H; Paternoster, Lavinia
2018-04-19
A recent genome-wide association study (GWAS) identified 99 loci that contain genetic risk variants shared between asthma, hay fever, and eczema. Many more risk loci shared between these common allergic diseases remain to be discovered, which could point to new therapeutic opportunities. We sought to identify novel risk loci shared between asthma, hay fever, and eczema by applying a gene-based test of association to results from a published GWAS that included data from 360,838 subjects. We used approximate conditional analysis to adjust the results from the published GWAS for the effects of the top risk variants identified in that study. We then analyzed the adjusted GWAS results with the EUGENE gene-based approach, which combines evidence for association with disease risk across regulatory variants identified in different tissues. Novel gene-based associations were followed up in an independent sample of 233,898 subjects from the UK Biobank study. Of the 19,432 genes tested, 30 had a significant gene-based association at a Bonferroni-corrected P value of 2.5 × 10 -6 . Of these, 20 were also significantly associated (P < .05/30 = .0016) with disease risk in the replication sample, including 19 that were located in 11 loci not reported to contain allergy risk variants in previous GWASs. Among these were 9 genes with a known function that is directly relevant to allergic disease: FOSL2, VPRBP, IPCEF1, PRR5L, NCF4, APOBR, IL27, ATXN2L, and LAT. For 4 genes (eg, ATXN2L), a genetically determined decrease in gene expression was associated with decreased allergy risk, and therefore drugs that inhibit gene expression or function are predicted to ameliorate disease symptoms. The opposite directional effect was observed for 14 genes, including IL27, a cytokine known to suppress T H 2 responses. Using a gene-based approach, we identified 11 risk loci for allergic disease that were not reported in previous GWASs. Functional studies that investigate the contribution of the 19 associated genes to the pathophysiology of allergic disease and assess their therapeutic potential are warranted. Copyright © 2018 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Li, Fengqi; Cao, Depan; Liu, Yang; Yang, Ting; Wang, Guirong
2015-01-01
The identification of genes under positive selection is a central goal of evolutionary biology. Many legume species, including Phaseolus vulgaris (common bean) and Phaseolus lunatus (lima bean), have important ecological and economic value. In this study, we sequenced and assembled the transcriptome of one Phaseolus species, lima bean. A comparison with the genomes of six other legume species, including the common bean, Medicago, lotus, soybean, chickpea, and pigeonpea, revealed 15 and 4 orthologous groups with signatures of positive selection among the two Phaseolus species and among the seven legume species, respectively. Characterization of these positively selected genes using Non redundant (nr) annotation, gene ontology (GO) classification, GO term enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses revealed that these genes are mostly involved in thylakoids, photosynthesis and metabolism. This study identified genes that may be related to the divergence of the Phaseolus and legume species. These detected genes are particularly good candidates for subsequent functional studies. PMID:26151849
Identifying a gene expression signature of cluster headache in blood
Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.
2017-01-01
Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859
Sima, Chao; Amundson, Sally A.; Zenhausern, Frederic
2018-01-01
Purpose To compile a list of genes that have been reported to be affected by external ionizing radiation (IR) and to assess their performance as candidate biomarkers for individual human radiation dosimetry. Methods Eligible studies were identified through extensive searches of the online databases from 1978 to 2017. Original English-language publications of microarray studies assessing radiation-induced changes in gene expression levels in human blood after external IR were included. Genes identified in at least half of the selected studies were retained for bio-statistical analysis in order to evaluate their diagnostic ability. Results 24 studies met the criteria and were included in this study. Radiation-induced expression of 10,170 unique genes was identified and the 31 genes that have been identified in at least 50% of studies (12/24 studies) were selected for diagnostic power analysis. Twenty-seven genes showed a significant Spearman’s correlation with radiation dose. Individually, TNFSF4, FDXR, MYC, ZMAT3 and GADD45A provided the best discrimination of radiation dose < 2 Gy and dose ≥ 2 Gy according to according to their maximized Youden’s index (0.67, 0.55, 0.55, 0.55 and 0.53 respectively). Moreover, 12 combinations of three genes display an area under the Receiver Operating Curve (ROC) curve (AUC) = 1 reinforcing the concept of biomarker combinations instead of looking for an ideal and unique biomarker. Conclusion Gene expression is a promising approach for radiation dosimetry assessment. A list of robust candidate biomarkers has been identified from analysis of the studies published to date, confirming for example the potential of well-known genes such as FDXR and TNFSF4 or highlighting other promising gene such as ZMAT3. However, heterogeneity in protocols and analysis methods will require additional studies to confirm these results. PMID:29879226
Wu, Lang; Shi, Wei; Long, Jirong; Guo, Xingyi; Michailidou, Kyriaki; Beesley, Jonathan; Bolla, Manjeet K; Shu, Xiao-Ou; Lu, Yingchang; Cai, Qiuyin; Al-Ejeh, Fares; Rozali, Esdy; Wang, Qin; Dennis, Joe; Li, Bingshan; Zeng, Chenjie; Feng, Helian; Gusev, Alexander; Barfield, Richard T; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Aronson, Kristan J; Auer, Paul L; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W; Benitez, Javier; Bermisheva, Marina; Blomqvist, Carl; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brucker, Sara Y; Burwinkel, Barbara; Caldés, Trinidad; Canzian, Federico; Carter, Brian D; Castelao, J Esteban; Chang-Claude, Jenny; Chen, Xiaoqing; Cheng, Ting-Yuan David; Christiansen, Hans; Clarke, Christine L; Collée, Margriet; Cornelissen, Sten; Couch, Fergus J; Cox, David; Cox, Angela; Cross, Simon S; Cunningham, Julie M; Czene, Kamila; Daly, Mary B; Devilee, Peter; Doheny, Kimberly F; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Dwek, Miriam; Eccles, Diana M; Eilber, Ursula; Eliassen, A Heather; Engel, Christoph; Eriksson, Mikael; Fachal, Laura; Fasching, Peter A; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gabrielson, Marike; Gago-Dominguez, Manuela; Gapstur, Susan M; García-Closas, Montserrat; Gaudet, Mia M; Ghoussaini, Maya; Giles, Graham G; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Guénel, Pascal; Hahnen, Eric; Haiman, Christopher A; Håkansson, Niclas; Hall, Per; Hallberg, Emily; Hamann, Ute; Harrington, Patricia; Hein, Alexander; Hicks, Belynda; Hillemanns, Peter; Hollestelle, Antoinette; Hoover, Robert N; Hopper, John L; Huang, Guanmengqian; Humphreys, Keith; Hunter, David J; Jakubowska, Anna; Janni, Wolfgang; John, Esther M; Johnson, Nichola; Jones, Kristine; Jones, Michael E; Jung, Audrey; Kaaks, Rudolf; Kerin, Michael J; Khusnutdinova, Elza; Kosma, Veli-Matti; Kristensen, Vessela N; Lambrechts, Diether; Le Marchand, Loic; Li, Jingmei; Lindström, Sara; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Lubinski, Jan; Luccarini, Craig; Lux, Michael P; MacInnis, Robert J; Maishman, Tom; Kostovska, Ivana Maleva; Mannermaa, Arto; Manson, JoAnn E; Margolin, Sara; Mavroudis, Dimitrios; Meijers-Heijboer, Hanne; Meindl, Alfons; Menon, Usha; Meyer, Jeffery; Mulligan, Anna Marie; Neuhausen, Susan L; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F; Nordestgaard, Børge G; Olopade, Olufunmilayo I; Olson, Janet E; Olsson, Håkan; Peterlongo, Paolo; Peto, Julian; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gad; Rennert, Hedy S; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Rudolph, Anja; Saloustros, Emmanouil; Sandler, Dale P; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Schneeweiss, Andreas; Scott, Rodney J; Scott, Christopher G; Seal, Sheila; Shah, Mitul; Shrubsole, Martha J; Smeets, Ann; Southey, Melissa C; Spinelli, John J; Stone, Jennifer; Surowy, Harald; Swerdlow, Anthony J; Tamimi, Rulla M; Tapper, William; Taylor, Jack A; Terry, Mary Beth; Tessier, Daniel C; Thomas, Abigail; Thöne, Kathrin; Tollenaar, Rob A E M; Torres, Diana; Truong, Thérèse; Untch, Michael; Vachon, Celine; Van Den Berg, David; Vincent, Daniel; Waisfisz, Quinten; Weinberg, Clarice R; Wendt, Camilla; Whittemore, Alice S; Wildiers, Hans; Willett, Walter C; Winqvist, Robert; Wolk, Alicja; Xia, Lucy; Yang, Xiaohong R; Ziogas, Argyrios; Ziv, Elad; Dunning, Alison M; Pharoah, Paul D P; Simard, Jacques; Milne, Roger L; Edwards, Stacey L; Kraft, Peter; Easton, Douglas F; Chenevix-Trench, Georgia; Zheng, Wei
2018-06-18
The breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas. Of the 8,597 genes evaluated, significant associations were identified for 48 at a Bonferroni-corrected threshold of P < 5.82 × 10 -6 , including 14 genes at loci not yet reported for breast cancer. We silenced 13 genes and showed an effect for 11 on cell proliferation and/or colony-forming efficiency. Our study provides new insights into breast cancer genetics and biology.
The sieve element occlusion gene family in dicotyledonous plants
Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Noll, Gundula A
2011-01-01
Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae. PMID:21422825
The sieve element occlusion gene family in dicotyledonous plants.
Ernst, Antonia M; Rüping, Boris; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A
2011-01-01
Sieve element occlusion (SEO) genes encoding forisome subunits have been identified in Medicago truncatula and other legumes. Forisomes are structural phloem proteins uniquely found in Fabaceae sieve elements. They undergo a reversible conformational change after wounding, from a condensed to a dispersed state, thereby blocking sieve tube translocation and preventing the loss of photoassimilates. Recently, we identified SEO genes in several non-Fabaceae plants (lacking forisomes) and concluded that they most probably encode conventional non-forisome P-proteins. Molecular and phylogenetic analysis of the SEO gene family has identified domains that are characteristic for SEO proteins. Here, we extended our phylogenetic analysis by including additional SEO genes from several diverse species based on recently published genomic data. Our results strengthen the original assumption that SEO genes seem to be widespread in dicotyledonous angiosperms, and further underline the divergent evolution of SEO genes within the Fabaceae.
Identifying metabolic enzymes with multiple types of association evidence
Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M
2006-01-01
Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130
2010-01-01
Background Parkinson's disease is the second most common neurodegenerative disorder. The pathological hallmark of the disease is degeneration of midbrain dopaminergic neurons. Genetic association studies have linked 13 human chromosomal loci to Parkinson's disease. Identification of gene(s), as part of the etiology of Parkinson's disease, within the large number of genes residing in these loci can be achieved through several approaches, including screening methods, and considering appropriate criteria. Since several of the indentified Parkinson's disease genes are expressed in substantia nigra pars compact of the midbrain, expression within the neurons of this area could be a suitable criterion to limit the number of candidates and identify PD genes. Methods In this work we have used the combination of findings from six rodent transcriptome analysis studies on the gene expression profile of midbrain dopaminergic neurons and the PARK loci in OMIM (Online Mendelian Inheritance in Man) database, to identify new candidate genes for Parkinson's disease. Results Merging the two datasets, we identified 20 genes within PARK loci, 7 of which are located in an orphan Parkinson's disease locus and one, which had been identified as a disease gene. In addition to identifying a set of candidates for further genetic association studies, these results show that the criteria of expression in midbrain dopaminergic neurons may be used to narrow down the number of genes in PARK loci for such studies. PMID:20716345
Prioritizing causal disease genes using unbiased genomic features.
Deo, Rahul C; Musso, Gabriel; Tasan, Murat; Tang, Paul; Poon, Annie; Yuan, Christiana; Felix, Janine F; Vasan, Ramachandran S; Beroukhim, Rameen; De Marco, Teresa; Kwok, Pui-Yan; MacRae, Calum A; Roth, Frederick P
2014-12-03
Cardiovascular disease (CVD) is the leading cause of death in the developed world. Human genetic studies, including genome-wide sequencing and SNP-array approaches, promise to reveal disease genes and mechanisms representing new therapeutic targets. In practice, however, identification of the actual genes contributing to disease pathogenesis has lagged behind identification of associated loci, thus limiting the clinical benefits. To aid in localizing causal genes, we develop a machine learning approach, Objective Prioritization for Enhanced Novelty (OPEN), which quantitatively prioritizes gene-disease associations based on a diverse group of genomic features. This approach uses only unbiased predictive features and thus is not hampered by a preference towards previously well-characterized genes. We demonstrate success in identifying genetic determinants for CVD-related traits, including cholesterol levels, blood pressure, and conduction system and cardiomyopathy phenotypes. Using OPEN, we prioritize genes, including FLNC, for association with increased left ventricular diameter, which is a defining feature of a prevalent cardiovascular disorder, dilated cardiomyopathy or DCM. Using a zebrafish model, we experimentally validate FLNC and identify a novel FLNC splice-site mutation in a patient with severe DCM. Our approach stands to assist interpretation of large-scale genetic studies without compromising their fundamentally unbiased nature.
Bii, Victor M; Rae, Dustin T; Trobridge, Grant D
2015-11-24
Breast cancer (BC) is the second leading cause of malignancy among U.S. women. Metastasis results in a poor prognosis and increased mortality, but the molecular mechanisms by which metastatic tumors occur are not well understood. Identifying the genes that drive the metastatic process could provide targets for improved therapy and biomarkers to improve BC patient outcomes. Using a forward mutagenesis screen, BC cells mutagenized with a replication-incompetent gammaretroviral vector (γRV) were xenotransplanted into the mammary fat pad of immunodeficient mice. In this approach the vector provirus dysregulates nearby genes, providing a selective advantage to transduced cells to form metastases. Metastatic tumors were analyzed for proviral integration sites to identify nearby candidate metastasis genes. The γRV has a transgene cassette that allows for rescue in bacteria and rapid identification of vector integration sites. Using this approach, we identified the previously described metastasis gene WWTR1 (TAZ), and three other novel candidate metastasis genes including SHARPIN. SHARPIN was independently validated in vivo as a BC metastasis gene. Analysis of patient data showed that SHARPIN expression predicts metastasis-free survival after adjuvant therapy. Our approach has broad potential to identify genes involved in oncogenic processes for BC and other cancers. We show here it can identify both known (WWTR1) and novel (SHARPIN) BC metastasis genes.
Li, Guosheng; Jagadeeswaran, Guru; Mort, Andrew; Sunkar, Ramanjulu
2017-01-01
Histone modifications represent the crux of epigenetic gene regulation essential for most biological processes including abiotic stress responses in plants. Thus, identification of histone modifications at the genome-scale can provide clues for how some genes are 'turned-on' while some others are "turned-off" in response to stress. This chapter details a step-by-step protocol for identifying genome-wide histone modifications associated with stress-responsive gene regulation using chromatin immunoprecipitation (ChIP) followed by sequencing of the DNA (ChIP-seq).
Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili; Liu, Bao; Li, Lin-Feng
2017-09-01
Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Lu, Xiangfeng; Peloso, Gina M; Liu, Dajiang J; Wu, Ying; Zhang, He; Zhou, Wei; Li, Jun; Tang, Clara Sze-Man; Dorajoo, Rajkumar; Li, Huaixing; Long, Jirong; Guo, Xiuqing; Xu, Ming; Spracklen, Cassandra N; Chen, Yang; Liu, Xuezhen; Zhang, Yan; Khor, Chiea Chuen; Liu, Jianjun; Sun, Liang; Wang, Laiyuan; Gao, Yu-Tang; Hu, Yao; Yu, Kuai; Wang, Yiqin; Cheung, Chloe Yu Yan; Wang, Feijie; Huang, Jianfeng; Fan, Qiao; Cai, Qiuyin; Chen, Shufeng; Shi, Jinxiu; Yang, Xueli; Zhao, Wanting; Sheu, Wayne H-H; Cherny, Stacey Shawn; He, Meian; Feranil, Alan B; Adair, Linda S; Gordon-Larsen, Penny; Du, Shufa; Varma, Rohit; Chen, Yii-Der Ida; Shu, Xiao-Ou; Lam, Karen Siu Ling; Wong, Tien Yin; Ganesh, Santhi K; Mo, Zengnan; Hveem, Kristian; Fritsche, Lars G; Nielsen, Jonas Bille; Tse, Hung-Fat; Huo, Yong; Cheng, Ching-Yu; Chen, Y Eugene; Zheng, Wei; Tai, E Shyong; Gao, Wei; Lin, Xu; Huang, Wei; Abecasis, Goncalo; Kathiresan, Sekar; Mohlke, Karen L; Wu, Tangchun; Sham, Pak Chung; Gu, Dongfeng; Willer, Cristen J
2017-12-01
Most genome-wide association studies have been of European individuals, even though most genetic variation in humans is seen only in non-European samples. To search for novel loci associated with blood lipid levels and clarify the mechanism of action at previously identified lipid loci, we used an exome array to examine protein-coding genetic variants in 47,532 East Asian individuals. We identified 255 variants at 41 loci that reached chip-wide significance, including 3 novel loci and 14 East Asian-specific coding variant associations. After a meta-analysis including >300,000 European samples, we identified an additional nine novel loci. Sixteen genes were identified by protein-altering variants in both East Asians and Europeans, and thus are likely to be functional genes. Our data demonstrate that most of the low-frequency or rare coding variants associated with lipids are population specific, and that examining genomic data across diverse ancestries may facilitate the identification of functional genes at associated loci.
USDA-ARS?s Scientific Manuscript database
Parastagonospora nodorum is a necrotrophic fungal pathogen causing Septoria nodorum blotch (SNB) on wheat. We have identified nine necrotrophic effector-host dominant sensitivity gene interactions, and we have cloned three of the necrotrophic effector (NE) genes, including SnToxA, SnTox1 and SnTox3...
Wei, Lin; Tang, Ruqi; Lian, Baofeng; Zhao, Yingjun; He, Xianghuo; Xie, Lu
2014-01-01
Background Recently, a number of studies have performed genome or exome sequencing of hepatocellular carcinoma (HCC) and identified hundreds or even thousands of mutations in protein-coding genes. However, these studies have only focused on a limited number of candidate genes, and many important mutation resources remain to be explored. Principal Findings In this study, we integrated mutation data obtained from various sources and performed pathway and network analysis. We identified 113 pathways that were significantly mutated in HCC samples and found that the mutated genes included in these pathways contained high percentages of known cancer genes, and damaging genes and also demonstrated high conservation scores, indicating their important roles in liver tumorigenesis. Five classes of pathways that were mutated most frequently included (a) proliferation and apoptosis related pathways, (b) tumor microenvironment related pathways, (c) neural signaling related pathways, (d) metabolic related pathways, and (e) circadian related pathways. Network analysis further revealed that the mutated genes with the highest betweenness coefficients, such as the well-known cancer genes TP53, CTNNB1 and recently identified novel mutated genes GNAL and the ADCY family, may play key roles in these significantly mutated pathways. Finally, we highlight several key genes (e.g., RPS6KA3 and PCLO) and pathways (e.g., axon guidance) in which the mutations were associated with clinical features. Conclusions Our workflow illustrates the increased statistical power of integrating multiple studies of the same subject, which can provide biological insights that would otherwise be masked under individual sample sets. This type of bioinformatics approach is consistent with the necessity of making the best use of the ever increasing data provided in valuable databases, such as TCGA, to enhance the speed of deciphering human cancers. PMID:24988079
Zhang, Yuannv; Qiu, Zhaoping; Wei, Lin; Tang, Ruqi; Lian, Baofeng; Zhao, Yingjun; He, Xianghuo; Xie, Lu
2014-01-01
Recently, a number of studies have performed genome or exome sequencing of hepatocellular carcinoma (HCC) and identified hundreds or even thousands of mutations in protein-coding genes. However, these studies have only focused on a limited number of candidate genes, and many important mutation resources remain to be explored. In this study, we integrated mutation data obtained from various sources and performed pathway and network analysis. We identified 113 pathways that were significantly mutated in HCC samples and found that the mutated genes included in these pathways contained high percentages of known cancer genes, and damaging genes and also demonstrated high conservation scores, indicating their important roles in liver tumorigenesis. Five classes of pathways that were mutated most frequently included (a) proliferation and apoptosis related pathways, (b) tumor microenvironment related pathways, (c) neural signaling related pathways, (d) metabolic related pathways, and (e) circadian related pathways. Network analysis further revealed that the mutated genes with the highest betweenness coefficients, such as the well-known cancer genes TP53, CTNNB1 and recently identified novel mutated genes GNAL and the ADCY family, may play key roles in these significantly mutated pathways. Finally, we highlight several key genes (e.g., RPS6KA3 and PCLO) and pathways (e.g., axon guidance) in which the mutations were associated with clinical features. Our workflow illustrates the increased statistical power of integrating multiple studies of the same subject, which can provide biological insights that would otherwise be masked under individual sample sets. This type of bioinformatics approach is consistent with the necessity of making the best use of the ever increasing data provided in valuable databases, such as TCGA, to enhance the speed of deciphering human cancers.
Lockyer, Anne E; Spinks, Jenny; Kane, Richard A; Hoffmann, Karl F; Fitzpatrick, Jennifer M; Rollinson, David; Noble, Leslie R; Jones, Catherine S
2008-01-01
Background Biomphalaria glabrata is an intermediate snail host for Schistosoma mansoni, one of the important schistosomes infecting man. B. glabrata/S. mansoni provides a useful model system for investigating the intimate interactions between host and parasite. Examining differential gene expression between S. mansoni-exposed schistosome-resistant and susceptible snail lines will identify genes and pathways that may be involved in snail defences. Results We have developed a 2053 element cDNA microarray for B. glabrata containing clones from ORESTES (Open Reading frame ESTs) libraries, suppression subtractive hybridization (SSH) libraries and clones identified in previous expression studies. Snail haemocyte RNA, extracted from parasite-challenged resistant and susceptible snails, 2 to 24 h post-exposure to S. mansoni, was hybridized to the custom made cDNA microarray and 98 differentially expressed genes or gene clusters were identified, 94 resistant-associated and 4 susceptible-associated. Quantitative PCR analysis verified the cDNA microarray results for representative transcripts. Differentially expressed genes were annotated and clustered using gene ontology (GO) terminology and Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathway analysis. 61% of the identified differentially expressed genes have no known function including the 4 susceptible strain-specific transcripts. Resistant strain-specific expression of genes implicated in innate immunity of invertebrates was identified, including hydrolytic enzymes such as cathepsin L, a cysteine proteinase involved in lysis of phagocytosed particles; metabolic enzymes such as ornithine decarboxylase, the rate-limiting enzyme in the production of polyamines, important in inflammation and infection processes, as well as scavenging damaging free radicals produced during production of reactive oxygen species; stress response genes such as HSP70; proteins involved in signalling, such as importin 7 and copine 1, cytoplasmic intermediate filament (IF) protein and transcription enzymes such as elongation factor 1α and EF-2. Conclusion Production of the first cDNA microarray for profiling gene expression in B. glabrata provides a foundation for expanding our understanding of pathways and genes involved in the snail internal defence system (IDS). We demonstrate resistant strain-specific expression of genes potentially associated with the snail IDS, ranging from signalling and inflammation responses through to lysis of proteinacous products (encapsulated sporocysts or phagocytosed parasite components) and processing/degradation of these targeted products by ubiquitination. PMID:19114004
Lockyer, Anne E; Spinks, Jenny; Kane, Richard A; Hoffmann, Karl F; Fitzpatrick, Jennifer M; Rollinson, David; Noble, Leslie R; Jones, Catherine S
2008-12-29
Biomphalaria glabrata is an intermediate snail host for Schistosoma mansoni, one of the important schistosomes infecting man. B. glabrata/S. mansoni provides a useful model system for investigating the intimate interactions between host and parasite. Examining differential gene expression between S. mansoni-exposed schistosome-resistant and susceptible snail lines will identify genes and pathways that may be involved in snail defences. We have developed a 2053 element cDNA microarray for B. glabrata containing clones from ORESTES (Open Reading frame ESTs) libraries, suppression subtractive hybridization (SSH) libraries and clones identified in previous expression studies. Snail haemocyte RNA, extracted from parasite-challenged resistant and susceptible snails, 2 to 24 h post-exposure to S. mansoni, was hybridized to the custom made cDNA microarray and 98 differentially expressed genes or gene clusters were identified, 94 resistant-associated and 4 susceptible-associated. Quantitative PCR analysis verified the cDNA microarray results for representative transcripts. Differentially expressed genes were annotated and clustered using gene ontology (GO) terminology and Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathway analysis. 61% of the identified differentially expressed genes have no known function including the 4 susceptible strain-specific transcripts. Resistant strain-specific expression of genes implicated in innate immunity of invertebrates was identified, including hydrolytic enzymes such as cathepsin L, a cysteine proteinase involved in lysis of phagocytosed particles; metabolic enzymes such as ornithine decarboxylase, the rate-limiting enzyme in the production of polyamines, important in inflammation and infection processes, as well as scavenging damaging free radicals produced during production of reactive oxygen species; stress response genes such as HSP70; proteins involved in signalling, such as importin 7 and copine 1, cytoplasmic intermediate filament (IF) protein and transcription enzymes such as elongation factor 1alpha and EF-2. Production of the first cDNA microarray for profiling gene expression in B. glabrata provides a foundation for expanding our understanding of pathways and genes involved in the snail internal defence system (IDS). We demonstrate resistant strain-specific expression of genes potentially associated with the snail IDS, ranging from signalling and inflammation responses through to lysis of proteinacous products (encapsulated sporocysts or phagocytosed parasite components) and processing/degradation of these targeted products by ubiquitination.
Raelson, John V; Little, Randall D; Ruether, Andreas; Fournier, Hélène; Paquin, Bruno; Van Eerdewegh, Paul; Bradley, W E C; Croteau, Pascal; Nguyen-Huu, Quynh; Segal, Jonathan; Debrus, Sophie; Allard, René; Rosenstiel, Philip; Franke, Andre; Jacobs, Gunnar; Nikolaus, Susanna; Vidal, Jean-Michel; Szego, Peter; Laplante, Nathalie; Clark, Hilary F; Paulussen, René J; Hooper, John W; Keith, Tim P; Belouchi, Abdelmajid; Schreiber, Stefan
2007-09-11
Genome-wide association (GWA) studies offer a powerful unbiased method for the identification of multiple susceptibility genes for complex diseases. Here we report the results of a GWA study for Crohn's disease (CD) using family trios from the Quebec Founder Population (QFP). Haplotype-based association analyses identified multiple regions associated with the disease that met the criteria for genome-wide significance, with many containing a gene whose function appears relevant to CD. A proportion of these were replicated in two independent German Caucasian samples, including the established CD loci NOD2 and IBD5. The recently described IL23R locus was also identified and replicated. For this region, multiple individuals with all major haplotypes in the QFP were sequenced and extensive fine mapping performed to identify risk and protective alleles. Several additional loci, including a region on 3p21 containing several plausible candidate genes, a region near JAKMIP1 on 4p16.1, and two larger regions on chromosome 17 were replicated. Together with previously published loci, the spectrum of CD genes identified to date involves biochemical networks that affect epithelial defense mechanisms, innate and adaptive immune response, and the repair or remodeling of tissue.
Song, Jae-Jun; Kwon, Jee Young; Park, Moo Kyun; Seo, Young Rok
2013-10-01
The primary aim of this study is to reveal the effect of particulate matter (PM) on the human middle ear epithelial cell (HMEEC). The HMEEC was treated with PM (300 μg/ml) for 24 h. Total RNA was extracted and used for microarray analysis. Molecular pathways among differentially expressed genes were further analyzed by using Pathway Studio 9.0 software. For selected genes, the changes in gene expression were confirmed by real-time PCR. A total of 611 genes were regulated by PM. Among them, 366 genes were up-regulated, whereas 245 genes were down-regulated. Up-regulated genes were mainly involved in cellular processes, including reactive oxygen species generation, cell proliferation, apoptosis, cell differentiation, inflammatory response and immune response. Down-regulated genes affected several cellular processes, including cell differentiation, cell cycle, proliferation, apoptosis and cell migration. A total of 21 genes were discovered as crucial components in potential signaling networks containing 2-fold up regulated genes. Four genes, VEGFA, IL1B, CSF2 and HMOX1 were revealed as key mediator genes among the up-regulated genes. A total of 25 genes were revealed as key modulators in the signaling pathway associated with 2-fold down regulated genes. Four genes, including IGF1R, TIMP1, IL6 and FN1, were identified as the main modulator genes. We identified the differentially expressed genes in PM-treated HMEEC, whose expression profile may provide a useful clue for the understanding of environmental pathophysiology of otitis media. Our work indicates that air pollution, like PM, plays an important role in the pathogenesis of otitis media. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Gao, Ji; Li, Hongyan; Liu, Lei; Song, Lide; Lv, Yanting; Han, Yuping
2017-12-01
The aim of the present study was to investigate risk-related microRNAs (miRs) for bladder urothelial carcinoma (BUC) prognosis. Clinical and microRNA expression data downloaded from the Cancer Genome Atlas were utilized for survival analysis. Risk factor estimation was performed using Cox's proportional regression analysis. A microRNA-regulated target gene network was constructed and presented using Cytoscape. In addition, the Database for Annotation, Visualization and Integrated Discovery was used for Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes pathway enrichment, followed by protein-protein interaction (PPI) network analysis. Finally, the K-clique method was applied to analyze sub-pathways. A total of 16 significant microRNAs, including hsa-miR-3622a and hsa-miR-29a, were identified (P<0.05). Following Cox's proportional regression analysis, hsa-miR-29a was screened as a prognostic marker of BUC risk (P=0.0449). A regulation network of hsa-miR-29a comprising 417 target genes was constructed. These target genes were primarily enriched in GO terms, including collagen fibril organization, extracellular matrix (ECM) organization and pathways, such as focal adhesion (P<0.05). A PPI network including 197 genes and 510 interactions, was constructed. The top 21 genes in the network module were enriched in GO terms, including collagen fibril organization and pathways, such as ECM receptor interaction (P<0.05). Finally, 4 sub-pathways of cysteine and methionine metabolism, including paths 00270_4, 00270_1, 00270_2 and 00270_5, were obtained (P<0.01) and identified to be enriched through DNA (cytosine-5)-methyltransferase ( DNMT)3A, DNMT3B , methionine adenosyltransferase 2α ( MAT2A ) and spermine synthase ( SMS ). The identified microRNAs, particularly hsa-miR-29a and its 4 associated target genes DNMT3A, DNMT3B, MAT2A and SMS , may participate in the prognostic risk mechanism of BUC.
Paulo, Paula; Maia, Sofia; Pinto, Carla; Pinto, Pedro; Monteiro, Augusta; Peixoto, Ana; Teixeira, Manuel R
2018-04-01
Considering that mutations in known prostate cancer (PrCa) predisposition genes, including those responsible for hereditary breast/ovarian cancer and Lynch syndromes, explain less than 5% of early-onset/familial PrCa, we have sequenced 94 genes associated with cancer predisposition using next generation sequencing (NGS) in a series of 121 PrCa patients. We found monoallelic truncating/functionally deleterious mutations in seven genes, including ATM and CHEK2, which have previously been associated with PrCa predisposition, and five new candidate PrCa associated genes involved in cancer predisposing recessive disorders, namely RAD51C, FANCD2, FANCI, CEP57 and RECQL4. Furthermore, using in silico pathogenicity prediction of missense variants among 18 genes associated with breast/ovarian cancer and/or Lynch syndrome, followed by KASP genotyping in 710 healthy controls, we identified "likely pathogenic" missense variants in ATM, BRIP1, CHEK2 and TP53. In conclusion, this study has identified putative PrCa predisposing germline mutations in 14.9% of early-onset/familial PrCa patients. Further data will be necessary to confirm the genetic heterogeneity of inherited PrCa predisposition hinted in this study.
Identification and function analysis of contrary genes in Dupuytren's contracture.
Ji, Xianglu; Tian, Feng; Tian, Lijie
2015-07-01
The present study aimed to analyze the expression of genes involved in Dupuytren's contracture (DC), using bioinformatic methods. The profile of GSE21221 was downloaded from the gene expression ominibus, which included six samples, derived from fibroblasts and six healthy control samples, derived from carpal-tunnel fibroblasts. A Distributed Intrusion Detection System was used in order to identify differentially expressed genes. The term contrary genes is proposed. Contrary genes were the genes that exhibited opposite expression patterns in the positive and negative groups, and likely exhibited opposite functions. These were identified using Coexpress software. Gene ontology (GO) function analysis was conducted for the contrary genes. A network of GO terms was constructed using the reduce and visualize gene ontology database. Significantly expressed genes (801) and contrary genes (98) were screened. A significant association was observed between Chitinase-3-like protein 1 and ten genes in the positive gene set. Positive regulation of transcription and the activation of nuclear factor-κB (NF-κB)-inducing kinase activity exhibited the highest degree values in the network of GO terms. In the present study, the expression of genes involved in the development of DC was analyzed, and the concept of contrary genes proposed. The genes identified in the present study are involved in the positive regulation of transcription and activation of NF-κB-inducing kinase activity. The contrary genes and GO terms identified in the present study may potentially be used for DC diagnosis and treatment.
Verslues, Paul E.; Lasky, Jesse R.; Juenger, Thomas E.; Liu, Tzu-Wen; Kumar, M. Nagaraj
2014-01-01
Arabidopsis (Arabidopsis thaliana) exhibits natural genetic variation in drought response, including varying levels of proline (Pro) accumulation under low water potential. As Pro accumulation is potentially important for stress tolerance and cellular redox control, we conducted a genome-wide association (GWAS) study of low water potential-induced Pro accumulation using a panel of natural accessions and publicly available single-nucleotide polymorphism (SNP) data sets. Candidate genomic regions were prioritized for subsequent study using metrics considering both the strength and spatial clustering of the association signal. These analyses found many candidate regions likely containing gene(s) influencing Pro accumulation. Reverse genetic analysis of several candidates identified new Pro effector genes, including thioredoxins and several genes encoding Universal Stress Protein A domain proteins. These new Pro effector genes further link Pro accumulation to cellular redox and energy status. Additional new Pro effector genes found include the mitochondrial protease LON1, ribosomal protein RPL24A, protein phosphatase 2A subunit A3, a MADS box protein, and a nucleoside triphosphate hydrolase. Several of these new Pro effector genes were from regions with multiple SNPs, each having moderate association with Pro accumulation. This pattern supports the use of summary approaches that incorporate clusters of SNP associations in addition to consideration of individual SNP probability values. Further GWAS-guided reverse genetics promises to find additional effectors of Pro accumulation. The combination of GWAS and reverse genetics to efficiently identify new effector genes may be especially applicable for traits difficult to analyze by other genetic screening methods. PMID:24218491
McKay, James D; Hung, Rayjean J; Han, Younghun; Zong, Xuchen; Carreras-Torres, Robert; Christiani, David C; Caporaso, Neil E; Johansson, Mattias; Xiao, Xiangjun; Li, Yafang; Byun, Jinyoung; Dunning, Alison; Pooley, Karen A; Qian, David C; Ji, Xuemei; Liu, Geoffrey; Timofeeva, Maria N; Bojesen, Stig E; Wu, Xifeng; Le Marchand, Loic; Albanes, Demetrios; Bickeböller, Heike; Aldrich, Melinda C; Bush, William S; Tardon, Adonina; Rennert, Gad; Teare, M Dawn; Field, John K; Kiemeney, Lambertus A; Lazarus, Philip; Haugen, Aage; Lam, Stephen; Schabath, Matthew B; Andrew, Angeline S; Shen, Hongbing; Hong, Yun-Chul; Yuan, Jian-Min; Bertazzi, Pier Alberto; Pesatori, Angela C; Ye, Yuanqing; Diao, Nancy; Su, Li; Zhang, Ruyang; Brhane, Yonathan; Leighl, Natasha; Johansen, Jakob S; Mellemgaard, Anders; Saliba, Walid; Haiman, Christopher A; Wilkens, Lynne R; Fernandez-Somoano, Ana; Fernandez-Tardon, Guillermo; van der Heijden, Henricus F M; Kim, Jin Hee; Dai, Juncheng; Hu, Zhibin; Davies, Michael P A; Marcus, Michael W; Brunnström, Hans; Manjer, Jonas; Melander, Olle; Muller, David C; Overvad, Kim; Trichopoulou, Antonia; Tumino, Rosario; Doherty, Jennifer A; Barnett, Matt P; Chen, Chu; Goodman, Gary E; Cox, Angela; Taylor, Fiona; Woll, Penella; Brüske, Irene; Wichmann, H-Erich; Manz, Judith; Muley, Thomas R; Risch, Angela; Rosenberger, Albert; Grankvist, Kjell; Johansson, Mikael; Shepherd, Frances A; Tsao, Ming-Sound; Arnold, Susanne M; Haura, Eric B; Bolca, Ciprian; Holcatova, Ivana; Janout, Vladimir; Kontic, Milica; Lissowska, Jolanta; Mukeria, Anush; Ognjanovic, Simona; Orlowski, Tadeusz M; Scelo, Ghislaine; Swiatkowska, Beata; Zaridze, David; Bakke, Per; Skaug, Vidar; Zienolddiny, Shanbeh; Duell, Eric J; Butler, Lesley M; Koh, Woon-Puay; Gao, Yu-Tang; Houlston, Richard S; McLaughlin, John; Stevens, Victoria L; Joubert, Philippe; Lamontagne, Maxime; Nickle, David C; Obeidat, Ma'en; Timens, Wim; Zhu, Bin; Song, Lei; Kachuri, Linda; Artigas, María Soler; Tobin, Martin D; Wain, Louise V; Rafnar, Thorunn; Thorgeirsson, Thorgeir E; Reginsson, Gunnar W; Stefansson, Kari; Hancock, Dana B; Bierut, Laura J; Spitz, Margaret R; Gaddis, Nathan C; Lutz, Sharon M; Gu, Fangyi; Johnson, Eric O; Kamal, Ahsan; Pikielny, Claudio; Zhu, Dakai; Lindströem, Sara; Jiang, Xia; Tyndale, Rachel F; Chenevix-Trench, Georgia; Beesley, Jonathan; Bossé, Yohan; Chanock, Stephen; Brennan, Paul; Landi, Maria Teresa; Amos, Christopher I
2017-07-01
Although several lung cancer susceptibility loci have been identified, much of the heritability for lung cancer remains unexplained. Here 14,803 cases and 12,262 controls of European descent were genotyped on the OncoArray and combined with existing data for an aggregated genome-wide association study (GWAS) analysis of lung cancer in 29,266 cases and 56,450 controls. We identified 18 susceptibility loci achieving genome-wide significance, including 10 new loci. The new loci highlight the striking heterogeneity in genetic susceptibility across the histological subtypes of lung cancer, with four loci associated with lung cancer overall and six loci associated with lung adenocarcinoma. Gene expression quantitative trait locus (eQTL) analysis in 1,425 normal lung tissue samples highlights RNASET2, SECISBP2L and NRG1 as candidate genes. Other loci include genes such as a cholinergic nicotinic receptor, CHRNA2, and the telomere-related genes OFBC1 and RTEL1. Further exploration of the target genes will continue to provide new insights into the etiology of lung cancer.
Computational approaches were developed to identify factors that regulate Nrf2 in a large gene expression compendium of microarray profiles including >2000 comparisons which queried the effects of chemicals, genes, diets, and infectious agents on gene expression in the mouse l...
Jazayeri, Roshanak; Hu, Hao; Fattahi, Zohreh; Musante, Luciana; Abedini, Seyedeh Sedigheh; Hosseini, Masoumeh; Wienker, Thomas F; Ropers, Hans Hilger; Najmabadi, Hossein; Kahrizi, Kimia
2015-10-01
Intellectual disability (ID) is a neuro-developmental disorder which causes considerable socio-economic problems. Some ID individuals are also affected by ataxia, and the condition includes different mutations affecting several genes. We used whole exome sequencing (WES) in combination with homozygosity mapping (HM) to identify the genetic defects in five consanguineous families among our cohort study, with two affected children with ID and ataxia as major clinical symptoms. We identified three novel candidate genes, RIPPLY1, MRPL10, SNX14, and a new mutation in known gene SURF1. All are autosomal genes, except RIPPLY1, which is located on the X chromosome. Two are housekeeping genes, implicated in transcription and translation regulation and intracellular trafficking, and two encode mitochondrial proteins. The pathogenesis of these variants was evaluated by mutation classification, bioinformatic methods, review of medical and biological relevance, co-segregation studies in the particular family, and a normal population study. Linkage analysis and exome sequencing of a small number of affected family members is a powerful new technique which can be used to decrease the number of candidate genes in heterogenic disorders such as ID, and may even identify the responsible gene(s).
Candidate Chemosensory Genes in the Stemborer Sesamia nonagrioides
Glaser, Nicolas; Gallot, Aurore; Legeai, Fabrice; Montagné, Nicolas; Poivet, Erwan; Harry, Myriam; Calatayud, Paul-André; Jacquin-Joly, Emmanuelle
2013-01-01
The stemborer Sesamia nonagrioides is an important pest of maize in the Mediterranean Basin. Like other moths, this noctuid uses its chemosensory system to efficiently interact with its environment. However, very little is known on the molecular mechanisms that underlie chemosensation in this species. Here, we used next-generation sequencing (454 and Illumina) on different tissues from adult and larvae, including chemosensory organs and female ovipositors, to describe the chemosensory transcriptome of S. nonagrioides and identify key molecular components of the pheromone production and detection systems. We identified a total of 68 candidate chemosensory genes in this species, including 31 candidate binding-proteins and 23 chemosensory receptors. In particular, we retrieved the three co-receptors Orco, IR25a and IR8a necessary for chemosensory receptor functioning. Focusing on the pheromonal communication system, we identified a new pheromone-binding protein in this species, four candidate pheromone receptors and 12 carboxylesterases as candidate acetate degrading enzymes. In addition, we identified enzymes putatively involved in S. nonagrioides pheromone biosynthesis, including a ∆11-desaturase and different acetyltransferases and reductases. RNAseq analyses and RT-PCR were combined to profile gene expression in different tissues. This study constitutes the first large scale description of chemosensory genes in S. nonagrioides. PMID:23781142
Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.
Kumar, Vikas; Kutschera, Verena E; Nilsson, Maria A; Janke, Axel
2015-08-07
The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated and analyzed for signatures of positive selection. In addition, the data allowed for a phylogenetic analysis and divergence time estimate between the two fox species. The de novo assembly of reads resulted in more than 160,000 contigs/transcripts per individual. Approximately 17,000 homologous genes were identified using human and the non-redundant databases. Positive selection analyses revealed several genes involved in various metabolic and molecular processes such as energy metabolism, cardiac gene regulation, apoptosis and blood coagulation to be under positive selection in foxes. Branch site tests identified four genes to be under positive selection in the Arctic fox transcriptome, two of which are fat metabolism genes. In the red fox transcriptome eight genes are under positive selection, including molecular process genes, notably genes involved in ATP metabolism. Analysis of the three transcriptomes and five Sanger re-sequenced genes in additional individuals identified a lower genetic variability within Arctic foxes compared to red foxes, which is consistent with distribution range differences and demographic responses to past climatic fluctuations. A phylogenomic analysis estimated that the Arctic and red fox lineages diverged about three million years ago. Transcriptome data are an economic way to generate genomic resources for evolutionary studies. Despite not representing an entire genome, this transcriptome analysis identified numerous genes that are relevant to arctic adaptation in foxes. Similar to polar bears, fat metabolism seems to play a central role in adaptation of Arctic foxes to the cold climate, as has been identified in the polar bear, another arctic specialist.
Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress
Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming
2017-01-01
The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance. PMID:28417911
Genome-Wide Analyses of the Soybean F-Box Gene Family in Response to Salt Stress.
Jia, Qi; Xiao, Zhi-Xia; Wong, Fuk-Ling; Sun, Song; Liang, Kang-Jing; Lam, Hon-Ming
2017-04-12
The F-box family is one of the largest gene families in plants that regulate diverse life processes, including salt responses. However, the knowledge of the soybean F-box genes and their roles in salt tolerance remains limited. Here, we conducted a genome-wide survey of the soybean F-box family, and their expression analysis in response to salinity via in silico analysis of online RNA-sequencing (RNA-seq) data and quantitative reverse-transcription polymerase chain reaction (qRT-PCR) to predict their potential functions. A total of 725 potential F-box proteins encoded by 509 genes were identified and classified into 9 subfamilies. The gene structures, conserved domains and chromosomal distributions were characterized. There are 76 pairs of duplicate genes identified, including genome-wide segmental and tandem duplication events, which lead to the expansion of the number of F-box genes. The in silico expression analysis showed that these genes would be involved in diverse developmental functions and play an important role in salt response. Our qRT-PCR analysis confirmed 12 salt-responding F-box genes. Overall, our results provide useful information on soybean F-box genes, especially their potential roles in salt tolerance.
Yang, W C; Zhu, L; Zhou, B X; Tania, S; Zhou, Q; Khan, M A; Fu, X L; Cheng, J L; Lv, H B; Fu, J J
2015-09-25
Retinitis pigmentosa (RP) is a retinal degenerative disorder that often causes complete blindness. Mutations of more than 50 genes have been identified as associated with RP, including the CACNA1F gene. In a recent study, by employing next-generation sequencing, we identified a novel mutation in the CACNA1F gene. In this study, we used the amplification refractory mutation system (ARMS) and identified a single nucleotide change c.1555C>T in exon 13 of the CACNA1F gene, leading to the substitution of arginine by tryptophan (p.R519W) in a Chinese individual affected by RP. This study actually confirms this novel mutation, and establishes the ARMS technique for the detection of mutations in RP.
Adamczuk, Marcin; Dziewit, Lukasz
2017-01-01
The draft genome of multidrug-resistant Aeromonas sp. ARM81 isolated from a wastewater treatment plant in Warsaw (Poland) was obtained. Sequence analysis revealed multiple genes conferring resistance to aminoglycosides, β-lactams or tetracycline. Three different β-lactamase genes were identified, including an extended-spectrum β-lactamase gene bla PER-1 . The antibiotic susceptibility was experimentally tested. Genome sequencing also allowed us to investigate the plasmidome and transposable mobilome of ARM81. Four plasmids, of which two carry phenotypic modules (i.e., genes encoding a zinc transporter ZitB and a putative glucosyltransferase), and 28 putative transposase genes were identified. The mobility of three insertion sequences (isoforms of previously identified elements ISAs12, ISKpn9 and ISAs26) was confirmed using trap plasmids.
Lundmark, Anna; Davanian, Haleh; Båge, Tove; Johannsen, Gunnar; Koro, Catalin; Lundeberg, Joakim; Yucel-Lindberg, Tülay
2015-01-01
The multifactorial chronic inflammatory disease periodontitis, which is characterized by destruction of tooth-supporting tissues, has also been implicated as a risk factor for various systemic diseases. Although periodontitis has been studied extensively, neither disease-specific biomarkers nor therapeutic targets have been identified, nor its link with systemic diseases. Here, we analyzed the global transcriptome of periodontitis and compared its gene expression profile with those of other inflammatory conditions, including cardiovascular disease (CVD), rheumatoid arthritis (RA), and ulcerative colitis (UC). Gingival biopsies from 62 patients with periodontitis and 62 healthy subjects were subjected to RNA sequencing. The up-regulated genes in periodontitis were related to inflammation, wounding and defense response, and apoptosis, whereas down-regulated genes were related to extracellular matrix organization and structural support. The most highly up-regulated gene was mucin 4 (MUC4), and its protein product was confirmed to be over-expressed in periodontitis. When comparing the expression profile of periodontitis with other inflammatory diseases, several gene ontology categories, including inflammatory response, cell death, cell motion, and homeostatic processes, were identified as common to all diseases. Only one gene, pleckstrin (PLEK), was significantly overexpressed in periodontitis, CVD, RA, and UC, implicating this gene as an important networking link between these chronic inflammatory diseases. PMID:26686060
Ding, Dong; Lou, Xiaoyan; Hua, Dasong; Yu, Wei; Li, Lisha; Wang, Jun; Gao, Feng; Zhao, Na; Ren, Guoping; Li, Lanjuan; Lin, Biaoyang
2012-01-01
Integration of the viral DNA into host chromosomes was found in most of the hepatitis B virus (HBV)–related hepatocellular carcinomas (HCCs). Here we devised a massive anchored parallel sequencing (MAPS) method using next-generation sequencing to isolate and sequence HBV integrants. Applying MAPS to 40 pairs of HBV–related HCC tissues (cancer and adjacent tissues), we identified 296 HBV integration events corresponding to 286 unique integration sites (UISs) with precise HBV–Human DNA junctions. HBV integration favored chromosome 17 and preferentially integrated into human transcript units. HBV targeted genes were enriched in GO terms: cAMP metabolic processes, T cell differentiation and activation, TGF beta receptor pathway, ncRNA catabolic process, and dsRNA fragmentation and cellular response to dsRNA. The HBV targeted genes include 7 genes (PTPRJ, CNTN6, IL12B, MYOM1, FNDC3B, LRFN2, FN1) containing IPR003961 (Fibronectin, type III domain), 7 genes (NRG3, MASP2, NELL1, LRP1B, ADAM21, NRXN1, FN1) containing IPR013032 (EGF-like region, conserved site), and three genes (PDE7A, PDE4B, PDE11A) containing IPR002073 (3′, 5′-cyclic-nucleotide phosphodiesterase). Enriched pathways include hsa04512 (ECM-receptor interaction), hsa04510 (Focal adhesion), and hsa04012 (ErbB signaling pathway). Fewer integration events were found in cancers compared to cancer-adjacent tissues, suggesting a clonal expansion model in HCC development. Finally, we identified 8 genes that were recurrent target genes by HBV integration including fibronectin 1 (FN1) and telomerase reverse transcriptase (TERT1), two known recurrent target genes, and additional novel target genes such as SMAD family member 5 (SMAD5), phosphatase and actin regulator 4 (PHACTR4), and RNA binding protein fox-1 homolog (C. elegans) 1 (RBFOX1). Integrating analysis with recently published whole-genome sequencing analysis, we identified 14 additional recurrent HBV target genes, greatly expanding the HBV recurrent target list. This global survey of HBV integration events, together with recently published whole-genome sequencing analyses, furthered our understanding of the HBV–related HCC. PMID:23236287
An Eye on Trafficking Genes: Identification of Four Eye Color Mutations in Drosophila
Grant, Paaqua; Maga, Tara; Loshakov, Anna; Singhal, Rishi; Wali, Aminah; Nwankwo, Jennifer; Baron, Kaitlin; Johnson, Diana
2016-01-01
Genes that code for proteins involved in organelle biogenesis and intracellular trafficking produce products that are critical in normal cell function . Conserved orthologs of these are present in most or all eukaryotes, including Drosophila melanogaster. Some of these genes were originally identified as eye color mutants with decreases in both types of pigments found in the fly eye. These criteria were used for identification of such genes, four eye color mutations that are not annotated in the genome sequence: chocolate, maroon, mahogany, and red Malpighian tubules were molecularly mapped and their genome sequences have been evaluated. Mapping was performed using deletion analysis and complementation tests. chocolate is an allele of the VhaAC39-1 gene, which is an ortholog of the Vacuolar H+ ATPase AC39 subunit 1. maroon corresponds to the Vps16A gene and its product is part of the HOPS complex, which participates in transport and organelle fusion. red Malpighian tubule is the CG12207 gene, which encodes a protein of unknown function that includes a LysM domain. mahogany is the CG13646 gene, which is predicted to be an amino acid transporter. The strategy of identifying eye color genes based on perturbations in quantities of both types of eye color pigments has proven useful in identifying proteins involved in trafficking and biogenesis of lysosome-related organelles. Mutants of these genes can form the basis of valuable in vivo models to understand these processes. PMID:27558665
Stankiewicz, Adrian M; Goscik, Joanna; Dyr, Wanda; Juszczak, Grzegorz R; Ryglewicz, Danuta; Swiergiel, Artur H; Wieczorek, Marek; Stefanski, Roman
2015-12-01
Animal models provide opportunity to study neurobiological aspects of human alcoholism. Changes in gene expression have been implicated in mediating brain functions, including reward system and addiction. The current study aimed to identify genes that may underlie differential ethanol preference in Warsaw High Preferring (WHP) and Warsaw Low Preferring (WLP) rats. Microarray analysis comparing gene expression in nucleus accumbens (NAc), hippocampus (HP) and medial prefrontal cortex (mPFC) was performed in male WHP and WLP rats bred for differences in ethanol preference. Differential and stable between biological repeats expression of 345, 254 and 129 transcripts in NAc, HP and mPFC was detected. Identified genes and processes included known mediators of ethanol response (Mx2, Fam111a, Itpr1, Gabra4, Agtr1a, LTP/LTD, renin-angiotensin signaling pathway), toxicity (Sult1c2a, Ces1, inflammatory response), as well as genes involved in regulation of important addiction-related brain systems such as dopamine, tachykinin or acetylcholine (Gng7, Tac4, Slc5a7). The identified candidate genes may underlie differential ethanol preference in an animal model of alcoholism. Names of genes are written in italics, while names of proteins are written in standard font. Names of human genes/proteins are written in all capital letters. Names of rodent genes/proteins are written in capital letter followed by small letters. Copyright © 2015 Elsevier Inc. All rights reserved.
de Jong, Simone; Vidler, Lewis R; Mokrab, Younes; Collier, David A; Breen, Gerome
2016-08-01
Genome-wide association studies (GWAS) have identified thousands of novel genetic associations for complex genetic disorders, leading to the identification of potential pharmacological targets for novel drug development. In schizophrenia, 108 conservatively defined loci that meet genome-wide significance have been identified and hundreds of additional sub-threshold associations harbour information on the genetic aetiology of the disorder. In the present study, we used gene-set analysis based on the known binding targets of chemical compounds to identify the 'drug pathways' most strongly associated with schizophrenia-associated genes, with the aim of identifying potential drug repositioning opportunities and clues for novel treatment paradigms, especially in multi-target drug development. We compiled 9389 gene sets (2496 with unique gene content) and interrogated gene-based p-values from the PGC2-SCZ analysis. Although no single drug exceeded experiment wide significance (corrected p<0.05), highly ranked gene-sets reaching suggestive significance including the dopamine receptor antagonists metoclopramide and trifluoperazine and the tyrosine kinase inhibitor neratinib. This is a proof of principle analysis showing the potential utility of GWAS data of schizophrenia for the direct identification of candidate drugs and molecules that show polypharmacy. © The Author(s) 2016.
Severe Hypertriglyceridemia due to a novel p.Q240H mutation in the Lipoprotein Lipase gene.
Soto, Angela Ganan; McIntyre, Adam; Agrawal, Sungeeta; Bialo, Shara R; Hegele, Robert A; Boney, Charlotte M
2015-09-04
Lipoprotein Lipase (LPL) deficiency is a rare autosomal recessive disorder with a heterogeneous clinical presentation. Several mutations in the LPL gene have been identified to cause decreased activity of the enzyme. An 11-week-old, exclusively breastfed male presented with coffee-ground emesis, melena, xanthomas, lipemia retinalis and chylomicronemia. Genomic DNA analysis identified lipoprotein lipase deficiency due to compound heterozygosity including a novel p.Q240H mutation in exon 5 of the lipoprotein lipase (LPL) gene. His severe hypertriglyceridemia, including xanthomas, resolved with dietary long-chain fat restriction. We describe a novel mutation of the LPL gene causing severe hypertriglyceridemia and report the response to treatment. A review of the current literature regarding LPL deficiency syndrome reveals a few potential new therapies under investigation.
Erickson, Keesha E; Otoupal, Peter B; Chatterjee, Anushree
2017-01-01
Antibiotic-resistant bacteria are an increasingly serious public health concern, as strains emerge that demonstrate resistance to almost all available treatments. One factor that contributes to the crisis is the adaptive ability of bacteria, which exhibit remarkable phenotypic and gene expression heterogeneity in order to gain a survival advantage in damaging environments. This high degree of variability in gene expression across biological populations makes it a challenging task to identify key regulators of bacterial adaptation. Here, we research the regulation of adaptive resistance by investigating transcriptome profiles of Escherichia coli upon adaptation to disparate toxins, including antibiotics and biofuels. We locate potential target genes via conventional gene expression analysis as well as using a new analysis technique examining differential gene expression variability. By investigating trends across the diverse adaptation conditions, we identify a focused set of genes with conserved behavior, including those involved in cell motility, metabolism, membrane structure, and transport, and several genes of unknown function. To validate the biological relevance of the observed changes, we synthetically perturb gene expression using clustered regularly interspaced short palindromic repeat (CRISPR)-dCas9. Manipulation of select genes in combination with antibiotic treatment promotes adaptive resistance as demonstrated by an increased degree of antibiotic tolerance and heterogeneity in MICs. We study the mechanisms by which identified genes influence adaptation and find that select differentially variable genes have the potential to impact metabolic rates, mutation rates, and motility. Overall, this work provides evidence for a complex nongenetic response, encompassing shifts in gene expression and gene expression variability, which underlies adaptive resistance. IMPORTANCE Even initially sensitive bacteria can rapidly thwart antibiotic treatment through stress response processes known as adaptive resistance. Adaptive resistance fosters transient tolerance increases and the emergence of mutations conferring heritable drug resistance. In order to extend the applicable lifetime of new antibiotics, we must seek to hinder the occurrence of bacterial adaptive resistance; however, the regulation of adaptation is difficult to identify due to immense heterogeneity emerging during evolution. This study specifically seeks to generate heterogeneity by adapting bacteria to different stresses and then examines gene expression trends across the disparate populations in order to pinpoint key genes and pathways associated with adaptive resistance. The targets identified here may eventually inform strategies for impeding adaptive resistance and prolonging the effectiveness of antibiotic treatment.
A chronological expression profile of gene activity during embryonic mouse brain development.
Goggolidou, P; Soneji, S; Powles-Glover, N; Williams, D; Sethi, S; Baban, D; Simon, M M; Ragoussis, I; Norris, D P
2013-12-01
The brain is a functionally complex organ, the patterning and development of which are key to adult health. To help elucidate the genetic networks underlying mammalian brain patterning, we conducted detailed transcriptional profiling during embryonic development of the mouse brain. A total of 2,400 genes were identified as showing differential expression between three developmental stages. Analysis of the data identified nine gene clusters to demonstrate analogous expression profiles. A significant group of novel genes of as yet undiscovered biological function were detected as being potentially relevant to brain development and function, in addition to genes that have previously identified roles in the brain. Furthermore, analysis for genes that display asymmetric expression between the left and right brain hemispheres during development revealed 35 genes as putatively asymmetric from a combined data set. Our data constitute a valuable new resource for neuroscience and neurodevelopment, exposing possible functional associations between genes, including novel loci, and encouraging their further investigation in human neurological and behavioural disorders.
Stessman, Holly A. F.; Xiong, Bo; Coe, Bradley P.; Wang, Tianyun; Hoekzema, Kendra; Fenckova, Michaela; Kvarnung, Malin; Gerdts, Jennifer; Trinh, Sandy; Cosemans, Nele; Vives, Laura; Lin, Janice; Turner, Tychele N.; Santen, Gijs; Ruivenkamp, Claudia; Kriek, Marjolein; van Haeringen, Arie; Aten, Emmelien; Friend, Kathryn; Liebelt, Jan; Barnett, Christopher; Haan, Eric; Shaw, Marie; Gecz, Jozef; Anderlid, Britt-Marie; Nordgren, Ann; Lindstrand, Anna; Schwartz, Charles; Kooy, R. Frank; Vandeweyer, Geert; Helsmoortel, Celine; Romano, Corrado; Alberti, Antonino; Vinci, Mirella; Avola, Emanuela; Giusto, Stefania; Courchesne, Eric; Pramparo, Tiziano; Pierce, Karen; Nalabolu, Srinivasa; Amaral, David; Scheffer, Ingrid E.; Delatycki, Martin B.; Lockhart, Paul J.; Hormozdiari, Fereydoun; Harich, Benjamin; Castells-Nobau, Anna; Xia, Kun; Peeters, Hilde; Nordenskjöld, Magnus; Schenck, Annette; Bernier, Raphael A.; Eichler, Evan E.
2017-01-01
Gene-disruptive mutations contribute to the biology of neurodevelopmental disorders (NDDs), but most pathogenic genes are not known. We sequenced 208 candidate genes from >11,730 patients and >2,867 controls. We report 91 genes with an excess of de novo mutations or private disruptive mutations in 5.7% of patients, including 38 novel NDD genes. Drosophila functional assays of a subset bolster their involvement in NDDs. We identify 25 genes that show a bias for autism versus intellectual disability and highlight a network associated with high-functioning autism (FSIQ>100). Clinical follow-up for NAA15, KMT5B, and ASH1L reveals novel syndromic and non-syndromic forms of disease. PMID:28191889
The Genetics of Deafness in Domestic Animals
Strain, George M.
2015-01-01
Although deafness can be acquired throughout an animal’s life from a variety of causes, hereditary deafness, especially congenital hereditary deafness, is a significant problem in several species. Extensive reviews exist of the genetics of deafness in humans and mice, but not for deafness in domestic animals. Hereditary deafness in many species and breeds is associated with loci for white pigmentation, where the cochlear pathology is cochleo-saccular. In other cases, there is no pigmentation association and the cochlear pathology is neuroepithelial. Late onset hereditary deafness has recently been identified in dogs and may be present but not yet recognized in other species. Few genes responsible for deafness have been identified in animals, but progress has been made for identifying genes responsible for the associated pigmentation phenotypes. Across species, the genes identified with deafness or white pigmentation patterns include MITF, PMEL, KIT, EDNRB, CDH23, TYR, and TRPM1 in dog, cat, horse, cow, pig, sheep, ferret, mink, camelid, and rabbit. Multiple causative genes are present in some species. Significant work remains in many cases to identify specific chromosomal deafness genes so that DNA testing can be used to identify carriers of the mutated genes and thereby reduce deafness prevalence. PMID:26664958
A Screen for Modifiers of Hedgehog Signaling in Drosophila melanogaster Identifies swm and mts
Casso, David J.; Liu, Songmei; Iwaki, D. David; Ogden, Stacey K.; Kornberg, Thomas B.
2008-01-01
Signaling by Hedgehog (Hh) proteins shapes most tissues and organs in both vertebrates and invertebrates, and its misregulation has been implicated in many human diseases. Although components of the signaling pathway have been identified, key aspects of the signaling mechanism and downstream targets remain to be elucidated. We performed an enhancer/suppressor screen in Drosophila to identify novel components of the pathway and identified 26 autosomal regions that modify a phenotypic readout of Hh signaling. Three of the regions include genes that contribute constituents to the pathway—patched, engrailed, and hh. One of the other regions includes the gene microtubule star (mts) that encodes a subunit of protein phosphatase 2A. We show that mts is necessary for full activation of Hh signaling. A second region includes the gene second mitotic wave missing (swm). swm is recessive lethal and is predicted to encode an evolutionarily conserved protein with RNA binding and Zn+ finger domains. Characterization of newly isolated alleles indicates that swm is a negative regulator of Hh signaling and is essential for cell polarity. PMID:18245841
Long, Jin; Liu, Zhe; Wu, Xingda; Xu, Yuanhong; Ge, Chunlin
2016-05-01
The present study aimed to screen for potential genes and subnetworks associated with pancreatic cancer (PC) using the gene expression profile. The expression profile GSE 16515 was downloaded from the Gene Expression Omnibus database, which included 36 PC tissue samples and 16 normal samples. Limma package in R language was used to screen differentially expressed genes (DEGs), which were grouped as up‑ and downregulated genes. Then, PFSNet was applied to perform subnetwork analysis for all the DEGs. Moreover, Gene Ontology (GO) and REACTOME pathway enrichment analysis of up‑ and downregulated genes was performed, followed by protein‑protein interaction (PPI) network construction using Search Tool for the Retrieval of Interacting Genes Search Tool for the Retrieval of Interacting Genes. In total, 1,989 DEGs including 1,461 up‑ and 528 downregulated genes were screened out. Subnetworks including pancreatic cancer in PC tissue samples and intercellular adhesion in normal samples were identified, respectively. A total of 8 significant REACTOME pathways for upregulated DEGs, such as hemostasis and cell cycle, mitotic were identified. Moreover, 4 significant REACTOME pathways for downregulated DEGs, including regulation of β‑cell development and transmembrane transport of small molecules were screened out. Additionally, DEGs with high connectivity degrees, such as CCNA2 (cyclin A2) and PBK (PDZ binding kinase), of the module in the protein‑protein interaction network were mainly enriched with cell‑division cycle. CCNA2 and PBK of the module and their relative pathway cell‑division cycle, and two subnetworks (pancreatic cancer and intercellular adhesion subnetworks) may be pivotal for further understanding of the molecular mechanism of PC.
Davis, Richard V N; Lamont, Susan J; Rothschild, Max F; Persia, Michael E; Ashwell, Chris M; Schmidt, Carl J
2015-01-01
Agriculture provides excellent model systems for understanding how selective pressure, as applied by humans, can affect the genomes of plants and animals. One such system is modern poultry breeding in which intensive genetic selection has been applied for meat production in the domesticated chicken. As a result, modern meat-type chickens (broilers) exhibit enhanced growth, especially of the skeletal muscle, relative to their legacy counterparts. Comparative studies of modern and legacy broiler chickens provide an opportunity to identify genes and pathways affected by this human-directed evolution. This study used RNA-seq to compare the transcriptomes of a modern and a legacy broiler line to identify differentially enriched genes in the breast muscle at days 6 and 21 post-hatch. Among the 15,945 genes analyzed, 10,841 were expressed at greater than 0.1 RPKM. At day 6 post-hatch 189 genes, including several regulators of myogenic growth and development, were differentially enriched between the two lines. The transcriptional profiles between lines at day 21 post-hatch identify 193 genes differentially enriched and still include genes associated with myogenic growth. This study identified differentially enriched genes that regulate myogenic growth and differentiation between the modern and legacy broiler lines. Specifically, differences in the ratios of several positive (IGF1, IGF1R, WFIKKN2) and negative (MSTN, ACE) myogenic growth regulators may help explain the differences underlying the enhanced growth characteristics of the modern broilers.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shakoor, N; Nair, R; Crasta, O
2014-01-23
Background: Effective improvement in sorghum crop development necessitates a genomics-based approach to identify functional genes and QTLs. Sequenced in 2009, a comprehensive annotation of the sorghum genome and the development of functional genomics resources is key to enable the discovery and deployment of regulatory and metabolic genes and gene networks for crop improvement. Results: This study utilizes the first commercially available whole-transcriptome sorghum microarray (Sorgh-WTa520972F) to identify tissue and genotype-specific expression patterns for all identified Sorghum bicolor exons and UTRs. The genechip contains 1,026,373 probes covering 149,182 exons (27,577 genes) across the Sorghum bicolor nuclear, chloroplast, and mitochondrial genomes. Specificmore » probesets were also included for putative non-coding RNAs that may play a role in gene regulation (e. g., microRNAs), and confirmed functional small RNAs in related species (maize and sugarcane) were also included in our array design. We generated expression data for 78 samples with a combination of four different tissue types (shoot, root, leaf and stem), two dissected stem tissues (pith and rind) and six diverse genotypes, which included 6 public sorghum lines (R159, Atlas, Fremont, PI152611, AR2400 and PI455230) representing grain, sweet, forage, and high biomass ideotypes. Conclusions: Here we present a summary of the microarray dataset, including analysis of tissue-specific gene expression profiles and associated expression profiles of relevant metabolic pathways. With an aim to enable identification and functional characterization of genes in sorghum, this expression atlas presents a new and valuable resource to the research community.« less
Haralambieva, Iana H.; Oberg, Ann L.; Ovsyannikova, Inna G.; Kennedy, Richard B.; Grill, Diane E.; Middha, Sumit; Bot, Brian M.; Wang, Vivian W.; Smith, David I.; Jacobson, Robert M.; Poland, Gregory A.
2013-01-01
Immune responses to current rubella vaccines demonstrate significant inter-individual variability. We performed mRNA-Seq profiling on PBMCs from high and low antibody responders to rubella vaccination to delineate transcriptional differences upon viral stimulation. Generalized linear models were used to assess the per gene fold change (FC) for stimulated versus unstimulated samples or the interaction between outcome and stimulation. Model results were evaluated by both FC and p-value. Pathway analysis and self-contained gene set tests were performed for assessment of gene group effects. Of 17,566 detected genes, we identified 1,080 highly significant differentially expressed genes upon viral stimulation (p<1.00E−15, FDR<1.00E−14), including various immune function and inflammation-related genes, genes involved in cell signaling, cell regulation and transcription, and genes with unknown function. Analysis by immune outcome and stimulation status identified 27 genes (p≤0.0006 and FDR≤0.30) that responded differently to viral stimulation in high vs. low antibody responders, including major histocompatibility complex (MHC) class I genes (HLA-A, HLA-B and B2M with p = 0.0001, p = 0.0005 and p = 0.0002, respectively), and two genes related to innate immunity and inflammation (EMR3 and MEFV with p = 1.46E−08 and p = 0.0004, respectively). Pathway and gene set analysis also revealed transcriptional differences in antigen presentation and innate/inflammatory gene sets and pathways between high and low responders. Using mRNA-Seq genome-wide transcriptional profiling, we identified antigen presentation and innate/inflammatory genes that may assist in explaining rubella vaccine-induced immune response variations. Such information may provide new scientific insights into vaccine-induced immunity useful in rational vaccine development and immune response monitoring. PMID:23658707
Tian, Xin-Jie; Long, Yan; Wang, Jiao; Zhang, Jing-Wen; Wang, Yan-Yan; Li, Wei-Min; Peng, Yu-Fa; Yuan, Qian-Hua; Pei, Xin-Wu
2015-01-01
The perennial O. rufipogon (common wild rice), which is considered to be the ancestor of Asian cultivated rice species, contains many useful genetic resources, including drought resistance genes. However, few studies have identified the drought resistance and tissue-specific genes in common wild rice. In this study, transcriptome sequencing libraries were constructed, including drought-treated roots (DR) and control leaves (CL) and roots (CR). Using Illumina sequencing technology, we generated 16.75 million bases of high-quality sequence data for common wild rice and conducted de novo assembly and annotation of genes without prior genome information. These reads were assembled into 119,332 unigenes with an average length of 715 bp. A total of 88,813 distinct sequences (74.42% of unigenes) significantly matched known genes in the NCBI NT database. Differentially expressed gene (DEG) analysis showed that 3617 genes were up-regulated and 4171 genes were down-regulated in the CR library compared with the CL library. Among the DEGs, 535 genes were expressed in roots but not in shoots. A similar comparison between the DR and CR libraries showed that 1393 genes were up-regulated and 315 genes were down-regulated in the DR library compared with the CR library. Finally, 37 genes that were specifically expressed in roots were screened after comparing the DEGs identified in the above-described analyses. This study provides a transcriptome sequence resource for common wild rice plants and establishes a digital gene expression profile of wild rice plants under drought conditions using the assembled transcriptome data as a reference. Several tissue-specific and drought-stress-related candidate genes were identified, representing a fully characterized transcriptome and providing a valuable resource for genetic and genomic studies in plants.
Sellamuthu, Rajendran; Umbright, Christina; Li, Shengqiao; Kashon, Michael; Joseph, Pius
2015-01-01
A proper understanding of the mechanisms underlying crystalline silica-induced pulmonary toxicity has implications in the management and potential prevention of the adverse health effects associated with silica exposure including silicosis, cancer and several auto-immune diseases. Human lung type II epithelial cells and rat lungs exposed to crystalline silica were employed as experimental models to determine global gene expression changes in order to understand the molecular mechanisms underlying silica-induced pulmonary toxicity. The differential gene expression profile induced by silica correlated with its toxicity in the A549 cells. The biological processes perturbed by silica exposure in the A549 cells and rat lungs, as identified by the bioinformatics analysis of the differentially expressed genes, demonstrated significant similarity. Functional categorization of the differentially expressed genes identified cancer, cellular movement, cellular growth and proliferation, cell death, inflammatory response, cell cycle, cellular development, and genetic disorder as top ranking biological functions perturbed by silica exposure in A549 cells and rat lungs. Results of our study, in addition to confirming several previously identified molecular targets and mechanisms involved in silica toxicity, identified novel molecular targets and mechanisms potentially involved in silica-induced pulmonary toxicity. Further investigations, including those focused on the novel molecular targets and mechanisms identified in the current study may result in better management and, possibly, reduction and/or prevention of the potential adverse health effects associated with crystalline silica exposure. PMID:22087542
Population- and individual-specific regulatory variation in Sardinia.
Pala, Mauro; Zappala, Zachary; Marongiu, Mara; Li, Xin; Davis, Joe R; Cusano, Roberto; Crobu, Francesca; Kukurba, Kimberly R; Gloudemans, Michael J; Reinier, Frederic; Berutti, Riccardo; Piras, Maria G; Mulas, Antonella; Zoledziewska, Magdalena; Marongiu, Michele; Sorokin, Elena P; Hess, Gaelen T; Smith, Kevin S; Busonero, Fabio; Maschio, Andrea; Steri, Maristella; Sidore, Carlo; Sanna, Serena; Fiorillo, Edoardo; Bassik, Michael C; Sawcer, Stephen J; Battle, Alexis; Novembre, John; Jones, Chris; Angius, Andrea; Abecasis, Gonçalo R; Schlessinger, David; Cucca, Francesco; Montgomery, Stephen B
2017-05-01
Genetic studies of complex traits have mainly identified associations with noncoding variants. To further determine the contribution of regulatory variation, we combined whole-genome and transcriptome data for 624 individuals from Sardinia to identify common and rare variants that influence gene expression and splicing. We identified 21,183 expression quantitative trait loci (eQTLs) and 6,768 splicing quantitative trait loci (sQTLs), including 619 new QTLs. We identified high-frequency QTLs and found evidence of selection near genes involved in malarial resistance and increased multiple sclerosis risk, reflecting the epidemiological history of Sardinia. Using family relationships, we identified 809 segregating expression outliers (median z score of 2.97), averaging 13.3 genes per individual. Outlier genes were enriched for proximal rare variants, providing a new approach to study large-effect regulatory variants and their relevance to traits. Our results provide insight into the effects of regulatory variants and their relationship to population history and individual genetic risk.
Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua
2015-01-01
Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as reveled by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus. PMID:25974068
Genetic Determinants Influencing Human Serum Metabolome among African Americans
Yu, Bing; Zheng, Yan; Alexander, Danny; Morrison, Alanna C.; Coresh, Josef; Boerwinkle, Eric
2014-01-01
Phenotypes proximal to gene action generally reflect larger genetic effect sizes than those that are distant. The human metabolome, a result of multiple cellular and biological processes, are functional intermediate phenotypes proximal to gene action. Here, we present a genome-wide association study of 308 untargeted metabolite levels among African Americans from the Atherosclerosis Risk in Communities (ARIC) Study. Nineteen significant common variant-metabolite associations were identified, including 13 novel loci (p<1.6×10−10). These loci were associated with 7–50% of the difference in metabolite levels per allele, and the variance explained ranged from 4% to 20%. Fourteen genes were identified within the nineteen loci, and four of them contained non-synonymous substitutions in four enzyme-encoding genes (KLKB1, SIAE, CPS1, and NAT8); the other significant loci consist of eight other enzyme-encoding genes (ACE, GATM, ACY3, ACSM2B, THEM4, ADH4, UGT1A, TREH), a transporter gene (SLC6A13) and a polycystin protein gene (PKD2L1). In addition, four potential disease-associated paths were identified, including two direct longitudinal predictive relationships: NAT8 with N-acetylornithine, N-acetyl-1-methylhistidine and incident chronic kidney disease, and TREH with trehalose and incident diabetes. These results highlight the value of using endophenotypes proximal to gene function to discover new insights into biology and disease pathology. PMID:24625756
Virulotyping of Shigella spp. isolated from pediatric patients in Tehran, Iran.
Ranjbar, Reza; Bolandian, Masomeh; Behzadi, Payam
2017-03-01
Shigellosis is a considerable infectious disease with high morbidity and mortality among children worldwide. In this survey the prevalence of four important virulence genes including ial, ipaH, set1A, and set1B were investigated among Shigella strains and the related gene profiles identified in the present investigation, stool specimens were collected from children who were referred to two hospitals in Tehran, Iran. The samples were collected during 3 years (2008-2010) from children who were suspected to shigellosis. Shigella spp. were identified throughout microbiological and serological tests and then subjected to PCR for virulotyping. Shigella sonnei was ranking first (65.5%) followed by Shigella flexneri (25.9%), Shigella boydii (6.9%), and Shigella dysenteriae (1.7%). The ial gene was the most frequent virulence gene among isolated bacterial strains and was followed by ipaH, set1B, and set1A. S. flexneri possessed all of the studied virulence genes (ial 65.51%, ipaH 58.62%, set1A 12.07%, and set1B 22.41%). Moreover, the pattern of virulence gene profiles including ial, ial-ipaH, ial-ipaH-set1B, and ial-ipaH-set1B-set1A was identified for isolated Shigella spp. strains. The pattern of virulence genes is changed in isolated strains of Shigella in this study. So, the ial gene is placed first and the ipaH in second.
Uchiyama, Ikuo
2008-10-31
Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes.
van de Pol, Laura A; Wolf, Nicole I; van Weissenbruch, Mirjam M; Stam, Cornelie J; Weiss, Janneke M; Waisfisz, Quinten; Kevelam, Sietske H; Bugiani, Mariana; van de Kamp, Jiddeke M; van der Knaap, Marjo S
2015-12-01
A variety of pathologies can underlie early-onset severe encephalopathy with epilepsy. To aid the diagnostic process in such patients we present an overview of causes, including the rapidly expanding list of genes involved. When no explanation is found, whole-exome sequencing (WES) can be used in an attempt to identify gene defects in patients suspected to suffer from a genetic form. We describe three siblings, born to consanguineous parents, with a lethal severe epileptic encephalopathy with early-infantile onset, including their magnetic resonance imaging, electroencephalography and, in one case, neuropathological findings. Using WES a homozygous frameshift mutation in the BRAT1 gene, c.638dup p.(Val214Glyfs*189), was identified. We present our cases in the context of all published cases with mutations in the BRAT1 gene and conclude that BRAT1 should be added to the growing list of genes related to early-onset severe encephalopathy with epilepsy. Georg Thieme Verlag KG Stuttgart · New York.
Global and disease-associated genetic variation in the human Fanconi anemia gene family
Rogers, Kai J.; Fu, Wenqing; Akey, Joshua M.; Monnat, Raymond J.
2014-01-01
Fanconi anemia (FA) is a human recessive genetic disease resulting from inactivating mutations in any of 16 FANC (Fanconi) genes. Individuals with FA are at high risk of developmental abnormalities, early bone marrow failure and leukemia. These are followed in the second and subsequent decades by a very high risk of carcinomas of the head and neck and anogenital region, and a small continuing risk of leukemia. In order to characterize base pair-level disease-associated (DA) and population genetic variation in FANC genes and the segregation of this variation in the human population, we identified 2948 unique FANC gene variants including 493 FA DA variants across 57 240 potential base pair variation sites in the 16 FANC genes. We then analyzed the segregation of this variation in the 7578 subjects included in the Exome Sequencing Project (ESP) and the 1000 Genomes Project (1KGP). There was a remarkably high frequency of FA DA variants in ESP/1KGP subjects: at least 1 FA DA variant was identified in 78.5% (5950 of 7578) individuals included in these two studies. Six widely used functional prediction algorithms correctly identified only a third of the known, DA FANC missense variants. We also identified FA DA variants that may be good candidates for different types of mutation-specific therapies. Our results demonstrate the power of direct DNA sequencing to detect, estimate the frequency of and follow the segregation of deleterious genetic variation in human populations. PMID:25104853
A Systems Biology Framework Identifies Molecular Underpinnings of Coronary Heart Disease
Huan, Tianxiao; Zhang, Bin; Wang, Zhi; Joehanes, Roby; Zhu, Jun; Johnson, Andrew D.; Ying, Saixia; Munson, Peter J.; Raghavachari, Nalini; Wang, Richard; Liu, Poching; Courchesne, Paul; Hwang, Shih-Jen; Assimes, Themistocles L.; McPherson, Ruth; Samani, Nilesh J.; Schunkert, Heribert; Meng, Qingying; Suver, Christine; O'Donnell, Christopher J.; Derry, Jonathan; Yang, Xia; Levy, Daniel
2013-01-01
Objective Genetic approaches have identified numerous loci associated with coronary heart disease (CHD). The molecular mechanisms underlying CHD gene-disease associations, however, remain unclear. We hypothesized that genetic variants with both strong and subtle effects drive gene subnetworks that in turn affect CHD. Approach and Results We surveyed CHD-associated molecular interactions by constructing coexpression networks using whole blood gene expression profiles from 188 CHD cases and 188 age- and sex-matched controls. 24 coexpression modules were identified including one case-specific and one control-specific differential module (DM). The DMs were enriched for genes involved in B-cell activation, immune response, and ion transport. By integrating the DMs with altered gene expression associated SNPs (eSNPs) and with results of GWAS of CHD and its risk factors, the control-specific DM was implicated as CHD-causal based on its significant enrichment for both CHD and lipid eSNPs. This causal DM was further integrated with tissue-specific Bayesian networks and protein-protein interaction networks to identify regulatory key driver (KD) genes. Multi-tissue KDs (SPIB and TNFRSF13C) and tissue-specific KDs (e.g. EBF1) were identified. Conclusions Our network-driven integrative analysis not only identified CHD-related genes, but also defined network structure that sheds light on the molecular interactions of genes associated with CHD risk. PMID:23539213
Ahlborn, Gene J; Nelson, Gail M; Ward, William O; Knapp, Geremy; Allen, James W; Ouyang, Ming; Roop, Barbara C; Chen, Yan; O'Brien, Thomas; Kitchin, Kirk T; Delker, Don A
2008-03-15
Chronic drinking water exposure to inorganic arsenic and its metabolites increases tumor frequency in the skin of K6/ODC transgenic mice. To identify potential biomarkers and modes of action for this skin tumorigenicity, we characterized gene expression profiles from analysis of K6/ODC mice administered 0, 0.05, 0.25, 1.0 and 10 ppm sodium arsenite in their drinking water for 4 weeks. Following exposure, total RNA was isolated from mouse skin and processed to biotin-labeled cRNA for microarray analyses. Skin gene expression was analyzed with Affymetrix Mouse Genome 430A 2.0 GeneChips, and pathway analysis was conducted with DAVID (NIH), Ingenuity Systems and MetaCore's GeneGo. Differential expression of several key genes was verified through qPCR. Only the highest dose (10 ppm) resulted in significantly altered KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways, including MAPK, regulation of actin cytoskeleton, Wnt, Jak-Stat, Tight junction, Toll-like, phosphatidylinositol and insulin signaling pathways. Approximately 20 genes exhibited a dose response, including several genes known to be associated with carcinogenesis or tumor progression including cyclin D1, CLIC4, Ephrin A1, STAT3 and DNA methyltransferase 3a. Although transcription changes in all identified genes have not previously been linked to arsenic carcinogenesis, their association with carcinogenesis in other systems suggests that these genes may play a role in the early stages of arsenic-induced skin carcinogenesis and can be considered potential biomarkers.
Genomic Signature of Kin Selection in an Ant with Obligately Sterile Workers
Warner, Michael R.; Mikheyev, Alexander S.
2017-01-01
Abstract Kin selection is thought to drive the evolution of cooperation and conflict, but the specific genes and genome-wide patterns shaped by kin selection are unknown. We identified thousands of genes associated with the sterile ant worker caste, the archetype of an altruistic phenotype shaped by kin selection, and then used population and comparative genomic approaches to study patterns of molecular evolution at these genes. Consistent with population genetic theoretical predictions, worker-upregulated genes experienced reduced selection compared with genes upregulated in reproductive castes. Worker-upregulated genes included more taxonomically restricted genes, indicating that the worker caste has recruited more novel genes, yet these genes also experienced reduced selection. Our study identifies a putative genomic signature of kin selection and helps to integrate emerging sociogenomic data with longstanding social evolution theory. PMID:28419349
Reynolds, Lindsay M.; Lohman, Kurt; Pittman, Gary S.; Barr, R. Graham; Chi, Gloria C.; Kaufman, Joel; Wan, Ma; Bell, Douglas A.; Blaha, Michael J.; Rodriguez, Carlos J.; Liu, Yongmei
2017-01-01
ABSTRACT Alterations in DNA methylation and gene expression in blood leukocytes are potential biomarkers of harm and mediators of the deleterious effects of tobacco exposure. However, methodological issues, including the use of self-reported smoking status and mixed cell types have made previously identified alterations in DNA methylation and gene expression difficult to interpret. In this study, we examined associations of tobacco exposure with DNA methylation and gene expression, utilizing a biomarker of tobacco exposure (urine cotinine) and CD14+ purified monocyte samples from 934 participants of the community-based Multi-Ethnic Study of Atherosclerosis (MESA). Urine cotinine levels were measured using an immunoassay. DNA methylation and gene expression were measured with microarrays. Multivariate linear regression was used to test for associations adjusting for age, sex, race/ethnicity, education, and study site. Urine cotinine levels were associated with methylation of 176 CpGs [false discovery rate (FDR)<0.01]. Four CpGs not previously identified by studies of non-purified blood samples nominally replicated (P value<0.05) with plasma cotinine-associated methylation in 128 independent monocyte samples. Urine cotinine levels associated with expression of 12 genes (FDR<0.01), including increased expression of P2RY6 (Beta ± standard error = 0.078 ± 0.008, P = 1.99 × 10−22), a gene previously identified to be involved in the release of pro-inflammatory cytokines. No cotinine-associated (FDR<0.01) methylation profiles significantly (FDR<0.01) correlated with cotinine-associated (FDR<0.01) gene expression profiles. In conclusion, our findings i) identify potential monocyte-specific smoking-associated methylation patterns and ii) suggest that alterations in methylation may not be a main mechanism regulating gene expression in monocytes in response to cigarette smoking. PMID:29166816
A network-based method for the identification of putative genes related to infertility.
Wang, ShaoPeng; Huang, GuoHua; Hu, Qinghua; Zou, Quan
2016-11-01
Infertility has become one of the major health problems worldwide, with its incidence having risen markedly in recent decades. There is an urgent need to investigate the pathological mechanisms behind infertility and to design effective treatments. However, this is made difficult by the fact that various biological factors have been identified to be related to infertility, including genetic factors. A network-based method was established to identify new genes potentially related to infertility. A network constructed using human protein-protein interactions based on previously validated infertility-related genes enabled the identification of some novel candidate genes. These genes were then filtered by a permutation test and their functional and structural associations with infertility-related genes. Our method identified 23 novel genes, which have strong functional and structural associations with previously validated infertility-related genes. Substantial evidence indicates that the identified genes are strongly related to dysfunction of the four main biological processes of fertility: reproductive development and physiology, gametogenesis, meiosis and recombination, and hormone regulation. The newly discovered genes may provide new directions for investigating infertility. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang. Copyright © 2016 Elsevier B.V. All rights reserved.
Gene Discovery in Bladder Cancer Progression using cDNA Microarrays
Sanchez-Carbayo, Marta; Socci, Nicholas D.; Lozano, Juan Jose; Li, Wentian; Charytonowicz, Elizabeth; Belbin, Thomas J.; Prystowsky, Michael B.; Ortiz, Angel R.; Childs, Geoffrey; Cordon-Cardo, Carlos
2003-01-01
To identify gene expression changes along progression of bladder cancer, we compared the expression profiles of early-stage and advanced bladder tumors using cDNA microarrays containing 17,842 known genes and expressed sequence tags. The application of bootstrapping techniques to hierarchical clustering segregated early-stage and invasive transitional carcinomas into two main clusters. Multidimensional analysis confirmed these clusters and more importantly, it separated carcinoma in situ from papillary superficial lesions and subgroups within early-stage and invasive tumors displaying different overall survival. Additionally, it recognized early-stage tumors showing gene profiles similar to invasive disease. Different techniques including standard t-test, single-gene logistic regression, and support vector machine algorithms were applied to identify relevant genes involved in bladder cancer progression. Cytokeratin 20, neuropilin-2, p21, and p33ING1 were selected among the top ranked molecular targets differentially expressed and validated by immunohistochemistry using tissue microarrays (n = 173). Their expression patterns were significantly associated with pathological stage, tumor grade, and altered retinoblastoma (RB) expression. Moreover, p33ING1 expression levels were significantly associated with overall survival. Analysis of the annotation of the most significant genes revealed the relevance of critical genes and pathways during bladder cancer progression, including the overexpression of oncogenic genes such as DEK in superficial tumors or immune response genes such as Cd86 antigen in invasive disease. Gene profiling successfully classified bladder tumors based on their progression and clinical outcome. The present study has identified molecular biomarkers of potential clinical significance and critical molecular targets associated with bladder cancer progression. PMID:12875971
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing.
Jäger, Marten; Ott, Claus-Eric; Grünhagen, Johannes; Hecht, Jochen; Schell, Hanna; Mundlos, Stefan; Duda, Georg N; Robinson, Peter N; Lienau, Jasmin
2011-03-24
The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism.
Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing
2011-01-01
Background The sheep is an important model organism for many types of medically relevant research, but molecular genetic experiments in the sheep have been limited by the lack of knowledge about ovine gene sequences. Results Prior to our study, mRNA sequences for only 1,556 partial or complete ovine genes were publicly available. Therefore, we developed a composite de novo transcriptome assembly method for next-generation sequence data to combine known ovine mRNA and EST sequences, mRNA sequences from mouse and cow, and sequences assembled de novo from short read RNA-Seq data into a composite reference transcriptome, and identified transcripts from over 12 thousand previously undescribed ovine genes. Gene expression analysis based on these data revealed substantially different expression profiles in standard versus delayed bone healing in an ovine tibial osteotomy model. Hundreds of transcripts were differentially expressed between standard and delayed healing and between the time points of the standard and delayed healing groups. We used the sheep sequences to design quantitative RT-PCR assays with which we validated the differential expression of 26 genes that had been identified by RNA-seq analysis. A number of clusters of characteristic expression profiles could be identified, some of which showed striking differences between the standard and delayed healing groups. Gene Ontology (GO) analysis showed that the differentially expressed genes were enriched in terms including extracellular matrix, cartilage development, contractile fiber, and chemokine activity. Conclusions Our results provide a first atlas of gene expression profiles and differentially expressed genes in standard and delayed bone healing in a large-animal model and provide a number of clues as to the shifts in gene expression that underlie delayed bone healing. In the course of our study, we identified transcripts of 13,987 ovine genes, including 12,431 genes for which no sequence information was previously available. This information will provide a basis for future molecular research involving the sheep as a model organism. PMID:21435219
Lake, Jennifer; Gravel, Catherine; Koko, Gabriel Koffi D; Robert, Claude; Vandenberg, Grant W
2010-03-01
Phosphorus (P)-responsive genes and how they regulate renal adaptation to phosphorous-deficient diets in animals, including fish, are not well understood. RNA abundance profiling using cDNA microarrays is an efficient approach to study nutrient-gene interactions and identify these dietary P-responsive genes. To test the hypothesis that dietary P-responsive genes are differentially expressed in fish fed varying P levels, rainbow trout were fed a practical high-P diet (R20: 0.96% P) or a low-P diet (R0: 0.38% P) for 7 weeks. The differentially-expressed genes between dietary groups were identified and compared from the kidney by combining suppressive subtractive hybridization (SSH) with cDNA microarray analysis. A number of genes were confirmed by real-time PCR, and correlated with plasma and bone P concentrations. Approximately 54 genes were identified as potential dietary P-responsive after 7 weeks on a diet deficient in P according to cDNA microarray analysis. Of 18 selected genes, 13 genes were confirmed to be P-responsive at 7 weeks by real-time PCR analysis, including: iNOS, cytochrome b, cytochrome c oxidase subunit II , alpha-globin I, beta-globin, ATP synthase, hyperosmotic protein 21, COL1A3, Nkef, NDPK, glucose phosphate isomerase 1, Na+/H+ exchange protein and GDP dissociation inhibitor 2. Many of these dietary P-responsive genes responded in a moderate way (R0/R20 ratio: <2-3 or >0.5) and in a transient manner to dietary P limitation. In summary, renal adaptation to dietary P deficiency in trout involves changes in the expression of several genes, suggesting a profile of metabolic stress, since many of these differentially-expressed candidates are associated with the cellular adaptative responses. Crown Copyright 2009. Published by Elsevier Inc. All rights reserved.
Nehme, A; Zibara, K; Cerutti, C; Bricca, G
2015-06-01
The implication of the renin-angiotensin-aldosterone system (RAAS) in atheroma development is well described. However, a complete view of the local RAAS in atheroma is still missing. In this study we aimed to reveal the organization of RAAS in atheroma at the transcriptomic level and identify the transcriptional regulators behind it. Extended RAAS (extRAAS) was defined as the set of 37 genes coding for classical and novel RAAS participants (Figure 1). Five microarray datasets containing overall 590 samples representing carotid and peripheral atheroma were downloaded from the GEO database. Correlation-based hierarchical clustering (R software) of extRAAS genes within each dataset allowed the identification of modules of co-expressed genes. Reproducible co-expression modules across datasets were then extracted. Transcription factors (TFs) having common binding sites (TFBSs) in the promoters of coordinated genes were identified using the Genomatix database tools and analyzed for their correlation with extRAAS genes in the microarray datasets. Expression data revealed the expressed extRAAS components and their relative abundance displaying the favored pathways in atheroma. Three co-expression modules with more than 80% reproducibility across datasets were extracted. Two of them (M1 and M2) contained genes coding for angiotensin metabolizing enzymes involved in different pathways: M1 included ACE, MME, RNPEP, and DPP3, in addition to 7 other genes; and M2 included CMA1, CTSG, and CPA3. The third module (M3) contained genes coding for receptors known to be implicated in atheroma (AGTR1, MR, GR, LNPEP, EGFR and GPER). M1 and M3 were negatively correlated in 3 of 5 datasets. We identified 19 TFs that have enriched TFBSs in the promoters of genes of M1, and two for M3, but none was found for M2. Among the extracted TFs, ELF1, MAX, and IRF5 showed significant positive correlations with peptidase-coding genes from M1 and negative correlations with receptors-coding genes from M3 (p < 0.05). The identified co-expression modules display the transcriptional organization of local extRAAS in human carotid atheroma. The identification of several TFs potentially associated to extRAAS genes may provide a frame for the discovery of atheroma-specific modulators of extRAAS activity.(Figure is included in full-text article.).
Inferring causal genomic alterations in breast cancer using gene expression data
2011-01-01
Background One of the primary objectives in cancer research is to identify causal genomic alterations, such as somatic copy number variation (CNV) and somatic mutations, during tumor development. Many valuable studies lack genomic data to detect CNV; therefore, methods that are able to infer CNVs from gene expression data would help maximize the value of these studies. Results We developed a framework for identifying recurrent regions of CNV and distinguishing the cancer driver genes from the passenger genes in the regions. By inferring CNV regions across many datasets we were able to identify 109 recurrent amplified/deleted CNV regions. Many of these regions are enriched for genes involved in many important processes associated with tumorigenesis and cancer progression. Genes in these recurrent CNV regions were then examined in the context of gene regulatory networks to prioritize putative cancer driver genes. The cancer driver genes uncovered by the framework include not only well-known oncogenes but also a number of novel cancer susceptibility genes validated via siRNA experiments. Conclusions To our knowledge, this is the first effort to systematically identify and validate drivers for expression based CNV regions in breast cancer. The framework where the wavelet analysis of copy number alteration based on expression coupled with the gene regulatory network analysis, provides a blueprint for leveraging genomic data to identify key regulatory components and gene targets. This integrative approach can be applied to many other large-scale gene expression studies and other novel types of cancer data such as next-generation sequencing based expression (RNA-Seq) as well as CNV data. PMID:21806811
A whole-blood transcriptome meta-analysis identifies gene expression signatures of cigarette smoking
Huan, Tianxiao; Joehanes, Roby; Schurmann, Claudia; Schramm, Katharina; Pilling, Luke C.; Peters, Marjolein J.; Mägi, Reedik; DeMeo, Dawn; O'Connor, George T.; Ferrucci, Luigi; Teumer, Alexander; Homuth, Georg; Biffar, Reiner; Völker, Uwe; Herder, Christian; Waldenberger, Melanie; Peters, Annette; Zeilinger, Sonja; Metspalu, Andres; Hofman, Albert; Uitterlinden, André G.; Hernandez, Dena G.; Singleton, Andrew B.; Bandinelli, Stefania; Munson, Peter J.; Lin, Honghuang; Benjamin, Emelia J.; Esko, Tõnu; Grabe, Hans J.; Prokisch, Holger; van Meurs, Joyce B.J.; Melzer, David; Levy, Daniel
2016-01-01
Abstract Cigarette smoking is a leading modifiable cause of death worldwide. We hypothesized that cigarette smoking induces extensive transcriptomic changes that lead to target-organ damage and smoking-related diseases. We performed a meta-analysis of transcriptome-wide gene expression using whole blood-derived RNA from 10,233 participants of European ancestry in six cohorts (including 1421 current and 3955 former smokers) to identify associations between smoking and altered gene expression levels. At a false discovery rate (FDR) <0.1, we identified 1270 differentially expressed genes in current vs. never smokers, and 39 genes in former vs. never smokers. Expression levels of 12 genes remained elevated up to 30 years after smoking cessation, suggesting that the molecular consequence of smoking may persist for decades. Gene ontology analysis revealed enrichment of smoking-related genes for activation of platelets and lymphocytes, immune response, and apoptosis. Many of the top smoking-related differentially expressed genes, including LRRN3 and GPR15, have DNA methylation loci in promoter regions that were recently reported to be hypomethylated among smokers. By linking differential gene expression with smoking-related disease phenotypes, we demonstrated that stroke and pulmonary function show enrichment for smoking-related gene expression signatures. Mediation analysis revealed the expression of several genes (e.g. ALAS2) to be putative mediators of the associations between smoking and inflammatory biomarkers (IL6 and C-reactive protein levels). Our transcriptomic study provides potential insights into the effects of cigarette smoking on gene expression in whole blood and their relations to smoking-related diseases. The results of such analyses may highlight attractive targets for treating or preventing smoking-related health effects. PMID:28158590
Zinzow-Kramer, W M; Horton, B M; McKee, C D; Michaud, J M; Tharp, G K; Thomas, J W; Tuttle, E M; Yi, S; Maney, D L
2015-11-01
The genome of the white-throated sparrow (Zonotrichia albicollis) contains an inversion polymorphism on chromosome 2 that is linked to predictable variation in a suite of phenotypic traits including plumage color, aggression and parental behavior. Differences in gene expression between the two color morphs, which represent the two common inversion genotypes (ZAL2/ZAL2 and ZAL2/ZAL2(m) ), may therefore advance our understanding of the molecular underpinnings of these phenotypes. To identify genes that are differentially expressed between the two morphs and correlated with behavior, we quantified gene expression and terrirorial aggression, including song, in a population of free-living white-throated sparrows. We analyzed gene expression in two brain regions, the medial amygdala (MeA) and hypothalamus. Both regions are part of a 'social behavior network', which is rich in steroid hormone receptors and previously linked with territorial behavior. Using weighted gene co-expression network analyses, we identified modules of genes that were correlated with both morph and singing behavior. The majority of these genes were located within the inversion, showing the profound effect of the inversion on the expression of genes captured by the rearrangement. These modules were enriched with genes related to retinoic acid signaling and basic cellular functioning. In the MeA, the most prominent pathways were those related to steroid hormone receptor activity. Within these pathways, the only gene encoding such a receptor was ESR1 (estrogen receptor 1), a gene previously shown to predict song rate in this species. The set of candidate genes we identified may mediate the effects of a chromosomal inversion on territorial behavior. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Molecular Basis of Sulfonamide and Trimethoprim Resistance in Fish-Pathogenic Aeromonas Isolates ▿
Kadlec, Kristina; von Czapiewski, Ellen; Kaspar, Heike; Wallmann, Jürgen; Michael, Geovana Brenner; Steinacker, Ulrike; Schwarz, Stefan
2011-01-01
Sulfonamide-trimethoprim-resistant Aeromonas salmonicida and motile Aeromonas spp. from diseased fish of the GERM-Vet study carried the sul1 gene together with mostly cassette-borne trimethoprim resistance genes, including the novel gene dfrA28. The seven dfrA and dfrB genes identified were located mostly in class 1 integrons which commonly harbored other gene cassettes. PMID:21764945
Wu, Chengjiang; Zhao, Yangjing; Lin, Yu; Yang, Xinxin; Yan, Meina; Min, Yujiao; Pan, Zihui; Xia, Sheng; Shao, Qixiang
2018-01-01
DNA microarray and high-throughput sequencing have been widely used to identify the differentially expressed genes (DEGs) in systemic lupus erythematosus (SLE). However, the big data from gene microarrays are also challenging to work with in terms of analysis and processing. The presents study combined data from the microarray expression profile (GSE65391) and bioinformatics analysis to identify the key genes and cellular pathways in SLE. Gene ontology (GO) and cellular pathway enrichment analyses of DEGs were performed to investigate significantly enriched pathways. A protein-protein interaction network was constructed to determine the key genes in the occurrence and development of SLE. A total of 310 DEGs were identified in SLE, including 193 upregulated genes and 117 downregulated genes. GO analysis revealed that the most significant biological process of DEGs was immune system process. Kyoto Encyclopedia of Genes and Genome pathway analysis showed that these DEGs were enriched in signaling pathways associated with the immune system, including the RIG-I-like receptor signaling pathway, intestinal immune network for IgA production, antigen processing and presentation and the toll-like receptor signaling pathway. The current study screened the top 10 genes with higher degrees as hub genes, which included 2′-5′-oligoadenylate synthetase 1, MX dynamin like GTPase 2, interferon induced protein with tetratricopeptide repeats 1, interferon regulatory factor 7, interferon induced with helicase C domain 1, signal transducer and activator of transcription 1, ISG15 ubiquitin-like modifier, DExD/H-box helicase 58, interferon induced protein with tetratricopeptide repeats 3 and 2′-5′-oligoadenylate synthetase 2. Module analysis revealed that these hub genes were also involved in the RIG-I-like receptor signaling, cytosolic DNA-sensing, toll-like receptor signaling and ribosome biogenesis pathways. In addition, these hub genes, from different probe sets, exhibited significant co-expressed tendency in multi-experiment microarray datasets (P<0.01). In conclusion, these key genes and cellular pathways may improve the current understanding of the underlying mechanism of development of SLE. These key genes may be potential biomarkers of diagnosis, therapy and prognosis for SLE. PMID:29257335
Systemic bioinformatics analysis of skeletal muscle gene expression profiles of sepsis
Yang, Fang; Wang, Yumei
2018-01-01
Sepsis is a type of systemic inflammatory response syndrome with high morbidity and mortality. Skeletal muscle dysfunction is one of the major complications of sepsis that may also influence the outcome of sepsis. The aim of the present study was to explore and identify potential mechanisms and therapeutic targets of sepsis. Systemic bioinformatics analysis of skeletal muscle gene expression profiles from the Gene Expression Omnibus was performed. Differentially expressed genes (DEGs) in samples from patients with sepsis and control samples were screened out using the limma package. Differential co-expression and coregulation (DCE and DCR, respectively) analysis was performed based on the Differential Co-expression Analysis package to identify differences in gene co-expression and coregulation patterns between the control and sepsis groups. Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways of DEGs were identified using the Database for Annotation, Visualization and Integrated Discovery, and inflammatory, cancer and skeletal muscle development-associated biological processes and pathways were identified. DCE and DCR analysis revealed several potential therapeutic targets for sepsis, including genes and transcription factors. The results of the present study may provide a basis for the development of novel therapeutic targets and treatment methods for sepsis. PMID:29805480
Congenital diaphragmatic hernias: from genes to mechanisms to therapies
McCulley, David J.; Shen, Yufeng; Wynn, Julia; Shang, Linshan; Bogenschutz, Eric; Sun, Xin
2017-01-01
ABSTRACT Congenital diaphragmatic hernias (CDHs) and structural anomalies of the diaphragm are a common class of congenital birth defects that are associated with significant morbidity and mortality due to associated pulmonary hypoplasia, pulmonary hypertension and heart failure. In ∼30% of CDH patients, genomic analyses have identified a range of genetic defects, including chromosomal anomalies, copy number variants and sequence variants. The affected genes identified in CDH patients include transcription factors, such as GATA4, ZFPM2, NR2F2 and WT1, and signaling pathway components, including members of the retinoic acid pathway. Mutations in these genes affect diaphragm development and can have pleiotropic effects on pulmonary and cardiac development. New therapies, including fetal endoscopic tracheal occlusion and prenatal transplacental fetal treatments, aim to normalize lung development and pulmonary vascular tone to prevent and treat lung hypoplasia and pulmonary hypertension, respectively. Studies of the association between particular genetic mutations and clinical outcomes should allow us to better understand the origin of this birth defect and to improve our ability to predict and identify patients most likely to benefit from specialized treatment strategies. PMID:28768736
Analysis of gene expression profile microarray data in complex regional pain syndrome.
Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing
2017-09-01
The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.
Multi-step splicing of sphingomyelin synthase linear and circular RNAs.
Filippenkov, Ivan B; Sudarkina, Olga Yu; Limborska, Svetlana A; Dergunova, Lyudmila V
2018-05-15
The SGMS1 gene encodes the enzyme sphingomyelin synthase 1 (SMS1), which is involved in the regulation of lipid metabolism, apoptosis, intracellular vesicular transport and other significant processes. The SGMS1 gene is located on chromosome 10 and has a size of 320 kb. Previously, we showed that dozens of alternative transcripts of the SGMS1 gene are present in various human tissues. In addition to mRNAs that provide synthesis of the SMS1 protein, this gene participates in the synthesis of non-coding transcripts, including circular RNAs (circRNAs), which include exons of the 5'-untranslated region (5'-UTR) and are highly represented in the brain. In this study, using the high-throughput technology RNA-CaptureSeq, many new SGMS1 transcripts were identified, including both intronic unspliced RNAs (premature RNAs) and RNAs formed via alternative splicing. Recursive exons (RS-exons) that can participate in the multi-step splicing of long introns of the gene were also identified. These exons participate in the formation of circRNAs. Thus, multi-step splicing may provide a variety of linear and circular RNAs of eukaryotic genes in tissues. Copyright © 2018 Elsevier B.V. All rights reserved.
Amano, Ikuko; Kitajima, Sakihito; Suzuki, Hideyuki; Koeduka, Takao
2018-01-01
The biosynthesis of plant secondary metabolites is associated with morphological and metabolic differentiation. As a consequence, gene expression profiles can change drastically, and primary and secondary metabolites, including intermediate and end-products, move dynamically within and between cells. However, little is known about the molecular mechanisms underlying differentiation and transport mechanisms. In this study, we performed a transcriptome analysis of Petunia axillaris subsp. parodii, which produces various volatiles in its corolla limbs and emits metabolites to attract pollinators. RNA-sequencing from leaves, buds, and limbs identified 53,243 unigenes. Analysis of differentially expressed genes, combined with gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses, showed that many biological processes were highly enriched in limbs. These included catabolic processes and signaling pathways of hormones, such as gibberellins, and metabolic pathways, including phenylpropanoids and fatty acids. Moreover, we identified five transporter genes that showed high expression in limbs, and we performed spatiotemporal expression analyses and homology searches to infer their putative functions. Our systematic analysis provides comprehensive transcriptomic information regarding morphological differentiation and metabolite transport in the Petunia flower and lays the foundation for establishing the specific mechanisms that control secondary metabolite biosynthesis in plants. PMID:29902274
Kraus, Cornelia; Hoyer, Juliane; Vasileiou, Georgia; Wunderle, Marius; Lux, Michael P; Fasching, Peter A; Krumbiegel, Mandy; Uebe, Steffen; Reuter, Miriam; Beckmann, Matthias W; Reis, André
2017-01-01
Breast and ovarian cancer (BC/OC) predisposition has been attributed to a number of high- and moderate to low-penetrance susceptibility genes. With the advent of next generation sequencing (NGS) simultaneous testing of these genes has become feasible. In this monocentric study, we report results of panel-based screening of 14 BC/OC susceptibility genes (BRCA1, BRCA2, RAD51C, RAD51D, CHEK2, PALB2, ATM, NBN, CDH1, TP53, MLH1, MSH2, MSH6 and PMS2) in a group of 581 consecutive individuals from a German population with BC and/or OC fulfilling diagnostic criteria for BRCA1 and BRCA2 testing including 179 with a triple-negative tumor. Altogether we identified 106 deleterious mutations in 105 (18%) patients in 10 different genes, including seven different exon deletions. Of these 106 mutations, 16 (15%) were novel and only six were found in BRCA1/2. To further characterize mutations located in or nearby splicing consensus sites we performed RT-PCR analysis which allowed confirmation of pathogenicity in 7 of 9 mutations analyzed. In PALB2, we identified a deleterious variant in six cases. All but one were associated with early onset BC and a positive family history indicating that penetrance for PALB2 mutations is comparable to BRCA2. Overall, extended testing beyond BRCA1/2 identified a deleterious mutation in further 6% of patients. As a downside, 89 variants of uncertain significance were identified highlighting the need for comprehensive variant databases. In conclusion, panel testing yields more accurate information on genetic cancer risk than assessing BRCA1/2 alone and wide-spread testing will help improve penetrance assessment of variants in these risk genes. © 2016 UICC.
Colbourne, John K; Eads, Brian D; Shaw, Joseph; Bohuski, Elizabeth; Bauer, Darren J; Andrews, Justen
2007-01-01
Background Functional and comparative studies of insect genomes have shed light on the complement of genes, which in part, account for shared morphologies, developmental programs and life-histories. Contrasting the gene inventories of insects to those of the nematodes provides insight into the genomic changes responsible for their diversification. However, nematodes have weak relationships to insects, as each belongs to separate animal phyla. A better outgroup to distinguish lineage specific novelties would include other members of Arthropoda. For example, crustaceans are close allies to the insects (together forming Pancrustacea) and their fascinating aquatic lifestyle provides an important comparison for understanding the genetic basis of adaptations to life on land versus life in water. Results This study reports on the first characterization of cDNA libraries and sequences for the model crustacean Daphnia pulex. We analyzed 1,546 ESTs of which 1,414 represent approximately 787 nuclear genes, by measuring their sequence similarities with insect and nematode proteomes. The provisional annotation of genes is supported by expression data from microarray studies described in companion papers. Loci expected to be shared between crustaceans and insects because of their mutual biological features are identified, including genes for reproduction, regulation and cellular processes. We identify genes that are likely derived within Pancrustacea or lost within the nematodes. Moreover, lineage specific gene family expansions are identified, which suggest certain biological demands associated with their ecological setting. In particular, up to seven distinct ferritin loci are found in Daphnia compared to three in most insects. Finally, a substantial fraction of the sampled gene transcripts shares no sequence similarity with those from other arthropods. Genes functioning during development and reproduction are comparatively well conserved between crustaceans and insects. By contrast, genes that were responsive to environmental conditions (metal stress) and not sex-biased included the greatest proportion of genes with no matches to insect proteomes. Conclusion This study along with associated microarray experiments are the initial steps in a coordinated effort by the Daphnia Genomics Consortium to build the necessary genomic platform needed to discover genes that account for the phenotypic diversity within the genus and to gain new insights into crustacean biology. This effort will soon include the first crustacean genome sequence. PMID:17612412
Takeda, Itaru; Umemura, Myco; Koike, Hideaki; Asai, Kiyoshi; Machida, Masayuki
2014-08-01
Despite their biological importance, a significant number of genes for secondary metabolite biosynthesis (SMB) remain undetected due largely to the fact that they are highly diverse and are not expressed under a variety of cultivation conditions. Several software tools including SMURF and antiSMASH have been developed to predict fungal SMB gene clusters by finding core genes encoding polyketide synthase, nonribosomal peptide synthetase and dimethylallyltryptophan synthase as well as several others typically present in the cluster. In this work, we have devised a novel comparative genomics method to identify SMB gene clusters that is independent of motif information of the known SMB genes. The method detects SMB gene clusters by searching for a similar order of genes and their presence in nonsyntenic blocks. With this method, we were able to identify many known SMB gene clusters with the core genes in the genomic sequences of 10 filamentous fungi. Furthermore, we have also detected SMB gene clusters without core genes, including the kojic acid biosynthesis gene cluster of Aspergillus oryzae. By varying the detection parameters of the method, a significant difference in the sequence characteristics was detected between the genes residing inside the clusters and those outside the clusters. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Identifying potential maternal genes of Bombyx mori using digital gene expression profiling
Xu, Pingzhen
2018-01-01
Maternal genes present in mature oocytes play a crucial role in the early development of silkworm. Although maternal genes have been widely studied in many other species, there has been limited research in Bombyx mori. High-throughput next generation sequencing provides a practical method for gene discovery on a genome-wide level. Herein, a transcriptome study was used to identify maternal-related genes from silkworm eggs. Unfertilized eggs from five different stages of early development were used to detect the changing situation of gene expression. The expressed genes showed different patterns over time. Seventy-six maternal genes were annotated according to homology analysis with Drosophila melanogaster. More than half of the differentially expressed maternal genes fell into four expression patterns, while the expression patterns showed a downward trend over time. The functional annotation of these material genes was mainly related to transcription factor activity, growth factor activity, nucleic acid binding, RNA binding, ATP binding, and ion binding. Additionally, twenty-two gene clusters including maternal genes were identified from 18 scaffolds. Altogether, we plotted a profile for the maternal genes of Bombyx mori using a digital gene expression profiling method. This will provide the basis for maternal-specific signature research and improve the understanding of the early development of silkworm. PMID:29462160
Juraeva, Dilafruz; Haenisch, Britta; Zapatka, Marc; Frank, Josef; Witt, Stephanie H; Mühleisen, Thomas W; Treutlein, Jens; Strohmaier, Jana; Meier, Sandra; Degenhardt, Franziska; Giegling, Ina; Ripke, Stephan; Leber, Markus; Lange, Christoph; Schulze, Thomas G; Mössner, Rainald; Nenadic, Igor; Sauer, Heinrich; Rujescu, Dan; Maier, Wolfgang; Børglum, Anders; Ophoff, Roel; Cichon, Sven; Nöthen, Markus M; Rietschel, Marcella; Mattheisen, Manuel; Brors, Benedikt
2014-06-01
In the present study, an integrated hierarchical approach was applied to: (1) identify pathways associated with susceptibility to schizophrenia; (2) detect genes that may be potentially affected in these pathways since they contain an associated polymorphism; and (3) annotate the functional consequences of such single-nucleotide polymorphisms (SNPs) in the affected genes or their regulatory regions. The Global Test was applied to detect schizophrenia-associated pathways using discovery and replication datasets comprising 5,040 and 5,082 individuals of European ancestry, respectively. Information concerning functional gene-sets was retrieved from the Kyoto Encyclopedia of Genes and Genomes, Gene Ontology, and the Molecular Signatures Database. Fourteen of the gene-sets or pathways identified in the discovery dataset were confirmed in the replication dataset. These include functional processes involved in transcriptional regulation and gene expression, synapse organization, cell adhesion, and apoptosis. For two genes, i.e. CTCF and CACNB2, evidence for association with schizophrenia was available (at the gene-level) in both the discovery study and published data from the Psychiatric Genomics Consortium schizophrenia study. Furthermore, these genes mapped to four of the 14 presently identified pathways. Several of the SNPs assigned to CTCF and CACNB2 have potential functional consequences, and a gene in close proximity to CACNB2, i.e. ARL5B, was identified as a potential gene of interest. Application of the present hierarchical approach thus allowed: (1) identification of novel biological gene-sets or pathways with potential involvement in the etiology of schizophrenia, as well as replication of these findings in an independent cohort; (2) detection of genes of interest for future follow-up studies; and (3) the highlighting of novel genes in previously reported candidate regions for schizophrenia.
Frech, Christian; Chen, Nansheng
2011-01-01
Genes underlying important phenotypic differences between Plasmodium species, the causative agents of malaria, are frequently found in only a subset of species and cluster at dynamically evolving subtelomeric regions of chromosomes. We hypothesized that chromosome-internal regions of Plasmodium genomes harbour additional species subset-specific genes that underlie differences in human pathogenicity, human-to-human transmissibility, and human virulence. We combined sequence similarity searches with synteny block analyses to identify species subset-specific genes in chromosome-internal regions of six published Plasmodium genomes, including Plasmodium falciparum, Plasmodium vivax, Plasmodium knowlesi, Plasmodium yoelii, Plasmodium berghei, and Plasmodium chabaudi. To improve comparative analysis, we first revised incorrectly annotated gene models using homology-based gene finders and examined putative subset-specific genes within syntenic contexts. Confirmed subset-specific genes were then analyzed for their role in biological pathways and examined for molecular functions using publicly available databases. We identified 16 genes that are well conserved in the three primate parasites but not found in rodent parasites, including three key enzymes of the thiamine (vitamin B1) biosynthesis pathway. Thirteen genes were found to be present in both human parasites but absent in the monkey parasite P. knowlesi, including genes specifically upregulated in sporozoites or gametocytes that could be linked to parasite transmission success between humans. Furthermore, we propose 15 chromosome-internal P. falciparum-specific genes as new candidate genes underlying increased human virulence and detected a currently uncharacterized cluster of P. vivax-specific genes on chromosome 6 likely involved in erythrocyte invasion. In conclusion, Plasmodium species harbour many chromosome-internal differences in the form of protein-coding genes, some of which are potentially linked to human disease and thus promising leads for future laboratory research. PMID:22215999
Transposon mutagenesis identifies genes that cooperate with mutant Pten in breast cancer progression
Rangel, Roberto; Lee, Song-Choon; Hon-Kim Ban, Kenneth; Guzman-Rojas, Liliana; Mann, Michael B.; Newberg, Justin Y.; McNoe, Leslie A.; Selvanesan, Luxmanan; Ward, Jerrold M.; Rust, Alistair G.; Chin, Kuan-Yew; Black, Michael A.; Jenkins, Nancy A.; Copeland, Neal G.
2016-01-01
Triple-negative breast cancer (TNBC) has the worst prognosis of any breast cancer subtype. To better understand the genetic forces driving TNBC, we performed a transposon mutagenesis screen in a phosphatase and tensin homolog (Pten) mutant mice and identified 12 candidate trunk drivers and a much larger number of progression genes. Validation studies identified eight TNBC tumor suppressor genes, including the GATA-like transcriptional repressor TRPS1. Down-regulation of TRPS1 in TNBC cells promoted epithelial-to-mesenchymal transition (EMT) by deregulating multiple EMT pathway genes, in addition to increasing the expression of SERPINE1 and SERPINB2 and the subsequent migration, invasion, and metastasis of tumor cells. Transposon mutagenesis has thus provided a better understanding of the genetic forces driving TNBC and discovered genes with potential clinical importance in TNBC. PMID:27849608
Aykut, Ayça; Karaca, Emin; Onay, Hüseyin; Gökşen, Damla; Çetinkalp, Şevki; Eren, Erdal; Ersoy, Betül; Çakır, Esra Papatya; Büyükinan, Muammer; Kara, Cengiz; Anık, Ahmet; Kırel, Birgül; Özen, Samim; Atik, Tahir; Darcan, Şükran; Özkınay, Ferda
2018-01-30
Maturity onset diabetes is a genetic form of diabetes mellitus characterized by an early age at onset and several etiologic genes for this form of diabetes have been identified in many patients. Maturity onset diabetes type 2 [MODY2 (#125851)] caused by mutations in the glucokinase gene (GCK). Although its prevalence is not clear, it is estimated that 1%-2% of patients with diabetes have the monogenic form. The aim of this study was to evaluate the molecular spectrum of GCK gene mutations in 177 Turkish MODY type 2 patients. Mutations in the GCK gene were identified in 79 out of 177. All mutant alleles were identified, including 45 different GCK mutations, 20 of which were novel. Copyright © 2017. Published by Elsevier B.V.
NASA Technical Reports Server (NTRS)
Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.
2005-01-01
The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2.
Chaudhary, Saurabh; Sharma, Prakash C.
2015-01-01
Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants. PMID:25803684
Chaudhary, Saurabh; Sharma, Prakash C
2015-01-01
Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants.
Lu, Wei; Wise, Michael J.; Tay, Chin Yen; Windsor, Helen M.; Marshall, Barry J.; Peacock, Christopher
2014-01-01
Isolates of Helicobacter pylori can be classified phylogeographically. High genetic diversity and rapid microevolution are a hallmark of H. pylori genomes, a phenomenon that is proposed to play a functional role in persistence and colonization of diverse human populations. To provide further genomic evidence in the lineage of H. pylori and to further characterize diverse strains of this pathogen in different human populations, we report the finished genome sequence of Sahul64, an H. pylori strain isolated from an indigenous Australian. Our analysis identified genes that were highly divergent compared to the 38 publically available genomes, which include genes involved in the biosynthesis and modification of lipopolysaccharide, putative prophage genes, restriction modification components, and hypothetical genes. Furthermore, the virulence-associated vacA locus is a pseudogene and the cag pathogenicity island (cagPAI) is not present. However, the genome does contain a gene cluster associated with pathogenicity, including dupA. Our analysis found that with the addition of Sahul64 to the 38 genomes, the core genome content of H. pylori is reduced by approximately 14% (∼170 genes) and the pan-genome has expanded from 2,070 to 2,238 genes. We have identified three putative horizontally acquired regions, including one that is likely to have been acquired from the closely related Helicobacter cetorum prior to speciation. Our results suggest that Sahul64, with the absence of cagPAI, highly divergent cell envelope proteins, and a predicted nontransportable VacA protein, could be more highly adapted to ancient indigenous Australian people but with lower virulence potential compared to other sequenced and cagPAI-positive H. pylori strains. PMID:24375107
Lu, Wei; Wise, Michael J; Tay, Chin Yen; Windsor, Helen M; Marshall, Barry J; Peacock, Christopher; Perkins, Tim
2014-03-01
Isolates of Helicobacter pylori can be classified phylogeographically. High genetic diversity and rapid microevolution are a hallmark of H. pylori genomes, a phenomenon that is proposed to play a functional role in persistence and colonization of diverse human populations. To provide further genomic evidence in the lineage of H. pylori and to further characterize diverse strains of this pathogen in different human populations, we report the finished genome sequence of Sahul64, an H. pylori strain isolated from an indigenous Australian. Our analysis identified genes that were highly divergent compared to the 38 publically available genomes, which include genes involved in the biosynthesis and modification of lipopolysaccharide, putative prophage genes, restriction modification components, and hypothetical genes. Furthermore, the virulence-associated vacA locus is a pseudogene and the cag pathogenicity island (cagPAI) is not present. However, the genome does contain a gene cluster associated with pathogenicity, including dupA. Our analysis found that with the addition of Sahul64 to the 38 genomes, the core genome content of H. pylori is reduced by approximately 14% (∼170 genes) and the pan-genome has expanded from 2,070 to 2,238 genes. We have identified three putative horizontally acquired regions, including one that is likely to have been acquired from the closely related Helicobacter cetorum prior to speciation. Our results suggest that Sahul64, with the absence of cagPAI, highly divergent cell envelope proteins, and a predicted nontransportable VacA protein, could be more highly adapted to ancient indigenous Australian people but with lower virulence potential compared to other sequenced and cagPAI-positive H. pylori strains.
RIT2: responsible and susceptible gene for neurological and psychiatric disorders.
Daneshmandpour, Yousef; Darvish, Hossein; Emamalizadeh, Babak
2018-06-02
RIT2 gene was recently introduced as a susceptibility gene in neurological disorders, a group of major problems in human society affecting millions of people worldwide. Several variants, including single nucleotide polymorphisms and CNVs, have been identified and studied in different populations. In this review, we have summarized the studies relevant to the RIT2 gene and its related disorders, including Parkinson's disease, schizophrenia, and autism. The protein product of RIT2 is a member of the Ras superfamily that plays important roles in many vital cellular functions, such as differentiation and survival. We have also investigated the protein network of the RIT2 protein and the diseases related to members of this network so as to obtain some clues for future studies by identifying the molecular pathophysiology of neurological disorders and revealing new possible disorders related to RIT2.
The current state of play on the molecular genetics of depression.
Cohen-Woods, S; Craig, I W; McGuffin, P
2013-04-01
It has been well established that both genes and non-shared environment contribute substantially to the underlying aetiology of major depressive disorder (MDD). A comprehensive overview of genetic research in MDD is presented. Method Papers were retrieved from PubMed up to December 2011, using many keywords including: depression, major depressive disorder, genetics, rare variants, gene-environment, whole genome, epigenetics, and specific candidate genes and variants. These were combined in a variety of permutations. Linkage studies have yielded some promising chromosomal regions in MDD. However, there is a continued lack of consistency in association studies, in both candidate gene and genome-wide association studies (GWAS). Numerous factors may account for variable results including the use of different diagnostic approaches, small samples in early studies, population stratification, epigenetic phenomena, copy number variation (CNV), rare variation, and phenotypic and allelic heterogeneity. The conflicting results are also probably, in part, a consequence of environmental factors not being considered or controlled for. Each research group has to identify what issues their sample may best address. We suggest that, where possible, more emphasis should be placed on the environment in molecular behavioural genetics to identify individuals at environmental high risk in addition to genetic high risk. Sequencing should be used to identify rare and alternative variation that may act as a risk factor, and a systems biology approach including gene-gene interactions and pathway analyses would be advantageous. GWAS may require even larger samples with reliably defined (sub)phenotypes.
Scott, Barry; Young, Carolyn A.; Saikia, Sanjay; McMillan, Lisa K.; Monahan, Brendon J.; Koulman, Albert; Astin, Jonathan; Eaton, Carla J.; Bryant, Andrea; Wrenn, Ruth E.; Finch, Sarah C.; Tapper, Brian A.; Parker, Emily J.; Jameson, Geoffrey B.
2013-01-01
The indole-diterpene paxilline is an abundant secondary metabolite synthesized by Penicillium paxilli. In total, 21 genes have been identified at the PAX locus of which six have been previously confirmed to have a functional role in paxilline biosynthesis. A combination of bioinformatics, gene expression and targeted gene replacement analyses were used to define the boundaries of the PAX gene cluster. Targeted gene replacement identified seven genes, paxG, paxA, paxM, paxB, paxC, paxP and paxQ that were all required for paxilline production, with one additional gene, paxD, required for regular prenylation of the indole ring post paxilline synthesis. The two putative transcription factors, PP104 and PP105, were not co-regulated with the pax genes and based on targeted gene replacement, including the double knockout, did not have a role in paxilline production. The relationship of indole dimethylallyl transferases involved in prenylation of indole-diterpenes such as paxilline or lolitrem B, can be found as two disparate clades, not supported by prenylation type (e.g., regular or reverse). This paper provides insight into the P. paxilli indole-diterpene locus and reviews the recent advances identified in paxilline biosynthesis. PMID:23949005
Capsule Production and Glucose Metabolism Dictate Fitness during Serratia marcescens Bacteremia.
Anderson, Mark T; Mitchell, Lindsay A; Zhao, Lili; Mobley, Harry L T
2017-05-23
Serratia marcescens is an opportunistic pathogen that causes a range of human infections, including bacteremia, keratitis, wound infections, and urinary tract infections. Compared to other members of the Enterobacteriaceae family, the genetic factors that facilitate Serratia proliferation within the mammalian host are less well defined. An in vivo screen of transposon insertion mutants identified 212 S. marcescens fitness genes that contribute to bacterial survival in a murine model of bloodstream infection. Among those identified, 11 genes were located within an 18-gene cluster encoding predicted extracellular polysaccharide biosynthesis proteins. A mutation in the wzx gene contained within this locus conferred a loss of fitness in competition infections with the wild-type strain and a reduction in extracellular uronic acids correlating with capsule loss. A second gene, pgm , encoding a phosphoglucomutase exhibited similar capsule-deficient phenotypes, linking central glucose metabolism with capsule production and fitness of Serratia during mammalian infection. Further evidence of the importance of central metabolism was obtained with a pfkA glycolytic mutant that demonstrated reduced replication in human serum and during murine infection. An MgtB magnesium transporter homolog was also among the fitness factors identified, and an S. marcescens mgtB mutant exhibited decreased growth in defined medium containing low concentrations of magnesium and was outcompeted ~10-fold by wild-type bacteria in mice. Together, these newly identified genes provide a more complete understanding of the specific requirements for S. marcescens survival in the mammalian host and provide a framework for further investigation of the means by which S. marcescens causes opportunistic infections. IMPORTANCE Serratia marcescens is a remarkably prolific organism that replicates in diverse environments, including as an opportunistic pathogen in human bacteremia. The genetic requirements for S. marcescens survival in the mammalian bloodstream were defined in this work by transposon insertion sequencing. In total, 212 genes that contribute to bacterial fitness were identified. When sorted via biological function, two of the major fitness categories identified herein were genes encoding capsule polysaccharide biogenesis functions and genes involved in glucose utilization. Further investigation determined that certain glucose metabolism fitness genes are also important for the generation of extracellular polysaccharides. Together, these results identify critical biological processes that allow S. marcescens to colonize the mammalian bloodstream. Copyright © 2017 Anderson et al.
Gao, Haiyan; Yang, Mei; Zhang, Xiaolan
2018-04-01
The present study aimed to investigate potential recurrence-risk biomarkers based on significant pathways for Luminal A breast cancer through gene expression profile analysis. Initially, the gene expression profiles of Luminal A breast cancer patients were downloaded from The Cancer Genome Atlas database. The differentially expressed genes (DEGs) were identified using a Limma package and the hierarchical clustering analysis was conducted for the DEGs. In addition, the functional pathways were screened using Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses and rank ratio calculation. The multigene prognostic assay was exploited based on the statistically significant pathways and its prognostic function was tested using train set and verified using the gene expression data and survival data of Luminal A breast cancer patients downloaded from the Gene Expression Omnibus. A total of 300 DEGs were identified between good and poor outcome groups, including 176 upregulated genes and 124 downregulated genes. The DEGs may be used to effectively distinguish Luminal A samples with different prognoses verified by hierarchical clustering analysis. There were 9 pathways screened as significant pathways and a total of 18 DEGs involved in these 9 pathways were identified as prognostic biomarkers. According to the survival analysis and receiver operating characteristic curve, the obtained 18-gene prognostic assay exhibited good prognostic function with high sensitivity and specificity to both the train and test samples. In conclusion the 18-gene prognostic assay including the key genes, transcription factor 7-like 2, anterior parietal cortex and lymphocyte enhancer factor-1 may provide a new method for predicting outcomes and may be conducive to the promotion of precision medicine for Luminal A breast cancer.
Microarray technology is a powerful tool to investigate the gene expression profiles for thousands of genes simultaneously. In recent years, microarrays have been used to characterize environmental pollutants and identify molecular mode(s) of action of chemicals including endocri...
Popesku, Jason T; Martyniuk, Christopher J; Trudeau, Vance L
2012-01-01
Dopamine (DA) is a major neurotransmitter important for neuroendocrine control and recent studies have described genomic signaling pathways activated and inhibited by DA agonists and antagonists in the goldfish brain. Here we perform a meta-type analysis using microarray datasets from experiments conducted with female goldfish to characterize the gene expression responses that underlie dopaminergic signaling. Sexually mature, pre-spawning [gonadosomatic index (GSI) = 4.5 ± 1.3%] or sexually regressing (GSI = 3 ± 0.4%) female goldfish (15-40 g) injected intraperitoneally with either SKF 38393, LY 171555, SCH 23390, sulpiride, or a combination of 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine and α-methyl-p-tyrosine. Microarray meta-type analysis identified 268 genes in the telencephalon and hypothalamus as having reciprocal (i.e., opposite between agonism and antagonism/depletion) fold change responses, suggesting that these transcripts are likely targets for DA-mediated regulation. Noteworthy genes included ependymin, vimentin, and aromatase, genes that support the significance of DA in neuronal plasticity and tissue remodeling. Sub-network enrichment analysis (SNEA) was used to identify common gene regulators and binding proteins associated with the differentially expressed genes mediated by DA. SNEA analysis identified gene expression targets that were related to three major categories that included cell signaling (STAT3, SP1, SMAD, Jun/Fos), immune response (IL-6, IL-1β, TNFs, cytokine, NF-κB), and cell proliferation and growth (IGF1, TGFβ1). These gene networks are also known to be associated with neurodegenerative disorders such as Parkinsons' disease, well-known to be associated with loss of dopaminergic neurons. This study identifies genes and networks that underlie DA signaling in the vertebrate CNS and provides targets that may be key neuroendocrine regulators. The results provide a foundation for future work on dopaminergic regulation of gene expression in fish model systems.
Popesku, Jason T.; Martyniuk, Christopher J.; Trudeau, Vance L.
2012-01-01
Dopamine (DA) is a major neurotransmitter important for neuroendocrine control and recent studies have described genomic signaling pathways activated and inhibited by DA agonists and antagonists in the goldfish brain. Here we perform a meta-type analysis using microarray datasets from experiments conducted with female goldfish to characterize the gene expression responses that underlie dopaminergic signaling. Sexually mature, pre-spawning [gonadosomatic index (GSI) = 4.5 ± 1.3%] or sexually regressing (GSI = 3 ± 0.4%) female goldfish (15–40 g) injected intraperitoneally with either SKF 38393, LY 171555, SCH 23390, sulpiride, or a combination of 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine and α-methyl-p-tyrosine. Microarray meta-type analysis identified 268 genes in the telencephalon and hypothalamus as having reciprocal (i.e., opposite between agonism and antagonism/depletion) fold change responses, suggesting that these transcripts are likely targets for DA-mediated regulation. Noteworthy genes included ependymin, vimentin, and aromatase, genes that support the significance of DA in neuronal plasticity and tissue remodeling. Sub-network enrichment analysis (SNEA) was used to identify common gene regulators and binding proteins associated with the differentially expressed genes mediated by DA. SNEA analysis identified gene expression targets that were related to three major categories that included cell signaling (STAT3, SP1, SMAD, Jun/Fos), immune response (IL-6, IL-1β, TNFs, cytokine, NF-κB), and cell proliferation and growth (IGF1, TGFβ1). These gene networks are also known to be associated with neurodegenerative disorders such as Parkinsons’ disease, well-known to be associated with loss of dopaminergic neurons. This study identifies genes and networks that underlie DA signaling in the vertebrate CNS and provides targets that may be key neuroendocrine regulators. The results provide a foundation for future work on dopaminergic regulation of gene expression in fish model systems. PMID:23130016
Coyne, Carolyn B; Bozym, Rebecca; Morosky, Stefanie A; Hanna, Sheri L; Mukherjee, Amitava; Tudor, Matthew; Kim, Kwang Sik; Cherry, Sara
2011-01-20
Enteroviruses, including coxsackievirus B (CVB) and poliovirus (PV), can access the CNS through the blood brain barrier (BBB) endothelium to cause aseptic meningitis. To identify cellular components required for CVB and PV infection of human brain microvascular endothelial cells, an in vitro BBB model, we performed comparative RNAi screens and identified 117 genes that influenced infection. Whereas a large proportion of genes whose depletion enhanced infection (17 of 22) were broadly antienteroviral, only 46 of the 95 genes whose depletion inhibited infection were required by both CVB and PV and included components of cell signaling pathways such as adenylate cyclases. Downregulation of genes including Rab GTPases, Src tyrosine kinases, and tyrosine phosphatases displayed specificity in their requirement for either CVB or PV infection. These findings highlight the pathways hijacked by enteroviruses for entry and replication in the BBB endothelium, a specialized and clinically relevant cell type for these viruses. Copyright © 2011 Elsevier Inc. All rights reserved.
An atlas of gene expression and gene co-regulation in the human retina.
Pinelli, Michele; Carissimo, Annamaria; Cutillo, Luisa; Lai, Ching-Hung; Mutarelli, Margherita; Moretti, Maria Nicoletta; Singh, Marwah Veer; Karali, Marianthi; Carrella, Diego; Pizzo, Mariateresa; Russo, Francesco; Ferrari, Stefano; Ponzin, Diego; Angelini, Claudia; Banfi, Sandro; di Bernardo, Diego
2016-07-08
The human retina is a specialized tissue involved in light stimulus transduction. Despite its unique biology, an accurate reference transcriptome is still missing. Here, we performed gene expression analysis (RNA-seq) of 50 retinal samples from non-visually impaired post-mortem donors. We identified novel transcripts with high confidence (Observed Transcriptome (ObsT)) and quantified the expression level of known transcripts (Reference Transcriptome (RefT)). The ObsT included 77 623 transcripts (23 960 genes) covering 137 Mb (35 Mb new transcribed genome). Most of the transcripts (92%) were multi-exonic: 81% with known isoforms, 16% with new isoforms and 3% belonging to new genes. The RefT included 13 792 genes across 94 521 known transcripts. Mitochondrial genes were among the most highly expressed, accounting for about 10% of the reads. Of all the protein-coding genes in Gencode, 65% are expressed in the retina. We exploited inter-individual variability in gene expression to infer a gene co-expression network and to identify genes specifically expressed in photoreceptor cells. We experimentally validated the photoreceptors localization of three genes in human retina that had not been previously reported. RNA-seq data and the gene co-expression network are available online (http://retina.tigem.it). © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A cluster of bacterial genes for anaerobic benzene ring biodegradation
Egland, Paul G.; Pelletier, Dale A.; Dispensa, Marilyn; Gibson, Jane; Harwood, Caroline S.
1997-01-01
A reductive benzoate pathway is the central conduit for the anaerobic biodegradation of aromatic pollutants and lignin monomers. Benzene ring reduction requires a large input of energy and this metabolic capability has, so far, been reported only in bacteria. To determine the molecular basis for this environmentally important process, we cloned and analyzed genes required for the anaerobic degradation of benzoate and related compounds from the phototrophic bacterium, Rhodopseudomonas palustris. A cluster of 24 genes was identified that includes twelve genes likely to be involved in anaerobic benzoate degradation and additional genes that convert the related compounds 4-hydroxybenzoate and cyclohexanecarboxylate to benzoyl-CoA. Genes encoding benzoyl-CoA reductase, a novel enzyme able to overcome the resonance stability of the aromatic ring, were identified by directed mutagenesis. The gene encoding the ring-cleavage enzyme, 2-ketocyclohexanecarboxyl-CoA hydrolase, was identified by assaying the enzymatic activity of the protein expressed in Escherichia coli. Physiological data and DNA sequence analyses indicate that the benzoate pathway consists of unusual enzymes for ring reduction and cleavage interposed among enzymes homologous to those catalyzing fatty acid degradation. The cloned genes should be useful as probes to identify benzoate degradation genes from other metabolically distinct groups of anaerobic bacteria, such as denitrifying bacteria and sulfate-reducing bacteria. PMID:9177244
Image-guided genomic analysis of tissue response to laser-induced thermal stress
NASA Astrophysics Data System (ADS)
Mackanos, Mark A.; Helms, Mike; Kalish, Flora; Contag, Christopher H.
2011-05-01
The cytoprotective response to thermal injury is characterized by transcriptional activation of ``heat shock proteins'' (hsp) and proinflammatory proteins. Expression of these proteins may predict cellular survival. Microarray analyses were performed to identify spatially distinct gene expression patterns responding to thermal injury. Laser injury zones were identified by expression of a transgene reporter comprised of the 70 kD hsp gene and the firefly luciferase coding sequence. Zones included the laser spot, the surrounding region where hsp70-luc expression was increased, and a region adjacent to the surrounding region. A total of 145 genes were up-regulated in the laser irradiated region, while 69 were up-regulated in the adjacent region. At 7 hours the chemokine Cxcl3 was the highest expressed gene in the laser spot (24 fold) and adjacent region (32 fold). Chemokines were the most common up-regulated genes identified. Microarray gene expression was successfully validated using qRT- polymerase chain reaction for selected genes of interest. The early response genes are likely involved in cytoprotection and initiation of the healing response. Their regulatory elements will benefit creating the next generation reporter mice and controlling expression of therapeutic proteins. The identified genes serve as drug development targets that may prevent acute tissue damage and accelerate healing.
Christensen, T; Bisgaard, C F; Wiborg, O
2011-11-24
The aim of the present study was to identify potential biomarkers for depression in the search for novel disease targets and treatment regimens. Furthermore, the study includes a search for biomarkers involved in treatment resistance and stress resilience in order to investigate mechanisms underlying antidepressant drug refraction and stress-coping strategies. Depression-related transcriptomic changes in gene expression profiles were investigated in laser-captured microdissected (LCM) rat hippocampal granular cell layers (GCL) using the chronic mild stress (CMS) rat model of depression and chronic administration of two selective serotonin reuptake inhibitors (SSRIs), escitalopram and sertraline. CMS rats were segregated into diverging groups according to behavioral readouts, and under stringent constraints, the associated differential gene regulations were analyzed. Accordingly, we identified four genes associated with recovery, two genes implicated in treatment resistance, and three genes involved in stress resilience. The identified genes associated with mechanisms of cellular plasticity, including signal transduction, cell proliferation, cell differentiation, and synaptic release. Hierarchical clustering analysis confirmed the subgroup segregation pattern in the CMS model. Thus antidepressant treatment refractors cluster with anhedonic-like rats, and, interestingly, stress-resilient rats cluster with rats undergoing antidepressant-mediated recovery from anhedonia, suggesting antidepressant mechanisms of action to emulate endogenous stress-coping strategies. Copyright © 2011 IBRO. Published by Elsevier Ltd. All rights reserved.
Ma, Yu-Hua; Ye, Gui-Sheng
2018-06-11
In this study, we screened differentially expressed genes in a multidrug-resistant isolate strain of Clostridium perfringens by RNA sequencing. We also separated and identified differentially expressed proteins (DEPs) in the isolate strain by two-dimensional electrophoresis (2-DE) and mass spectrometry (MS). The RNA sequencing results showed that, compared with the control strain, 1128 genes were differentially expressed in the isolate strain, and these included 227 up-regulated genes and 901 down-regulated genes. Bioinformatics analysis identified the following genes and gene categories that are potentially involved in multidrug resistance (MDR) in the isolate strain: drug transport, drug response, hydrolase activity, transmembrane transporter, transferase activity, amidase transmembrane transporter, efflux transmembrane transporter, bacterial chemotaxis, ABC transporter, and others. The results of the 2-DE showed that 70 proteins were differentially expressed in the isolate strain, 45 of which were up-regulated and 25 down-regulated. Twenty-seven DEPs were identified by MS and these included the following protein categories: ribosome, antimicrobial peptide resistance, and ABC transporter, all of which may be involved in MDR in the isolate strain of C. perfringens. The results provide reference data for further investigations on the drug resistant molecular mechanisms of C. perfringens.
Halabi, Najeeb M.; Martinez, Alejandra; Al-Farsi, Halema; Mery, Eliane; Puydenus, Laurence; Pujol, Pascal; Khalak, Hanif G.; McLurcan, Cameron; Ferron, Gwenael; Querleu, Denis; Al-Azwani, Iman; Al-Dous, Eman; Mohamoud, Yasmin A.; Malek, Joel A.; Rafii, Arash
2016-01-01
Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. PMID:26735499
Guo, Nan; Zhang, Nan; Yan, Liqiu; Lian, Zheng; Wang, Jiawang; Lv, Fengfeng; Wang, Yunfei; Cao, Xufen
2018-06-14
Acute myocardial infarction induces ventricular remodeling, which is implicated in dilated heart and heart failure. The pathogenical mechanism of myocardium remodeling remains to be elucidated. The aim of the present study was to identify key genes and networks for myocardium remodeling following ischemia‑reperfusion (IR). First, the mRNA expression data from the National Center for Biotechnology Information database were downloaded to identify differences in mRNA expression of the IR heart at days 2 and 7. Then, weighted gene co‑expression network analysis, hierarchical clustering, protein‑protein interaction (PPI) network, Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway were used to identify key genes and networks for the heart remodeling process following IR. A total of 3,321 differentially expressed genes were identified during the heart remodeling process. A total of 6 modules were identified through gene co‑expression network analysis. GO and KEGG analysis results suggested that each module represented a different biological function and was associated with different pathways. Finally, hub genes of each module were identified by PPI network construction. The present study revealed that heart remodeling following IR is a complicated process, involving extracellular matrix organization, neural development, apoptosis and energy metabolism. The dysregulated genes, including SRC proto‑oncogene, non‑receptor tyrosine kinase, discs large MAGUK scaffold protein 1, ATP citrate lyase, RAN, member RAS oncogene family, tumor protein p53, and polo like kinase 2, may be essential for heart remodeling following IR and may be used as potential targets for the inhibition of heart remodeling following acute myocardial infarction.
Yeger-Lotem, Esti; Riva, Laura; Su, Linhui Julie; Gitler, Aaron D.; Cashikar, Anil; King, Oliver D.; Auluck, Pavan K.; Geddie, Melissa L.; Valastyan, Julie S.; Karger, David R.; Lindquist, Susan; Fraenkel, Ernest
2009-01-01
Cells respond to stimuli by changes in various processes, including signaling pathways and gene expression. Efforts to identify components of these responses increasingly depend on mRNA profiling and genetic library screens, yet the functional roles of the genes identified by these assays often remain enigmatic. By comparing the results of these two assays across various cellular responses, we found that they are consistently distinct. Moreover, genetic screens tend to identify response regulators, while mRNA profiling frequently detects metabolic responses. We developed an integrative approach that bridges the gap between these data using known molecular interactions, thus highlighting major response pathways. We harnessed this approach to reveal cellular pathways related to alpha-synuclein, a small lipid-binding protein implicated in several neurodegenerative disorders including Parkinson disease. For this we screened an established yeast model for alpha-synuclein toxicity to identify genes that when overexpressed alter cellular survival. Application of our algorithm to these data and data from mRNA profiling provided functional explanations for many of these genes and revealed novel relations between alpha-synuclein toxicity and basic cellular pathways. PMID:19234470
Maletzki, Claudia; Huehns, Maja; Bauer, Ingrid; Ripperger, Tim; Mork, Maureen M; Vilar, Eduardo; Klöcking, Sabine; Zettl, Heike; Prall, Friedrich; Linnebacher, Michael
2017-07-01
Mismatch-repair deficient (MMR-D) malignancies include Lynch Syndrome (LS), which is secondary to germline mutations in one of the MMR genes, and the rare childhood-form of constitutional mismatch repair-deficiency (CMMR-D); caused by bi-allelic MMR gene mutations. A hallmark of LS-associated cancers is microsatellite instability (MSI), characterized by coding frameshift mutations (cFSM) in target genes. By contrast, tumors arising in CMMR-D patients are thought to display a somatic mutation pattern differing from LS. This study has the main goal to identify cFSM in MSI target genes relevant in CMMR-D and to compare the spectrum of common somatic mutations, including alterations in DNA polymerases POLE and D1 between LS and CMMR-D. CMMR-D-associated tumors harbored more somatic mutations compared to LS cases, especially in the TP53 gene and in POLE and POLD1, where novel mutations were additionally identified. Strikingly, MSI in classical mononucleotide markers BAT40 and CAT25 was frequent in CMMR-D cases. MSI-target gene analysis revealed mutations in CMMR-D-associated tumors, some of them known to be frequently hit in LS, such as RNaseT2, HT001, and TGFβR2. Our results imply a general role for these cFSM as potential new drivers of MMR-D tumorigenesis. © 2017 Wiley Periodicals, Inc.
Choi, Mi-Jin; Kim, Gun-Do; Kim, Jong-Myoung; Lim, Han Kyu
2015-01-01
The Pacific abalone Haliotis discus hannai is used for commercial aquaculture in Korea. We examined the transcriptome of Pacific abalone Haliotis discus hannai siblings using NGS technology to identify genes associated with high growth rates. Pacific abalones grown for 200 days post-fertilization were divided into small-, medium-, and large-size groups with mean weights of 0.26 ± 0.09 g, 1.43 ± 0.405 g, and 5.24 ± 1.09 g, respectively. RNA isolated from the soft tissues of each group was subjected to RNA sequencing. Approximately 1%–3% of the transcripts were differentially expressed in abalones, depending on the growth rate. RT-PCR was carried out on thirty four genes selected to confirm the relative differences in expression detected by RNA sequencing. Six differentially-expressed genes were identified as associated with faster growth of the Pacific abalone. These include five up-regulated genes (including one specific to females) encoding transcripts homologous to incilarin A, perlucin, transforming growth factor-beta-induced protein immunoglobulin-heavy chain 3 (ig-h3), vitelline envelope zona pellucida domain 4, and defensin, and one down-regulated gene encoding tomoregulin in large abalones. Most of the transcripts were expressed predominantly in the hepatopancreas. The genes identified in this study will lead to development of markers for identification of high-growth-rate abalones and female abalones. PMID:26593905
Choi, Mi-Jin; Kim, Gun-Do; Kim, Jong-Myoung; Lim, Han Kyu
2015-11-18
The Pacific abalone Haliotis discus hannai is used for commercial aquaculture in Korea. We examined the transcriptome of Pacific abalone Haliotis discus hannai siblings using NGS technology to identify genes associated with high growth rates. Pacific abalones grown for 200 days post-fertilization were divided into small-, medium-, and large-size groups with mean weights of 0.26 ± 0.09 g, 1.43 ± 0.405 g, and 5.24 ± 1.09 g, respectively. RNA isolated from the soft tissues of each group was subjected to RNA sequencing. Approximately 1%-3% of the transcripts were differentially expressed in abalones, depending on the growth rate. RT-PCR was carried out on thirty four genes selected to confirm the relative differences in expression detected by RNA sequencing. Six differentially-expressed genes were identified as associated with faster growth of the Pacific abalone. These include five up-regulated genes (including one specific to females) encoding transcripts homologous to incilarin A, perlucin, transforming growth factor-beta-induced protein immunoglobulin-heavy chain 3 (ig-h3), vitelline envelope zona pellucida domain 4, and defensin, and one down-regulated gene encoding tomoregulin in large abalones. Most of the transcripts were expressed predominantly in the hepatopancreas. The genes identified in this study will lead to development of markers for identification of high-growth-rate abalones and female abalones.
Identifying transcription factor functions and targets by phenotypic activation
Chua, Gordon; Morris, Quaid D.; Sopko, Richelle; Robinson, Mark D.; Ryan, Owen; Chan, Esther T.; Frey, Brendan J.; Andrews, Brenda J.; Boone, Charles; Hughes, Timothy R.
2006-01-01
Mapping transcriptional regulatory networks is difficult because many transcription factors (TFs) are activated only under specific conditions. We describe a generic strategy for identifying genes and pathways induced by individual TFs that does not require knowledge of their normal activation cues. Microarray analysis of 55 yeast TFs that caused a growth phenotype when overexpressed showed that the majority caused increased transcript levels of genes in specific physiological categories, suggesting a mechanism for growth inhibition. Induced genes typically included established targets and genes with consensus promoter motifs, if known, indicating that these data are useful for identifying potential new target genes and binding sites. We identified the sequence 5′-TCACGCAA as a binding sequence for Hms1p, a TF that positively regulates pseudohyphal growth and previously had no known motif. The general strategy outlined here presents a straightforward approach to discovery of TF activities and mapping targets that could be adapted to any organism with transgenic technology. PMID:16880382
DOE Office of Scientific and Technical Information (OSTI.GOV)
Williams, Kelly Porter
Key goals towards national biosecurity include methods for analyzing pathogens, predicting their emergence, and developing countermeasures. These goals are served by studying bacterial genes that promote pathogenicity and the pathogenicity islands that mobilize them. Cyberinfrastructure promoting an island database advances this field and enables deeper bioinformatic analysis that may identify novel pathogenicity genes. New automated methods and rich visualizations were developed for identifying pathogenicity islands, based on the principle that islands occur sporadically among closely related strains. The chromosomally-ordered pan-genome organizes all genes from a clade of strains; gaps in this visualization indicate islands, and decorations of the gene matrixmore » facilitate exploration of island gene functions. A %E2%80%9Clearned phyloblocks%E2%80%9D method was developed for automated island identification, that trains on the phylogenetic patterns of islands identified by other methods. Learned phyloblocks better defined termini of previously identified islands in multidrug-resistant Klebsiella pneumoniae ATCC BAA-2146, and found its only antibiotic resistance island.« less
A novel large deletion mutation of FERMT1 gene in a Chinese patient with Kindler syndrome.
Gao, Ying; Bai, Jin-li; Liu, Xiao-yan; Qu, Yu-jin; Cao, Yan-yan; Wang, Jian-cai; Jin, Yu-wei; Wang, Hong; Song, Fang
2015-11-01
Kindler syndrome (KS; OMIM 173650) is a rare autosomal recessive skin disorder, which results in symptoms including blistering, epidermal atrophy, increased risk of cancer, and poor wound healing. The majority of mutations of the disease-determining gene (FERMT1 gene) are single nucleotide substitutions, including missense mutations, nonsense mutations, etc. Large deletion mutations are seldom reported. To determine the mutation in the FERMT1 gene associated with a 7-year-old Chinese patient who presented clinical manifestation of KS, we performed direct sequencing of all the exons of FERMT1 gene. For the exons 2-6 without amplicons, we analyzed the copy numbers using quantitative real-time polymerase chain reaction (qRT-PCR) with specific primers. The deletion breakpoints were sublocalized and the range of deletion was confirmed by PCR and direct sequencing. In this study, we identified a new 17-kb deletion mutation spanning the introns 1-6 of FERMT1 gene in a Chinese patient with severe KS phenotypes. Her parents were carriers of the same mutation. Our study reported a newly identified large deletion mutation of FERMT1 gene involved in KS, which further enriched the mutation spectrum of the FERMT1 gene.
Winterhoff, Boris J; Maile, Makayla; Mitra, Amit Kumar; Sebe, Attila; Bazzaro, Martina; Geller, Melissa A; Abrahante, Juan E; Klein, Molly; Hellweg, Raffaele; Mullany, Sally A; Beckman, Kenneth; Daniel, Jerry; Starr, Timothy K
2017-03-01
The purpose of this study was to determine the level of heterogeneity in high grade serous ovarian cancer (HGSOC) by analyzing RNA expression in single epithelial and cancer associated stromal cells. In addition, we explored the possibility of identifying subgroups based on pathway activation and pre-defined signatures from cancer stem cells and chemo-resistant cells. A fresh, HGSOC tumor specimen derived from ovary was enzymatically digested and depleted of immune infiltrating cells. RNA sequencing was performed on 92 single cells and 66 of these single cell datasets passed quality control checks. Sequences were analyzed using multiple bioinformatics tools, including clustering, principle components analysis, and geneset enrichment analysis to identify subgroups and activated pathways. Immunohistochemistry for ovarian cancer, stem cell and stromal markers was performed on adjacent tumor sections. Analysis of the gene expression patterns identified two major subsets of cells characterized by epithelial and stromal gene expression patterns. The epithelial group was characterized by proliferative genes including genes associated with oxidative phosphorylation and MYC activity, while the stromal group was characterized by increased expression of extracellular matrix (ECM) genes and genes associated with epithelial-to-mesenchymal transition (EMT). Neither group expressed a signature correlating with published chemo-resistant gene signatures, but many cells, predominantly in the stromal subgroup, expressed markers associated with cancer stem cells. Single cell sequencing provides a means of identifying subpopulations of cancer cells within a single patient. Single cell sequence analysis may prove to be critical for understanding the etiology, progression and drug resistance in ovarian cancer. Copyright © 2017 Elsevier Inc. All rights reserved.
Swaminathan, Shanker; Huentelman, Matthew J; Corneveaux, Jason J; Myers, Amanda J; Faber, Kelley M; Foroud, Tatiana; Mayeux, Richard; Shen, Li; Kim, Sungeun; Turk, Mari; Hardy, John; Reiman, Eric M; Saykin, Andrew J
2012-01-01
Copy number variations (CNVs) are genomic regions that have added (duplications) or deleted (deletions) genetic material. They may overlap genes affecting their function and have been shown to be associated with disease. We previously investigated the role of CNVs in late-onset Alzheimer's disease (AD) and mild cognitive impairment using Alzheimer's Disease Neuroimaging Initiative (ADNI) and National Institute of Aging-Late Onset AD/National Cell Repository for AD (NIA-LOAD/NCRAD) Family Study participants, and identified a number of genes overlapped by CNV calls. To confirm the findings and identify other potential candidate regions, we analyzed array data from a unique cohort of 1617 Caucasian participants (1022 AD cases and 595 controls) who were clinically characterized and whose diagnosis was neuropathologically verified. All DNA samples were extracted from brain tissue. CNV calls were generated and subjected to quality control (QC). 728 cases and 438 controls who passed all QC measures were included in case/control association analyses including candidate gene and genome-wide approaches. Rates of deletions and duplications did not significantly differ between cases and controls. Case-control association identified a number of previously reported regions (CHRFAM7A, RELN and DOPEY2) as well as a new gene (HLA-DRA). Meta-analysis of CHRFAM7A indicated a significant association of the gene with AD and/or MCI risk (P = 0.006, odds ratio = 3.986 (95% confidence interval 1.490-10.667)). A novel APP gene duplication was observed in one case sample. Further investigation of the identified genes in independent and larger samples is warranted.
Genome-Wide Association Analysis of Sasang Constitution in the Korean Population
Kim, Bu-Yeo; Jin, Hee-Jeong
2012-01-01
Abstract Objectives Sasang constitutional medicine is a traditional Korean medicine in which an individual is classified into one of four types of constitution: Taeum (TE), Soeum (SE) Soyang (SY), and Taeyang (TY). These constitution types are determined with biologic and physiologic characteristics, so it has been assumed that genetic factors are associated with each constitution type. Identifying the genetic elements underlying each constitution is necessary for the elucidation of the molecular mechanism of Sasang constitutional medicine. Design A total of 341,998 genetic loci across the whole genome were genotyped for 1222 subjects of defined constitution type. The genetic loci associated with each constitution type were identified and the functional connectivity of genes within these loci was analyzed using statistical text mining. Results From the difference in allele frequencies between constitution types, significant genetic loci associated with each type were identified. Chromosomes 3q27.3 (rs10937331, p=2.71×10−6), 15q22.2 (rs7180547, p=1.58×10−6), and 14q22.3 (rs12431592, p=1.31×10−6) were most significantly associated with TE, SE, and SY constitution types, respectively. From the functional relationship analysis using all loci with a p-value≤10−4, genes associated with each constitution type were identified. Fifteen (15) genes, including GPM6A, SYT4, and GRIK1, were significantly associated with the TE constitution type (p<0.05); 12 genes, including DRGX and AKAP11, were significantly associated with the SE constitution type (p<0.05); and 17 genes, including ZFP42, CDH22, ALDH1A2, OTX2, and EN2, were significantly associated with the SY constitution type (p<0.05). Conclusions Genetic loci and genes associated with Sasang constitution types were systematically identified from a genome-wide association study using a large number of subjects. PMID:22394158
Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U.; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N.; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O.
2014-01-01
Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes. PMID:25264628
Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O
2014-01-01
Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes.
Mutation analysis of pre-mRNA splicing genes in Chinese families with retinitis pigmentosa
Pan, Xinyuan; Chen, Xue; Liu, Xiaoxing; Gao, Xiang; Kang, Xiaoli; Xu, Qihua; Chen, Xuejuan; Zhao, Kanxing; Zhang, Xiumei; Chu, Qiaomei; Wang, Xiuying
2014-01-01
Purpose Seven genes involved in precursor mRNA (pre-mRNA) splicing have been implicated in autosomal dominant retinitis pigmentosa (adRP). We sought to detect mutations in all seven genes in Chinese families with RP, to characterize the relevant phenotypes, and to evaluate the prevalence of mutations in splicing genes in patients with adRP. Methods Six unrelated families from our adRP cohort (42 families) and two additional families with RP with uncertain inheritance mode were clinically characterized in the present study. Targeted sequence capture with next-generation massively parallel sequencing (NGS) was performed to screen mutations in 189 genes including all seven pre-mRNA splicing genes associated with adRP. Variants detected with NGS were filtered with bioinformatics analyses, validated with Sanger sequencing, and prioritized with pathogenicity analysis. Results Mutations in pre-mRNA splicing genes were identified in three individual families including one novel frameshift mutation in PRPF31 (p.Leu366fs*1) and two known mutations in SNRNP200 (p.Arg681His and p.Ser1087Leu). The patients carrying SNRNP200 p.R681H showed rapid disease progression, and the family carrying p.S1087L presented earlier onset ages and more severe phenotypes compared to another previously reported family with p.S1087L. In five other families, we identified mutations in other RP-related genes, including RP1 p. Ser781* (novel), RP2 p.Gln65* (novel) and p.Ile137del (novel), IMPDH1 p.Asp311Asn (recurrent), and RHO p.Pro347Leu (recurrent). Conclusions Mutations in splicing genes identified in the present and our previous study account for 9.5% in our adRP cohort, indicating the important role of pre-mRNA splicing deficiency in the etiology of adRP. Mutations in the same splicing gene, or even the same mutation, could correlate with different phenotypic severities, complicating the genotype–phenotype correlation and clinical prognosis. PMID:24940031
Wen, Dong-Yue; Lin, Peng; Pang, Yu-Yan; Chen, Gang; He, Yun; Dang, Yi-Wu; Yang, Hong
2018-05-05
BACKGROUND Long non-coding RNAs (lncRNAs) have a role in physiological and pathological processes, including cancer. The aim of this study was to investigate the expression of the long intergenic non-protein coding RNA 665 (LINC00665) gene and the cell cycle in hepatocellular carcinoma (HCC) using database analysis including The Cancer Genome Atlas (TCGA), the Gene Expression Omnibus (GEO), and quantitative real-time polymerase chain reaction (qPCR). MATERIAL AND METHODS Expression levels of LINC00665 were compared between human tissue samples of HCC and adjacent normal liver, clinicopathological correlations were made using TCGA and the GEO, and qPCR was performed to validate the findings. Other public databases were searched for other genes associated with LINC00665 expression, including The Atlas of Noncoding RNAs in Cancer (TANRIC), the Multi Experiment Matrix (MEM), Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and protein-protein interaction (PPI) networks. RESULTS Overexpression of LINC00665 in patients with HCC was significantly associated with gender, tumor grade, stage, and tumor cell type. Overexpression of LINC00665 in patients with HCC was significantly associated with overall survival (OS) (HR=1.47795%; CI: 1.046-2.086). Bioinformatics analysis identified 469 related genes and further analysis supported a hypothesis that LINC00665 regulates pathways in the cell cycle to facilitate the development and progression of HCC through ten identified core genes: CDK1, BUB1B, BUB1, PLK1, CCNB2, CCNB1, CDC20, ESPL1, MAD2L1, and CCNA2. CONCLUSIONS Overexpression of the lncRNA, LINC00665 may be involved in the regulation of cell cycle pathways in HCC through ten identified hub genes.
2013-01-01
Background Colorectal cancer is the third leading cause of cancer deaths in the United States. The initial assessment of colorectal cancer involves clinical staging that takes into account the extent of primary tumor invasion, determining the number of lymph nodes with metastatic cancer and the identification of metastatic sites in other organs. Advanced clinical stage indicates metastatic cancer, either in regional lymph nodes or in distant organs. While the genomic and genetic basis of colorectal cancer has been elucidated to some degree, less is known about the identity of specific cancer genes that are associated with advanced clinical stage and metastasis. Methods We compiled multiple genomic data types (mutations, copy number alterations, gene expression and methylation status) as well as clinical meta-data from The Cancer Genome Atlas (TCGA). We used an elastic-net regularized regression method on the combined genomic data to identify genetic aberrations and their associated cancer genes that are indicators of clinical stage. We ranked candidate genes by their regression coefficient and level of support from multiple assay modalities. Results A fit of the elastic-net regularized regression to 197 samples and integrated analysis of four genomic platforms identified the set of top gene predictors of advanced clinical stage, including: WRN, SYK, DDX5 and ADRA2C. These genetic features were identified robustly in bootstrap resampling analysis. Conclusions We conducted an analysis integrating multiple genomic features including mutations, copy number alterations, gene expression and methylation. This integrated approach in which one considers all of these genomic features performs better than any individual genomic assay. We identified multiple genes that robustly delineate advanced clinical stage, suggesting their possible role in colorectal cancer metastatic progression. PMID:24308539
Huang, Gangyong; Wei, Yibing; Zhao, Guanglei; Xia, Jun; Wang, Siqun; Wu, Jianguo; Chen, Feiyan; Chen, Jie; Shi, Jingshen
2017-06-01
The underlying mechanisms of glucocorticoid (GC)‑induced avascular necrosis of the femoral head (ANFH) have yet to be fully understood, in particular the mechanisms associated with the change of gene expression pattern. The present study aimed to identify key genes with a differential expression pattern in GC‑induced ANFH. E‑MEXP‑2751 microarray data were downloaded from the ArrayExpress database. Differentially expressed genes (DEGs) were identified in 5 femoral head samples of steroid‑induced ANFH rats compared with 5 placebo‑treated rat samples. Gene Ontology (GO) and pathway enrichment analyses were performed upon these DEGs. A total 93 DEGs (46 upregulated and 47 downregulated genes) were identified in GC‑induced ANFH samples. These DEGs were enriched in different GO terms and pathways, including chondrocyte differentiation and detection of chemical stimuli. The enrichment map revealed that skeletal system development was interconnected with several other GO terms by gene overlap. The literature mined network analysis revealed that 5 upregulated genes were associated with femoral necrosis, including parathyroid hormone receptor 1 (PTHR1), vitamin D (1,25‑Dihydroxyvitamin D3) receptor (VDR), collagen, type II, α1, proprotein convertase subtilisin/kexin type 6 and zinc finger protein 354C (ZFP354C). In addition, ZFP354C and VDR were identified to transcription factors. Furthermore, PTHR1 was revealed to interact with VDR, and α‑2‑macroglobulin (A2M) interacted with fibronectin 1 (FN1) in the PPI network. PTHR1 may be involved in GC‑induced ANFH via interacting with VDR. A2M may also be involved in the development of GC‑induced ANFH through interacting with FN1. An improved understanding of the molecular mechanisms underlying GC‑induced ANFH may provide novel targets for diagnostics and therapeutic treatment.
Huang, Gangyong; Wei, Yibing; Zhao, Guanglei; Xia, Jun; Wang, Siqun; Wu, Jianguo; Chen, Feiyan; Chen, Jie; Shi, Jingshen
2017-01-01
The underlying mechanisms of glucocorticoid (GC)-induced avascular necrosis of the femoral head (ANFH) have yet to be fully understood, in particular the mechanisms associated with the change of gene expression pattern. The present study aimed to identify key genes with a differential expression pattern in GC-induced ANFH. E-MEXP-2751 microarray data were downloaded from the ArrayExpress database. Differentially expressed genes (DEGs) were identified in 5 femoral head samples of steroid-induced ANFH rats compared with 5 placebo-treated rat samples. Gene Ontology (GO) and pathway enrichment analyses were performed upon these DEGs. A total 93 DEGs (46 upregulated and 47 downregulated genes) were identified in GC-induced ANFH samples. These DEGs were enriched in different GO terms and pathways, including chondrocyte differentiation and detection of chemical stimuli. The enrichment map revealed that skeletal system development was interconnected with several other GO terms by gene overlap. The literature mined network analysis revealed that 5 upregulated genes were associated with femoral necrosis, including parathyroid hormone receptor 1 (PTHR1), vitamin D (1,25-Dihydroxyvitamin D3) receptor (VDR), collagen, type II, α1, proprotein convertase subtilisin/kexin type 6 and zinc finger protein 354C (ZFP354C). In addition, ZFP354C and VDR were identified to transcription factors. Furthermore, PTHR1 was revealed to interact with VDR, and α-2-macroglobulin (A2M) interacted with fibronectin 1 (FN1) in the PPI network. PTHR1 may be involved in GC-induced ANFH via interacting with VDR. A2M may also be involved in the development of GC-induced ANFH through interacting with FN1. An improved understanding of the molecular mechanisms underlying GC-induced ANFH may provide novel targets for diagnostics and therapeutic treatment. PMID:28393228
Kreiner, Frederik Flindt; Borup, Rehannah; Nielsen, Finn Cilius; Schjerling, Peter; Galbo, Henrik
2017-08-07
The pathophysiology, including the impact of gene expression, of polymyalgia rheumatica (PMR) remains elusive. We profiled the gene expression in muscle tissue in PMR patients before and after glucocorticoid treatment. Gene expression was measured using Affymetrix Human Genome U133 Plus 2.0 arrays in muscle biopsies from 8 glucocorticoid-naive patients with PMR and 10 controls before and after prednisolone-treatment for 14 days. For 14 genes, quantitative real-time PCR (qRT-PCR, n = 9 in both groups) was used to validate the microarray findings and to further investigate the expression of genes of particular interest. Prednisolone normalized erythrocyte sedimentation rate (ESR) and C-reactive protein (CRP) in PMR patients. A total of 165 putatively clinically relevant, differentially expressed genes were identified (cut-off: fold difference > ±1.2, difference of mean > 30, and p < 0.05); of these, 78 genes differed between patients and controls before treatment, 131 genes responded to treatment in a given direction only in patients, and 44 fulfilled both these criteria. In 43 of the 44 genes, treatment counteracted the initial difference. Functional clustering identified themes of biological function, including regulation of protein biosynthesis, and regulation of transcription and of extracellular matrix processes. Overall, qRT-PCR confirmed the microarray findings: Microarray-detected group differences were confirmed for 9 genes in 17 of 18 comparisons (same magnitude and direction of change); lack of group differences in microarray testing was confirmed for 5 genes in 8 of 10 comparisons. Before treatment, using qRT-PCR, expression of interleukin 6 (IL-6) was found to be 4-fold higher in patients (p < 0.05). This study identifies genes in muscle, the expression of which may impact the pathophysiology of PMR. Moreover, the study adds further evidence of the importance of IL-6 in the disease. Follow-up studies are needed to establish the exact pathophysiological relevance of the identified genes. The study was retrospectively listed on the ISRCTN registry with study ID ISRCTN69503018 and date of registration the 26th of July 2017.
Enamelin/ameloblastin gene polymorphisms in autosomal amelogenesis imperfecta among Syrian families.
Dashash, Mayssoon; Bazrafshani, Mohamed Riza; Poulton, Kay; Jaber, Saaed; Naeem, Emad; Blinkhorn, Anthony Stevenson
2011-02-01
This study was undertaken to investigate whether a single G deletion within a series of seven G residues (codon 196) at the exon 9-intron 9 boundary of the enamelin gene ENAM and a tri-nucleotide deletion at codon 180 in exon 7 (GGA vs deletion) of ameloblastin gene AMBN could have a role in autosomal amelogenesis imperfecta among affected Syrian families. A new technique - size-dependent, deletion screening - was developed to detect nucleotide deletion in ENAM and AMBN genes. Twelve Syrian families with autosomal-dominant or -recessive amelogenesis imperfecta were included. A homozygous/heterozygous mutation in the ENAM gene (152/152, 152/153) was identified in affected members of three families with autosomal-dominant amelogenesis imperfecta and one family with autosomal-recessive amelogenesis imperfecta. A heterozygous mutation (222/225) in the AMBN gene was identified. However, no disease causing mutations was found. The present findings provide useful information for the implication of ENAM gene polymorphism in autosomal-dominant/-recessive amelogenesis imperfecta. Further investigations are required to identify other genes responsible for the various clinical phenotypes. © 2010 Blackwell Publishing Asia Pty Ltd.
Čejková, Darina; Strouhal, Michal; Norris, Steven J; Weinstock, George M; Šmajs, David
2015-01-01
Pathogenic uncultivable treponemes comprise human and animal pathogens including agents of syphilis, yaws, bejel, pinta, and venereal spirochetosis in rabbits and hares. A set of 10 treponemal genome sequences including those of 4 Treponema pallidum ssp. pallidum (TPA) strains (Nichols, DAL-1, Mexico A, SS14), 4 T. p. ssp. pertenue (TPE) strains (CDC-2, Gauthier, Samoa D, Fribourg-Blanc), 1 T. p. ssp. endemicum (TEN) strain (Bosnia A) and one strain (Cuniculi A) of Treponema paraluisleporidarum ecovar Cuniculus (TPLC) were examined with respect to the presence of nucleotide intrastrain heterogeneous sites. The number of identified intrastrain heterogeneous sites in individual genomes ranged between 0 and 7. Altogether, 23 intrastrain heterogeneous sites (in 17 genes) were found in 5 out of 10 investigated treponemal genomes including TPA strains Nichols (n = 5), DAL-1 (n = 4), and SS14 (n = 7), TPE strain Samoa D (n = 1), and TEN strain Bosnia A (n = 5). Although only one heterogeneous site was identified among 4 tested TPE strains, 16 such sites were identified among 4 TPA strains. Heterogeneous sites were mostly strain-specific and were identified in four tpr genes (tprC, GI, I, K), in genes involved in bacterial motility and chemotaxis (fliI, cheC-fliY), in genes involved in cell structure (murC), translation (prfA), general and DNA metabolism (putative SAM dependent methyltransferase, topA), and in seven hypothetical genes. Heterogeneous sites likely represent both the selection of adaptive changes during infection of the host as well as an ongoing diversifying evolutionary process.
Global and disease-associated genetic variation in the human Fanconi anemia gene family.
Rogers, Kai J; Fu, Wenqing; Akey, Joshua M; Monnat, Raymond J
2014-12-20
Fanconi anemia (FA) is a human recessive genetic disease resulting from inactivating mutations in any of 16 FANC (Fanconi) genes. Individuals with FA are at high risk of developmental abnormalities, early bone marrow failure and leukemia. These are followed in the second and subsequent decades by a very high risk of carcinomas of the head and neck and anogenital region, and a small continuing risk of leukemia. In order to characterize base pair-level disease-associated (DA) and population genetic variation in FANC genes and the segregation of this variation in the human population, we identified 2948 unique FANC gene variants including 493 FA DA variants across 57,240 potential base pair variation sites in the 16 FANC genes. We then analyzed the segregation of this variation in the 7578 subjects included in the Exome Sequencing Project (ESP) and the 1000 Genomes Project (1KGP). There was a remarkably high frequency of FA DA variants in ESP/1KGP subjects: at least 1 FA DA variant was identified in 78.5% (5950 of 7578) individuals included in these two studies. Six widely used functional prediction algorithms correctly identified only a third of the known, DA FANC missense variants. We also identified FA DA variants that may be good candidates for different types of mutation-specific therapies. Our results demonstrate the power of direct DNA sequencing to detect, estimate the frequency of and follow the segregation of deleterious genetic variation in human populations. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Shitara, M; Tsuboi, Y; Sekizuka, T; Tazumi, A; Moorei, J E; Millar, B C; Taneike, I; Matsuda, M
2008-01-01
Nucleotide sequences of approximately 3.1 kbp consisting of the full-length open reading frame (ORF) for grpE, a non-coding (NC) region and a putative ORF for the full-length dnaK gene (1860 bp) were identified from a urease-positive thermophilic Campylobacter (UPTC) CF89-12 isolate. Then, following the construction of a new degenerate polymerase chain reaction (PCR) primer pair for amplification of the dnaK structural gene, including the transcription terminator region of C. lari isolates, the dnaK region was amplified successfully, TA-cloned and sequenced in nine C. lari isolates. The dnaK gene sequences commenced with an ATG and terminated with a TAA in all 10 isolates, including CF89-12. In addition, the putative ORFs for the dnaK gene locus from seven UPTC isolates consisted of 1860 bases, and the four urease-negative (UN) C. lari isolates included C. lari RM2100 reference strain 1866. Interestingly, different probable ribosome binding sites and hypothetically intrinsic p-independent terminator structures were identified between the seven UPTC and four UN C. lari isolates, respectively. Moreover, it is interesting to note that 20 out of a total of 28 polymorphic sites occurred among amino acid sequences of the dnaK ORF from 11 C. lari isolates, identified to be alternatively UPTC-specific or UN C. lari-specific. In the neighbour-joining tree based on the nucleotide sequence information of the dnaK gene, C. lari forms two major distinct clusters consisting of UPTC and UN C. lari isolates, respectively, with UN C. lari being more closely related to other thermophilic campylobacters than to UPTC.
Li, Chunquan; Han, Junwei; Yao, Qianlan; Zou, Chendan; Xu, Yanjun; Zhang, Chunlong; Shang, Desi; Zhou, Lingyun; Zou, Chaoxia; Sun, Zeguo; Li, Jing; Zhang, Yunpeng; Yang, Haixiu; Gao, Xu; Li, Xia
2013-05-01
Various 'omics' technologies, including microarrays and gas chromatography mass spectrometry, can be used to identify hundreds of interesting genes, proteins and metabolites, such as differential genes, proteins and metabolites associated with diseases. Identifying metabolic pathways has become an invaluable aid to understanding the genes and metabolites associated with studying conditions. However, the classical methods used to identify pathways fail to accurately consider joint power of interesting gene/metabolite and the key regions impacted by them within metabolic pathways. In this study, we propose a powerful analytical method referred to as Subpathway-GM for the identification of metabolic subpathways. This provides a more accurate level of pathway analysis by integrating information from genes and metabolites, and their positions and cascade regions within the given pathway. We analyzed two colorectal cancer and one metastatic prostate cancer data sets and demonstrated that Subpathway-GM was able to identify disease-relevant subpathways whose corresponding entire pathways might be ignored using classical entire pathway identification methods. Further analysis indicated that the power of a joint genes/metabolites and subpathway strategy based on their topologies may play a key role in reliably recalling disease-relevant subpathways and finding novel subpathways.
The genetics of alcoholism: identifying specific genes through family studies.
Edenberg, Howard J; Foroud, Tatiana
2006-09-01
Alcoholism is a complex disorder with both genetic and environmental risk factors. Studies in humans have begun to elucidate the genetic underpinnings of the risk for alcoholism. Here we briefly review strategies for identifying individual genes in which variations affect the risk for alcoholism and related phenotypes, in the context of one large study that has successfully identified such genes. The Collaborative Study on the Genetics of Alcoholism (COGA) is a family-based study that has collected detailed phenotypic data on individuals in families with multiple alcoholic members. A genome-wide linkage approach led to the identification of chromosomal regions containing genes that influenced alcoholism risk and related phenotypes. Subsequently, single nucleotide polymorphisms (SNPs) were genotyped in positional candidate genes located within the linked chromosomal regions, and analyzed for association with these phenotypes. Using this sequential approach, COGA has detected association with GABRA2, CHRM2 and ADH4; these associations have all been replicated by other researchers. COGA has detected association to additional genes including GABRG3, TAS2R16, SNCA, OPRK1 and PDYN, results that are awaiting confirmation. These successes demonstrate that genes contributing to the risk for alcoholism can be reliably identified using human subjects.
Koltovaya, N A; Guerasimova, A S; Tchekhouta, I A; Devin, A B
2003-08-01
An increase in the mitochondrial rho(-) mutagenesis is a well-known response of yeast cells to mutations in numerous nuclear genes as well as to various kinds of stress. Despite extensive studies for several decades, the biological significance of this response is still not fully understood. The genetic approach to solving this enigma includes a study of genes that are required for the high incidence of spontaneous rho(-) mutants. We have obtained mutations of a few nuclear genes of that sort and found that mutations in certain genes, including CDC28, the central cell-cycle regulation gene, result in a decrease in spontaneous rho(-) mutability and simultaneously affect the maintenance of the yeast chromosomes and plasmids. Two more genes resembling CDC28 in this respect are identified in the present work as a result of the characterization of four new mutants. These two genes are NET1 and HFI1 which mediate important regulatory protein-protein interactions in the yeast cell. The effects of four mutations, including net1-srm and hfi1-srm, on the maintenance of the yeast mitochondrial genome, chromosomes and plasmids, as well as on the cell's sensitivity to ionizing radiation, are also described. The data presented suggest that the pleiotropic srm mutations determining coordinate changes in the fidelity of mitotic transmission of chromosomes, plasmids and mtDNA molecules identify genes that most probably operate high up in the hierarchy of the general genetic regulation of yeast. Copyright 2003 John Wiley & Sons, Ltd.
Rathinam, Elanagai; Rajasekharan, Sivaprakash; Chitturi, Ravi Teja; Declercq, Heidi; Martens, Luc; De Coster, Peter
2016-12-01
The aim of this study was to present a systematic review investigating the gene expression of various cells (other than dental pulp cells) in response to different variants of tricalcium silicate cements (TSCs). A systematic search of the literature was performed by 2 independent reviewers followed by article selection and data extraction. Studies analyzing any cell type except dental pulp stem cells and any variant of tricalcium silicate cement either as the experimental or as the control group were included. A total of 41 relevant articles were included in this review. Among the included studies, ProRoot MTA (Dentsply, Tulsa, OK) was the most commonly studied (69.1%) TSC variant, and 11 cell types were identified, with 13 articles investigating gene expression in osteoblasts. A total of 39 different genes/molecules expressed were found in the selected studies. The experimental group (irrespective of the TSC variant) was identified to express significantly increased gene expression compared with the control group (untreated) in all included studies. Recent studies have provided useful insight into the gene expression and molecular signaling of various cells in response to TSCs, and new elements have been supplied on the pathways activated in this process. TSCs are capable of eliciting a favorable cellular response in periapical regeneration. Copyright © 2016 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
2011-01-01
Background Nocturnal insects such as moths are ideal models to study the molecular bases of olfaction that they use, among examples, for the detection of mating partners and host plants. Knowing how an odour generates a neuronal signal in insect antennae is crucial for understanding the physiological bases of olfaction, and also could lead to the identification of original targets for the development of olfactory-based control strategies against herbivorous moth pests. Here, we describe an Expressed Sequence Tag (EST) project to characterize the antennal transcriptome of the noctuid pest model, Spodoptera littoralis, and to identify candidate genes involved in odour/pheromone detection. Results By targeting cDNAs from male antennae, we biased gene discovery towards genes potentially involved in male olfaction, including pheromone reception. A total of 20760 ESTs were obtained from a normalized library and were assembled in 9033 unigenes. 6530 were annotated based on BLAST analyses and gene prediction software identified 6738 ORFs. The unigenes were compared to the Bombyx mori proteome and to ESTs derived from Lepidoptera transcriptome projects. We identified a large number of candidate genes involved in odour and pheromone detection and turnover, including 31 candidate chemosensory receptor genes, but also genes potentially involved in olfactory modulation. Conclusions Our project has generated a large collection of antennal transcripts from a Lepidoptera. The normalization process, allowing enrichment in low abundant genes, proved to be particularly relevant to identify chemosensory receptors in a species for which no genomic data are available. Our results also suggest that olfactory modulation can take place at the level of the antennae itself. These EST resources will be invaluable for exploring the mechanisms of olfaction and pheromone detection in S. littoralis, and for ultimately identifying original targets to fight against moth herbivorous pests. PMID:21276261
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.
Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan
2015-01-01
Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data.
ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes
Zhou, Yuanshuai; Liu, Yongjing; Li, Kening; Zhang, Rui; Qiu, Fujun; Zhao, Ning; Xu, Yan
2015-01-01
Background Over the last decade, an increasing number of integrative studies on cancer-related genes have been published. Integrative analyses aim to overcome the limitation of a single data type, and provide a more complete view of carcinogenesis. The vast majority of these studies used sample-matched data of gene expression and copy number to investigate the impact of copy number alteration on gene expression, and to predict and prioritize candidate oncogenes and tumor suppressor genes. However, correlations between genes were neglected in these studies. Our work aimed to evaluate the co-alteration of copy number, methylation and expression, allowing us to identify cancer-related genes and essential functional modules in cancer. Results We built the Integrated Co-alteration network (ICan) based on multi-omics data, and analyzed the network to uncover cancer-related genes. After comparison with random networks, we identified 155 ovarian cancer-related genes, including well-known (TP53, BRCA1, RB1 and PTEN) and also novel cancer-related genes, such as PDPN and EphA2. We compared the results with a conventional method: CNAmet, and obtained a significantly better area under the curve value (ICan: 0.8179, CNAmet: 0.5183). Conclusion In this paper, we describe a framework to find cancer-related genes based on an Integrated Co-alteration network. Our results proved that ICan could precisely identify candidate cancer genes and provide increased mechanistic understanding of carcinogenesis. This work suggested a new research direction for biological network analyses involving multi-omics data. PMID:25803614
GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains
Lu, Zhiyong
2015-01-01
The automatic recognition of gene names and their associated database identifiers from biomedical text has been widely studied in recent years, as these tasks play an important role in many downstream text-mining applications. Despite significant previous research, only a small number of tools are publicly available and these tools are typically restricted to detecting only mention level gene names or only document level gene identifiers. In this work, we report GNormPlus: an end-to-end and open source system that handles both gene mention and identifier detection. We created a new corpus of 694 PubMed articles to support our development of GNormPlus, containing manual annotations for not only gene names and their identifiers, but also closely related concepts useful for gene name disambiguation, such as gene families and protein domains. GNormPlus integrates several advanced text-mining techniques, including SimConcept for resolving composite gene names. As a result, GNormPlus compares favorably to other state-of-the-art methods when evaluated on two widely used public benchmarking datasets, achieving 86.7% F1-score on the BioCreative II Gene Normalization task dataset and 50.1% F1-score on the BioCreative III Gene Normalization task dataset. The GNormPlus source code and its annotated corpus are freely available, and the results of applying GNormPlus to the entire PubMed are freely accessible through our web-based tool PubTator. PMID:26380306
Prophage Integrase Typing Is a Useful Indicator of Genomic Diversity in Salmonella enterica
Colavecchio, Anna; D’Souza, Yasmin; Tompkins, Elizabeth; Jeukens, Julie; Freschi, Luca; Emond-Rheault, Jean-Guillaume; Kukavica-Ibrulj, Irena; Boyle, Brian; Bekal, Sadjia; Tamber, Sandeep; Levesque, Roger C.; Goodridge, Lawrence D.
2017-01-01
Salmonella enterica is a bacterial species that is a major cause of illness in humans and food-producing animals. S. enterica exhibits considerable inter-serovar diversity, as evidenced by the large number of host adapted serovars that have been identified. The development of methods to assess genome diversity in S. enterica will help to further define the limits of diversity in this foodborne pathogen. Thus, we evaluated a PCR assay, which targets prophage integrase genes, as a rapid method to investigate S. enterica genome diversity. To evaluate the PCR prophage integrase assay, 49 isolates of S. enterica were selected, including 19 clinical isolates from clonal serovars (Enteritidis and Heidelberg) that commonly cause human illness, and 30 isolates from food-associated Salmonella serovars that rarely cause human illness. The number of integrase genes identified by the PCR assay was compared to the number of integrase genes within intact prophages identified by whole genome sequencing and phage finding program PHASTER. The PCR assay identified a total of 147 prophage integrase genes within the 49 S. enterica genomes (79 integrase genes in the food-associated Salmonella isolates, 50 integrase genes in S. Enteritidis, and 18 integrase genes in S. Heidelberg). In comparison, whole genome sequencing and PHASTER identified a total of 75 prophage integrase genes within 102 intact prophages in the 49 S. enterica genomes (44 integrase genes in the food-associated Salmonella isolates, 21 integrase genes in S. Enteritidis, and 9 integrase genes in S. Heidelberg). Collectively, both the PCR assay and PHASTER identified the presence of a large diversity of prophage integrase genes in the food-associated isolates compared to the clinical isolates, thus indicating a high degree of diversity in the food-associated isolates, and confirming the clonal nature of S. Enteritidis and S. Heidelberg. Moreover, PHASTER revealed a diversity of 29 different types of prophages and 23 different integrase genes within the food-associated isolates, but only identified four different phages and integrase genes within clonal isolates of S. Enteritidis and S. Heidelberg. These results demonstrate the potential usefulness of PCR based detection of prophage integrase genes as a rapid indicator of genome diversity in S. enterica. PMID:28740489
Prophage Integrase Typing Is a Useful Indicator of Genomic Diversity in Salmonella enterica.
Colavecchio, Anna; D'Souza, Yasmin; Tompkins, Elizabeth; Jeukens, Julie; Freschi, Luca; Emond-Rheault, Jean-Guillaume; Kukavica-Ibrulj, Irena; Boyle, Brian; Bekal, Sadjia; Tamber, Sandeep; Levesque, Roger C; Goodridge, Lawrence D
2017-01-01
Salmonella enterica is a bacterial species that is a major cause of illness in humans and food-producing animals. S. enterica exhibits considerable inter-serovar diversity, as evidenced by the large number of host adapted serovars that have been identified. The development of methods to assess genome diversity in S. enterica will help to further define the limits of diversity in this foodborne pathogen. Thus, we evaluated a PCR assay, which targets prophage integrase genes, as a rapid method to investigate S. enterica genome diversity. To evaluate the PCR prophage integrase assay, 49 isolates of S. enterica were selected, including 19 clinical isolates from clonal serovars (Enteritidis and Heidelberg) that commonly cause human illness, and 30 isolates from food-associated Salmonella serovars that rarely cause human illness. The number of integrase genes identified by the PCR assay was compared to the number of integrase genes within intact prophages identified by whole genome sequencing and phage finding program PHASTER. The PCR assay identified a total of 147 prophage integrase genes within the 49 S. enterica genomes (79 integrase genes in the food-associated Salmonella isolates, 50 integrase genes in S . Enteritidis, and 18 integrase genes in S . Heidelberg). In comparison, whole genome sequencing and PHASTER identified a total of 75 prophage integrase genes within 102 intact prophages in the 49 S. enterica genomes (44 integrase genes in the food-associated Salmonella isolates, 21 integrase genes in S . Enteritidis, and 9 integrase genes in S . Heidelberg). Collectively, both the PCR assay and PHASTER identified the presence of a large diversity of prophage integrase genes in the food-associated isolates compared to the clinical isolates, thus indicating a high degree of diversity in the food-associated isolates, and confirming the clonal nature of S . Enteritidis and S . Heidelberg. Moreover, PHASTER revealed a diversity of 29 different types of prophages and 23 different integrase genes within the food-associated isolates, but only identified four different phages and integrase genes within clonal isolates of S. Enteritidis and S. Heidelberg. These results demonstrate the potential usefulness of PCR based detection of prophage integrase genes as a rapid indicator of genome diversity in S. enterica .
Parodi, Stefano; Manneschi, Chiara; Verda, Damiano; Ferrari, Enrico; Muselli, Marco
2018-03-01
This study evaluates the performance of a set of machine learning techniques in predicting the prognosis of Hodgkin's lymphoma using clinical factors and gene expression data. Analysed samples from 130 Hodgkin's lymphoma patients included a small set of clinical variables and more than 54,000 gene features. Machine learning classifiers included three black-box algorithms ( k-nearest neighbour, Artificial Neural Network, and Support Vector Machine) and two methods based on intelligible rules (Decision Tree and the innovative Logic Learning Machine method). Support Vector Machine clearly outperformed any of the other methods. Among the two rule-based algorithms, Logic Learning Machine performed better and identified a set of simple intelligible rules based on a combination of clinical variables and gene expressions. Decision Tree identified a non-coding gene ( XIST) involved in the early phases of X chromosome inactivation that was overexpressed in females and in non-relapsed patients. XIST expression might be responsible for the better prognosis of female Hodgkin's lymphoma patients.
Marra, Nicholas J; Richards, Vincent P; Early, Angela; Bogdanowicz, Steve M; Pavinski Bitar, Paulina D; Stanhope, Michael J; Shivji, Mahmood S
2017-01-30
Comparative genomic and/or transcriptomic analyses involving elasmobranchs remain limited, with genome level comparisons of the elasmobranch immune system to that of higher vertebrates, non-existent. This paper reports a comparative RNA-seq analysis of heart tissue from seven species, including four elasmobranchs and three teleosts, focusing on immunity, but concomitantly seeking to identify genetic similarities shared by the two lamnid sharks and the single billfish in our study, which could be linked to convergent evolution of regional endothermy. Across seven species, we identified an average of 10,877 Swiss-Prot annotated genes from an average of 32,474 open reading frames within each species' heart transcriptome. About half of these genes were shared between all species while the remainder included functional differences between our groups of interest (elasmobranch vs. teleost and endotherms vs. ectotherms) as revealed by Gene Ontology (GO) and selection analyses. A repeatedly represented functional category, in both the uniquely expressed elasmobranch genes (total of 259) and the elasmobranch GO enrichment results, involved antibody-mediated immunity, either in the recruitment of immune cells (Fc receptors) or in antigen presentation, including such terms as "antigen processing and presentation of exogenous peptide antigen via MHC class II", and such genes as MHC class II, HLA-DPB1. Molecular adaptation analyses identified three genes in elasmobranchs with a history of positive selection, including legumain (LGMN), a gene with roles in both innate and adaptive immunity including producing antigens for presentation by MHC class II. Comparisons between the endothermic and ectothermic species revealed an enrichment of GO terms associated with cardiac muscle contraction in endotherms, with 19 genes expressed solely in endotherms, several of which have significant roles in lipid and fat metabolism. This collective comparative evidence provides the first multi-taxa transcriptomic-based perspective on differences between elasmobranchs and teleosts, and suggests various unique features associated with the adaptive immune system of elasmobranchs, pointing in particular to the potential importance of MHC Class II. This in turn suggests that expanded comparative work involving additional tissues, as well as genome sequencing of multiple elasmobranch species would be productive in elucidating the regulatory and genome architectural hallmarks of elasmobranchs.
Wardell, Christopher P; Fujita, Masashi; Yamada, Toru; Simbolo, Michele; Fassan, Matteo; Karlic, Rosa; Polak, Paz; Kim, Jaegil; Hatanaka, Yutaka; Maejima, Kazuhiro; Lawlor, Rita T; Nakanishi, Yoshitsugu; Mitsuhashi, Tomoko; Fujimoto, Akihiro; Furuta, Mayuko; Ruzzenente, Andrea; Conci, Simone; Oosawa, Ayako; Sasaki-Oku, Aya; Nakano, Kaoru; Tanaka, Hiroko; Yamamoto, Yujiro; Michiaki, Kubo; Kawakami, Yoshiiku; Aikata, Hiroshi; Ueno, Masaki; Hayami, Shinya; Gotoh, Kunihito; Ariizumi, Shun-Ichi; Yamamoto, Masakazu; Yamaue, Hiroki; Chayama, Kazuaki; Miyano, Satoru; Getz, Gad; Scarpa, Aldo; Hirano, Satoshi; Nakamura, Toru; Nakagawa, Hidewaki
2018-05-01
Biliary tract cancers (BTCs) are clinically and pathologically heterogeneous and respond poorly to treatment. Genomic profiling can offer a clearer understanding of their carcinogenesis, classification and treatment strategy. We performed large-scale genome sequencing analyses on BTCs to investigate their somatic and germline driver events and characterize their genomic landscape. We analyzed 412 BTC samples from Japanese and Italian populations, 107 by whole-exome sequencing (WES), 39 by whole-genome sequencing (WGS), and a further 266 samples by targeted sequencing. The subtypes were 136 intrahepatic cholangiocarcinomas (ICCs), 101 distal cholangiocarcinomas (DCCs), 109 peri-hilar type cholangiocarcinomas (PHCs), and 66 gallbladder or cystic duct cancers (GBCs/CDCs). We identified somatic alterations and searched for driver genes in BTCs, finding pathogenic germline variants of cancer-predisposing genes. We predicted cell-of-origin for BTCs by combining somatic mutation patterns and epigenetic features. We identified 32 significantly and commonly mutated genes including TP53, KRAS, SMAD4, NF1, ARID1A, PBRM1, and ATR, some of which negatively affected patient prognosis. A novel deletion of MUC17 at 7q22.1 affected patient prognosis. Cell-of-origin predictions using WGS and epigenetic features suggest hepatocyte-origin of hepatitis-related ICCs. Deleterious germline mutations of cancer-predisposing genes such as BRCA1, BRCA2, RAD51D, MLH1, or MSH2 were detected in 11% (16/146) of BTC patients. BTCs have distinct genetic features including somatic events and germline predisposition. These findings could be useful to establish treatment and diagnostic strategies for BTCs based on genetic information. We here analyzed genomic features of 412 BTC samples from Japanese and Italian populations. A total of 32 significantly and commonly mutated genes were identified, some of which negatively affected patient prognosis, including a novel deletion of MUC17 at 7q22.1. Cell-of-origin predictions using WGS and epigenetic features suggest hepatocyte-origin of hepatitis-related ICCs. Deleterious germline mutations of cancer-predisposing genes were detected in 11% of patients with BTC. BTCs have distinct genetic features including somatic events and germline predisposition. Copyright © 2018 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Insights into the innate immunome of actiniarians using a comparative genomic approach.
van der Burg, Chloé A; Prentis, Peter J; Surm, Joachim M; Pavasovic, Ana
2016-11-02
Innate immune genes tend to be highly conserved in metazoans, even in early divergent lineages such as Cnidaria (jellyfish, corals, hydroids and sea anemones) and Porifera (sponges). However, constant and diverse selection pressures on the immune system have driven the expansion and diversification of different immune gene families in a lineage-specific manner. To investigate how the innate immune system has evolved in a subset of sea anemone species (Order: Actiniaria), we performed a comprehensive and comparative study using 10 newly sequenced transcriptomes, as well as three publically available transcriptomes, to identify the origins, expansions and contractions of candidate and novel immune gene families. We characterised five conserved genes and gene families, as well as multiple novel innate immune genes, including the newly recognised putative pattern recognition receptor CniFL. Single copies of TLR, MyD88 and NF-κB were found in most species, and several copies of IL-1R-like, NLR and CniFL were found in almost all species. Multiple novel immune genes were identified with domain architectures including the Toll/interleukin-1 receptor (TIR) homology domain, which is well documented as functioning in protein-protein interactions and signal transduction in immune pathways. We hypothesise that these genes may interact as novel proteins in immune pathways of cnidarian species. Novelty in the actiniarian immunome is not restricted to only TIR-domain-containing proteins, as we identify a subset of NLRs which have undergone neofunctionalisation and contain 3-5 N-terminal transmembrane domains, which have so far only been identified in two anthozoan species. This research has significance in understanding the evolution and origin of the core eumetazoan gene set, including how novel innate immune genes evolve. For example, the evolution of transmembrane domain containing NLRs indicates that these NLRs may be membrane-bound, while all other metazoan and plant NLRs are exclusively cytosolic receptors. This is one example of how species without an adaptive immune system may evolve innovative solutions to detect pathogens or interact with native microbiota. Overall, these results provide an insight into the evolution of the innate immune system, and show that early divergent lineages, such as actiniarians, have a diverse repertoire of conserved and novel innate immune genes.
Activation of Ftz-F1-Responsive Genes through Ftz/Ftz-F1 Dependent Enhancers
Field, Amanda; Xiang, Jie; Anderson, W. Ray; Graham, Patricia; Pick, Leslie
2016-01-01
The orphan nuclear receptor Ftz-F1 is expressed in all somatic nuclei in Drosophila embryos, but mutations result in a pair-rule phenotype. This was explained by the interaction of Ftz-F1 with the homeodomain protein Ftz that is expressed in stripes in the primordia of segments missing in either ftz-f1 or ftz mutants. Ftz-F1 and Ftz were shown to physically interact and coordinately activate the expression of ftz itself and engrailed by synergistic binding to composite Ftz-F1/Ftz binding sites. However, attempts to identify additional target genes on the basis of Ftz-F1/ Ftz binding alone has met with only limited success. To discern rules for Ftz-F1 target site selection in vivo and to identify additional target genes, a microarray analysis was performed comparing wildtype and ftz-f1 mutant embryos. Ftz-F1-responsive genes most highly regulated included engrailed and nine additional genes expressed in patterns dependent on both ftz and ftz-f1. Candidate enhancers for these genes were identified by combining BDTNP Ftz ChIP-chip data with a computational search for Ftz-F1 binding sites. Of eight enhancer reporter genes tested in transgenic embryos, six generated expression patterns similar to the corresponding endogenous gene and expression was lost in ftz mutants. These studies identified a new set of Ftz-F1 targets, all of which are co-regulated by Ftz. Comparative analysis of enhancers containing Ftz/Ftz-F1 binding sites that were or were not bona fide targets in vivo suggested that GAF negatively regulates enhancers that contain Ftz/Ftz-F1 binding sites but are not actually utilized. These targets include other regulatory factors as well as genes involved directly in morphogenesis, providing insight into how pair-rule genes establish the body pattern. PMID:27723822
MicroRNA profiling in the dentate gyrus in epileptic rats: The role of miR-187-3p.
Zhang, Suya; Kou, Yubin; Hu, Chunmei; Han, Yan
2017-06-01
This study aimed to explore the role of aberrant miRNA expression in epilepsy and to identify more potential genes associated with epileptogenesis.The miRNA expression profile of GSE49850, which included 20 samples from the rat epileptic dentate gyrus at 7, 14, 30, and 90 days after electrical stimulation and 20 additional samples from sham time-matched controls, was downloaded from the Gene Expression Omnibus database. The significantly differentially expressed miRNAs were identified in stimulated samples at each time point compared to time-matched controls, respectively. The target genes of consistently differentially expressed miRNAs were screened from miRDB and microRNA.org databases, followed by Gene Ontology (GO) and pathway enrichment analysis and regulatory network construction. The overlapping target genes for consistently differentially expressed miRNAs were also identified from these 2 databases. Furthermore, the potential binding sites of miRNAs and their target genes were analyzed.Rno-miR-187-3p was consistently downregulated in stimulated groups compared with time-matched controls. The predicted target genes of rno-miR-187-3p were enriched in different GO terms and pathways. In addition, 7 overlapping target genes of rno-miR-187-3p were identified, including NFS1, PAQR4, CAND1, DCLK1, PRKAR2A, AKAP3, and KCNK10. These 7 overlapping target genes were determined to have a different number of matched binding sites with rno-miR-187-3p.Our study suggests that miR-187-3p may play an important role in epilepsy development and progression via regulating numerous target genes, such as NFS1, CAND1, DCLK1, AKAP3, and KCNK10. Determining the underlying mechanism of the role of miR-187-3p in epilepsy may make it a potential therapeutic option.
Johansen, Ilona; Andreassen, Rune
2014-12-23
MicroRNAs (miRNAs) are an abundant class of endogenous small RNA molecules that downregulate gene expression at the post-transcriptional level. They play important roles by regulating genes that control multiple biological processes, and recent years there has been an increased interest in studying miRNA genes and miRNA gene expression. The most common method applied to study gene expression of single genes is quantitative PCR (qPCR). However, before expression of mature miRNAs can be studied robust qPCR methods (miRNA-qPCR) must be developed. This includes identification and validation of suitable reference genes. We are particularly interested in Atlantic salmon (Salmo salar). This is an economically important aquaculture species, but no reference genes dedicated for use in miRNA-qPCR methods has been validated for this species. Our aim was, therefore, to identify suitable reference genes for miRNA-qPCR methods in Salmo salar. We used a systematic approach where we utilized similar studies in other species, some biological criteria, results from deep sequencing of small RNAs and, finally, experimental validation of candidate reference genes by qPCR to identify the most suitable reference genes. Ssa-miR-25-3p was identified as most suitable single reference gene. The best combinations of two reference genes were ssa-miR-25-3p and ssa-miR-455-5p. These two genes were constitutively and stably expressed across many different tissues. Furthermore, infectious salmon anaemia did not seem to affect their expression levels. These genes were amplified with high specificity, good efficiency and the qPCR assays showed a good linearity when applying a simple cybergreen miRNA-PCR method using miRNA gene specific forward primers. We have identified suitable reference genes for miRNA-qPCR in Atlantic salmon. These results will greatly facilitate further studies on miRNA genes in this species. The reference genes identified are conserved genes that are identical in their mature sequence in many aquaculture species. Therefore, they may also be suitable as reference genes in other teleosts. Finally, the systematic approach used in our study successfully identified suitable reference genes, suggesting that this may be a useful strategy to apply in similar validation studies in other aquaculture species.
Capturing novel mouse genes encoding chromosomal and other nuclear proteins.
Tate, P; Lee, M; Tweedie, S; Skarnes, W C; Bickmore, W A
1998-09-01
The burgeoning wealth of gene sequences contrasts with our ignorance of gene function. One route to assigning function is by determining the sub-cellular location of proteins. We describe the identification of mouse genes encoding proteins that are confined to nuclear compartments by splicing endogeneous gene sequences to a promoterless betageo reporter, using a gene trap approach. Mouse ES (embryonic stem) cell lines were identified that express betageo fusions located within sub-nuclear compartments, including chromosomes, the nucleolus and foci containing splicing factors. The sequences of 11 trapped genes were ascertained, and characterisation of endogenous protein distribution in two cases confirmed the validity of the approach. Three novel proteins concentrated within distinct chromosomal domains were identified, one of which appears to be a serine/threonine kinase. The sequence of a gene whose product co-localises with splicesome components suggests that this protein may be an E3 ubiquitin-protein ligase. The majority of the other genes isolated represent novel genes. This approach is shown to be a powerful tool for identifying genes encoding novel proteins with specific sub-nuclear localisations and exposes our ignorance of the protein composition of the nucleus. Motifs in two of the isolated genes suggest new links between cellular regulatory mechanisms (ubiquitination and phosphorylation) and mRNA splicing and chromosome structure/function.
Construct and Compare Gene Coexpression Networks with DAPfinder and DAPview.
Skinner, Jeff; Kotliarov, Yuri; Varma, Sudhir; Mine, Karina L; Yambartsev, Anatoly; Simon, Richard; Huyen, Yentram; Morgun, Andrey
2011-07-14
DAPfinder and DAPview are novel BRB-ArrayTools plug-ins to construct gene coexpression networks and identify significant differences in pairwise gene-gene coexpression between two phenotypes. Each significant difference in gene-gene association represents a Differentially Associated Pair (DAP). Our tools include several choices of filtering methods, gene-gene association metrics, statistical testing methods and multiple comparison adjustments. Network results are easily displayed in Cytoscape. Analyses of glioma experiments and microarray simulations demonstrate the utility of these tools. DAPfinder is a new friendly-user tool for reconstruction and comparison of biological networks.
Mutation spectrum of genes associated with steroid-resistant nephrotic syndrome in Chinese children.
Wang, Ying; Dang, Xiqiang; He, Qingnan; Zhen, Yan; He, Xiaoxie; Yi, Zhuwen; Zhu, Kuichun
2017-08-20
Approximately 20% of children with idiopathic nephrotic syndrome do not respond to steroid therapy. More than 30 genes have been identified as disease-causing genes for the steroid-resistant nephrotic syndrome (SRNS). Few reports were from the Chinese population. The coding regions of genes commonly associated with SRNS were analyzed to characterize the gene mutation spectrum in children with SRNS in central China. The first phase study involved 38 children with five genes (NPHS1, NPHS2, PLCE1, WT1, and TRPC6) by Sanger sequencing. The second phase study involved 33 children with 17 genes by next generation DNA sequencing (NGS. 22 new patients, and 11 patients from first phase study but without positive findings). Overall deleterious or putatively deleterious gene variants were identified in 19 patients (31.7%), including four NPHS1 variants among five patients and three PLCE1 variants among four other patients. Variants in COL4A3, COL4A4, or COL4A5 were found in six patients. Eight novel variants were identified, including two in NPHS1, two in PLCE1, one in NPHS2, LAMB2, COL4A3, and COL4A4, respectively. 55.6% of the children with variants failed to respond to immunosuppressive agent therapy, while the resistance rate in children without variants was 44.4%. Our results show that screening for deleterious variants in some common genes in children clinically suspected with SRNS might be helpful for disease diagnosis as well as prediction of treatment efficacy and prognosis. Copyright © 2017 Elsevier B.V. All rights reserved.
Katz, Laura A.
2015-01-01
While there is compelling evidence for the impact of endosymbiotic gene transfer (EGT; transfer from either mitochondrion or chloroplast to the nucleus) on genome evolution in eukaryotes, the role of interdomain transfer from bacteria and/or archaea (i.e. prokaryotes) is less clear. Lateral gene transfers (LGTs) have been argued to be potential sources of phylogenetic information, particularly for reconstructing deep nodes that are difficult to recover with traditional phylogenetic methods. We sought to identify interdomain LGTs by using a phylogenomic pipeline that generated 13 465 single gene trees and included up to 487 eukaryotes, 303 bacteria and 118 archaea. Our goals include searching for LGTs that unite major eukaryotic clades, and describing the relative contributions of LGT and EGT across the eukaryotic tree of life. Given the difficulties in interpreting single gene trees that aim to capture the approximately 1.8 billion years of eukaryotic evolution, we focus on presence–absence data to identify interdomain transfer events. Specifically, we identify 1138 genes found only in prokaryotes and representatives of three or fewer major clades of eukaryotes (e.g. Amoebozoa, Archaeplastida, Excavata, Opisthokonta, SAR and orphan lineages). The majority of these genes have phylogenetic patterns that are consistent with recent interdomain LGTs and, with the notable exception of EGTs involving photosynthetic eukaryotes, we detect few ancient interdomain LGTs. These analyses suggest that LGTs have probably occurred throughout the history of eukaryotes, but that ancient events are not maintained unless they are associated with endosymbiotic gene transfer among photosynthetic lineages. PMID:26323756
Machuca, Mayra Alejandra; Sosa, Luis Miguel; González, Clara Isabel
2013-01-01
Background Staphylococcus aureus is among the most common global nosocomial pathogens. The emergence and spread of methicillin-resistant Staphylococcus aureus (MRSA) is a public health problem worldwide that causes nosocomial and community infections. The goals of this study were to establish the clonal complexes (CC) of the isolates of MRSA obtained from pediatric patients in a university hospital in Colombia and to investigate its molecular characteristics based on the virulence genes and the genes of staphylococcal toxins and adhesins. Methods A total of 53 MRSA isolates from pediatric patients with local or systemic infections were collected. The MRSA isolates were typed based on the SCCmec, MLST, spa and agr genes. The molecular characterization included the detection of Panton-Valentine Leukocidin, superantigenic and exfoliative toxins, and adhesin genes. The correlation between the molecular types identified and the profile of virulence factors was determined for all isolates. Results Four CC were identified, including CC8, CC5, CC80 and CC78. The ST8-MRSA-IVc-agrI was the predominant clone among the isolates, followed by the ST5-MRSA-I-agrII and ST5-MRSA-IVc-agrII clones. Twelve spa types were identified, of which t10796 and t10799 were new repeat sequences. The isolates were carriers of toxin genes, and hlg (100%), sek (92%) and pvl (88%) were the most frequent. Ten toxin gene profiles were observed, and the most frequent were seq-sek-hlg (22.6%), sek-hlg (22.6%), seb-seq-sek-hlg (18.9%) and seb-sek-hlg (15.1%). The adhesion genes were present in most of the MRSA isolates, including the following: clf-A (89%), clf-B (87%), fnb-A (83%) and ica (83%). The majority of the strains carried SCCmec-IVc and were identified as causing nosocomial infection. No significant association between a molecular type and the virulence factors was found. Conclusion Four major MRSA clone complexes were identified among the isolates. ST8-MRSA-IVc-agrI pvl+ (USA300-LV) was the most frequent, confirming the presence of community-associated MRSA in Colombian hospitals. PMID:24058415
Rotival, Maxime; Zeller, Tanja; Wild, Philipp S; Maouche, Seraya; Szymczak, Silke; Schillert, Arne; Castagné, Raphaele; Deiseroth, Arne; Proust, Carole; Brocheton, Jessy; Godefroy, Tiphaine; Perret, Claire; Germain, Marine; Eleftheriadis, Medea; Sinning, Christoph R; Schnabel, Renate B; Lubos, Edith; Lackner, Karl J; Rossmann, Heidi; Münzel, Thomas; Rendon, Augusto; Erdmann, Jeanette; Deloukas, Panos; Hengstenberg, Christian; Diemert, Patrick; Montalescot, Gilles; Ouwehand, Willem H; Samani, Nilesh J; Schunkert, Heribert; Tregouet, David-Alexandre; Ziegler, Andreas; Goodall, Alison H; Cambien, François; Tiret, Laurence; Blankenberg, Stefan
2011-12-01
One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs) have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739), previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1) is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178), which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644) was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the mechanisms linking genome-wide association loci to disease.
The long tail of oncogenic drivers in prostate cancer.
Armenia, Joshua; Wankowicz, Stephanie A M; Liu, David; Gao, Jianjiong; Kundra, Ritika; Reznik, Ed; Chatila, Walid K; Chakravarty, Debyani; Han, G Celine; Coleman, Ilsa; Montgomery, Bruce; Pritchard, Colin; Morrissey, Colm; Barbieri, Christopher E; Beltran, Himisha; Sboner, Andrea; Zafeiriou, Zafeiris; Miranda, Susana; Bielski, Craig M; Penson, Alexander V; Tolonen, Charlotte; Huang, Franklin W; Robinson, Dan; Wu, Yi Mi; Lonigro, Robert; Garraway, Levi A; Demichelis, Francesca; Kantoff, Philip W; Taplin, Mary-Ellen; Abida, Wassim; Taylor, Barry S; Scher, Howard I; Nelson, Peter S; de Bono, Johann S; Rubin, Mark A; Sawyers, Charles L; Chinnaiyan, Arul M; Schultz, Nikolaus; Van Allen, Eliezer M
2018-05-01
Comprehensive genomic characterization of prostate cancer has identified recurrent alterations in genes involved in androgen signaling, DNA repair, and PI3K signaling, among others. However, larger and uniform genomic analysis may identify additional recurrently mutated genes at lower frequencies. Here we aggregate and uniformly analyze exome sequencing data from 1,013 prostate cancers. We identify and validate a new class of E26 transformation-specific (ETS)-fusion-negative tumors defined by mutations in epigenetic regulators, as well as alterations in pathways not previously implicated in prostate cancer, such as the spliceosome pathway. We find that the incidence of significantly mutated genes (SMGs) follows a long-tail distribution, with many genes mutated in less than 3% of cases. We identify a total of 97 SMGs, including 70 not previously implicated in prostate cancer, such as the ubiquitin ligase CUL3 and the transcription factor SPEN. Finally, comparing primary and metastatic prostate cancer identifies a set of genomic markers that may inform risk stratification.
Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong
2018-03-01
Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.
Assessment of gene order computing methods for Alzheimer's disease
2013-01-01
Background Computational genomics of Alzheimer disease (AD), the most common form of senile dementia, is a nascent field in AD research. The field includes AD gene clustering by computing gene order which generates higher quality gene clustering patterns than most other clustering methods. However, there are few available gene order computing methods such as Genetic Algorithm (GA) and Ant Colony Optimization (ACO). Further, their performance in gene order computation using AD microarray data is not known. We thus set forth to evaluate the performances of current gene order computing methods with different distance formulas, and to identify additional features associated with gene order computation. Methods Using different distance formulas- Pearson distance and Euclidean distance, the squared Euclidean distance, and other conditions, gene orders were calculated by ACO and GA (including standard GA and improved GA) methods, respectively. The qualities of the gene orders were compared, and new features from the calculated gene orders were identified. Results Compared to the GA methods tested in this study, ACO fits the AD microarray data the best when calculating gene order. In addition, the following features were revealed: different distance formulas generated a different quality of gene order, and the commonly used Pearson distance was not the best distance formula when used with both GA and ACO methods for AD microarray data. Conclusion Compared with Pearson distance and Euclidean distance, the squared Euclidean distance generated the best quality gene order computed by GA and ACO methods. PMID:23369541
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A
2016-06-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3' end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. Copyright © 2016 Larson et al.
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A.
2016-01-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3ʹ end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. PMID:27172183
Colaprico, Antonio; Bontempi, Gianluca; Castiglioni, Isabella
2018-01-01
Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC. PMID:29562723
Identification of druggable cancer driver genes amplified across TCGA datasets.
Chen, Ying; McGee, Jeremy; Chen, Xianming; Doman, Thompson N; Gong, Xueqian; Zhang, Youyan; Hamm, Nicole; Ma, Xiwen; Higgs, Richard E; Bhagwat, Shripad V; Buchanan, Sean; Peng, Sheng-Bin; Staschke, Kirk A; Yadav, Vipin; Yue, Yong; Kouros-Mehr, Hosein
2014-01-01
The Cancer Genome Atlas (TCGA) projects have advanced our understanding of the driver mutations, genetic backgrounds, and key pathways activated across cancer types. Analysis of TCGA datasets have mostly focused on somatic mutations and translocations, with less emphasis placed on gene amplifications. Here we describe a bioinformatics screening strategy to identify putative cancer driver genes amplified across TCGA datasets. We carried out GISTIC2 analysis of TCGA datasets spanning 16 cancer subtypes and identified 486 genes that were amplified in two or more datasets. The list was narrowed to 75 cancer-associated genes with potential "druggable" properties. The majority of the genes were localized to 14 amplicons spread across the genome. To identify potential cancer driver genes, we analyzed gene copy number and mRNA expression data from individual patient samples and identified 42 putative cancer driver genes linked to diverse oncogenic processes. Oncogenic activity was further validated by siRNA/shRNA knockdown and by referencing the Project Achilles datasets. The amplified genes represented a number of gene families, including epigenetic regulators, cell cycle-associated genes, DNA damage response/repair genes, metabolic regulators, and genes linked to the Wnt, Notch, Hedgehog, JAK/STAT, NF-KB and MAPK signaling pathways. Among the 42 putative driver genes were known driver genes, such as EGFR, ERBB2 and PIK3CA. Wild-type KRAS was amplified in several cancer types, and KRAS-amplified cancer cell lines were most sensitive to KRAS shRNA, suggesting that KRAS amplification was an independent oncogenic event. A number of MAP kinase adapters were co-amplified with their receptor tyrosine kinases, such as the FGFR adapter FRS2 and the EGFR family adapters GRB2 and GRB7. The ubiquitin-like ligase DCUN1D1 and the histone methyltransferase NSD3 were also identified as novel putative cancer driver genes. We discuss the patient tailoring implications for existing cancer drug targets and we further discuss potential novel opportunities for drug discovery efforts.
Identification of Druggable Cancer Driver Genes Amplified across TCGA Datasets
Chen, Ying; McGee, Jeremy; Chen, Xianming; Doman, Thompson N.; Gong, Xueqian; Zhang, Youyan; Hamm, Nicole; Ma, Xiwen; Higgs, Richard E.; Bhagwat, Shripad V.; Buchanan, Sean; Peng, Sheng-Bin; Staschke, Kirk A.; Yadav, Vipin; Yue, Yong; Kouros-Mehr, Hosein
2014-01-01
The Cancer Genome Atlas (TCGA) projects have advanced our understanding of the driver mutations, genetic backgrounds, and key pathways activated across cancer types. Analysis of TCGA datasets have mostly focused on somatic mutations and translocations, with less emphasis placed on gene amplifications. Here we describe a bioinformatics screening strategy to identify putative cancer driver genes amplified across TCGA datasets. We carried out GISTIC2 analysis of TCGA datasets spanning 14 cancer subtypes and identified 461 genes that were amplified in two or more datasets. The list was narrowed to 73 cancer-associated genes with potential “druggable” properties. The majority of the genes were localized to 14 amplicons spread across the genome. To identify potential cancer driver genes, we analyzed gene copy number and mRNA expression data from individual patient samples and identified 40 putative cancer driver genes linked to diverse oncogenic processes. Oncogenic activity was further validated by siRNA/shRNA knockdown and by referencing the Project Achilles datasets. The amplified genes represented a number of gene families, including epigenetic regulators, cell cycle-associated genes, DNA damage response/repair genes, metabolic regulators, and genes linked to the Wnt, Notch, Hedgehog, JAK/STAT, NF-KB and MAPK signaling pathways. Among the 40 putative driver genes were known driver genes, such as EGFR, ERBB2 and PIK3CA. Wild-type KRAS was amplified in several cancer types, and KRAS-amplified cancer cell lines were most sensitive to KRAS shRNA, suggesting that KRAS amplification was an independent oncogenic event. A number of MAP kinase adapters were co-amplified with their receptor tyrosine kinases, such as the FGFR adapter FRS2 and the EGFR family adapter GRB7. The ubiquitin-like ligase DCUN1D1 and the histone methyltransferase NSD3 were also identified as novel putative cancer driver genes. We discuss the patient tailoring implications for existing cancer drug targets and we further discuss potential novel opportunities for drug discovery efforts. PMID:24874471
Hinchcliff, Monique; Huang, Chiang-Ching; Wood, Tammara A.; Mahoney, J. Matthew; Martyanov, Viktor; Bhattacharyya, Swati; Tamaki, Zenshiro; Lee, Jungwha; Carns, Mary; Podlusky, Sofia; Sirajuddin, Arlene; Shah, Sanjiv J; Chang, Rowland W.; Lafyatis, Robert; Varga, John; Whitfield, Michael L.
2013-01-01
Heterogeneity in systemic sclerosis/SSc confounds clinical trials. We previously identified ‘intrinsic’ gene expression subsets by analysis of SSc skin. Here we test the hypotheses that skin gene expression signatures including intrinsic subset are associated with skin score/MRSS improvement during mycophenolate mofetil (MMF) treatment. Gene expression and intrinsic subset assignment were measured in 12 SSc patients’ biopsies and ten controls at baseline, and from serial biopsies of one cyclophosphamide-treated patient, and nine MMF-treated patients. Gene expression changes during treatment were determined using paired t-tests corrected for multiple hypothesis testing. MRSS improved in four of seven MMF-treated patients classified as the inflammatory intrinsic subset. Three patients without MRSS improvement were classified as normal-like or fibroproliferative intrinsic subsets. 321 genes (FDR <5%) were differentially expressed at baseline between patients with and without MRSS improvement during treatment. Expression of 571 genes (FDR <10%) changed between pre- and post-MMF treatment biopsies for patients demonstrating MRSS improvement. Gene expression changes in skin are only seen in patients with MRSS improvement. Baseline gene expression in skin, including intrinsic subset assignment, may identify SSc patients whose MRSS will improve during MMF treatment, suggesting that gene expression in skin may allow targeted treatment in SSc. PMID:23677167
A cluster of novel serotonin receptor 3-like genes on human chromosome 3.
Karnovsky, Alla M; Gotow, Lisa F; McKinley, Denise D; Piechan, Julie L; Ruble, Cara L; Mills, Cynthia J; Schellin, Kathleen A B; Slightom, Jerry L; Fitzgerald, Laura R; Benjamin, Christopher W; Roberds, Steven L
2003-11-13
The ligand-gated ion channel family includes receptors for serotonin (5-hydroxytryptamine, 5-HT), acetylcholine, GABA, and glutamate. Drugs targeting subtypes of these receptors have proven useful for the treatment of various neuropsychiatric and neurological disorders. To identify new ligand-gated ion channels as potential therapeutic targets, drafts of human genome sequence were interrogated. Portions of four novel genes homologous to 5-HT(3A) and 5-HT(3B) receptors were identified within human sequence databases. We named the genes 5-HT(3C1)-5-HT(3C4). Radiation hybrid (RH) mapping localized these genes to chromosome 3q27-28. All four genes shared similar intron-exon organizations and predicted protein secondary structure with 5-HT(3A) and 5-HT(3B). Orthologous genes were detected by Southern blotting in several species including dog, cow, and chicken, but not in rodents, suggesting that these novel genes are not present in rodents or are very poorly conserved. Two of the novel genes are predicted to be pseudogenes, but two other genes are transcribed and spliced to form appropriate open reading frames. The 5-HT(3C1) transcript is expressed almost exclusively in small intestine and colon, suggesting a possible role in the serotonin-responsiveness of the gut.
Lockyer, Anne E; Noble, Leslie R; Rollinson, David; Jones, Catherine S
2004-01-01
The freshwater tropical snail Biomphalaria glabrata is an intermediate host for Schistosoma mansoni, the causative agent of human intestinal schistosomiasis, and strains differ in their susceptibility to parasite infection. Changes in gene expression in response to parasite infection have been simultaneously examined in a susceptible strain (NHM1742) and a resistant strain (NHM1981) using a newly developed fluorescent-based differential display method. Such RNA profiling techniques allow the examination of changes in gene expression in response to parasite infection, without requiring previous sequence knowledge, or selecting candidate genes that may be involved in the complex neuroendocrine or defence systems of the snail. Thus, novel genes may be identified. Ten transcripts were initially identified, present only in the profiles derived from snails of the resistant strain when exposed to infection. The differential expression of five of these genes, including HSP70 and several novel transcripts with one containing at least two globin-like domains, has been confirmed by semi-quantitative RT-PCR.
Comprehensive Molecular Characterization of Urothelial Bladder Carcinoma
2014-01-01
Urothelial carcinoma of the bladder is a common malignancy that causes approximately 150,000 deaths per year worldwide. To date, no molecularly targeted agents have been approved for the disease. As part of The Cancer Genome Atlas project, we report here an integrated analysis of 131 urothelial carcinomas to provide a comprehensive landscape of molecular alterations. There were statistically significant recurrent mutations in 32 genes, including multiple genes involved in cell cycle regulation, chromatin regulation, and kinase signaling pathways, as well as 9 genes not previously reported as significantly mutated in any cancer. RNA sequencing revealed four expression subtypes, two of which (papillary-like and basal/squamous-like) were also evident in miRNA sequencing and protein data. Whole-genome and RNA sequencing identified recurrent in-frame activating FGFR3-TACC3 fusions and expression or integration of several viruses (including HPV16) that are associated with gene inactivation. Our analyses identified potential therapeutic targets in 69% of the tumours, including 42% with targets in the PI3K/AKT/mTOR pathway and 45% with targets (including ERBB2) in the RTK/MAPK pathway. Chromatin regulatory genes were more frequently mutated in urothelial carcinoma than in any common cancer studied to date, suggesting the future possibility of targeted therapy for chromatin abnormalities. PMID:24476821
Zhu, Hong; Xia, Wei; Mo, Xing-Bo; Lin, Xiang; Qiu, Ying-Hua; Yi, Neng-Jun; Zhang, Yong-Hong; Deng, Fei-Yan; Lei, Shu-Feng
2016-01-01
Rheumatoid arthritis (RA) is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations. Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects). For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected 'highly verified' genes were measured by ELISA among our in-house RA cases and controls. A total of 221 RA-associated genes were newly identified by gene-based association study, including 71'overlapped', 76 'European-specific' and 74 'Asian-specific' genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 'overlapped' (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA), 5 'European-specific' (PHTF1, RPS18, BAK1, TNFRSF14, SUOX) and 4 'Asian-specific' (RNASET2, HFE, BTN2A2, MAPK13) genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02) and HLA-DMA (P value = 4.70E-02) in plasma were significantly different in our in-house samples. Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and detected a long list of overlapped or ethnic specific RA genes. The study not only greatly increases our understanding of genetic susceptibility to RA, but also provides important insights into the ethno-genetic homogeneity and heterogeneity of RA in both ethnicities.
DGEM--a microarray gene expression database for primary human disease tissues.
Xia, Yuni; Campen, Andrew; Rigsby, Dan; Guo, Ying; Feng, Xingdong; Su, Eric W; Palakal, Mathew; Li, Shuyu
2007-01-01
Gene expression patterns can reflect gene regulations in human tissues under normal or pathologic conditions. Gene expression profiling data from studies of primary human disease samples are particularly valuable since these studies often span many years in order to collect patient clinical information and achieve a large sample size. Disease-to-Gene Expression Mapper (DGEM) provides a beneficial community resource to access and analyze these data; it currently includes Affymetrix oligonucleotide array datasets for more than 40 human diseases and 1400 samples. The data are normalized to the same scale and stored in a relational database. A statistical-analysis pipeline was implemented to identify genes abnormally expressed in disease tissues or genes whose expressions are associated with clinical parameters such as cancer patient survival. Data-mining results can be queried through a web-based interface at http://dgem.dhcp.iupui.edu/. The query tool enables dynamic generation of graphs and tables that are further linked to major gene and pathway resources that connect the data to relevant biology, including Entrez Gene and Kyoto Encyclopedia of Genes and Genomes (KEGG). In summary, DGEM provides scientists and physicians a valuable tool to study disease mechanisms, to discover potential disease biomarkers for diagnosis and prognosis, and to identify novel gene targets for drug discovery. The source code is freely available for non-profit use, on request to the authors.
Expression of drought tolerance genes in tropical upland rice cultivars (Oryza sativa).
Silveira, R D D; Abreu, F R M; Mamidi, S; McClean, P E; Vianello, R P; Lanna, A C; Carneiro, N P; Brondani, C
2015-07-27
Gene expression related to drought response in the leaf tissues of two Brazilian upland cultivars, the drought-tolerant Douradão and the drought-sensitive Primavera, was analyzed. RNA-seq identified 27,618 transcripts in the Douradão cultivar, with 24,090 (87.2%) homologous to the rice database, and 27,221 transcripts in the Primavera cultivar, with 23,663 (86.9%) homologous to the rice database. Gene-expression analysis between control and water-deficient treatments revealed 493 and 1154 differentially expressed genes in Douradão and Primavera cultivars, respectively. Genes exclusively expressed under drought were identified for Douradão, including two genes of particular interest coding for the protein peroxidase precursor, which is involved in three distinct metabolic pathways. Comparisons between the two drought-exposed cultivars revealed 2314 genes were differentially expressed (978 upregulated, 1336 downregulated in Douradão). Six genes distributed across 4 different transcription factor families (bHLH, MYB, NAC, and WRKY) were identified, all of which were upregulated in Douradão compared to Primavera during drought. Most of the genes identified in Douradão activate metabolic pathways responsible for production of secondary metabolites and genes coding for enzymatically active signaling receptors. Quantitative PCR validation showed that most gene expression was in agreement with computational prediction of these transcripts. The transcripts identified here will define molecular markers for identification of Cis-acting elements to search for allelic variants of these genes through analysis of polymorphic SNPs in GenBank accessions of upland rice, aiming to develop cultivars with the best combination of these alleles, resulting in materials with high yield potential in the event of drought during the reproductive phase.
Qi, Jingjing; Yu, Yong; Akilli Öztürk, Özlem; Holland, Jane D; Besser, Daniel; Fritzmann, Johannes; Wulf-Goldenberg, Annika; Eckert, Klaus; Fichtner, Iduna; Birchmeier, Walter
2016-10-01
We have previously identified a 115-gene signature that characterises the metastatic potential of human primary colon cancers. The signature included the canonical Wnt target gene BAMBI, which promoted experimental metastasis in mice. Here, we identified three new direct Wnt target genes from the signature, and studied their functions in epithelial-mesenchymal transition (EMT), cell migration and experimental metastasis. We examined experimental liver metastases following injection of selected tumour cells into spleens of NOD/SCID mice. Molecular and cellular techniques were used to identify direct transcription target genes of Wnt/β-catenin signals. Microarray analyses and experiments that interfered with cell migration through inhibitors were performed to characterise downstream signalling systems. Three new genes from the colorectal cancer (CRC) metastasis signature, BOP1, CKS2 and NFIL3, were identified as direct transcription targets of β-catenin/TCF4. Overexpression and knocking down of these genes in CRC cells promoted and inhibited, respectively, experimental metastasis in mice, EMT and cell motility in culture. Cell migration was repressed by interfering with distinct signalling systems through inhibitors of PI3K, JNK, p38 mitogen-activated protein kinase and/or mTOR. Gene expression profiling identified a series of migration-promoting genes, which were induced by BOP1, CKS2 and NFIL3, and could be repressed by inhibitors that are specific to these pathways. We identified new direct Wnt/β-catenin target genes, BOP1, CKS2 and NFIL3, which induced EMT, cell migration and experimental metastasis of CRC cells. These genes crosstalk with different downstream signalling systems, and activate migration-promoting genes. These pathways and downstream genes may serve as therapeutic targets in the treatment of CRC metastasis. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Zhu, Xinyu; Ma, Hong; Chen, Zhiduan
2011-03-09
Plants contain numerous Su(var)3-9 homologues (SUVH) and related (SUVR) genes, some of which await functional characterization. Although there have been studies on the evolution of plant Su(var)3-9 SET genes, a systematic evolutionary study including major land plant groups has not been reported. Large-scale phylogenetic and evolutionary analyses can help to elucidate the underlying molecular mechanisms and contribute to improve genome annotation. Putative orthologs of plant Su(var)3-9 SET protein sequences were retrieved from major representatives of land plants. A novel clustering that included most members analyzed, henceforth referred to as core Su(var)3-9 homologues and related (cSUVHR) gene clade, was identified as well as all orthologous groups previously identified. Our analysis showed that plant Su(var)3-9 SET proteins possessed a variety of domain organizations, and can be classified into five types and ten subtypes. Plant Su(var)3-9 SET genes also exhibit a wide range of gene structures among different paralogs within a family, even in the regions encoding conserved PreSET and SET domains. We also found that the majority of SUVH members were intronless and formed three subclades within the SUVH clade. A detailed phylogenetic analysis of the plant Su(var)3-9 SET genes was performed. A novel deep phylogenetic relationship including most plant Su(var)3-9 SET genes was identified. Additional domains such as SAR, ZnF_C2H2 and WIYLD were early integrated into primordial PreSET/SET/PostSET domain organization. At least three classes of gene structures had been formed before the divergence of Physcomitrella patens (moss) from other land plants. One or multiple retroposition events might have occurred among SUVH genes with the donor genes leading to the V-2 orthologous group. The structural differences among evolutionary groups of plant Su(var)3-9 SET genes with different functions were described, contributing to the design of further experimental studies.
DNA methylome signature in rheumatoid arthritis.
Nakano, Kazuhisa; Whitaker, John W; Boyle, David L; Wang, Wei; Firestein, Gary S
2013-01-01
Epigenetics can influence disease susceptibility and severity. While DNA methylation of individual genes has been explored in autoimmunity, no unbiased systematic analyses have been reported. Therefore, a genome-wide evaluation of DNA methylation loci in fibroblast-like synoviocytes (FLS) isolated from the site of disease in rheumatoid arthritis (RA) was performed. Genomic DNA was isolated from six RA and five osteoarthritis (OA) FLS lines and evaluated using the Illumina HumanMethylation450 chip. Cluster analysis of data was performed and corrected using Benjamini-Hochberg adjustment for multiple comparisons. Methylation was confirmed by pyrosequencing and gene expression was determined by qPCR. Pathway analysis was performed using the Kyoto Encyclopedia of Genes and Genomes. RA and control FLS segregated based on DNA methylation, with 1859 differentially methylated loci. Hypomethylated loci were identified in key genes relevant to RA, such as CHI3L1, CASP1, STAT3, MAP3K5, MEFV and WISP3. Hypermethylation was also observed, including TGFBR2 and FOXO1. Hypomethylation of individual genes was associated with increased gene expression. Grouped analysis identified 207 hypermethylated or hypomethylated genes with multiple differentially methylated loci, including COL1A1, MEFV and TNF. Hypomethylation was increased in multiple pathways related to cell migration, including focal adhesion, cell adhesion, transendothelial migration and extracellular matrix interactions. Confirmatory studies with OA and normal FLS also demonstrated segregation of RA from control FLS based on methylation pattern. Differentially methylated genes could alter FLS gene expression and contribute to the pathogenesis of RA. DNA methylation of critical genes suggests that RA FLS are imprinted and implicate epigenetic contributions to inflammatory arthritis.
Ram, Chet; Koramutla, Murali Krishna; Bhattacharya, Ramcharan
2017-07-01
Brassica juncea is a chief oil yielding crop in many parts of the world including India. With advancement of molecular techniques, RT-qPCR based study of gene-expression has become an integral part of experimentations in crop breeding. In RT-qPCR, use of appropriate reference gene(s) is pivotal. The virtue of the reference genes, being constant in expression throughout the experimental treatments, needs to be validated case by case. Appropriate reference gene(s) for normalization of gene-expression data in B. juncea during the biotic stress of aphid infestation is not known. In the present investigation, 11 reference genes identified from microarray database of Arabidopsis-aphid interaction at a cut off FDR ≤0.1, along with two known reference genes of B. juncea, were analyzed for their expression stability upon aphid infestation. These included 6 frequently used and 5 newly identified reference genes. Ranking orders of the reference genes in terms of expression stability were calculated using advanced statistical approaches such as geNorm, NormFinder, delta Ct and BestKeeper. The analysis suggested CAC, TUA and DUF179 as the most suitable reference genes. Further, normalization of the gene-expression data of STP4 and PR1 by the most and the least stable reference gene, respectively has demonstrated importance and applicability of the recommended reference genes in aphid infested samples of B. juncea. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
SSTAR, a Stand-Alone Easy-To-Use Antimicrobial Resistance Gene Predictor.
de Man, Tom J B; Limbago, Brandi M
2016-01-01
We present the easy-to-use Sequence Search Tool for Antimicrobial Resistance, SSTAR. It combines a locally executed BLASTN search against a customizable database with an intuitive graphical user interface for identifying antimicrobial resistance (AR) genes from genomic data. Although the database is initially populated from a public repository of acquired resistance determinants (i.e., ARG-ANNOT), it can be customized for particular pathogen groups and resistance mechanisms. For instance, outer membrane porin sequences associated with carbapenem resistance phenotypes can be added, and known intrinsic mechanisms can be included. Unique about this tool is the ability to easily detect putative new alleles and truncated versions of existing AR genes. Variants and potential new alleles are brought to the attention of the user for further investigation. For instance, SSTAR is able to identify modified or truncated versions of porins, which may be of great importance in carbapenemase-negative carbapenem-resistant Enterobacteriaceae. SSTAR is written in Java and is therefore platform independent and compatible with both Windows and Unix operating systems. SSTAR and its manual, which includes a simple installation guide, are freely available from https://github.com/tomdeman-bio/Sequence-Search-Tool-for-Antimicrobial-Resistance-SSTAR-. IMPORTANCE Whole-genome sequencing (WGS) is quickly becoming a routine method for identifying genes associated with antimicrobial resistance (AR). However, for many microbiologists, the use and analysis of WGS data present a substantial challenge. We developed SSTAR, software with a graphical user interface that enables the identification of known AR genes from WGS and has the unique capacity to easily detect new variants of known AR genes, including truncated protein variants. Current software solutions do not notify the user when genes are truncated and, therefore, likely nonfunctional, which makes phenotype predictions less accurate. SSTAR users can apply any AR database of interest as a reference comparator and can manually add genes that impact resistance, even if such genes are not resistance determinants per se (e.g., porins and efflux pumps).
The chemokine receptor CCR1 is identified in mast cell-derived exosomes.
Liang, Yuting; Qiao, Longwei; Peng, Xia; Cui, Zelin; Yin, Yue; Liao, Huanjin; Jiang, Min; Li, Li
2018-01-01
Mast cells are important effector cells of the immune system, and mast cell-derived exosomes carrying RNAs play a role in immune regulation. However, the molecular function of mast cell-derived exosomes is currently unknown, and here, we identify differentially expressed genes (DEGs) in mast cells and exosomes. We isolated mast cells derived exosomes through differential centrifugation and screened the DEGs from mast cell-derived exosomes, using the GSE25330 array dataset downloaded from the Gene Expression Omnibus database. Biochemical pathways were analyzed by Gene ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway on the online tool DAVID. DEGs-associated protein-protein interaction networks (PPIs) were constructed using the STRING database and Cytoscape software. The genes identified from these bioinformatics analyses were verified by qRT-PCR and Western blot in mast cells and exosomes. We identified 2121 DEGs (843 up and 1278 down-regulated genes) in HMC-1 cell-derived exosomes and HMC-1 cells. The up-regulated DEGs were classified into two significant modules. The chemokine receptor CCR1 was screened as a hub gene and enriched in cytokine-mediated signaling pathway in module one. Seven genes, including CCR1, CD9, KIT, TGFBR1, TLR9, TPSAB1 and TPSB2 were screened and validated through qRT-PCR analysis. We have achieved a comprehensive view of the pivotal genes and pathways in mast cells and exosomes and identified CCR1 as a hub gene in mast cell-derived exosomes. Our results provide novel clues with respect to the biological processes through which mast cell-derived exosomes modulate immune responses.
Korashy, Hesham M; Attafi, Ibraheem M; Famulski, Konrad S; Bakheet, Saleh A; Hafez, Mohammed M; Alsaad, Abdulaziz M S; Al-Ghadeer, Abdul Rahman M
2017-02-01
Heavy metals are the most commonly encountered toxic substances that increase susceptibility to various diseases after prolonged exposure. We have previously shown that healthy volunteers living near a mining area had significant contamination with heavy metals associated with significant changes in the expression of some detoxifying genes, xenobiotic metabolizing enzymes, and DNA repair genes. However, alterations of most of the molecular target genes associated with diseases are still unknown. Thus, the aims of this study were to (a) evaluate the gene expression profile and (b) identify the toxicities and potentially relevant human disease outcomes associated with long-term human exposure to environmental heavy metals in mining area using microarray analysis. For this purpose, 40 healthy male volunteers who were residents of a heavy metal-polluted area (Mahd Al-Dhahab city, Saudi Arabia) and 20 healthy male volunteers who were residents of a non-heavy metal-polluted area were included in the study. Total RNA was isolated from whole blood using PAXgene Blood RNA tubes and then reversed transcribed and hybridized to the gene array using the Affymetrix U219 GeneChip. Microarray analysis showed about 2129 genes were identified and differentially altered, among which a shared set of 425 genes was differentially expressed in the heavy metal-exposed groups. Ingenuity pathway analysis revealed that the most altered gene-regulated diseases in heavy metal-exposed groups included hematological and developmental disorders and mostly renal and urological diseases. Quantitative real-time polymerase chain reaction closely matched the microarray data for some genes tested. Importantly, changes in gene-related diseases were attributed to alterations in the genes encoded for protein synthesis. Renal and urological diseases were the diseases that were most frequently associated with the heavy metal-exposed group. Therefore, there is a need for further studies to validate these genes, which could be used as early biomarkers to prevent renal injury. Copyright © 2016 Elsevier Ltd. All rights reserved.
Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng
2017-08-01
Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.
A 15-gene signature for prediction of colon cancer recurrence and prognosis based on SVM.
Xu, Guangru; Zhang, Minghui; Zhu, Hongxing; Xu, Jinhua
2017-03-10
To screen the gene signature for distinguishing patients with high risks from those with low-risks for colon cancer recurrence and predicting their prognosis. Five microarray datasets of colon cancer samples were collected from Gene Expression Omnibus database and one was obtained from The Cancer Genome Atlas (TCGA). After preprocessing, data in GSE17537 were analyzed using the Linear Models for Microarray data (LIMMA) method to identify the differentially expressed genes (DEGs). The DEGs further underwent PPI network-based neighborhood scoring and support vector machine (SVM) analyses to screen the feature genes associated with recurrence and prognosis, which were then validated by four datasets GSE38832, GSE17538, GSE28814 and TCGA using SVM and Cox regression analyses. A total of 1207 genes were identified as DEGs between recurrence and no-recurrence samples, including 726 downregulated and 481 upregulated genes. Using SVM analysis and five gene expression profile data confirmation, a 15-gene signature (HES5, ZNF417, GLRA2, OR8D2, HOXA7, FABP6, MUSK, HTR6, GRIP2, KLRK1, VEGFA, AKAP12, RHEB, NCRNA00152 and PMEPA1) were identified as a predictor of recurrence risk and prognosis for colon cancer patients. Our identified 15-gene signature may be useful to classify colon cancer patients with different prognosis and some genes in this signature may represent new therapeutic targets. Copyright © 2016. Published by Elsevier B.V.
Davidson, Ben; Stavnes, Helene Tuft; Holth, Arild; Chen, Xu; Yang, Yanqin; Shih, Ie-Ming; Wang, Tian-Li
2011-01-01
Abstract Ovarian/primary peritoneal carcinoma and breast carcinoma are the gynaecological cancers that most frequently involve the serosal cavities. With the objective of improving on the limited diagnostic panel currently available for the differential diagnosis of these two malignancies, as well as to define tumour-specific biological targets, we compared their global gene expression patterns. Gene expression profiles of 10 serous ovarian/peritoneal and eight ductal breast carcinoma effusions were analysed using the HumanRef-8 BeadChip from Illumina. Differentially expressed candidate genes were validated using quantitative real-time PCR and immunohistochemistry. Unsupervised hierarchical clustering using all 54,675 genes in the array separated ovarian from breast carcinoma samples. We identified 288 unique probes that were significantly differentially expressed in the two cancers by greater than 3.5-fold, of which 81 and 207 were overexpressed in breast and ovarian/peritoneal carcinoma, respectively. SAM analysis identified 1078 differentially expressed probes with false discovery rate less than 0.05. Genes overexpressed in breast carcinoma included TFF1, TFF3, FOXA1, CA12, GATA3, SDC1, PITX1, TH, EHFD1, EFEMP1, TOB1 and KLF2. Genes overexpressed in ovarian/peritoneal carcinoma included SPON1, RBP1, MFGE8, TM4SF12, MMP7, KLK5/6/7, FOLR1/3, PAX8, APOL2 and NRCAM. The differential expression of 14 genes was validated by quantitative real-time PCR, and differences in 5 gene products were confirmed by immunohistochemistry. Expression profiling distinguishes ovarian/peritoneal carcinoma from breast carcinoma and identifies genes that are differentially expressed in these two tumour types. The molecular signatures unique to these cancers may facilitate their differential diagnosis and may provide a molecular basis for therapeutic target discovery. PMID:20132413
Comparative Analysis of AhR-Mediated TCDD-Elicited Gene Expression in Human Liver Adult Stem Cells
Kim, Suntae; Dere, Edward; Burgoon, Lyle D.; Chang, Chia-Cheng; Zacharewski, Timothy R.
2009-01-01
Time course and dose-response studies were conducted in HL1-1 cells, a human liver cell line with stem cell–like characteristics, to assess the differential gene expression elicited by 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) compared with other established models. Cells were treated with 0.001, 0.01, 0.1, 1, 10, or 100nM TCDD or dimethyl sulfoxide vehicle control for 12 h for the dose-response study, or with 10nM TCDD or vehicle for 1, 2, 4, 8, 12, 24, or 48 h for the time course study. Elicited changes were monitored using a human cDNA microarray with 6995 represented genes. Empirical Bayes analysis identified 144 genes differentially expressed at one or more time points following treatment. Most genes exhibited dose-dependent responses including CYP1A1, CYP1B1, ALDH1A3, and SLC7A5 genes. Comparative analysis of HL1-1 differential gene expression to human HepG2 data identified 74 genes with comparable temporal expression profiles including 12 putative primary responses. HL1-1–specific changes were related to lipid metabolism and immune responses, consistent with effects elicited in vivo. Furthermore, comparative analysis of HL1-1 cells with mouse Hepa1c1c7 hepatoma cell lines and C57BL/6 hepatic tissue identified 18 and 32 commonly regulated orthologous genes, respectively, with functions associated with signal transduction, transcriptional regulation, metabolism and transport. Although some common pathways are affected, the results suggest that TCDD elicits species- and model-specific gene expression profiles. PMID:19684285
Woods, Stephanie E; Lieberman, Mia T; Lebreton, Francois; Trowel, Elise; de la Fuente-Núñez, César; Dzink-Fox, Joanne; Gilmore, Michael S; Fox, James G
2017-01-01
Nonhuman primates are commonly used for cognitive neuroscience research and often surgically implanted with cephalic recording chambers for electrophysiological recording. Aerobic bacterial cultures from 25 macaques identified 72 bacterial isolates, including 15 Enterococcus faecalis isolates. The E. faecalis isolates displayed multi-drug resistant phenotypes, with resistance to ciprofloxacin, enrofloxacin, trimethoprim-sulfamethoxazole, tetracycline, chloramphenicol, bacitracin, and erythromycin, as well as high-level aminoglycoside resistance. Multi-locus sequence typing showed that most belonged to two E. faecalis sequence types (ST): ST 4 and ST 55. The genomes of three representative isolates were sequenced to identify genes encoding antimicrobial resistances and other traits. Antimicrobial resistance genes identified included aac(6')-aph(2"), aph(3')-III, str, ant(6)-Ia, tetM, tetS, tetL, ermB, bcrABR, cat, and dfrG, and polymorphisms in parC (S80I) and gyrA (S83I) were observed. These isolates also harbored virulence factors including the cytolysin toxin genes in ST 4 isolates, as well as multiple biofilm-associated genes (esp, agg, ace, SrtA, gelE, ebpABC), hyaluronidases (hylA, hylB), and other survival genes (ElrA, tpx). Crystal violet biofilm assays confirmed that ST 4 isolates produced more biofilm than ST 55 isolates. The abundance of antimicrobial resistance and virulence factor genes in the ST 4 isolates likely relates to the loss of CRISPR-cas. This macaque colony represents a unique model for studying E. faecalis infection associated with indwelling devices, and provides an opportunity to understand the basis of persistence of this pathogen in a healthcare setting.
Ye, Bang-Ce; Zhang, Yan; Yu, Hui; Yu, Wen-Bang; Liu, Bao-Hong; Yin, Bin-Cheng; Yin, Chun-Yun; Li, Yuan-Yuan; Chu, Ju; Zhang, Si-Liang
2009-01-01
Microorganisms can restructure their transcriptional output to adapt to environmental conditions by sensing endogenous metabolite pools. In this paper, an Agilent customized microarray representing 4,106 genes was used to study temporal transcript profiles of Bacillus subtilis in response to valine, glutamate and glutamine pulses over 24 h. A total of 673, 835, and 1135 amino-acid-regulated genes were identified having significantly changed expression at one or more time points in response to valine, glutamate, and glutamine, respectively, including genes involved in cell wall, cellular import, metabolism of amino-acids and nucleotides, transcriptional regulation, flagellar motility, chemotaxis, phage proteins, sporulation, and many genes of unknown function. Different amino acid treatments were compared in terms of both the global temporal profiles and the 5-minute quick regulations, and between-experiment differential genes were identified. The highlighted genes were analyzed based on diverse sources of gene functions using a variety of computational tools, including T-profiler analysis, and hierarchical clustering. The results revealed the common and distinct modes of action of these three amino acids, and should help to elucidate the specific signaling mechanism of each amino acid as an effector. PMID:19763274
A global interaction network maps a wiring diagram of cellular function
Costanzo, Michael; VanderSluis, Benjamin; Koch, Elizabeth N.; Baryshnikova, Anastasia; Pons, Carles; Tan, Guihong; Wang, Wen; Usaj, Matej; Hanchard, Julia; Lee, Susan D.; Pelechano, Vicent; Styles, Erin B.; Billmann, Maximilian; van Leeuwen, Jolanda; van Dyk, Nydia; Lin, Zhen-Yuan; Kuzmin, Elena; Nelson, Justin; Piotrowski, Jeff S.; Srikumar, Tharan; Bahr, Sondra; Chen, Yiqun; Deshpande, Raamesh; Kurat, Christoph F.; Li, Sheena C.; Li, Zhijian; Usaj, Mojca Mattiazzi; Okada, Hiroki; Pascoe, Natasha; Luis, Bryan-Joseph San; Sharifpoor, Sara; Shuteriqi, Emira; Simpkins, Scott W.; Snider, Jamie; Suresh, Harsha Garadi; Tan, Yizhao; Zhu, Hongwei; Malod-Dognin, Noel; Janjic, Vuk; Przulj, Natasa; Troyanskaya, Olga G.; Stagljar, Igor; Xia, Tian; Ohya, Yoshikazu; Gingras, Anne-Claude; Raught, Brian; Boutros, Michael; Steinmetz, Lars M.; Moore, Claire L.; Rosebrock, Adam P.; Caudy, Amy A.; Myers, Chad L.; Andrews, Brenda; Boone, Charles
2017-01-01
We generated a global genetic interaction network for Saccharomyces cerevisiae, constructing over 23 million double mutants, identifying ~550,000 negative and ~350,000 positive genetic interactions. This comprehensive network maps genetic interactions for essential gene pairs, highlighting essential genes as densely connected hubs. Genetic interaction profiles enabled assembly of a hierarchical model of cell function, including modules corresponding to protein complexes and pathways, biological processes, and cellular compartments. Negative interactions connected functionally related genes, mapped core bioprocesses, and identified pleiotropic genes, whereas positive interactions often mapped general regulatory connections among gene pairs, rather than shared functionality. The global network illustrates how coherent sets of genetic interactions connect protein complex and pathway modules to map a functional wiring diagram of the cell. PMID:27708008
Knowledge-guided gene prioritization reveals new insights into the mechanisms of chemoresistance.
Emad, Amin; Cairns, Junmei; Kalari, Krishna R; Wang, Liewei; Sinha, Saurabh
2017-08-11
Identification of genes whose basal mRNA expression predicts the sensitivity of tumor cells to cytotoxic treatments can play an important role in individualized cancer medicine. It enables detailed characterization of the mechanism of action of drugs. Furthermore, screening the expression of these genes in the tumor tissue may suggest the best course of chemotherapy or a combination of drugs to overcome drug resistance. We developed a computational method called ProGENI to identify genes most associated with the variation of drug response across different individuals, based on gene expression data. In contrast to existing methods, ProGENI also utilizes prior knowledge of protein-protein and genetic interactions, using random walk techniques. Analysis of two relatively new and large datasets including gene expression data on hundreds of cell lines and their cytotoxic responses to a large compendium of drugs reveals a significant improvement in prediction of drug sensitivity using genes identified by ProGENI compared to other methods. Our siRNA knockdown experiments on ProGENI-identified genes confirmed the role of many new genes in sensitivity to three chemotherapy drugs: cisplatin, docetaxel, and doxorubicin. Based on such experiments and extensive literature survey, we demonstrate that about 73% of our top predicted genes modulate drug response in selected cancer cell lines. In addition, global analysis of genes associated with groups of drugs uncovered pathways of cytotoxic response shared by each group. Our results suggest that knowledge-guided prioritization of genes using ProGENI gives new insight into mechanisms of drug resistance and identifies genes that may be targeted to overcome this phenomenon.
Chaillou, Thomas; Jackson, Janna R; England, Jonathan H; Kirby, Tyler J; Richards-White, Jena; Esser, Karyn A; Dupont-Versteegden, Esther E; McCarthy, John J
2015-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. Copyright © 2015 the American Physiological Society.
Chaillou, Thomas; Jackson, Janna R.; England, Jonathan H.; Kirby, Tyler J.; Richards-White, Jena; Esser, Karyn A.; Dupont-Versteegden, Esther E.
2014-01-01
The purpose of this study was to compare the gene expression profile of mouse skeletal muscle undergoing two forms of growth (hypertrophy and regrowth) with the goal of identifying a conserved set of differentially expressed genes. Expression profiling by microarray was performed on the plantaris muscle subjected to 1, 3, 5, 7, 10, and 14 days of hypertrophy or regrowth following 2 wk of hind-limb suspension. We identified 97 differentially expressed genes (≥2-fold increase or ≥50% decrease compared with control muscle) that were conserved during the two forms of muscle growth. The vast majority (∼90%) of the differentially expressed genes was upregulated and occurred at a single time point (64 out of 86 genes), which most often was on the first day of the time course. Microarray analysis from the conserved upregulated genes showed a set of genes related to contractile apparatus and stress response at day 1, including three genes involved in mechanotransduction and four genes encoding heat shock proteins. Our analysis further identified three cell cycle-related genes at day and several genes associated with extracellular matrix (ECM) at both days 3 and 10. In conclusion, we have identified a core set of genes commonly upregulated in two forms of muscle growth that could play a role in the maintenance of sarcomere stability, ECM remodeling, cell proliferation, fast-to-slow fiber type transition, and the regulation of skeletal muscle growth. These findings suggest conserved regulatory mechanisms involved in the adaptation of skeletal muscle to increased mechanical loading. PMID:25554798
Uncovering co-expression gene network modules regulating fruit acidity in diverse apples.
Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Zhong, Gan-Yuan; Xu, Kenong
2015-08-16
Acidity is a major contributor to fruit quality. Several organic acids are present in apple fruit, but malic acid is predominant and determines fruit acidity. The trait is largely controlled by the Malic acid (Ma) locus, underpinning which Ma1 that putatively encodes a vacuolar aluminum-activated malate transporter1 (ALMT1)-like protein is a strong candidate gene. We hypothesize that fruit acidity is governed by a gene network in which Ma1 is key member. The goal of this study is to identify the gene network and the potential mechanisms through which the network operates. Guided by Ma1, we analyzed the transcriptomes of mature fruit of contrasting acidity from six apple accessions of genotype Ma_ (MaMa or Mama) and four of mama using RNA-seq and identified 1301 fruit acidity associated genes, among which 18 were most significant acidity genes (MSAGs). Network inferring using weighted gene co-expression network analysis (WGCNA) revealed five co-expression gene network modules of significant (P < 0.001) correlation with malate. Of these, the Ma1 containing module (Turquoise) of 336 genes showed the highest correlation (0.79). We also identified 12 intramodular hub genes from each of the five modules and 18 enriched gene ontology (GO) terms and MapMan sub-bines, including two GO terms (GO:0015979 and GO:0009765) and two MapMap sub-bins (1.3.4 and 1.1.1.1) related to photosynthesis in module Turquoise. Using Lemon-Tree algorithms, we identified 12 regulator genes of probabilistic scores 35.5-81.0, including MDP0000525602 (a LLR receptor kinase), MDP0000319170 (an IQD2-like CaM binding protein) and MDP0000190273 (an EIN3-like transcription factor) of greater interest for being one of the 18 MSAGs or one of the 12 intramodular hub genes in Turquoise, and/or a regulator to the cluster containing Ma1. The most relevant finding of this study is the identification of the MSAGs, intramodular hub genes, enriched photosynthesis related processes, and regulator genes in a WGCNA module Turquoise that not only encompasses Ma1 but also shows the highest modular correlation with acidity. Overall, this study provides important insight into the Ma1-mediated gene network controlling acidity in mature apple fruit of diverse genetic background.
D'Addabbo, Annarita; Palmieri, Orazio; Maglietta, Rosalia; Latiano, Anna; Mukherjee, Sayan; Annese, Vito; Ancona, Nicola
2011-08-01
A meta-analysis has re-analysed previous genome-wide association scanning definitively confirming eleven genes and further identifying 21 new loci. However, the identified genes/loci still explain only the minority of genetic predisposition of Crohn's disease. To identify genes weakly involved in disease predisposition by analysing chromosomal regions enriched of single nucleotide polymorphisms with modest statistical association. We utilized the WTCCC data set evaluating 1748 CD and 2938 controls. The identification of candidate genes/loci was performed by a two-step procedure: first of all chromosomal regions enriched of weak association signals were localized; subsequently, weak signals clustered in gene regions were identified. The statistical significance was assessed by non parametric permutation tests. The cytoband enrichment analysis highlighted 44 regions (P≤0.05) enriched with single nucleotide polymorphisms significantly associated with the trait including 23 out of 31 previously confirmed and replicated genes. Importantly, we highlight further 20 novel chromosomal regions carrying approximately one hundred genes/loci with modest association. Amongst these we find compelling functional candidate genes such as MAPT, GRB2 and CREM, LCT, and IL12RB2. Our study suggests a different statistical perspective to discover genes weakly associated with a given trait, although further confirmatory functional studies are needed. Copyright © 2011 Editrice Gastroenterologica Italiana S.r.l. All rights reserved.
Morris, Katrina M; Wright, Belinda; Grueber, Catherine E; Hogg, Carolyn; Belov, Katherine
2015-08-01
The Tasmanian devil (Sarcophilus harrisii) is threatened with extinction due to the spread of devil facial tumour disease. Polymorphisms in immune genes can provide adaptive potential to resist diseases. Previous studies in diversity at immune loci in wild species have almost exclusively focused on genes of the major histocompatibility complex (MHC); however, these genes only account for a fraction of immune gene diversity. Devils lack diversity at functionally important immunity loci, including MHC and Toll-like receptor genes. Whether there are polymorphisms at devil immune genes outside these two families is unknown. Here, we identify polymorphisms in a wide range of key immune genes, and develop assays to type single nucleotide polymorphisms (SNPs) within a subset of these genes. A total of 167 immune genes were examined, including cytokines, chemokines and natural killer cell receptors. Using genome-level data from ten devils, SNPs within coding regions, introns and 10 kb flanking genes of interest were identified. We found low polymorphism across 167 immune genes examined bioinformatically using whole-genome data. From this data, we developed long amplicon assays to target nine genes. These amplicons were sequenced in 29-220 devils and found to contain 78 SNPs, including eight SNPS within exons. Despite the extreme paucity of genetic diversity within these genes, signatures of balancing selection were exhibited by one chemokine gene, suggesting that remaining diversity may hold adaptive potential. The low functional diversity may leave devils highly vulnerable to infectious disease, and therefore, monitoring and preserving remaining diversity will be critical for the long-term management of this species. Examining genetic variation in diverse immune genes should be a priority for threatened wildlife species. This study can act as a model for broad-scale immunogenetic diversity analysis in threatened species. © 2015 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou
2011-11-01
Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.
Senthivel, Vivek Raj; Sturrock, Marc; Piedrafita, Gabriel; Isalan, Mark
2016-12-16
Nonlinear responses to signals are widespread natural phenomena that affect various cellular processes. Nonlinearity can be a desirable characteristic for engineering living organisms because it can lead to more switch-like responses, similar to those underlying the wiring in electronics. Steeper functions are described as ultrasensitive, and can be applied in synthetic biology by using various techniques including receptor decoys, multiple co-operative binding sites, and sequential positive feedbacks. Here, we explore the inherent non-linearity of a biological signaling system to identify functions that can potentially be exploited using cell genome engineering. For this, we performed genome-wide transcription profiling to identify genes with ultrasensitive response functions to Hepatocyte Growth Factor (HGF). We identified 3,527 genes that react to increasing concentrations of HGF, in Madin-Darby canine kidney (MDCK) cells, grown as cysts in 3D collagen cell culture. By fitting a generic Hill function to the dose-responses of these genes we obtained a measure of the ultrasensitivity of HGF-responsive genes, identifying a subset with higher apparent Hill coefficients (e.g. MMP1, TIMP1, SNORD75, SNORD86 and ERRFI1). The regulatory regions of these genes are potential candidates for future engineering of synthetic mammalian gene circuits requiring nonlinear responses to HGF signalling.
Yang, Jialiang; Qiu, Jing; Wang, Kejing; Zhu, Lijuan; Fan, Jingjing; Zheng, Deyin; Meng, Xiaodi; Yang, Jiasheng; Peng, Lihong; Fu, Yu; Zhang, Dahan; Peng, Shouneng; Huang, Haiyun; Zhang, Yi
2017-01-01
Obesity is a primary risk factor for many diseases such as certain cancers. In this study, we have developed three algorithms including a random-walk based method OBNet, a shortest-path based method OBsp and a direct-overlap method OBoverlap, to reveal obesity-disease connections at protein-interaction subnetworks corresponding to thousands of biological functions and pathways. Through literature mining, we also curated an obesity-associated disease list, by which we compared the methods. As a result, OBNet outperforms other two methods. OBNet can predict whether a disease is obesity-related based on its associated genes. Meanwhile, OBNet identifies extensive connections between obesity genes and genes associated with a few diseases at various functional modules and pathways. Using breast cancer and Type 2 diabetes as two examples, OBNet identifies meaningful genes that may play key roles in connecting obesity and the two diseases. For example, TGFB1 and VEGFA are inferred to be the top two key genes mediating obesity-breast cancer connection in modules associated with brain development. Finally, the top modules identified by OBNet in breast cancer significantly overlap with modules identified from TCGA breast cancer gene expression study, revealing the power of OBNet in identifying biological processes involved in the disease. PMID:29156709
Smith, Milo R; Glicksberg, Benjamin S; Li, Li; Chen, Rong; Morishita, Hirofumi; Dudley, Joel T
2018-01-01
High and increasing prevalence of neurodevelopmental disorders place enormous personal and economic burdens on society. Given the growing realization that the roots of neurodevelopmental disorders often lie in early childhood, there is an urgent need to identify childhood risk factors. Neurodevelopment is marked by periods of heightened experience-dependent neuroplasticity wherein neural circuitry is optimized by the environment. If these critical periods are disrupted, development of normal brain function can be permanently altered, leading to neurodevelopmental disorders. Here, we aim to systematically identify human variants in neuroplasticity-related genes that confer risk for neurodevelopmental disorders. Historically, this knowledge has been limited by a lack of techniques to identify genes related to neurodevelopmental plasticity in a high-throughput manner and a lack of methods to systematically identify mutations in these genes that confer risk for neurodevelopmental disorders. Using an integrative genomics approach, we determined loss-of-function (LOF) variants in putative plasticity genes, identified from transcriptional profiles of brain from mice with elevated plasticity, that were associated with neurodevelopmental disorders. From five shared differentially expressed genes found in two mouse models of juvenile-like elevated plasticity (juvenile wild-type or adult Lynx1-/- relative to adult wild-type) that were also genotyped in the Mount Sinai BioMe Biobank we identified multiple associations between LOF genes and increased risk for neurodevelopmental disorders across 10,510 patients linked to the Mount Sinai Electronic Medical Records (EMR), including epilepsy and schizophrenia. This work demonstrates a novel approach to identify neurodevelopmental risk genes and points toward a promising avenue to discover new drug targets to address the unmet therapeutic needs of neurodevelopmental disease.
Qiu, Ying-Hua; Deng, Fei-Yan; Li, Min-Jing; Lei, Shu-Feng
2014-11-01
Type 1 diabetes mellitus is a serious disorder characterized by destruction of pancreatic β-cells, culminating in absolute insulin deficiency. Genetic factors contribute to the susceptibility of type 1 diabetes mellitus. The aim of the present study was to identify more susceptibility genes of type 1 diabetes mellitus. We carried out an initial gene-based genome-wide association study in a total of 4,075 type 1 diabetes mellitus cases and 2,604 controls by using the Gene-based Association Test using Extended Simes procedure. Furthermore, we carried out replication studies, differential expression analysis and functional annotation clustering analysis to support the significance of the identified susceptibility genes. We identified 452 genes associated with type 1 diabetes mellitus, even after adapting the genome-wide threshold for significance (P < 9.05E-04). Among these genes, 171 were newly identified for type 1 diabetes mellitus, which were ignored in single-nucleotide polymorphism-based association analysis and were not previously reported. We found that 53 genes have supportive evidence from replication studies and/or differential expression studies. In particular, seven genes including four non-human leukocyte antigen (HLA) genes (RASIP1, STRN4, BCAR1 and MYL2) are replicated in at least one independent population and also differentially expressed in peripheral blood mononuclear cells or monocytes. Furthermore, the associated genes tend to enrich in immune-related pathways or Gene Ontology project terms. The present results suggest the high power of gene-based association analysis in detecting disease-susceptibility genes. Our findings provide more insights into the genetic basis of type 1 diabetes mellitus.
Mining biological databases for candidate disease genes
NASA Astrophysics Data System (ADS)
Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.
2001-07-01
The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
Lanktree, Matthew B; Hegele, Robert A
2009-02-26
Despite the recent success of genome-wide association studies (GWASs) in identifying loci consistently associated with coronary artery disease (CAD), a large proportion of the genetic components of CAD and its metabolic risk factors, including plasma lipids, type 2 diabetes and body mass index, remain unattributed. Gene-gene and gene-environment interactions might produce a meaningful improvement in quantification of the genetic determinants of CAD. Testing for gene-gene and gene-environment interactions is thus a new frontier for large-scale GWASs of CAD. There are several anecdotal examples of monogenic susceptibility to CAD in which the phenotype was worsened by an adverse environment. In addition, small-scale candidate gene association studies with functional hypotheses have identified gene-environment interactions. For future evaluation of gene-gene and gene-environment interactions to achieve the same success as the single gene associations reported in recent GWASs, it will be important to pre-specify agreed standards of study design and statistical power, environmental exposure measurement, phenomic characterization and analytical strategies. Here we discuss these issues, particularly in relation to the investigation and potential clinical utility of gene-gene and gene-environment interactions in CAD.
Zhang, Xianglan; Cha, In-Ho; Kim, Ki-Yeol
2017-12-26
In this study, we investigated the consensus gene modules in head and neck cancer (HNC) and cervical cancer (CC). We used a publicly available gene expression dataset, GSE6791, which included 42 HNC, 14 normal head and neck, 20 CC and 8 normal cervical tissue samples. To exclude bias because of different human papilloma virus (HPV) types, we analyzed HPV16-positive samples only. We identified 3824 genes common to HNC and CC samples. Among these, 977 genes showed high connectivity and were used to construct consensus modules. We demonstrated eight consensus gene modules for HNC and CC using the dissimilarity measure and average linkage hierarchical clustering methods. These consensus modules included genes with significant biological functions, including ATP binding and extracellular exosome. Eigengen network analysis revealed the consensus modules were highly preserved with high connectivity. These findings demonstrate that HPV16-positive head and neck and cervical cancers share highly preserved consensus gene modules with common potentially therapeutic targets.
Zhang, Xianglan; Cha, In-Ho; Kim, Ki-Yeol
2017-01-01
In this study, we investigated the consensus gene modules in head and neck cancer (HNC) and cervical cancer (CC). We used a publicly available gene expression dataset, GSE6791, which included 42 HNC, 14 normal head and neck, 20 CC and 8 normal cervical tissue samples. To exclude bias because of different human papilloma virus (HPV) types, we analyzed HPV16-positive samples only. We identified 3824 genes common to HNC and CC samples. Among these, 977 genes showed high connectivity and were used to construct consensus modules. We demonstrated eight consensus gene modules for HNC and CC using the dissimilarity measure and average linkage hierarchical clustering methods. These consensus modules included genes with significant biological functions, including ATP binding and extracellular exosome. Eigengen network analysis revealed the consensus modules were highly preserved with high connectivity. These findings demonstrate that HPV16-positive head and neck and cervical cancers share highly preserved consensus gene modules with common potentially therapeutic targets. PMID:29371966
Tumor gene expression and prognosis in breast cancer patients with 10 or more positive lymph nodes.
Cobleigh, Melody A; Tabesh, Bita; Bitterman, Pincas; Baker, Joffre; Cronin, Maureen; Liu, Mei-Lan; Borchik, Russell; Mosquera, Juan-Miguel; Walker, Michael G; Shak, Steven
2005-12-15
This study, along with two others, was done to develop the 21-gene Recurrence Score assay (Oncotype DX) that was validated in a subsequent independent study and is used to aid decision making about chemotherapy in estrogen receptor (ER)-positive, node-negative breast cancer patients. Patients with >or=10 nodes diagnosed from 1979 to 1999 were identified. RNA was extracted from paraffin blocks, and expression of 203 candidate genes was quantified using reverse transcription-PCR (RT-PCR). Seventy-eight patients were studied. As of August 2002, 77% of patients had distant recurrence or breast cancer death. Univariate Cox analysis of clinical and immunohistochemistry variables indicated that HER2/immunohistochemistry, number of involved nodes, progesterone receptor (PR)/immunohistochemistry (% cells), and ER/immunohistochemistry (% cells) were significantly associated with distant recurrence-free survival (DRFS). Univariate Cox analysis identified 22 genes associated with DRFS. Higher expression correlated with shorter DRFS for the HER2 adaptor GRB7 and the macrophage marker CD68. Higher expression correlated with longer DRFS for tumor protein p53-binding protein 2 (TP53BP2) and the ER axis genes PR and Bcl2. Multivariate methods, including stepwise variable selection and bootstrap resampling of the Cox proportional hazards regression model, identified several genes, including TP53BP2 and Bcl2, as significant predictors of DRFS. Tumor gene expression profiles of archival tissues, some more than 20 years old, provide significant information about risk of distant recurrence even among patients with 10 or more nodes.
Johnson, Keven R; Nicodemus-Johnson, Jessie; Spindler, Mathew J; Carnegie, Graeme K
2015-01-01
In the heart, scaffolding proteins such as A-Kinase Anchoring Proteins (AKAPs) play a crucial role in normal cellular function by serving as a signaling hub for multiple protein kinases including protein kinase D1 (PKD1). Under cardiac hypertrophic conditions AKAP13 anchored PKD1 activates the transcription factor MEF2 leading to subsequent fetal gene activation and hypertrophic response. We used an expression microarray to identify the global transcriptional response in the hearts of wild-type mice expressing the native form of AKAP13 compared to a gene-trap mouse model expressing a truncated form of AKAP13 that is unable to bind PKD1 (AKAP13-ΔPKD1). Microarray analysis showed that AKAP13-ΔPKD1 mice broadly failed to exhibit the transcriptional profile normally associated with compensatory cardiac hypertrophy following trans-aortic constriction (TAC). The identified differentially expressed genes in WT and AKAP13-ΔPKD1 hearts are vital for the compensatory hypertrophic response to pressure-overload and include myofilament, apoptotic, and cell growth/differentiation genes in addition to genes not previously identified as affected by AKAP13-anchored PKD1. Our results show that AKAP13-PKD1 signaling is critical for transcriptional regulation of key contractile, cell death, and metabolic pathways during the development of compensatory hypertrophy in vivo.
Johnson, Keven R.; Nicodemus-Johnson, Jessie; Spindler, Mathew J.
2015-01-01
In the heart, scaffolding proteins such as A-Kinase Anchoring Proteins (AKAPs) play a crucial role in normal cellular function by serving as a signaling hub for multiple protein kinases including protein kinase D1 (PKD1). Under cardiac hypertrophic conditions AKAP13 anchored PKD1 activates the transcription factor MEF2 leading to subsequent fetal gene activation and hypertrophic response. We used an expression microarray to identify the global transcriptional response in the hearts of wild-type mice expressing the native form of AKAP13 compared to a gene-trap mouse model expressing a truncated form of AKAP13 that is unable to bind PKD1 (AKAP13-ΔPKD1). Microarray analysis showed that AKAP13-ΔPKD1 mice broadly failed to exhibit the transcriptional profile normally associated with compensatory cardiac hypertrophy following trans-aortic constriction (TAC). The identified differentially expressed genes in WT and AKAP13-ΔPKD1 hearts are vital for the compensatory hypertrophic response to pressure-overload and include myofilament, apoptotic, and cell growth/differentiation genes in addition to genes not previously identified as affected by AKAP13-anchored PKD1. Our results show that AKAP13-PKD1 signaling is critical for transcriptional regulation of key contractile, cell death, and metabolic pathways during the development of compensatory hypertrophy in vivo. PMID:26192751
SEA: a super-enhancer archive.
Wei, Yanjun; Zhang, Shumei; Shang, Shipeng; Zhang, Bin; Li, Song; Wang, Xinyu; Wang, Fang; Su, Jianzhong; Wu, Qiong; Liu, Hongbo; Zhang, Yan
2016-01-04
Super-enhancers are large clusters of transcriptional enhancers regarded as having essential roles in driving the expression of genes that control cell identity during development and tumorigenesis. The construction of a genome-wide super-enhancer database is urgently needed to better understand super-enhancer-directed gene expression regulation for a given biology process. Here, we present a specifically designed web-accessible database, Super-Enhancer Archive (SEA, http://sea.edbc.org). SEA focuses on integrating super-enhancers in multiple species and annotating their potential roles in the regulation of cell identity gene expression. The current release of SEA incorporates 83 996 super-enhancers computationally or experimentally identified in 134 cell types/tissues/diseases, including human (75 439, three of which were experimentally identified), mouse (5879, five of which were experimentally identified), Drosophila melanogaster (1774) and Caenorhabditis elegans (904). To facilitate data extraction, SEA supports multiple search options, including species, genome location, gene name, cell type/tissue and super-enhancer name. The response provides detailed (epi)genetic information, incorporating cell type specificity, nearby genes, transcriptional factor binding sites, CRISPR/Cas9 target sites, evolutionary conservation, SNPs, H3K27ac, DNA methylation, gene expression and TF ChIP-seq data. Moreover, analytical tools and a genome browser were developed for users to explore super-enhancers and their roles in defining cell identity and disease processes in depth. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Zhang, Jin; Wang, Bing; Dong, Shuanglin; Cao, Depan; Dong, Junfeng; Walker, William B.; Liu, Yang; Wang, Guirong
2015-01-01
To better understand the olfactory mechanisms in the two lepidopteran pest model species, the Helicoverpa armigera and H. assulta, we conducted transcriptome analysis of the adult antennae using Illumina sequencing technology and compared the chemosensory genes between these two related species. Combined with the chemosensory genes we had identified previously in H. armigera by 454 sequencing, we identified 133 putative chemosensory unigenes in H. armigera including 60 odorant receptors (ORs), 19 ionotropic receptors (IRs), 34 odorant binding proteins (OBPs), 18 chemosensory proteins (CSPs), and 2 sensory neuron membrane proteins (SNMPs). Consistent with these results, 131 putative chemosensory genes including 64 ORs, 19 IRs, 29 OBPs, 17 CSPs, and 2 SNMPs were identified through male and female antennal transcriptome analysis in H. assulta. Reverse Transcription-PCR (RT-PCR) was conducted in H. assulta to examine the accuracy of the assembly and annotation of the transcriptome and the expression profile of these unigenes in different tissues. Most of the ORs, IRs and OBPs were enriched in adult antennae, while almost all the CSPs were expressed in antennae as well as legs. We compared the differences of the chemosensory genes between these two species in detail. Our work will surely provide valuable information for further functional studies of pheromones and host volatile recognition genes in these two related species. PMID:25659090
Guerra, Susana; López-Fernández, Luis A.; Conde, Raquel; Pascual-Montano, Alberto; Harshman, Keith; Esteban, Mariano
2004-01-01
The potential use of the modified vaccinia virus Ankara (MVA) strain as a live recombinant vector to deliver antigens and elicit protective immune responses against infectious diseases demands a comprehensive understanding of the effect of MVA infection on human host gene expression. We used microarrays containing more than 15,000 human cDNAs to identify gene expression changes in human HeLa cell cultures at 2, 6, and 16 h postinfection. Clustering of the 410 differentially regulated genes identified 11 discrete gene clusters with altered expression patterns after MVA infection. Clusters 1 and 2 (accounting for 16.59% [68 of 410] of the genes) contained 68 transcripts showing a robust induction pattern that was maintained during the course of infection. Changes in cellular gene transcription detected by microarrays after MVA infection were confirmed for selected genes by Northern blot analysis and by real-time reverse transcription-PCR. Upregulated transcripts in clusters 1 and 2 included 20 genes implicated in immune responses, including interleukin 1A (IL-1A), IL-6, IL-7, IL-8, and IL-15 genes. MVA infection also stimulated the expression of NF-κB and components of the NF-κB signal transduction pathway, including p50 and TRAF-interacting protein. A marked increase in the expression of histone family members was also induced during MVA infection. Expression of the Wiskott-Aldrich syndrome family members WAS, WASF1, and the small GTP-binding protein RAC-1, which are involved in actin cytoskeleton reorganization, was enhanced after MVA infection. This study demonstrates that MVA infection triggered the induction of groups of genes, some of which may be involved in host resistance and immune modulation during virus infection. PMID:15140980
Is There a Genetic Predisposition to Anterior Cruciate Ligament Tear? A Systematic Review.
John, Rakesh; Dhillon, Mandeep Singh; Sharma, Siddhartha; Prabhakar, Sharad; Bhandari, Mohit
2016-12-01
Injuries to the anterior cruciate ligament (ACL) are among the most common knee ligament injuries and frequently warrant reconstruction. The etiopathogenesis of these injuries has focused mainly on mechanism of trauma, patient sex, and anatomic factors as predisposing causes. Several genetic factors that could predispose to an ACL tear have recently been reported. This systematic review summarizes the current evidence for a genetic predisposition to ACL tears. The principal research question was to identify genetic factors, based on the available literature, that could predispose an individual to an ACL tear. Systematic review. The PubMed, EMBASE, Cochrane, and HuGE databases were searched; the search was run from the period of inception until June 21, 2015. A secondary search was performed by screening the references of full-text articles obtained and by manually searching selected journals. Articles were screened with prespecified inclusion criteria. The quality of studies included in the review was assessed for risk of bias by 2 reviewers using the Newcastle-Ottawa Scale. A total of 994 records were identified by the search, out of which 17 studies (16 case-control studies and 1 cross-sectional study) were included in the final review. Two studies observed a familial predisposition to an ACL tear. Fourteen studies looked at specific gene polymorphisms in 20 genes, from which different polymorphisms in 10 genes were positively associated with an ACL tear. In addition to these polymorphisms, 8 haplotypes were associated with ACL tear. One study looked at gene expression analysis. Although specific gene polymorphisms and haplotypes have been identified, it is difficult to come to a conclusion on the basis of the existing literature. Several sources of bias have been identified in these studies, and the results cannot be extrapolated to the general population. More studies are needed in larger populations of different ethnicities. Gene-gene interactions and gene expression studies in the future may delineate the exact role of these gene polymorphisms in ACL tears. © 2016 The Author(s).
The human cumulus--oocyte complex gene-expression profile
Assou, Said; Anahory, Tal; Pantesco, Véronique; Le Carrour, Tanguy; Pellestor, Franck; Klein, Bernard; Reyftmann, Lionel; Dechaud, Hervé; De Vos, John; Hamamah, Samir
2006-01-01
BACKGROUND The understanding of the mechanisms regulating human oocyte maturation is still rudimentary. We have identified transcripts differentially expressed between immature and mature oocytes, and cumulus cells. METHODS Using oligonucleotides microarrays, genome wide gene expression was studied in pooled immature and mature oocytes or cumulus cells from patients who underwent IVF. RESULTS In addition to known genes such as DAZL, BMP15 or GDF9, oocytes upregulated 1514 genes. We show that PTTG3 and AURKC are respectively the securin and the Aurora kinase preferentially expressed during oocyte meiosis. Strikingly, oocytes overexpressed previously unreported growth factors such as TNFSF13/APRIL, FGF9, FGF14, and IL4, and transcription factors including OTX2, SOX15 and SOX30. Conversely, cumulus cells, in addition to known genes such as LHCGR or BMPR2, overexpressed cell-tocell signaling genes including TNFSF11/RANKL, numerous complement components, semaphorins (SEMA3A, SEMA6A, SEMA6D) and CD genes such as CD200. We also identified 52 genes progressively increasing during oocyte maturation, comprising CDC25A and SOCS7. CONCLUSION The identification of genes up and down regulated during oocyte maturation greatly improves our understanding of oocyte biology and will provide new markers that signal viable and competent oocytes. Furthermore, genes found expressed in cumulus cells are potential markers of granulosa cell tumors. PMID:16571642
Sharma, Akshay; Easow Mathew, Manu; Sriganesh, Vasumathi; Neely, Jessica A; Kalipatnapu, Sasank
2014-11-14
Haemophilia is a genetic disorder which is characterized by spontaneous or provoked, often uncontrolled, bleeding into joints, muscles and other soft tissues. Current methods of treatment are expensive, challenging and involve regular administration of clotting factors. Gene therapy has recently been prompted as a curative treatment modality. To evaluate the safety and efficacy of gene therapy for treating people with haemophilia A or B. We searched the Cochrane Cystic Fibrosis & Genetic Disorders Group's Coagulopathies Trials Register, compiled from electronic database searches and handsearching of journals and conference abstract books. We also searched the reference lists of relevant articles and reviews.Date of last search: 06 November 2014. Eligible trials included randomised or quasi-randomised clinical trials, including controlled clinical trials comparing gene therapy (with or without standard treatment) with standard treatment (factor replacement) or other 'curative' treatment such as stem cell transplantation individuals with haemophilia A or B of all ages who do not have inhibitors to factor VIII or IX. No trials of gene therapy for haemophilia were found. No trials of gene therapy for haemophilia were identified. No randomised or quasi-randomised clinical trials of gene therapy for haemophilia were identified. Thus, we are unable to determine the effects of gene therapy for haemophilia. Gene therapy for haemophilia is still in its nascent stages and there is a need for well-designed clinical trials to assess the long-term feasibility, success and risks of gene therapy for people with haemophilia.
Gonzalez, S M; Ferland, L H; Robert, B; Abdelhay, E
1998-06-01
Vertebrate Msx genes are related to one of the most divergent homeobox genes of Drosophila, the muscle segment homeobox (msh) gene, and are expressed in a well-defined pattern at sites of tissue interactions. This pattern of expression is conserved in vertebrates as diverse as quail, zebrafish, and mouse in a range of sites including neural crest, appendages, and craniofacial structures. In the present work, we performed structural and functional analyses in order to identify potential cis-acting elements that may be regulating Msx1 gene expression. To this end, a 4.9-kb segment of the 5'-flanking region was sequenced and analyzed for transcription-factor binding sites. Four regions showing a high concentration of these sites were identified. Transfection assays with fragments of regulatory sequences driving the expression of the bacterial lacZ reporter gene showed that a region of 4 kb upstream of the transcription start site contains positive and negative elements responsible for controlling gene expression. Interestingly, a fragment of 130 bp seems to contain the minimal elements necessary for gene expression, as its removal completely abolishes gene expression in cultured cells. These results are reinforced by comparison of this region with the human Msx1 gene promoter, which shows extensive conservation, including many consensus binding sites, suggesting a regulatory role for them.
Lee, Ann-Ying; Chen, Chun-Yi; Chang, Yao-Chien Alex; Chao, Ya-Ting; Shih, Ming-Che
2013-01-01
Previously we developed genomic resources for orchids, including transcriptomic analyses using next-generation sequencing techniques and construction of a web-based orchid genomic database. Here, we report a modified molecular model of flower development in the Orchidaceae based on functional analysis of gene expression profiles in Phalaenopsis aphrodite (a moth orchid) that revealed novel roles for the transcription factors involved in floral organ pattern formation. Phalaenopsis orchid floral organ-specific genes were identified by microarray analysis. Several critical transcription factors including AP3, PI, AP1 and AGL6, displayed distinct spatial distribution patterns. Phylogenetic analysis of orchid MADS box genes was conducted to infer the evolutionary relationship among floral organ-specific genes. The results suggest that gene duplication MADS box genes in orchid may have resulted in their gaining novel functions during evolution. Based on these analyses, a modified model of orchid flowering was proposed. Comparison of the expression profiles of flowers of a peloric mutant and wild-type Phalaenopsis orchid further identified genes associated with lip morphology and peloric effects. Large scale investigation of gene expression profiles revealed that homeotic genes from the ABCDE model of flower development classes A and B in the Phalaenopsis orchid have novel functions due to evolutionary diversification, and display differential expression patterns. PMID:24265826
Identifying novel genetic determinants of hemostatic balance.
Ginsburg, D
2005-08-01
Incomplete penetrance and variable expressivity confound the diagnosis and therapy of most inherited thrombotic and hemorrhagic disorders. For many of these diseases, some or most of this variability is determined by genetic modifiers distinct from the primary disease gene itself. Clues toward identifying such modifier genes may come from studying rare Mendelian disorders of hemostasis. Examples include identification of the cause of combined factor V and VIII deficiency as mutations in the ER Golgi intermediate compartment proteins LMAN1 and MCFD2. These proteins form a cargo receptor that facilitates the transport of factors V and VIII, and presumably other proteins, from the ER to the Golgi. A similar positional cloning approach identified ADAMTS-13 as the gene responsible for familial TTP. Along with the work of many other groups, these findings identified VWF proteolysis by ADAMTS-13 as a key regulatory pathway for hemostasis. Recent advances in mouse genetics also provide powerful tools for the identification of novel genes contributing to hemostatic balance. Genetic studies of inbred mouse lines with unusually high and unusually low plasma VWF levels identified polymorphic variation in the expression of a glycosyltransferase gene, Galgt2, as an important determinant of plasma VWF levels in the mouse. Ongoing studies in mice genetically engineered to carry the factor V Leiden mutation may similarly identify novel genes contributing to thrombosis risk in humans.
Exploring of the molecular mechanism of rhinitis via bioinformatics methods
Song, Yufen; Yan, Zhaohui
2018-01-01
The aim of this study was to analyze gene expression profiles for exploring the function and regulatory network of differentially expressed genes (DEGs) in pathogenesis of rhinitis by a bioinformatics method. The gene expression profile of GSE43523 was downloaded from the Gene Expression Omnibus database. The dataset contained 7 seasonal allergic rhinitis samples and 5 non-allergic normal samples. DEGs between rhinitis samples and normal samples were identified via the limma package of R. The webGestal database was used to identify enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of the DEGs. The differentially co-expressed pairs of the DEGs were identified via the DCGL package in R, and the differential co-expression network was constructed based on these pairs. A protein-protein interaction (PPI) network of the DEGs was constructed based on the Search Tool for the Retrieval of Interacting Genes database. A total of 263 DEGs were identified in rhinitis samples compared with normal samples, including 125 downregulated ones and 138 upregulated ones. The DEGs were enriched in 7 KEGG pathways. 308 differential co-expression gene pairs were obtained. A differential co-expression network was constructed, containing 212 nodes. In total, 148 PPI pairs of the DEGs were identified, and a PPI network was constructed based on these pairs. Bioinformatics methods could help us identify significant genes and pathways related to the pathogenesis of rhinitis. Steroid biosynthesis pathway and metabolic pathways might play important roles in the development of allergic rhinitis (AR). Genes such as CDC42 effector protein 5, solute carrier family 39 member A11 and PR/SET domain 10 might be also associated with the pathogenesis of AR, which provided references for the molecular mechanisms of AR. PMID:29257233
Genome-scale analysis of positionally relocated genes
Bhutkar, Arjun; Russo, Susan M.; Smith, Temple F.; Gelbart, William M.
2007-01-01
During evolution, genome reorganization includes large-scale events such as inversions, translocations, and segmental or even whole-genome duplications, as well as fine-scale events such as the relocation of individual genes. This latter category, which we will refer to as positionally relocated genes (PRGs), is the subject of this report. Assessment of the magnitude of such PRGs and of possible contributing mechanisms is aided by a comparative analysis of related genomes, where conserved chromosomal organization can aid in identifying genes that have acquired a new location in a lineage of these genomes. Here we utilize two methods to comprehensively identify relocated protein-coding genes in the recently sequenced genomes of 12 species of genus Drosophila. We use exceptions to the general rule of maintenance of chromosome arm (Muller element) association for most Drosophila genes to identify one major class of PRGs. We also identify a partially overlapping set of PRGs among “embedded genes,” located within the extents of other surrounding genes. We provide evidence that PRG movements have at least two different origins: Some events occur via retrotransposition of processed RNAs and others via a DNA-based transposition mechanism. Overall, we identify several hundred PRGs that arose within a lineage of the genus Drosophila phylogeny and provide suggestive evidence that a few thousand such events have occurred within the radiation of the insect order Diptera, thereby illustrating the magnitude of the contribution of PRG movement to chromosomal reorganization during evolution. PMID:17989252
Identification of olfactory receptor genes in the Japanese grenadier anchovy Coilia nasus.
Zhu, Guoli; Wang, Liangjiang; Tang, Wenqiao; Wang, Xiaomei; Wang, Cong
2017-01-01
Olfaction is essential for fish to detect odorant elements in the environment and plays a critical role in navigating, locating food and detecting predators. Olfactory function is produced by the olfactory transduction pathway and is activated by olfactory receptors (ORs) through the binding of odorant elements. Recently, four types of olfactory receptors have been identified in vertebrate olfactory epithelium, including main odorant receptors (MORs), vomeronasal type receptors (VRs), trace-amine associated receptors (TAARs) and formyl peptide receptors (FPRs). It has been hypothesized that migratory fish, which have the ability to perform spawning migration, use olfactory cues to return to natal rivers. Therefore, obtaining OR genes from migratory fish will provide a resource for the study of molecular mechanisms that underlie fish spawning migration behaviors. Previous studies of OR genes have mainly focused on genomic data, however little information has been gained at the transcript level. In this study, we identified the OR genes of an economically important commercial fish Coilia nasus through searching for olfactory epithelium transcriptomes. A total of 142 candidate MOR, 52 V2R/OlfC, 32 TAAR and two FPR putative genes were identified. In addition, through genomic analysis we identified several MOR genes containing introns, which is unusual for vertebrate MOR genes. The transcriptome-scale mining strategy proved to be fruitful in identifying large sets of OR genes from species whose genome information is unavailable. Our findings lay the foundation for further research into the possible molecular mechanisms underlying the spawning migration behavior in C. nasus .
USDA-ARS?s Scientific Manuscript database
Background: Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to deve...
SNP identification, genetic mapping and tissue expression of the rainbow trout TLR9 gene
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) in Toll-like receptor (TLR) genes have been reported to be associated with disease resistance in human and livestock. A number of TLR genes have been identified in rainbow trout including TLR2, TLR3, TLR5, TLR20, TLR22 and TLR23. The rainbow trout (Oncorhynch...
USDA-ARS?s Scientific Manuscript database
Asian longhorned beetle (ALB), Anoplophora glabripennis, is a serious invasive forest pest in several countries including the United States, Canada, and Europe. RNA interference (RNAi)technology is being developed as a novel method for pest management. Here, we identified the ALB core RNAi genes in...
Diverse growth hormone receptor gene mutations in Laron syndrome.
Berg, M A; Argente, J; Chernausek, S; Gracia, R; Guevara-Aguirre, J; Hopp, M; Pérez-Jurado, L; Rosenbloom, A; Toledo, S P; Francke, U
1993-01-01
To better understand the molecular genetic basis and genetic epidemiology of Laron syndrome (growth-hormone insensitivity syndrome), we analyzed the growth-hormone receptor (GHR) genes of seven unrelated affected individuals from the United States, South America, Europe, and Africa. We amplified all nine GHR gene exons and splice junctions from these individuals by PCR and screened the products for mutations by using denaturing gradient gel electrophoresis (DGGE). We identified a single GHR gene fragment with abnormal DGGE results for each affected individual, sequenced this fragment, and, in each case, identified a mutation likely to cause Laron syndrome, including two nonsense mutations (R43X and R217X), two splice-junction mutations, (189-1 G to T and 71 + 1 G to A), and two frameshift mutations (46 del TT and 230 del TA or AT). Only one of these mutations, R43X, has been previously reported. Using haplotype analysis, we determined that this mutation, which involves a CpG dinucleotide hot spot, likely arose as a separate event in this case, relative to the two prior reports of R43X. Aside from R43X, the mutations we identified are unique to patients from particular geographic regions. Ten GHR gene mutations have now been described in this disorder. We conclude that Laron syndrome is caused by diverse GHR gene mutations, including deletions, RNA processing defects, translational stop codons, and missense codons. All the identified mutations involve the extracellular domain of the receptor, and most are unique to particular families or geographic areas. Images Figure 1 Figure 2 PMID:8488849
Choi, Hoseong; Jo, Yeonhwa; Lian, Sen; Jo, Kyoung-Min; Chu, Hyosub; Yoon, Ju-Yeon; Choi, Seung-Kook; Kim, Kook-Hyung; Cho, Won Kyong
2015-06-01
The chrysanthemum is one of popular flowers in the world and a host for several viruses. So far, molecular interaction studies between the chrysanthemum and viruses are limited. In this study, we carried out a transcriptome analysis of chrysanthemum in response to three different viruses including Cucumber mosaic virus (CMV), Tomato spotted wilt virus (TSWV) and Potato virus X (PVX). A chrysanthemum 135K microarray derived from expressed sequence tags was successfully applied for the expression profiles of the chrysanthemum at early stage of virus infection. Finally, we identified a total of 125, 70 and 124 differentially expressed genes (DEGs) for CMV, TSWV and PVX, respectively. Many DEGs were virus specific; however, 33 DEGs were commonly regulated by three viruses. Gene ontology (GO) enrichment analysis identified a total of 132 GO terms, and of them, six GO terms related stress response and MCM complex were commonly identified for three viruses. Several genes functioning in stress response such as chitin response and ethylene mediated signaling pathway were up-regulated indicating their involvement in establishment of host immune system. In particular, TSWV infection significantly down-regulated genes related to DNA metabolic process including DNA replication, chromatin organization, histone modification and cytokinesis, and they are mostly targeted to nucleosome and MCM complex. Taken together, our comparative transcriptome analysis revealed several genes related to hormone mediated viral stress response and DNA modification. The identified chrysanthemums genes could be good candidates for further functional study associated with resistant to various plant viruses.
Rohde, Palle Duun; Gaertner, Bryn; Ward, Kirsty; Sørensen, Peter; Mackay, Trudy F C
2017-08-01
Human psychiatric disorders such as schizophrenia, bipolar disorder, and attention-deficit/hyperactivity disorder often include adverse behaviors including increased aggressiveness. Individuals with psychiatric disorders often exhibit social withdrawal, which can further increase the probability of conducting a violent act. Here, we used the inbred, sequenced lines of the Drosophila Genetic Reference Panel (DGRP) to investigate the genetic basis of variation in male aggressive behavior for flies reared in a socialized and socially isolated environment. We identified genetic variation for aggressive behavior, as well as significant genotype-by-social environmental interaction (GSEI); i.e. , variation among DGRP genotypes in the degree to which social isolation affected aggression. We performed genome-wide association (GWA) analyses to identify genetic variants associated with aggression within each environment. We used genomic prediction to partition genetic variants into gene ontology (GO) terms and constituent genes, and identified GO terms and genes with high prediction accuracies in both social environments and for GSEI. The top predictive GO terms significantly increased the proportion of variance explained, compared to prediction models based on all segregating variants. We performed genomic prediction across environments, and identified genes in common between the social environments that turned out to be enriched for genome-wide associated variants. A large proportion of the associated genes have previously been associated with aggressive behavior in Drosophila and mice. Further, many of these genes have human orthologs that have been associated with neurological disorders, indicating partially shared genetic mechanisms underlying aggression in animal models and human psychiatric disorders. Copyright © 2017 by the Genetics Society of America.
Harshavardhan Doddapaneni; Venkataramanan Subramanian; Bolei Fu; Dan Cullen
2013-01-01
The oxidative enzymatic machinery for degradation of organic substrates in Agaricus bisporus (Ab) is at the core of the carbon recycling mechanisms in this fungus. To date, 156 genes have been tentatively identified as part of this oxidative enzymatic machinery, which includes 26 peroxidase encoding genes, nine copper radical oxidase [including three...
Fine mapping of regulatory loci for mammalian gene expression using radiation hybrids
Park, Christopher C; Ahn, Sangtae; Bloom, Joshua S; Lin, Andy; Wang, Richard T; Wu, Tongtong; Sekar, Aswin; Khan, Arshad H; Farr, Christine J; Lusis, Aldons J; Leahy, Richard M; Lange, Kenneth; Smith, Desmond J
2010-01-01
We mapped regulatory loci for nearly all protein-coding genes in mammals using comparative genomic hybridization and expression array measurements from a panel of mouse–hamster radiation hybrid cell lines. The large number of breaks in the mouse chromosomes and the dense genotyping of the panel allowed extremely sharp mapping of loci. As the regulatory loci result from extra gene dosage, we call them copy number expression quantitative trait loci, or ceQTLs. The −2log10P support interval for the ceQTLs was <150 kb, containing an average of <2–3 genes. We identified 29,769 trans ceQTLs with −log10P > 4, including 13 hotspots each regulating >100 genes in trans. Further, this work identifies 2,761 trans ceQTLs harboring no known genes, and provides evidence for a mode of gene expression autoregulation specific to the X chromosome. PMID:18362883
Warburton, Marilyn L; Williams, William Paul; Hawkins, Leigh; Bridges, Susan; Gresham, Cathy; Harper, Jonathan; Ozkan, Seval; Mylroie, J Erik; Shan, Xueyan
2011-07-01
A public candidate gene testing pipeline for resistance to aflatoxin accumulation or Aspergillus flavus infection in maize is presented here. The pipeline consists of steps for identifying, testing, and verifying the association of selected maize gene sequences with resistance under field conditions. Resources include a database of genetic and protein sequences associated with the reduction in aflatoxin contamination from previous studies; eight diverse inbred maize lines for polymorphism identification within any maize gene sequence; four Quantitative Trait Loci (QTL) mapping populations and one association mapping panel, all phenotyped for aflatoxin accumulation resistance and associated phenotypes; and capacity for Insertion/Deletion (InDel) and SNP genotyping in the population(s) for mapping. To date, ten genes have been identified as possible candidate genes and put through the candidate gene testing pipeline, and results are presented here to demonstrate the utility of the pipeline.
Hao, Xinyuan; Horvath, David P.; Chao, Wun S.; Yang, Yajun; Wang, Xinchao; Xiao, Bin
2014-01-01
Reliable reference selection for the accurate quantification of gene expression under various experimental conditions is a crucial step in qRT-PCR normalization. To date, only a few housekeeping genes have been identified and used as reference genes in tea plant. The validity of those reference genes are not clear since their expression stabilities have not been rigorously examined. To identify more appropriate reference genes for qRT-PCR studies on tea plant, we examined the expression stability of 11 candidate reference genes from three different sources: the orthologs of Arabidopsis traditional reference genes and stably expressed genes identified from whole-genome GeneChip studies, together with three housekeeping gene commonly used in tea plant research. We evaluated the transcript levels of these genes in 94 experimental samples. The expression stabilities of these 11 genes were ranked using four different computation programs including geNorm, Normfinder, BestKeeper, and the comparative ∆CT method. Results showed that the three commonly used housekeeping genes of CsTUBULIN1, CsACINT1 and Cs18S rRNA1 together with CsUBQ1 were the most unstable genes in all sample ranking order. However, CsPTB1, CsEF1, CsSAND1, CsCLATHRIN1 and CsUBC1 were the top five appropriate reference genes for qRT-PCR analysis in complex experimental conditions. PMID:25474086
Gouré, Julien; Findlay, Wendy A; Deslandes, Vincent; Bouevitch, Anne; Foote, Simon J; MacInnes, Janet I; Coulton, James W; Nash, John HE; Jacques, Mario
2009-01-01
Background Actinobacillus pleuropneumoniae, the causative agent of porcine pleuropneumonia, is a highly contagious respiratory pathogen that causes severe losses to the swine industry worldwide. Current commercially-available vaccines are of limited value because they do not induce cross-serovar immunity and do not prevent development of the carrier state. Microarray-based comparative genomic hybridizations (M-CGH) were used to estimate whole genomic diversity of representative Actinobacillus pleuropneumoniae strains. Our goal was to identify conserved genes, especially those predicted to encode outer membrane proteins and lipoproteins because of their potential for the development of more effective vaccines. Results Using hierarchical clustering, our M-CGH results showed that the majority of the genes in the genome of the serovar 5 A. pleuropneumoniae L20 strain were conserved in the reference strains of all 15 serovars and in representative field isolates. Fifty-eight conserved genes predicted to encode for outer membrane proteins or lipoproteins were identified. As well, there were several clusters of diverged or absent genes including those associated with capsule biosynthesis, toxin production as well as genes typically associated with mobile elements. Conclusion Although A. pleuropneumoniae strains are essentially clonal, M-CGH analysis of the reference strains of the fifteen serovars and representative field isolates revealed several classes of genes that were divergent or absent. Not surprisingly, these included genes associated with capsule biosynthesis as the capsule is associated with sero-specificity. Several of the conserved genes were identified as candidates for vaccine development, and we conclude that M-CGH is a valuable tool for reverse vaccinology. PMID:19239696
Shea, Patrick R; Virtaneva, Kimmo; Kupko, John J; Porcella, Stephen F; Barry, William T; Wright, Fred A; Kobayashi, Scott D; Carmody, Aaron; Ireland, Robin M; Sturdevant, Daniel E; Ricklefs, Stacy M; Babar, Imran; Johnson, Claire A; Graham, Morag R; Gardner, Donald J; Bailey, John R; Parnell, Michael J; Deleo, Frank R; Musser, James M
2010-03-09
Relatively little is understood about the dynamics of global host-pathogen transcriptome changes that occur during bacterial infection of mucosal surfaces. To test the hypothesis that group A Streptococcus (GAS) infection of the oropharynx provokes a distinct host transcriptome response, we performed genome-wide transcriptome analysis using a nonhuman primate model of experimental pharyngitis. We also identified host and pathogen biological processes and individual host and pathogen gene pairs with correlated patterns of expression, suggesting interaction. For this study, 509 host genes and seven biological pathways were differentially expressed throughout the entire 32-day infection cycle. GAS infection produced an initial widespread significant decrease in expression of many host genes, including those involved in cytokine production, vesicle formation, metabolism, and signal transduction. This repression lasted until day 4, at which time a large increase in expression of host genes was observed, including those involved in protein translation, antigen presentation, and GTP-mediated signaling. The interactome analysis identified 73 host and pathogen gene pairs with correlated expression levels. We discovered significant correlations between transcripts of GAS genes involved in hyaluronic capsule production and host endocytic vesicle formation, GAS GTPases and host fibrinolytic genes, and GAS response to interaction with neutrophils. We also identified a strong signal, suggesting interaction between host gammadelta T cells and genes in the GAS mevalonic acid synthesis pathway responsible for production of isopentenyl-pyrophosphate, a short-chain phospholipid that stimulates these T cells. Taken together, our results are unique in providing a comprehensive understanding of the host-pathogen interactome during mucosal infection by a bacterial pathogen.
Zhang, Xinxin; Ma, Dehua; Zou, Wei; Ding, Yibing; Zhu, Chengchu; Min, Haiyan; Zhang, Bin; Wang, Wei; Chen, Baofu; Ye, Minhua; Cai, Minghui; Pan, Yanqing; Cao, Lei; Wan, Yueming; Jin, Yu; Gao, Qian; Yi, Long
2016-05-27
Primary spontaneous pneumothorax (PSP) or pulmonary cysts is one of the manifestations of Birt-Hogg-Dube syndrome (BHDS) that is caused by heterozygous mutations in FLCN gene. Most of the mutations are SNVs and small indels, and there are also approximately 10 % large intragenic deletions and duplications of the mutations. These molecular findings are generally obtained by disparate methods including Sanger sequencing and Multiple Ligation-dependent Probe Amplification in the clinical laboratory. In addition, as a genetically heterogeneous disorder, PSP may be caused by mutations in multiple genes include FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 genes. For differential diagnosis, these genes should also be screened which makes the diagnostic procedure more time-consuming and labor-intensive. Forty PSP patients were divided into 2 groups. Nineteen patients with different pathogenic mutations of FLCN previously identified by conventional Sanger sequencing and MLPA were included in test group, 21 random PSP patients without any genetic screening were included in blinded sample group. 7 PSP genes including FLCN, FBN1, COL3A1, CBS, SERPINA1 and TSC1/TSC2 were designed and enriched by Haloplex system, sequenced on a Miseq platform and analyzed in the 40 patients to evaluate the performance of the targeted-NGS method. We demonstrated that the full spectrum of genes associated with pneumothorax including FLCN gene mutations can be identified simultaneously in multiplexed sequence data. Noteworthy, by our in-house copy number analysis of the sequence data, we could not only detect intragenic deletions, but also determine approximate deletion junctions simultaneously. NGS based Haloplex target enrichment technology is proved to be a rapid and cost-effective screening strategy for the comprehensive molecular diagnosis of BHDS in PSP patients, as it can replace Sanger sequencing and MLPA by simultaneously detecting exonic and intronic SNVs, small indels, large intragenic deletions and determining deletion junctions in PSP-related genes.
Wang, Yimin; Du, Xiaonan; Bin, Rao; Yu, Shanshan; Xia, Zhezhi; Zheng, Guo; Zhong, Jianmin; Zhang, Yunjian; Jiang, Yong-hui; Wang, Yi
2017-01-01
Genetic factors play a major role in the etiology of epilepsy disorders. Recent genomics studies using next generation sequencing (NGS) technique have identified a large number of genetic variants including copy number (CNV) and single nucleotide variant (SNV) in a small set of genes from individuals with epilepsy. These discoveries have contributed significantly to evaluate the etiology of epilepsy in clinic and lay the foundation to develop molecular specific treatment. However, the molecular basis for a majority of epilepsy patients remains elusive, and furthermore, most of these studies have been conducted in Caucasian children. Here we conducted a targeted exome-sequencing of 63 trios of Chinese epilepsy families using a custom-designed NGS panel that covers 412 known and candidate genes for epilepsy. We identified pathogenic and likely pathogenic variants in 15 of 63 (23.8%) families in known epilepsy genes including SCN1A, CDKL5, STXBP1, CHD2, SCN3A, SCN9A, TSC2, MBD5, POLG and EFHC1. More importantly, we identified likely pathologic variants in several novel candidate genes such as GABRE, MYH1, and CLCN6. Our results provide the evidence supporting the application of custom-designed NGS panel in clinic and indicate a conserved genetic susceptibility for epilepsy between Chinese and Caucasian children. PMID:28074849
Transcriptional profiling of the parr–smolt transformation in Atlantic salmon
Robertson, Laura S.; McCormick, Stephen D.
2012-01-01
The parr–smolt transformation in Atlantic salmon (Salmo salar) is a complex developmental process that culminates in the ability to migrate to and live in seawater. We used GRASP 16K cDNA microarrays to identify genes that are differentially expressed in the liver, gill, hypothalamus, pituitary, and olfactory rosettes of smolts compared to parr. Smolts had higher levels of gill Na+/K+-ATPase activity, plasma cortisol and plasma thyroid hormones relative to parr. Across all five tissues, stringent microarray analyses identified 48 features that were differentially expressed in smolts compared to parr. Using a less stringent method we found 477 features that were differentially expressed at least 1.2-fold in smolts, including 172 features in the gill. Smolts had higher mRNA levels of genes involved in transcription, protein biosynthesis and folding, electron transport, oxygen transport, and sensory perception and lower mRNA levels for genes involved in proteolysis. Quantitative RT-PCR was used to confirm differential expression in select genes identified by microarray analyses and to quantify expression of other genes known to be involved in smolting. This study expands our understanding of the molecular processes that underlie smolting in Atlantic salmon and identifies genes for further investigation.
Cracking the genomic piggy bank: identifying secrets of the pig genome.
Mote, B E; Rothschild, M F
2006-01-01
Though researchers are uncovering valuable information about the pig genome at unprecedented speed, the porcine genome community is barely scratching the surface as to understanding interactions of the biological code. The pig genetic linkage map has nearly 5,000 loci comprised of genes, microsatellites, and amplified fragment length polymorphism markers. Likewise, the physical map is becoming denser with nearly 6,000 markers. The long awaited sequencing efforts are providing multidimensional benefits with sequence available for comparative genomics and identifying single nucleotide polymorphisms for use in linkage and trait association studies. Scientists are using exotic and commercial breeds for quantitative trait loci scans. Additionally, candidate gene studies continue to identify chromosomal regions or genes associated with economically important traits such as growth rate, leanness, feed intake, meat quality, litter size, and disease resistance. The commercial pig industry is actively incorporating these markers in marker-assisted selection along with traditional performance information to improve said traits. Researchers are utilizing novel tools including pig microarrays along with advanced bioinformatics to identify new candidate genes, understand gene function, and piece together gene networks involved in important biological processes. Advances in pig genomics and implications to the pork industry as well as human health are reviewed.
Identification of Susceptibility Loci and Genes for Colorectal Cancer Risk
Zeng, Chenjie; Matsuda, Koichi; Jia, Wei-Hua; Chang, Jiang; Kweon, Sun-Seog; Xiang, Yong-Bing; Shin, Aesun; Jee, Sun Ha; Kim, Dong-Hyun; Zhang, Ben; Cai, Qiuyin; Guo, Xingyi; Long, Jirong; Wang, Nan; Courtney, Regina; Pan, Zhi-Zhong; Wu, Chen; Takahashi, Atsushi; Shin, Min-Ho; Matsuo, Keitaro; Matsuda, Fumihiko; Gao, Yu-Tang; Oh, Jae Hwan; Kim, Soriul; Jung, Keum Ji; Ahn, Yoon-Ok; Ren, Zefang; Li, Hong-Lan; Wu, Jie; Shi, Jiajun; Wen, Wanqing; Yang, Gong; Li, Bingshan; Ji, Bu-Tian; Brenner, Hermann; Schoen, Robert E.; Küry, Sébastien; Gruber, Stephen B.; Schumacher, Fredrick R.; Stenzel, Stephanie L.; Casey, Graham; Hopper, John L.; Jenkins, Mark A.; Kim, Hyeong-Rok; Jeong, Jin-Young; Park, Ji Won; Tajima, Kazuo; Cho, Sang-Hee; Kubo, Michiaki; Shu, Xiao-Ou; Lin, Dongxin; Zeng, Yi-Xin; Zheng, Wei
2016-01-01
Background & Aims Known Genetic factors explain only a small fraction of genetic variation in colorectal cancer (CRC). We conducted a genome-wide association study (GWAS) to identify risk loci for CRC. Methods This discovery stage included 8027 cases and 22577 controls of East-Asian ancestry. Promising variants were evaluated in studies including as many as 11044 cases and 12047 controls. Tumor-adjacent normal tissues from 188 patients were analyzed to evaluate correlations of risk variants with expression levels of nearby genes. Potential functionality of risk variants were evaluated using public genomic and epigenomic databases. Results We identified 4 loci associated with CRC risk; P values for the most significant variant in each locus ranged from 3.92×10−8 to 1.24×10−12: 6p21.1 (rs4711689), 8q23.3 (rs2450115, rs6469656), 10q24.3 (rs4919687), and 12p13.3 (rs11064437). We also identified 2 risk variants at loci previously associated with CRC: 10q25.2 (rs10506868) and 20q13.3 (rs6061231). These risk variants, conferring an approximate 10%–18% increase in risk per allele, are located either inside or near protein-coding genes that include TFEB (lysosome biogenesis and autophagy), EIF3H (initiation of translation), CYP17A1 (steroidogenesis), SPSB2 (proteasome degradation), and RPS21 (ribosome biogenesis). Gene expression analyses showed a significant association (P <.05) for rs4711689 with TFEB, rs6469656 with EIF3H, rs11064437 with SPSB2, and rs6061231 with RPS21. Conclusions We identified susceptibility loci and genes associated with CRC risk, linking CRC predisposition to steroid hormone, protein synthesis and degradation, and autophagy pathways and providing added insight into the mechanism of CRC pathogenesis. PMID:26965516
Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.
Borodovsky, M; Rudd, K E; Koonin, E V
1994-01-01
The unannotated regions of the Escherichia coli genome DNA sequence from the EcoSeq6 database, totaling 1,278 'intergenic' sequences of the combined length of 359,279 basepairs, were analyzed using computer-assisted methods with the aim of identifying putative unknown genes. The proposed strategy for finding new genes includes two key elements: i) prediction of expressed open reading frames (ORFs) using the GeneMark method based on Markov chain models for coding and non-coding regions of Escherichia coli DNA, and ii) search for protein sequence similarities using programs based on the BLAST algorithm and programs for motif identification. A total of 354 putative expressed ORFs were predicted by GeneMark. Using the BLASTX and TBLASTN programs, it was shown that 208 ORFs located in the unannotated regions of the E. coli chromosome are significantly similar to other protein sequences. Identification of 182 ORFs as probable genes was supported by GeneMark and BLAST, comprising 51.4% of the GeneMark 'hits' and 87.5% of the BLAST 'hits'. 73 putative new genes, comprising 20.6% of the GeneMark predictions, belong to ancient conserved protein families that include both eubacterial and eukaryotic members. This value is close to the overall proportion of highly conserved sequences among eubacterial proteins, indicating that the majority of the putative expressed ORFs that are predicted by GeneMark, but have no significant BLAST hits, nevertheless are likely to be real genes. The majority of the putative genes identified by BLAST search have been described since the release of the EcoSeq6 database, but about 70 genes have not been detected so far. Among these new identifications are genes encoding proteins with a variety of predicted functions including dehydrogenases, kinases, several other metabolic enzymes, ATPases, rRNA methyltransferases, membrane proteins, and different types of regulatory proteins. Images PMID:7984428
Yang, Xue-Dong; Tan, Hua-Wei; Zhu, Wei-Min
2016-01-01
Spinach (Spinacia oleracea L.), which originated in central and western Asia, belongs to the family Amaranthaceae. Spinach is one of most important leafy vegetables with a high nutritional value as well as being a perfect research material for plant sex chromosome models. As the completion of genome assembly and gene prediction of spinach, we developed SpinachDB (http://222.73.98.124/spinachdb) to store, annotate, mine and analyze genomics and genetics datasets efficiently. In this study, all of 21702 spinach genes were annotated. A total of 15741 spinach genes were catalogued into 4351 families, including identification of a substantial number of transcription factors. To construct a high-density genetic map, a total of 131592 SSRs and 1125743 potential SNPs located in 548801 loci of spinach genome were identified in 11 cultivated and wild spinach cultivars. The expression profiles were also performed with RNA-seq data using the FPKM method, which could be used to compare the genes. Paralogs in spinach and the orthologous genes in Arabidopsis, grape, sugar beet and rice were identified for comparative genome analysis. Finally, the SpinachDB website contains seven main sections, including the homepage; the GBrowse map that integrates genome, genes, SSR and SNP marker information; the Blast alignment service; the gene family classification search tool; the orthologous and paralogous gene pairs search tool; and the download and useful contact information. SpinachDB will be continually expanded to include newly generated robust genomics and genetics data sets along with the associated data mining and analysis tools.
Yan, Bo; Neilson, Karen M.; Ranganathan, Ramya; Maynard, Thomas; Streit, Andrea; Moody, Sally A.
2014-01-01
Background Six1 plays an important role in the development of several vertebrate organs, including cranial sensory placodes, somites and kidney. Although Six1 mutations cause one form of Branchio-Otic Syndrome (BOS), the responsible gene in many patients has not been identified; genes that act downstream of Six1 are potential BOS candidates. Results We sought to identify novel genes expressed during placode, somite and kidney development by comparing gene expression between control and Six1-expressing ectodermal explants. The expression patterns of 19 of the significantly up-regulated and 11 of the significantly down-regulated genes were assayed from cleavage to larval stages. 28/30 genes are expressed in the otocyst, a structure that is functionally disrupted in BOS, and 26/30 genes are expressed in the nephric mesoderm, a structure that is functionally disrupted in the related Branchio-Otic-Renal (BOR) syndrome. We also identified the chick homologues of 5 genes and show that they have conserved expression patterns. Conclusions Of the 30 genes selected for expression analyses, all are expressed at many of the developmental times and appropriate tissues to be regulated by Six1. Many have the potential to play a role in the disruption of hearing and kidney function seen in BOS/BOR patients. PMID:25403746
Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang
2016-01-01
Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01–14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I–IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478
Kang, Hye-Min; Lee, Jin-Sol; Kim, Min-Sub; Lee, Young Hwan; Jung, Jee-Hyun; Hagiwara, Atsushi; Zhou, Bingsheng; Lee, Jae-Seong; Jeong, Chang-Bum
2018-05-30
Autophagy originated from the common ancestor of all life forms, and its function is highly conserved from yeast to humans. Autophagy plays a key role in various fundamental biological processes including defense, and has developed through serial interactions of multiple gene sets referred to as autophagy-related (Atg) genes. Despite their significance in metazoan life and evolution, few studies have been conducted to identify these genes in aquatic invertebrates. In this study, we identified whole Atg genes in four Brachionus rotifer spp., namely B. calyciflorus, B. koreanus, B. plicatilis, and B. rotundiformis, through searches of their entire genomes; and we annotated them according to the yeast nomenclature. Twenty-four genes orthologous to yeast genes were present in all of the Brachionus spp. while three additional gene duplicates were identified in the genome of B. koreanus, indicating that these genes had diversified during the speciation. Also, their transcriptional responses to cadmium exposure indicated regulation by cadmium-induced oxidative-stress-related signaling pathways. This study provides valuable information on 99 conserved Atg genes involved in autophagosome formation in Brachionus spp., with transcriptional modulation in response to cadmium, in the context of the role of autophagy in the damage response. Copyright © 2018 Elsevier B.V. All rights reserved.
Tohge, Takayuki; Fernie, Alisdair R.
2014-01-01
Whole genome sequencing and the relative ease of transcript profiling have facilitated the collection and data warehousing of immense quantities of expression data. However, a substantial proportion of genes are not yet functionally annotated a problem which is particularly acute for transport proteins. In Arabidopsis, for example, only a minor fraction of the estimated 700 intracellular transporters have been identified at the molecular genetic level. Furthermore it is only within the last couple of years that critical genes such as those encoding the final transport step required for the long distance transport of sucrose and the first transporter of the core photorespiratory pathway have been identified. Here we will describe how transcriptional coordination between genes of known function and non-annotated genes allows the identification of putative transporters on the premise that such co-expressed genes tend to be functionally related. We will additionally extend this to include the expansion of this approach to include phenotypic information from other levels of cellular organization such as proteomic and metabolomic data and provide case studies wherein this approach has successfully been used to fill knowledge gaps in important metabolic pathways and physiological processes. PMID:24672529
Taste receptors and gustatory associated G proteins in channel catfish, Ictalurus punctatus.
Gao, Sen; Liu, Shikai; Yao, Jun; Zhou, Tao; Li, Ning; Li, Qi; Dunham, Rex; Liu, Zhanjiang
2017-03-01
Taste sensation plays a pivotal role in nutrient identification and acquisition. This is particularly true for channel catfish (Ictalurus punctatus) that live in turbid waters with limited visibility. This biological process is mainly mediated by taste receptors expressed in taste buds that are distributed in several organs and tissues, including the barbels and skin. In the present study, we identified a complete repertoire of taste receptor and gustatory associated G protein genes in the channel catfish genome. A total of eight taste receptor genes were identified, including five type I and three type II taste receptor genes. Their genomic locations, phylogenetic relations, orthologies and expression were determined. Phylogenetic and collinear analyses provided understanding of the evolution dynamics of this gene family. Furthermore, the motif and dN/dS analyses indicated that selection pressures of different degrees were imposed on these receptors. Additionally, four genes of gustatory associated G proteins were also identified. It was indicated that expression patterns of catfish taste receptors and gustatory associated G proteins across organs mirror the distribution of taste buds across organs. Finally, the expression comparison between catfish and zebrafish organs provided evidence of potential roles of catfish skin and gill involved in taste sensation. Copyright © 2016 Elsevier Inc. All rights reserved.
A Penalized Robust Method for Identifying Gene-Environment Interactions
Shi, Xingjie; Liu, Jin; Huang, Jian; Zhou, Yong; Xie, Yang; Ma, Shuangge
2015-01-01
In high-throughput studies, an important objective is to identify gene-environment interactions associated with disease outcomes and phenotypes. Many commonly adopted methods assume specific parametric or semiparametric models, which may be subject to model mis-specification. In addition, they usually use significance level as the criterion for selecting important interactions. In this study, we adopt the rank-based estimation, which is much less sensitive to model specification than some of the existing methods and includes several commonly encountered data and models as special cases. Penalization is adopted for the identification of gene-environment interactions. It achieves simultaneous estimation and identification and does not rely on significance level. For computation feasibility, a smoothed rank estimation is further proposed. Simulation shows that under certain scenarios, for example with contaminated or heavy-tailed data, the proposed method can significantly outperform the existing alternatives with more accurate identification. We analyze a lung cancer prognosis study with gene expression measurements under the AFT (accelerated failure time) model. The proposed method identifies interactions different from those using the alternatives. Some of the identified genes have important implications. PMID:24616063
2012-01-01
Background The fetal and adult globin genes in the human β-globin cluster on chromosome 11 are sequentially expressed to achieve normal hemoglobin switching during human development. The pharmacological induction of fetal γ-globin (HBG) to replace abnormal adult sickle βS-globin is a successful strategy to treat sickle cell disease; however the molecular mechanism of γ-gene silencing after birth is not fully understood. Therefore, we performed global gene expression profiling using primary erythroid progenitors grown from human peripheral blood mononuclear cells to characterize gene expression patterns during the γ-globin to β-globin (γ/β) switch observed throughout in vitro erythroid differentiation. Results We confirmed erythroid maturation in our culture system using cell morphologic features defined by Giemsa staining and the γ/β-globin switch by reverse transcription-quantitative PCR (RT-qPCR) analysis. We observed maximal γ-globin expression at day 7 with a switch to a predominance of β-globin expression by day 28 and the γ/β-globin switch occurred around day 21. Expression patterns for transcription factors including GATA1, GATA2, KLF1 and NFE2 confirmed our system produced the expected pattern of expression based on the known function of these factors in globin gene regulation. Subsequent gene expression profiling was performed with RNA isolated from progenitors harvested at day 7, 14, 21, and 28 in culture. Three major gene profiles were generated by Principal Component Analysis (PCA). For profile-1 genes, where expression decreased from day 7 to day 28, we identified 2,102 genes down-regulated > 1.5-fold. Ingenuity pathway analysis (IPA) for profile-1 genes demonstrated involvement of the Cdc42, phospholipase C, NF-Kβ, Interleukin-4, and p38 mitogen activated protein kinase (MAPK) signaling pathways. Transcription factors known to be involved in γ-and β-globin regulation were identified. The same approach was used to generate profile-2 genes where expression was up-regulated over 28 days in culture. IPA for the 2,437 genes with > 1.5-fold induction identified the mitotic roles of polo-like kinase, aryl hydrocarbon receptor, cell cycle control, and ATM (Ataxia Telangiectasia Mutated Protein) signaling pathways; transcription factors identified included KLF1, GATA1 and NFE2 among others. Finally, profile-3 was generated from 1,579 genes with maximal expression at day 21, around the time of the γ/β-globin switch. IPA identified associations with cell cycle control, ATM, and aryl hydrocarbon receptor signaling pathways. Conclusions The transcriptome analysis completed with erythroid progenitors grown in vitro identified groups of genes with distinct expression profiles, which function in metabolic pathways associated with cell survival, hematopoiesis, blood cells activation, and inflammatory responses. This study represents the first report of a transcriptome analysis in human primary erythroid progenitors to identify transcription factors involved in hemoglobin switching. Our results also demonstrate that the in vitro liquid culture system is an excellent model to define mechanisms of global gene expression and the DNA-binding protein and signaling pathways involved in globin gene regulation. PMID:22537182
O'Brien, Greg; Maricic, Natalie; Kesterson, Alexandria; Grace, Megan
2017-01-01
ABSTRACT A network of genes and at least two peptide signaling molecules tightly control when Streptococcus mutans becomes competent to take up DNA from its environment. Widespread changes in the expression of genes occur when S. mutans is presented with competence signal peptides in vitro, including the increased production of the alternative sigma factor, ComX, which activates late competence genes. Still, the way that gene products that are regulated by competence peptides influence DNA uptake and cellular physiology are not well understood. Here, we developed and employed comprehensive transposon mutagenesis of the S. mutans genome, with a screen to identify mutants that aberrantly expressed comX, coupled with transposon sequencing (Tn-seq) to gain a more thorough understanding of the factors modulating comX expression and progression to the competent state. The screens effectively identified genes known to affect competence, e.g., comR, comS, comD, comE, cipB, clpX, rcrR, and ciaH, but disclosed an additional 20 genes that were not previously competence associated. The competence phenotypes of mutants were characterized, including by fluorescence microscopy to determine at which stage the mutants were impaired for comX activation. Among the novel genes studied were those implicated in cell division, the sensing of cell envelope stress, cell envelope biogenesis, and RNA stability. Our results provide a platform for determining the specific chemical and physical cues that are required for genetic competence in S. mutans, while highlighting the effectiveness of using Tn-seq in S. mutans to discover and study novel biological processes. IMPORTANCE Streptococcus mutans acquires DNA from its environment by becoming genetically competent, a physiologic state triggered by cell-cell communication using secreted peptides. Competence is important for acquiring novel genetic traits and has a strong influence on the expression of virulence-associated traits of S. mutans. Here, we used transposon mutagenesis and genomic technologies to identify novel genes involved in competence development. In addition to identifying genes previously known to be required for comX expression, 20 additional genes were identified and characterized. The findings create opportunities to diminish the pathogenic potential of S. mutans, while validating technologies that can rapidly advance our understanding of the physiology, biology, and genetics of S. mutans and related pathogens. PMID:29109185
Shields, Robert C; O'Brien, Greg; Maricic, Natalie; Kesterson, Alexandria; Grace, Megan; Hagen, Stephen J; Burne, Robert A
2017-11-06
A network of genes and at least two peptide signaling molecules tightly control when Streptococcus mutans becomes competent to take up DNA from its environment. Widespread changes in the expression of genes occur when S. mutans is presented with competence signal peptides in vitro , including increased production of the alternative sigma factor, ComX, which activates late competence genes. Still, the way that gene products that are regulated by competence peptides influence DNA uptake and cellular physiology are not well understood. Here, we developed and employed comprehensive transposon mutagenesis of the S. mutans genome with a screen to identify mutants that aberrantly expressed comX , coupled with transposon sequencing (Tn-seq) to gain a more thorough understanding of the factors modulating comX expression and progression to the competent state. The screens effectively identified genes known to affect competence, e.g. comR , comS , comD , comE , cipB , clpX , rcrR , ciaH , but disclosed an additional 20 genes that were not previously competence-associated. The competence phenotypes of mutants were characterized, including using fluorescence microscopy to determine at which stage the mutants were impaired for comX activation. Among the novel genes studied were those implicated in cell division, sensing of cell envelope stress, cell envelope biogenesis, and RNA stability. Our results provide a platform for determining the specific chemical and physical cues that are required for genetic competence in S. mutans , while highlighting the effectiveness of using Tn-seq in S. mutans to discover and study novel biological processes. IMPORTANCE Streptococcus mutans acquires DNA from its environment by becoming genetically competent, a physiologic state triggered by cell-cell communication using secreted peptides. Competence is important for acquiring novel genetic traits and has a strong influence on the expression of virulence-associated traits of S. mutans Here, we used transposon mutagenesis and genomic technologies to identify novel genes involved in competence development. In addition to identifying genes previously known to be required for comX expression, 20 additional genes were identified and characterized. The findings create opportunities to diminish the pathogenic potential of S. mutans , while validating technologies that can rapidly advance our understanding of the physiology, biology and genetics of S. mutans and related pathogens. Copyright © 2017 American Society for Microbiology.
Li, Chunquan; Han, Junwei; Yao, Qianlan; Zou, Chendan; Xu, Yanjun; Zhang, Chunlong; Shang, Desi; Zhou, Lingyun; Zou, Chaoxia; Sun, Zeguo; Li, Jing; Zhang, Yunpeng; Yang, Haixiu; Gao, Xu; Li, Xia
2013-01-01
Various ‘omics’ technologies, including microarrays and gas chromatography mass spectrometry, can be used to identify hundreds of interesting genes, proteins and metabolites, such as differential genes, proteins and metabolites associated with diseases. Identifying metabolic pathways has become an invaluable aid to understanding the genes and metabolites associated with studying conditions. However, the classical methods used to identify pathways fail to accurately consider joint power of interesting gene/metabolite and the key regions impacted by them within metabolic pathways. In this study, we propose a powerful analytical method referred to as Subpathway-GM for the identification of metabolic subpathways. This provides a more accurate level of pathway analysis by integrating information from genes and metabolites, and their positions and cascade regions within the given pathway. We analyzed two colorectal cancer and one metastatic prostate cancer data sets and demonstrated that Subpathway-GM was able to identify disease-relevant subpathways whose corresponding entire pathways might be ignored using classical entire pathway identification methods. Further analysis indicated that the power of a joint genes/metabolites and subpathway strategy based on their topologies may play a key role in reliably recalling disease-relevant subpathways and finding novel subpathways. PMID:23482392
Irf8-Regulated Genomic Responses Drive Pathological Inflammation during Cerebral Malaria
Radovanovic, Irena; Tam, Mifong; MacMicking, John D.; Stevenson, Mary M.; Gros, Philippe
2013-01-01
Interferon Regulatory Factor 8 (IRF8) is required for development, maturation and expression of anti-microbial defenses of myeloid cells. BXH2 mice harbor a severely hypomorphic allele at Irf8 (Irf8R294C) that causes susceptibility to infection with intracellular pathogens including Mycobacterium tuberculosis. We report that BXH2 are completely resistant to the development of cerebral malaria (ECM) following Plasmodium berghei ANKA infection. Comparative transcriptional profiling of brain RNA as well as chromatin immunoprecipitation and high-throughput sequencing (ChIP-seq) was used to identify IRF8-regulated genes whose expression is associated with pathological acute neuroinflammation. Genes increased by infection were strongly enriched for IRF8 binding sites, suggesting that IRF8 acts as a transcriptional activator in inflammatory programs. These lists were enriched for myeloid-specific pathways, including interferon responses, antigen presentation and Th1 polarizing cytokines. We show that inactivation of several of these downstream target genes (including the Irf8 transcription partner Irf1) confers protection against ECM. ECM-resistance in Irf8 and Irf1 mutants is associated with impaired myeloid and lymphoid cells function, including production of IL12p40 and IFNγ. We note strong overlap between genes bound and regulated by IRF8 during ECM and genes regulated in the lungs of M. tuberculosis infected mice. This IRF8-dependent network contains several genes recently identified as risk factors in acute and chronic human inflammatory conditions. We report a common core of IRF8-bound genes forming a critical inflammatory host-response network. PMID:23853600
Sleeping Beauty mutagenesis reveals cooperating mutations and pathways in pancreatic adenocarcinoma
Mann, Karen M.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Kovochich, Anne; Dawson, David W.; Black, Michael A.; Brett, Benjamin T.; Sheetz, Todd E.; Dupuy, Adam J.; Chang, David K.; Biankin, Andrew V.; Waddell, Nicola; Kassahn, Karin S.; Grimmond, Sean M.; Rust, Alistair G.; Adams, David J.; Jenkins, Nancy A.; Copeland, Neal G.
2012-01-01
Pancreatic cancer is one of the most deadly cancers affecting the Western world. Because the disease is highly metastatic and difficult to diagnosis until late stages, the 5-y survival rate is around 5%. The identification of molecular cancer drivers is critical for furthering our understanding of the disease and development of improved diagnostic tools and therapeutics. We have conducted a mutagenic screen using Sleeping Beauty (SB) in mice to identify new candidate cancer genes in pancreatic cancer. By combining SB with an oncogenic Kras allele, we observed highly metastatic pancreatic adenocarcinomas. Using two independent statistical methods to identify loci commonly mutated by SB in these tumors, we identified 681 loci that comprise 543 candidate cancer genes (CCGs); 75 of these CCGs, including Mll3 and Ptk2, have known mutations in human pancreatic cancer. We identified point mutations in human pancreatic patient samples for another 11 CCGs, including Acvr2a and Map2k4. Importantly, 10% of the CCGs are involved in chromatin remodeling, including Arid4b, Kdm6a, and Nsd3, and all SB tumors have at least one mutated gene involved in this process; 20 CCGs, including Ctnnd1, Fbxo11, and Vgll4, are also significantly associated with poor patient survival. SB mutagenesis provides a rich resource of mutations in potential cancer drivers for cross-comparative analyses with ongoing sequencing efforts in human pancreatic adenocarcinoma. PMID:22421440
Massingham, Lauren J; Johnson, Kirby L; Scholl, Thomas M; Slonim, Donna K; Wick, Heather C; Bianchi, Diana W
2014-09-01
Turner syndrome is a sex chromosome aneuploidy with characteristic malformations. Amniotic fluid, a complex biological material, could contribute to the understanding of Turner syndrome pathogenesis. In this pilot study, global gene expression analysis of cell-free RNA in amniotic fluid supernatant was utilized to identify specific genes/organ systems that may play a role in Turner syndrome pathophysiology. Cell-free RNA from amniotic fluid of five mid-trimester Turner syndrome fetuses and five euploid female fetuses matched for gestational age was extracted, amplified, and hybridized onto Affymetrix(®) U133 Plus 2.0 arrays. Significantly differentially regulated genes were identified using paired t tests. Biological interpretation was performed using Ingenuity Pathway Analysis and BioGPS gene expression atlas. There were 470 statistically significantly differentially expressed genes identified. They were widely distributed across the genome. XIST was significantly down-regulated (p < 0.0001); SHOX was not differentially expressed. One of the most highly represented organ systems was the hematologic/immune system, distinguishing the Turner syndrome transcriptome from other aneuploidies we previously studied. Manual curation of the differentially expressed gene list identified genes of possible pathologic significance, including NFATC3, IGFBP5, and LDLR. Transcriptomic differences in the amniotic fluid of Turner syndrome fetuses are due to genome-wide dysregulation. The hematologic/immune system differences may play a role in early-onset autoimmune dysfunction. Other genes identified with possible pathologic significance are associated with cardiac and skeletal systems, which are known to be affected in females with Turner syndrome. The discovery-driven approach described here may be useful in elucidating novel mechanisms of disease in Turner syndrome.
Luo, Xiongjian; Huang, Liang; Han, Leng; Luo, Zhenwu; Hu, Fang; Tieu, Roger; Gan, Lin
2014-01-01
Schizophrenia is a common mental disorder with high heritability and strong genetic heterogeneity. Common disease-common variants hypothesis predicts that schizophrenia is attributable in part to common genetic variants. However, recent studies have clearly demonstrated that copy number variations (CNVs) also play pivotal roles in schizophrenia susceptibility and explain a proportion of missing heritability. Though numerous CNVs have been identified, many of the regions affected by CNVs show poor overlapping among different studies, and it is not known whether the genes disrupted by CNVs contribute to the risk of schizophrenia. By using cumulative scoring, we systematically prioritized the genes affected by CNVs in schizophrenia. We identified 8 top genes that are frequently disrupted by CNVs, including NRXN1, CHRNA7, BCL9, CYFIP1, GJA8, NDE1, SNAP29, and GJA5. Integration of genes affected by CNVs with known schizophrenia susceptibility genes (from previous genetic linkage and association studies) reveals that many genes disrupted by CNVs are also associated with schizophrenia. Further protein-protein interaction (PPI) analysis indicates that protein products of genes affected by CNVs frequently interact with known schizophrenia-associated proteins. Finally, systematic integration of CNVs prioritization data with genetic association and PPI data identifies key schizophrenia candidate genes. Our results provide a global overview of genes impacted by CNVs in schizophrenia and reveal a densely interconnected molecular network of de novo CNVs in schizophrenia. Though the prioritized top genes represent promising schizophrenia risk genes, further work with different prioritization methods and independent samples is needed to confirm these findings. Nevertheless, the identified key candidate genes may have important roles in the pathogenesis of schizophrenia, and further functional characterization of these genes may provide pivotal targets for future therapeutics and diagnostics. PMID:24664977
Mutations in the collagen XII gene define a new form of extracellular matrix-related myopathy.
Hicks, Debbie; Farsani, Golara Torabi; Laval, Steven; Collins, James; Sarkozy, Anna; Martoni, Elena; Shah, Ashoke; Zou, Yaqun; Koch, Manuel; Bönnemann, Carsten G; Roberts, Mark; Lochmüller, Hanns; Bushby, Kate; Straub, Volker
2014-05-01
Bethlem myopathy (BM) [MIM 158810] is a slowly progressive muscle disease characterized by contractures and proximal weakness, which can be caused by mutations in one of the collagen VI genes (COL6A1, COL6A2 and COL6A3). However, there may be additional causal genes to identify as in ∼50% of BM cases no mutations in the COL6 genes are identified. In a cohort of -24 patients with a BM-like phenotype, we first sequenced 12 candidate genes based on their function, including genes for known binding partners of collagen VI, and those enzymes involved in its correct post-translational modification, assembly and secretion. Proceeding to whole-exome sequencing (WES), we identified mutations in the COL12A1 gene, a member of the FACIT collagens (fibril-associated collagens with interrupted triple helices) in five individuals from two families. Both families showed dominant inheritance with a clinical phenotype resembling classical BM. Family 1 had a single-base substitution that led to the replacement of one glycine residue in the triple-helical domain, breaking the Gly-X-Y repeating pattern, and Family 2 had a missense mutation, which created a mutant protein with an unpaired cysteine residue. Abnormality at the protein level was confirmed in both families by the intracellular retention of collagen XII in patient dermal fibroblasts. The mutation in Family 2 leads to the up-regulation of genes associated with the unfolded protein response (UPR) pathway and swollen, dysmorphic rough-ER. We conclude that the spectrum of causative genes in extracellular matrix (ECM)-related myopathies be extended to include COL12A1.
Ham, Seungmin; de Kretser, David; Southwick, Graeme; Sprung, Carl N.
2013-01-01
Dupuytren's disease (DD) is a classic example of pathological fibrosis which results in a debilitating disorder affecting a large sector of the human population. It is characterized by excessive local proliferation of fibroblasts and over-production of collagen and other components of extracellular matrix (ECM) in the palmar fascia. The fibrosis progressively results in contracture of elements between the palmar fascia and skin causing flexion deformity or clawing of the fingers and a severe reduction in hand function. While much is known about the pathogenesis and surgical treatment of DD, little is known about the factors that cause its onset and progression, despite many years of research. Gene expression patterns in DD patients now offers the potential to identify genes that direct the pathogenesis of DD. In this study we used primary cultures of fibroblasts derived from excisional biopsies of fibrotic tissue from DD patients to compare the gene expression profiles on a genome-wide basis with normal control fibroblasts. Our investigations have identified genes that may be involved with DD pathogenesis including some which are directly relevant to fibrosis. In particular, these include significantly reduced expression levels of three matrix metallopeptidases (MMP1, MMP3, MMP16), follistatin, and STAT1, and significantly increased expression levels of fibroblast growth factors (FGF9, FGF11), a number of collagen genes and other ECM genes in DD patient samples. Many of these gene products are known to be involved in fibrosis, tumour formation and in the normal processes of tissue remodelling. In addition, alternative splicing was identified in some DD associated genes. These highly sensitive genomic investigations provide new insight into the molecular mechanisms that may underpin the development and progression of DD. PMID:23554969
Whole Gene Capture Analysis of 15 CRC Susceptibility Genes in Suspected Lynch Syndrome Patients.
Jansen, Anne M L; Geilenkirchen, Marije A; van Wezel, Tom; Jagmohan-Changur, Shantie C; Ruano, Dina; van der Klift, Heleen M; van den Akker, Brendy E W M; Laros, Jeroen F J; van Galen, Michiel; Wagner, Anja; Letteboer, Tom G W; Gómez-García, Encarna B; Tops, Carli M J; Vasen, Hans F; Devilee, Peter; Hes, Frederik J; Morreau, Hans; Wijnen, Juul T
2016-01-01
Lynch Syndrome (LS) is caused by pathogenic germline variants in one of the mismatch repair (MMR) genes. However, up to 60% of MMR-deficient colorectal cancer cases are categorized as suspected Lynch Syndrome (sLS) because no pathogenic MMR germline variant can be identified, which leads to difficulties in clinical management. We therefore analyzed the genomic regions of 15 CRC susceptibility genes in leukocyte DNA of 34 unrelated sLS patients and 11 patients with MLH1 hypermethylated tumors with a clear family history. Using targeted next-generation sequencing, we analyzed the entire non-repetitive genomic sequence, including intronic and regulatory sequences, of 15 CRC susceptibility genes. In addition, tumor DNA from 28 sLS patients was analyzed for somatic MMR variants. Of 1979 germline variants found in the leukocyte DNA of 34 sLS patients, one was a pathogenic variant (MLH1 c.1667+1delG). Leukocyte DNA of 11 patients with MLH1 hypermethylated tumors was negative for pathogenic germline variants in the tested CRC susceptibility genes and for germline MLH1 hypermethylation. Somatic DNA analysis of 28 sLS tumors identified eight (29%) cases with two pathogenic somatic variants, one with a VUS predicted to pathogenic and LOH, and nine cases (32%) with one pathogenic somatic variant (n = 8) or one VUS predicted to be pathogenic (n = 1). This is the first study in sLS patients to include the entire genomic sequence of CRC susceptibility genes. An underlying somatic or germline MMR gene defect was identified in ten of 34 sLS patients (29%). In the remaining sLS patients, the underlying genetic defect explaining the MMRdeficiency in their tumors might be found outside the genomic regions harboring the MMR and other known CRC susceptibility genes.
Doi, Ayano; Ichinohe, Risa; Ikuyo, Yoriko; Takahashi, Teruyoshi; Marui, Shigetaka; Yasuhara, Koji; Nakamura, Tetsuro; Sugita, Shintaro; Sakamoto, Hiromi; Yoshida, Teruhiko; Hasegawa, Tadashi
2014-01-01
The diagnosis and treatment of soft tissue sarcomas (STS) have been difficult. Of the diverse histological subtypes, undifferentiated pleomorphic sarcoma (UPS) is particularly difficult to diagnose accurately, and its classification per se is still controversial. Recent advances in genomic technologies provide an excellent way to address such problems. However, it is often difficult, if not impossible, to identify definitive disease-associated genes using genome-wide analysis alone, primarily because of multiple testing problems. In the present study, we analyzed microarray data from 88 STS patients using a combination method that used knowledge-based filtering and a simulation based on the integration of multiple statistics to reduce multiple testing problems. We identified 25 genes, including hypoxia-related genes (e.g., MIF, SCD1, P4HA1, ENO1, and STAT1) and cell cycle- and DNA repair-related genes (e.g., TACC3, PRDX1, PRKDC, and H2AFY). These genes showed significant differential expression among histological subtypes, including UPS, and showed associations with overall survival. STAT1 showed a strong association with overall survival in UPS patients (logrank p = 1.84×10−6 and adjusted p value 2.99×10−3 after the permutation test). According to the literature, the 25 genes selected are useful not only as markers of differential diagnosis but also as prognostic/predictive markers and/or therapeutic targets for STS. Our combination method can identify genes that are potential prognostic/predictive factors and/or therapeutic targets in STS and possibly in other cancers. These disease-associated genes deserve further preclinical and clinical validation. PMID:25188299
Jia, Peilin; Wang, Lily; Fanous, Ayman H.; Pato, Carlos N.; Edwards, Todd L.; Zhao, Zhongming
2012-01-01
With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data, has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes with consistent association signals. We validated the module genes in an independent dataset, and also examined them through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had P meta<1×10−4, including the gene HLA-DQA1 located in the MHC region on chromosome 6, which was reported in previous studies using the largest cohort of schizophrenia patients to date. These results demonstrated our bi-directional network-based strategy is efficient for identifying disease-associated genes with modest signals in GWAS datasets. This approach can be applied to any other complex diseases/traits where multiple GWAS datasets are available. PMID:22792057
Identification of novel Cyclooxygenase-2-dependent genes in Helicobacter pylori infection in vivo
Walduck, Anna K; Weber, Matthias; Wunder, Christian; Juettner, Stefan; Stolte, Manfred; Vieth, Michael; Wiedenmann, Bertram; Meyer, Thomas F; Naumann, Michael; Hoecker, Michael
2009-01-01
Background Helicobacter pylori is a crucial determining factor in the pathogenesis of benign and neoplastic gastric diseases. Cyclooxygenase-2 (Cox-2) is the inducible key enzyme of arachidonic acid metabolism and is a central mediator in inflammation and cancer. Expression of the Cox-2 gene is up-regulated in the gastric mucosa during H. pylori infection but the pathobiological consequences of this enhanced Cox-2 expression are not yet characterized. The aim of this study was to identify novel genes down-stream of Cox-2 in an in vivo model, thereby identifying potential targets for the study of the role of Cox- 2 in H. pylori pathogenesis and the initiation of pre- cancerous changes. Results Gene expression profiles in the gastric mucosa of mice treated with a specific Cox-2 inhibitor (NS398) or vehicle were analysed at different time points (6, 13 and 19 wk) after H. pylori infection. H. pylori infection affected the expression of 385 genes over the experimental period, including regulators of gastric physiology, proliferation, apoptosis and mucosal defence. Under conditions of Cox-2 inhibition, 160 target genes were regulated as a result of H. pylori infection. The Cox-2 dependent subset included those influencing gastric physiology (Gastrin, Galr1), epithelial barrier function (Tjp1, connexin45, Aqp5), inflammation (Icam1), apoptosis (Clu) and proliferation (Gdf3, Igf2). Treatment with NS398 alone caused differential expression of 140 genes, 97 of which were unique, indicating that these genes are regulated under conditions of basal Cox-2 expression. Conclusion This study has identified a panel of novel Cox-2 dependent genes influenced under both normal and the inflammatory conditions induced by H. pylori infection. These data provide important new links between Cox-2 and inflammatory processes, epithelial repair and integrity. PMID:19317916
Liu, Ranran; Sun, Yanfa; Zhao, Guiping; Wang, Fangjie; Wu, Dan; Zheng, Maiqing; Chen, Jilan; Zhang, Lei; Hu, Yaodong; Wen, Jie
2013-01-01
Body composition and meat quality traits are important economic traits of chickens. The development of high-throughput genotyping platforms and relevant statistical methods have enabled genome-wide association studies in chickens. In order to identify molecular markers and candidate genes associated with body composition and meat quality traits, genome-wide association studies were conducted using the Illumina 60 K SNP Beadchip to genotype 724 Beijing-You chickens. For each bird, a total of 16 traits were measured, including carcass weight (CW), eviscerated weight (EW), dressing percentage, breast muscle weight (BrW) and percentage (BrP), thigh muscle weight and percentage, abdominal fat weight and percentage, dry matter and intramuscular fat contents of breast and thigh muscle, ultimate pH, and shear force of the pectoralis major muscle at 100 d of age. The SNPs that were significantly associated with the phenotypic traits were identified using both simple (GLM) and compressed mixed linear (MLM) models. For nine of ten body composition traits studied, SNPs showing genome wide significance (P<2.59E-6) have been identified. A consistent region on chicken (Gallus gallus) chromosome 4 (GGA4), including seven significant SNPs and four candidate genes (LCORL, LAP3, LDB2, TAPT1), were found to be associated with CW and EW. Another 0.65 Mb region on GGA3 for BrW and BrP was identified. After measuring the mRNA content in beast muscle for five genes located in this region, the changes in GJA1 expression were found to be consistent with that of breast muscle weight across development. It is highly possible that GJA1 is a functional gene for breast muscle development in chickens. For meat quality traits, several SNPs reaching suggestive association were identified and possible candidate genes with their functions were discussed.
Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset
2012-01-01
Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA) with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO) correctly identified (p < 0.05) microarray data in which genes annotated to differentially expressed GO terms are upregulated. We found that GSEA + MIMGO was slightly less effective than, or comparable to, GSEA (Pearson), a method that uses Pearson’s correlation as a metric, at detecting true differentially expressed GO terms. However, unlike other methods including GSEA (Pearson), GSEA + MIMGO can comprehensively identify the microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively. PMID:23232071
Transcriptomic analysis links gene expression to unilateral pollen-pistil reproductive barriers.
Broz, Amanda K; Guerrero, Rafael F; Randle, April M; Baek, You Soon; Hahn, Matthew W; Bedinger, Patricia A
2017-04-24
Unilateral incompatibility (UI) is an asymmetric reproductive barrier that unidirectionally prevents gene flow between species and/or populations. UI is characterized by a compatible interaction between partners in one direction, but in the reciprocal cross fertilization fails, generally due to pollen tube rejection by the pistil. Although UI has long been observed in crosses between different species, the underlying molecular mechanisms are only beginning to be characterized. The wild tomato relative Solanum habrochaites provides a unique study system to investigate the molecular basis of this reproductive barrier, as populations within the species exhibit both interspecific and interpopulation UI. Here we utilized a transcriptomic approach to identify genes in both pollen and pistil tissues that may be key players in UI. We confirmed UI at the pollen-pistil level between a self-incompatible population and a self-compatible population of S. habrochaites. A comparison of gene expression between pollinated styles exhibiting the incompatibility response and unpollinated controls revealed only a small number of differentially expressed transcripts. Many more differences in transcript profiles were identified between UI-competent versus UI-compromised reproductive tissues. A number of intriguing candidate genes were highly differentially expressed, including a putative pollen arabinogalactan protein, a stylar Kunitz family protease inhibitor, and a stylar peptide hormone Rapid ALkalinization Factor. Our data also provide transcriptomic evidence that fundamental processes including reactive oxygen species (ROS) signaling are likely key in UI pollen-pistil interactions between both populations and species. Gene expression analysis of reproductive tissues allowed us to better understand the molecular basis of interpopulation incompatibility at the level of pollen-pistil interactions. Our transcriptomic analysis highlighted specific genes, including those in ROS signaling pathways that warrant further study in investigations of UI. To our knowledge, this is the first report to identify candidate genes involved in unilateral barriers between populations within a species.
Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome.
Nicholson, Matthew J; Theodorou, Michael K; Brookman, Jayne L
2005-01-01
The anaerobic gut fungi occupy a unique niche in the intestinal tract of large herbivorous animals and are thought to act as primary colonizers of plant material during digestion. They are the only known obligately anaerobic fungi but molecular analysis of this group has been hampered by difficulties in their culture and manipulation, and by their extremely high A+T nucleotide content. This study begins to answer some of the fundamental questions about the structure and organization of the anaerobic gut fungal genome. Directed plasmid libraries using genomic DNA digested with highly or moderately rich AT-specific restriction enzymes (VspI and EcoRI) were prepared from a polycentric Orpinomyces isolate. Clones were sequenced from these libraries and the breadth of genomic inserts, both genic and intergenic, was characterized. Genes encoding numerous functions not previously characterized for these fungi were identified, including cytoskeletal, secretory pathway and transporter genes. A peptidase gene with no introns and having sequence similarity to a gene encoding a bacterial peptidase was also identified, extending the range of metabolic enzymes resulting from apparent trans-kingdom transfer from bacteria to fungi, as previously characterized largely for genes encoding plant-degrading enzymes. This paper presents the first thorough analysis of the genic, intergenic and rDNA regions of a variety of genomic segments from an anaerobic gut fungus and provides observations on rules governing intron boundaries, the codon biases observed with different types of genes, and the sequence of only the second anaerobic gut fungal promoter reported. Large numbers of retrotransposon sequences of different types were found and the authors speculate on the possible consequences of any such transposon activity in the genome. The coding sequences identified included several orphan gene sequences, including one with regions strongly suggestive of structural proteins such as collagens and lampirin. This gene was present as a single copy in Orpinomyces, was expressed during vegetative growth and was also detected in genomes from another gut fungal genus, Neocallimastix.
Barcode Sequencing Screen Identifies SUB1 as a Regulator of Yeast Pheromone Inducible Genes
Sliva, Anna; Kuang, Zheng; Meluh, Pamela B.; Boeke, Jef D.
2016-01-01
The yeast pheromone response pathway serves as a valuable model of eukaryotic mitogen-activated protein kinase (MAPK) pathways, and transcription of their downstream targets. Here, we describe application of a screening method combining two technologies: fluorescence-activated cell sorting (FACS), and barcode analysis by sequencing (Bar-Seq). Using this screening method, and pFUS1-GFP as a reporter for MAPK pathway activation, we readily identified mutants in known mating pathway components. In this study, we also include a comprehensive analysis of the FUS1 induction properties of known mating pathway mutants by flow cytometry, featuring single cell analysis of each mutant population. We also characterized a new source of false positives resulting from the design of this screen. Additionally, we identified a deletion mutant, sub1Δ, with increased basal expression of pFUS1-GFP. Here, in the first ChIP-Seq of Sub1, our data shows that Sub1 binds to the promoters of about half the genes in the genome (tripling the 991 loci previously reported), including the promoters of several pheromone-inducible genes, some of which show an increase upon pheromone induction. Here, we also present the first RNA-Seq of a sub1Δ mutant; the majority of genes have no change in RNA, but, of the small subset that do, most show decreased expression, consistent with biochemical studies implicating Sub1 as a positive transcriptional regulator. The RNA-Seq data also show that certain pheromone-inducible genes are induced less in the sub1Δ mutant relative to the wild type, supporting a role for Sub1 in regulation of mating pathway genes. The sub1Δ mutant has increased basal levels of a small subset of other genes besides FUS1, including IMD2 and FIG1, a gene encoding an integral membrane protein necessary for efficient mating. PMID:26837954
Haakensen, Vilde D; Biong, Margarethe; Lingjærde, Ole Christian; Holmen, Marit Muri; Frantzen, Jan Ole; Chen, Ying; Navjord, Dina; Romundstad, Linda; Lüders, Torben; Bukholm, Ida K; Solvang, Hiroko K; Kristensen, Vessela N; Ursin, Giske; Børresen-Dale, Anne-Lise; Helland, Aslaug
2010-01-01
Mammographic density (MD), as assessed from film screen mammograms, is determined by the relative content of adipose, connective and epithelial tissue in the female breast. In epidemiological studies, a high percentage of MD confers a four to six fold risk elevation of developing breast cancer, even after adjustment for other known breast cancer risk factors. However, the biologic correlates of density are little known. Gene expression analysis using whole genome arrays was performed on breast biopsies from 143 women; 79 women with no malignancy (healthy women) and 64 newly diagnosed breast cancer patients, both included from mammographic centres. Percent MD was determined using a previously validated, computerized method on scanned mammograms. Significance analysis of microarrays (SAM) was performed to identify genes influencing MD and a linear regression model was used to assess the independent contribution from different variables to MD. SAM-analysis identified 24 genes differentially expressed between samples from breasts with high and low MD. These genes included three uridine 5'-diphospho-glucuronosyltransferase (UGT) genes and the oestrogen receptor gene (ESR1). These genes were down-regulated in samples with high MD compared to those with low MD. The UGT gene products, which are known to inactivate oestrogen metabolites, were also down-regulated in tumour samples compared to samples from healthy individuals. Several single nucleotide polymorphisms (SNPs) in the UGT genes associated with the expression of UGT and other genes in their vicinity were identified. Three UGT enzymes were lower expressed both in breast tissue biopsies from healthy women with high MD and in biopsies from newly diagnosed breast cancers. The association was strongest amongst young women and women using hormonal therapy. UGT2B10 predicts MD independently of age, hormone therapy and parity. Our results indicate that down-regulation of UGT genes in women exposed to female sex hormones is associated with high MD and might increase the risk of breast cancer.
2010-01-01
Introduction Mammographic density (MD), as assessed from film screen mammograms, is determined by the relative content of adipose, connective and epithelial tissue in the female breast. In epidemiological studies, a high percentage of MD confers a four to six fold risk elevation of developing breast cancer, even after adjustment for other known breast cancer risk factors. However, the biologic correlates of density are little known. Methods Gene expression analysis using whole genome arrays was performed on breast biopsies from 143 women; 79 women with no malignancy (healthy women) and 64 newly diagnosed breast cancer patients, both included from mammographic centres. Percent MD was determined using a previously validated, computerized method on scanned mammograms. Significance analysis of microarrays (SAM) was performed to identify genes influencing MD and a linear regression model was used to assess the independent contribution from different variables to MD. Results SAM-analysis identified 24 genes differentially expressed between samples from breasts with high and low MD. These genes included three uridine 5'-diphospho-glucuronosyltransferase (UGT) genes and the oestrogen receptor gene (ESR1). These genes were down-regulated in samples with high MD compared to those with low MD. The UGT gene products, which are known to inactivate oestrogen metabolites, were also down-regulated in tumour samples compared to samples from healthy individuals. Several single nucleotide polymorphisms (SNPs) in the UGT genes associated with the expression of UGT and other genes in their vicinity were identified. Conclusions Three UGT enzymes were lower expressed both in breast tissue biopsies from healthy women with high MD and in biopsies from newly diagnosed breast cancers. The association was strongest amongst young women and women using hormonal therapy. UGT2B10 predicts MD independently of age, hormone therapy and parity. Our results indicate that down-regulation of UGT genes in women exposed to female sex hormones is associated with high MD and might increase the risk of breast cancer. PMID:20799965
Namani, Akhileshwar; Matiur Rahaman, Md; Chen, Ming; Tang, Xiuwen
2018-01-06
NRF2 is the key regulator of oxidative stress in normal cells and aberrant expression of the NRF2 pathway due to genetic alterations in the KEAP1 (Kelch-like ECH-associated protein 1)-NRF2 (nuclear factor erythroid 2 like 2)-CUL3 (cullin 3) axis leads to tumorigenesis and drug resistance in many cancers including head and neck squamous cell cancer (HNSCC). The main goal of this study was to identify specific genes regulated by the KEAP1-NRF2-CUL3 axis in HNSCC patients, to assess the prognostic value of this gene signature in different cohorts, and to reveal potential biomarkers. RNA-Seq V2 level 3 data from 279 tumor samples along with 37 adjacent normal samples from patients enrolled in the The Cancer Genome Atlas (TCGA)-HNSCC study were used to identify upregulated genes using two methods (altered KEAP1-NRF2-CUL3 versus normal, and altered KEAP1-NRF2-CUL3 versus wild-type). We then used a new approach to identify the combined gene signature by integrating both datasets and subsequently tested this signature in 4 independent HNSCC datasets to assess its prognostic value. In addition, functional annotation using the DAVID v6.8 database and protein-protein interaction (PPI) analysis using the STRING v10 database were performed on the signature. A signature composed of a subset of 17 genes regulated by the KEAP1-NRF2-CUL3 axis was identified by overlapping both the upregulated genes of altered versus normal (251 genes) and altered versus wild-type (25 genes) datasets. We showed that increased expression was significantly associated with poor survival in 4 independent HNSCC datasets, including the TCGA-HNSCC dataset. Furthermore, Gene Ontology, Kyoto Encyclopedia of Genes and Genomes, and PPI analysis revealed that most of the genes in this signature are associated with drug metabolism and glutathione metabolic pathways. Altogether, our study emphasizes the discovery of a gene signature regulated by the KEAP1-NRF2-CUL3 axis which is strongly associated with tumorigenesis and drug resistance in HNSCC. This 17-gene signature provides potential biomarkers and therapeutic targets for HNSCC cases in which the NRF2 pathway is activated.
A limited role for gene duplications in the evolution of platypus venom.
Wong, Emily S W; Papenfuss, Anthony T; Whittington, Camilla M; Warren, Wesley C; Belov, Katherine
2012-01-01
Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the "venome" of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation.
A Limited Role for Gene Duplications in the Evolution of Platypus Venom
Wong, Emily S. W.; Papenfuss, Anthony T.; Whittington, Camilla M.; Warren, Wesley C.; Belov, Katherine
2012-01-01
Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date, no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species. However, gene duplications alone do not explain the “venome” of the platypus. Other mechanisms, such as alternative splicing and mutation, may be important in venom innovation. PMID:21816864
The chemokine receptor CCR1 is identified in mast cell-derived exosomes
Liang, Yuting; Qiao, Longwei; Peng, Xia; Cui, Zelin; Yin, Yue; Liao, Huanjin; Jiang, Min; Li, Li
2018-01-01
Mast cells are important effector cells of the immune system, and mast cell-derived exosomes carrying RNAs play a role in immune regulation. However, the molecular function of mast cell-derived exosomes is currently unknown, and here, we identify differentially expressed genes (DEGs) in mast cells and exosomes. We isolated mast cells derived exosomes through differential centrifugation and screened the DEGs from mast cell-derived exosomes, using the GSE25330 array dataset downloaded from the Gene Expression Omnibus database. Biochemical pathways were analyzed by Gene ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway on the online tool DAVID. DEGs-associated protein-protein interaction networks (PPIs) were constructed using the STRING database and Cytoscape software. The genes identified from these bioinformatics analyses were verified by qRT-PCR and Western blot in mast cells and exosomes. We identified 2121 DEGs (843 up and 1278 down-regulated genes) in HMC-1 cell-derived exosomes and HMC-1 cells. The up-regulated DEGs were classified into two significant modules. The chemokine receptor CCR1 was screened as a hub gene and enriched in cytokine-mediated signaling pathway in module one. Seven genes, including CCR1, CD9, KIT, TGFBR1, TLR9, TPSAB1 and TPSB2 were screened and validated through qRT-PCR analysis. We have achieved a comprehensive view of the pivotal genes and pathways in mast cells and exosomes and identified CCR1 as a hub gene in mast cell-derived exosomes. Our results provide novel clues with respect to the biological processes through which mast cell-derived exosomes modulate immune responses. PMID:29511430
Kucerova, Eva; Clifton, Sandra W.; Xia, Xiao-Qin; Long, Fred; Porwollik, Steffen; Fulton, Lucinda; Fronick, Catrina; Minx, Patrick; Kyung, Kim; Warren, Wesley; Fulton, Robert; Feng, Dongyan; Wollam, Aye; Shah, Neha; Bhonagiri, Veena; Nash, William E.; Hallsworth-Pepin, Kymberlie; Wilson, Richard K.
2010-01-01
Background The genus Cronobacter (formerly called Enterobacter sakazakii) is composed of five species; C. sakazakii, C. malonaticus, C. turicensis, C. muytjensii, and C. dublinensis. The genus includes opportunistic human pathogens, and the first three species have been associated with neonatal infections. The most severe diseases are caused in neonates and include fatal necrotizing enterocolitis and meningitis. The genetic basis of the diversity within the genus is unknown, and few virulence traits have been identified. Methodology/Principal Findings We report here the first sequence of a member of this genus, C. sakazakii strain BAA-894. The genome of Cronobacter sakazakii strain BAA-894 comprises a 4.4 Mb chromosome (57% GC content) and two plasmids; 31 kb (51% GC) and 131 kb (56% GC). The genome was used to construct a 387,000 probe oligonucleotide tiling DNA microarray covering the whole genome. Comparative genomic hybridization (CGH) was undertaken on five other C. sakazakii strains, and representatives of the four other Cronobacter species. Among 4,382 annotated genes inspected in this study, about 55% of genes were common to all C. sakazakii strains and 43% were common to all Cronobacter strains, with 10–17% absence of genes. Conclusions/Significance CGH highlighted 15 clusters of genes in C. sakazakii BAA-894 that were divergent or absent in more than half of the tested strains; six of these are of probable prophage origin. Putative virulence factors were identified in these prophage and in other variable regions. A number of genes unique to Cronobacter species associated with neonatal infections (C. sakazakii, C. malonaticus and C. turicensis) were identified. These included a copper and silver resistance system known to be linked to invasion of the blood-brain barrier by neonatal meningitic strains of Escherichia coli. In addition, genes encoding for multidrug efflux pumps and adhesins were identified that were unique to C. sakazakii strains from outbreaks in neonatal intensive care units. PMID:20221447
Antunes, Patrícia; Machado, Jorge; Sousa, João Carlos; Peixe, Luísa
2005-01-01
In 200 sulfonamide-resistant Portuguese Salmonella isolates, 152 sul1, 74 sul2, and 14 sul3 genes were detected. Class 1 integrons were always associated with sul genes, including sul3 alone in some isolates. The sul3 gene has been identified in isolates from different sources and serotypes, which also carried a class 1 integron with aadA and dfrA gene cassettes. PMID:15673783
A genome-wide scan for signatures of selection in Azeri and Khuzestani buffalo breeds.
Mokhber, Mahdi; Moradi-Shahrbabak, Mohammad; Sadeghi, Mostafa; Moradi-Shahrbabak, Hossein; Stella, Alessandra; Nicolzzi, Ezequiel; Rahmaninia, Javad; Williams, John L
2018-06-11
Identification of genomic regions that have been targets of selection may shed light on the genetic history of livestock populations and help to identify variation controlling commercially important phenotypes. The Azeri and Kuzestani buffalos are the most common indigenous Iranian breeds which have been subjected to divergent selection and are well adapted to completely different regions. Examining the genetic structure of these populations may identify genomic regions associated with adaptation to the different environments and production goals. A set of 385 water buffalo samples from Azeri (N = 262) and Khuzestani (N = 123) breeds were genotyped using the Axiom® Buffalo Genotyping 90 K Array. The unbiased fixation index method (F ST ) was used to detect signatures of selection. In total, 13 regions with outlier F ST values (0.1%) were identified. Annotation of these regions using the UMD3.1 Bos taurus Genome Assembly was performed to find putative candidate genes and QTLs within the selected regions. Putative candidate genes identified include FBXO9, NDFIP1, ACTR3, ARHGAP26, SERPINF2, BOLA-DRB3, BOLA-DQB, CLN8, and MYOM2. Candidate genes identified in regions potentially under selection were associated with physiological pathways including milk production, cytoskeleton organization, growth, metabolic function, apoptosis and domestication-related changes include immune and nervous system development. The QTL identified are involved in economically important traits in buffalo related to milk composition, udder structure, somatic cell count, meat quality, and carcass and body weight.
Kankare, Maaria; Parker, Darren J.; Merisalo, Mikko; Salminen, Tiina S.; Hoikkala, Anneli
2016-01-01
Background A wide range of insects living at higher latitudes enter diapause at the end of the warm season, which increases their chances of survival through harsh winter conditions. In this study we used RNA sequencing to identify genes involved in adult reproductive diapause in a northern fly species, Drosophila montana. Both diapausing and non-diapausing flies were reared under a critical day length and temperature, where about half of the emerging females enter diapause enabling us to eliminate the effects of varying environmental conditions on gene expression patterns of the two types of female flies. Results RNA sequencing revealed large differences between gene expression patterns of diapausing and non-diapausing females, especially in genes involved with metabolism, fatty acid biosynthesis, and metal and nucleotide binding. Differently expressed genes included several gene groups, including myosin, actin and cytochromeP450 genes, which have been previously associated with diapause. This study also identified new candidate genes, including some involved in cuticular hydrocarbon synthesis or regulation (desat1 and desat2), and acyl-CoA Δ11-desaturase activity (CG9747), and few odorant-binding protein genes (e.g. Obp44A). Also, several transposable elements (TEs) showed differential expression between the two female groups motivating future research on their roles in diapause. Conclusions Our results demonstrate that the adult reproductive diapause in D. montana involves changes in the expression level of a variety of genes involved in key processes (e.g. metabolism and fatty acid biosynthesis) which help diapausing females to cope with overwintering. This is consistent with the view that diapause is a complex adaptive phenotype where not only sexual maturation is arrested, but also changes in adult physiology are required in order to survive over the winter. PMID:27571415
Takeda, Kojiro; Mori, Ayaka; Yanagida, Mitsuhiro
2011-01-01
Bortezomib/PS-341/Velcade, a proteasome inhibitor, is widely used to treat multiple myeloma. While several mechanisms of the cytotoxicity of the drug were proposed, the actual mechanism remains elusive. We aimed to identify genes affecting the cytotoxicity of Bortezomib in the fission yeast S.pombe as the drug inhibits this organism's cell division cycle like proteasome mutants. Among the 2815 genes screened (covering 56% of total ORFs), 19 genes, whose deletions induce strong synthetic lethality with Bortezomib, were identified. The products of the 19 genes included four ubiquitin enzymes and one nuclear proteasome factor, and 13 of them are conserved in humans. Our results will provide useful information for understanding the actions of Bortezomib within cells. PMID:21760946
Ni, ZhouXian; Ye, YouJu; Bai, Tiandao; Xu, Meng; Xu, Li-An
2017-09-11
The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between "IRa" and "IRb". The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.
Tey, S; Ahmad-Annuar, A; Drew, A P; Shahrizaila, N; Nicholson, G A; Kennerson, M L
2016-08-01
The cytoplasmic dynein-dynactin genes are attractive candidates for neurodegenerative disorders given their functional role in retrograde transport along neurons. The cytoplasmic dynein heavy chain (DYNC1H1) gene has been implicated in various neurodegenerative disorders, and dynactin 1 (DCTN1) genes have been implicated in a wide spectrum of disorders including motor neuron disease, Parkinson's disease, spinobulbar muscular atrophy and hereditary spastic paraplegia. However, the involvement of other dynactin genes with inherited peripheral neuropathies (IPN) namely, hereditary sensory neuropathy, hereditary motor neuropathy and Charcot-Marie-Tooth disease is under reported. We screened eight genes; DCTN1-6 and ACTR1A and ACTR1B in 136 IPN patients using whole-exome sequencing and high-resolution melt (HRM) analysis. Eight non-synonymous variants (including one novel variant) and three synonymous variants were identified. Four variants have been reported previously in other studies, however segregation analysis within family members excluded them from causing IPN in these families. No variants of disease significance were identified in this study suggesting the dynactin genes are unlikely to be a common cause of IPNs. However, with the ease of querying gene variants from exome data, these genes remain worthwhile candidates to assess unsolved IPN families for variants that may affect the function of the proteins. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
2011-01-01
Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format) and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper) is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays), implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile), useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples) and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene clusters with differential expression during the differentiation toward megakaryocyte were identified. Conclusions TRAM is designed to create, and statistically analyze, quantitative transcriptome maps, based on gene expression data from multiple sources. The release includes FileMaker Pro database management runtime application and it is freely available at http://apollo11.isto.unibo.it/software/, along with preconfigured implementations for mapping of human, mouse and zebrafish transcriptomes. PMID:21333005
Cancer genome characterization efforts now provide an initial view of the somatic alterations in primary tumors. However, most point mutations occur at low frequency, and the function of these alleles remains undefined. We have developed a scalable systematic approach to interrogate the function of cancer-associated gene variants. We subjected 474 mutant alleles curated from 5,338 tumors to pooled in vivo tumor formation assays and gene expression profiling. We identified 12 transforming alleles, including two in genes (PIK3CB, POT1) that have not been shown to be tumorigenic.
[Progress in research on pathogenic genes and gene therapy for inherited retinal diseases].
Zhu, Ling; Cao, Cong; Sun, Jiji; Gao, Tao; Liang, Xiaoyang; Nie, Zhipeng; Ji, Yanchun; Jiang, Pingping; Guan, Minxin
2017-02-10
Inherited retinal diseases (IRDs), including retinitis pigmentosa, Usher syndrome, Cone-Rod degenerations, inherited macular dystrophy, Leber's congenital amaurosis, Leber's hereditary optic neuropathy are the most common and severe types of hereditary ocular diseases. So far more than 200 pathogenic genes have been identified. With the growing knowledge of the genetics and mechanisms of IRDs, a number of gene therapeutic strategies have been developed in the laboratory or even entered clinical trials. Here the progress of IRD research on the pathogenic genes and therapeutic strategies, particularly gene therapy, are reviewed.
Hicks, Chindo; Kumar, Ranjit; Pannuti, Antonio; Miele, Lucio
2012-01-01
Variable response and resistance to tamoxifen treatment in breast cancer patients remains a major clinical problem. To determine whether genes and biological pathways containing SNPs associated with risk for breast cancer are dysregulated in response to tamoxifen treatment, we performed analysis combining information from 43 genome-wide association studies with gene expression data from 298 ER(+) breast cancer patients treated with tamoxifen and 125 ER(+) controls. We identified 95 genes which distinguished tamoxifen treated patients from controls. Additionally, we identified 54 genes which stratified tamoxifen treated patients into two distinct groups. We identified biological pathways containing SNPs associated with risk for breast cancer, which were dysregulated in response to tamoxifen treatment. Key pathways identified included the apoptosis, P53, NFkB, DNA repair and cell cycle pathways. Combining GWAS with transcription profiling provides a unified approach for associating GWAS findings with response to drug treatment and identification of potential drug targets.
Common Viral Integration Sites Identified in Avian Leukosis Virus-Induced B-Cell Lymphomas
Justice, James F.; Morgan, Robin W.
2015-01-01
ABSTRACT Avian leukosis virus (ALV) induces B-cell lymphoma and other neoplasms in chickens by integrating within or near cancer genes and perturbing their expression. Four genes—MYC, MYB, Mir-155, and TERT—have previously been identified as common integration sites in these virus-induced lymphomas and are thought to play a causal role in tumorigenesis. In this study, we employ high-throughput sequencing to identify additional genes driving tumorigenesis in ALV-induced B-cell lymphomas. In addition to the four genes implicated previously, we identify other genes as common integration sites, including TNFRSF1A, MEF2C, CTDSPL, TAB2, RUNX1, MLL5, CXorf57, and BACH2. We also analyze the genome-wide ALV integration landscape in vivo and find increased frequency of ALV integration near transcriptional start sites and within transcripts. Previous work has shown ALV prefers a weak consensus sequence for integration in cultured human cells. We confirm this consensus sequence for ALV integration in vivo in the chicken genome. PMID:26670384
Modena, Brian D; Bleecker, Eugene R; Busse, William W; Erzurum, Serpil C; Gaston, Benjamin M; Jarjour, Nizar N; Meyers, Deborah A; Milosevic, Jadranka; Tedrow, John R; Wu, Wei; Kaminski, Naftali; Wenzel, Sally E
2017-06-01
Severe asthma (SA) is a heterogeneous disease with multiple molecular mechanisms. Gene expression studies of bronchial epithelial cells in individuals with asthma have provided biological insight and underscored possible mechanistic differences between individuals. Identify networks of genes reflective of underlying biological processes that define SA. Airway epithelial cell gene expression from 155 subjects with asthma and healthy control subjects in the Severe Asthma Research Program was analyzed by weighted gene coexpression network analysis to identify gene networks and profiles associated with SA and its specific characteristics (i.e., pulmonary function tests, quality of life scores, urgent healthcare use, and steroid use), which potentially identified underlying biological processes. A linear model analysis confirmed these findings while adjusting for potential confounders. Weighted gene coexpression network analysis constructed 64 gene network modules, including modules corresponding to T1 and T2 inflammation, neuronal function, cilia, epithelial growth, and repair mechanisms. Although no network selectively identified SA, genes in modules linked to epithelial growth and repair and neuronal function were markedly decreased in SA. Several hub genes of the epithelial growth and repair module were found located at the 17q12-21 locus, near a well-known asthma susceptibility locus. T2 genes increased with severity in those treated with corticosteroids but were also elevated in untreated, mild-to-moderate disease compared with healthy control subjects. T1 inflammation, especially when associated with increased T2 gene expression, was elevated in a subgroup of younger patients with SA. In this hypothesis-generating analysis, gene expression networks in relation to asthma severity provided potentially new insight into biological mechanisms associated with the development of SA and its phenotypes.
Modena, Brian D.; Bleecker, Eugene R.; Busse, William W.; Erzurum, Serpil C.; Gaston, Benjamin M.; Jarjour, Nizar N.; Meyers, Deborah A.; Milosevic, Jadranka; Tedrow, John R.; Wu, Wei; Kaminski, Naftali
2017-01-01
Rationale: Severe asthma (SA) is a heterogeneous disease with multiple molecular mechanisms. Gene expression studies of bronchial epithelial cells in individuals with asthma have provided biological insight and underscored possible mechanistic differences between individuals. Objectives: Identify networks of genes reflective of underlying biological processes that define SA. Methods: Airway epithelial cell gene expression from 155 subjects with asthma and healthy control subjects in the Severe Asthma Research Program was analyzed by weighted gene coexpression network analysis to identify gene networks and profiles associated with SA and its specific characteristics (i.e., pulmonary function tests, quality of life scores, urgent healthcare use, and steroid use), which potentially identified underlying biological processes. A linear model analysis confirmed these findings while adjusting for potential confounders. Measurements and Main Results: Weighted gene coexpression network analysis constructed 64 gene network modules, including modules corresponding to T1 and T2 inflammation, neuronal function, cilia, epithelial growth, and repair mechanisms. Although no network selectively identified SA, genes in modules linked to epithelial growth and repair and neuronal function were markedly decreased in SA. Several hub genes of the epithelial growth and repair module were found located at the 17q12–21 locus, near a well-known asthma susceptibility locus. T2 genes increased with severity in those treated with corticosteroids but were also elevated in untreated, mild-to-moderate disease compared with healthy control subjects. T1 inflammation, especially when associated with increased T2 gene expression, was elevated in a subgroup of younger patients with SA. Conclusions: In this hypothesis-generating analysis, gene expression networks in relation to asthma severity provided potentially new insight into biological mechanisms associated with the development of SA and its phenotypes. PMID:27984699
Singh, Himanshu Narayan; Rajeswari, Moganty R
2016-01-01
Purine repeat sequences present in a gene are unique as they have high propensity to form unusual DNA-triple helix structures. Friedreich's ataxia is the only human disease that is well known to be associated with DNA-triplexes formed by purine repeats. The purpose of this study was to recognize the expanded purine repeats (EPRs) in human genome and find their correlation with cancer pathogenesis. We developed "PuRepeatFinder.pl" algorithm to identify non-overlapping EPRs without pyrimidine interruptions in the human genome and customized for searching repeat lengths, n ≥ 200. A total of 1158 EPRs were identified in the genome which followed Wakeby distribution. Two hundred and ninety-six EPRs were found in geneic regions of 282 genes (EPR-genes). Gene clustering of EPR-genes was done based on their cellular function and a large number of EPR-genes were found to be enzymes/enzyme modulators. Meta-analysis of 282 EPR-genes identified only 63 EPR-genes in association with cancer, mostly in breast, lung, and blood cancers. Protein-protein interaction network analysis of all 282 EPR-genes identified proteins including those in cadherins and VEGF. The two observations, that EPRs can induce mutations under malignant conditions and that identification of some EPR-gene products in vital cell signaling-mediated pathways, together suggest the crucial role of EPRs in carcinogenesis. The new link between EPR-genes and their functionally interacting proteins throws a new dimension in the present understanding of cancer pathogenesis and can help in planning therapeutic strategies. Validation of present results using techniques like NGS is required to establish the role of the EPR genes in cancer pathology.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Josse, Rozenn; Dumont, Julie; Fautrel, Alain
Gene expression profiling has recently emerged as a promising approach to identify early target genes and discriminate genotoxic carcinogens from non-genotoxic carcinogens and non-carcinogens. However, early gene changes induced by genotoxic compounds in human liver remain largely unknown. Primary human hepatocytes and differentiated HepaRG cells were exposed to aflatoxin B1 (AFB1) that induces DNA damage following enzyme-mediated bioactivation. Gene expression profile changes induced by a 24 h exposure of these hepatocyte models to 0.05 and 0.25 μM AFB1 were analyzed by using oligonucleotide pangenomic microarrays. The main altered signaling pathway was the p53 pathway and related functions such as cellmore » cycle, apoptosis and DNA repair. Direct involvement of the p53 protein in response to AFB1 was verified by using siRNA directed against p53. Among the 83 well-annotated genes commonly modulated in two pools of three human hepatocyte populations and HepaRG cells, several genes were identified as altered by AFB1 for the first time. In addition, a subset of 10 AFB1-altered genes, selected upon basis of their function or tumor suppressor role, was tested in four human hepatocyte populations and in response to other chemicals. Although they exhibited large variable inter-donor fold-changes, several of these genes, particularly FHIT, BCAS3 and SMYD3, were found to be altered by various direct and other indirect genotoxic compounds and unaffected by non-genotoxic compounds. Overall, this comprehensive analysis of early gene expression changes induced by AFB1 in human hepatocytes identified a gene subset that included several genes representing potential biomarkers of genotoxic compounds. -- Highlights: ► Gene expression profile changes induced by aflatoxin B1 in human hepatocytes. ► AFB1 modulates various genes including tumor suppressor genes and proto-oncogenes. ► Important inter-individual variations in the response to AFB1. ► Some genes also altered by other genotoxic compounds requiring or not bioactivation.« less
Novel genes associated with enhanced motility of Escherichia coli ST131
Kakkanat, Asha; Phan, Minh-Duy; Lo, Alvin W.; Beatson, Scott A.
2017-01-01
Uropathogenic Escherichia coli (UPEC) is the cause of ~75% of all urinary tract infections (UTIs) and is increasingly associated with multidrug resistance. This includes UPEC strains from the recently emerged and globally disseminated sequence type 131 (ST131), which is now the dominant fluoroquinolone-resistant UPEC clone worldwide. Most ST131 strains are motile and produce H4-type flagella. Here, we applied a combination of saturated Tn5 mutagenesis and transposon directed insertion site sequencing (TraDIS) as a high throughput genetic screen and identified 30 genes associated with enhanced motility of the reference ST131 strain EC958. This included 12 genes that repress motility of E. coli K-12, four of which (lrhA, ihfA, ydiV, lrp) were confirmed in EC958. Other genes represented novel factors that impact motility, and we focused our investigation on characterisation of the mprA, hemK and yjeA genes. Mutation of each of these genes in EC958 led to increased transcription of flagellar genes (flhD and fliC), increased expression of the FliC flagellin, enhanced flagella synthesis and a hyper-motile phenotype. Complementation restored all of these properties to wild-type level. We also identified Tn5 insertions in several intergenic regions (IGRs) on the EC958 chromosome that were associated with enhanced motility; this included flhDC and EC958_1546. In both of these cases, the Tn5 insertions were associated with increased transcription of the downstream gene(s), which resulted in enhanced motility. The EC958_1546 gene encodes a phage protein with similarity to esterase/deacetylase enzymes involved in the hydrolysis of sialic acid derivatives found in human mucus. We showed that over-expression of EC958_1546 led to enhanced motility of EC958 as well as the UPEC strains CFT073 and UTI89, demonstrating its activity affects the motility of different UPEC strains. Overall, this study has identified and characterised a number of novel factors associated with enhanced UPEC motility. PMID:28489862
Tamm-Rosenstein, Karin; Simm, Jaak; Suhorutshenko, Marina; Salumets, Andres; Metsis, Madis
2013-01-01
Background Estrogen (E2) and progesterone (P4) are key players in the maturation of the human endometrium. The corresponding steroid hormone modulators, tamoxifen (TAM) and mifepristone (RU486) are widely used in breast cancer therapy and for contraception purposes, respectively. Methodology/Principal findings Gene expression profiling of the human endometrial Ishikawa cancer cell line treated with E2 and P4 for 3 h and 12 h, and TAM and RU486 for 12 h, was performed using RNA-sequencing. High levels of mRNA were detected for genes, including PSAP, ATP5G2, ATP5H, and GNB2L1 following E2 or P4 treatment. A total of 82 biomarkers for endometrial biology were identified among E2 induced genes, and 93 among P4 responsive genes. Identified biomarkers included: EZH2, MDK, MUC1, SLIT2, and IL6ST, which are genes previously associated with endometrial receptivity. Moreover, 98.8% and 98.6% of E2 and P4 responsive genes in Ishikawa cells, respectively, were also detected in two human mid-secretory endometrial biopsy samples. TAM treatment exhibited both antagonistic and agonistic effects of E2, and also regulated a subset of genes independently. The cell cycle regulator cyclin D1 (CCND1) showed significant up-regulation following treatment with TAM. RU486 did not appear to act as a pure antagonist of P4 and a functional analysis of RU486 response identified genes related to adhesion and apoptosis, including down-regulated genes associated with cell-cell contacts and adhesion as CTNND1, JUP, CDH2, IQGAP1, and COL2A1. Conclusions Significant changes in gene expression by the Ishikawa cell line were detected after treatments with E2, P4, TAM, and RU486. These transcriptome data provide valuable insight into potential biomarkers related to endometrial receptivity, and also facilitate an understanding of the molecular changes that take place in the endometrium in the early stages of breast cancer treatment and contraception usage. PMID:23874806
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Yongbaek; Thai-Vu Ton; De Angelo, Anthony B.
2006-07-15
This study was performed to characterize the gene expression profile and to identify the major carcinogenic pathways involved in rat peritoneal mesothelioma (RPM) formation following treatment of Fischer 344 rats with o-nitrotoluene (o-NT) or bromochloracetic acid (BCA). Oligo arrays, with over 20,000 target genes, were used to evaluate o-NT- and BCA-induced RPMs, when compared to a non-transformed mesothelial cell line (Fred-PE). Analysis using Ingenuity Pathway Analysis software revealed 169 cancer-related genes that were categorized into binding activity, growth and proliferation, cell cycle progression, apoptosis, and invasion and metastasis. The microarray data were validated by positive correlation with quantitative real-time RT-PCRmore » on 16 selected genes including igf1, tgfb3 and nov. Important carcinogenic pathways involved in RPM formation included insulin-like growth factor 1 (IGF-1), p38 MAPkinase, Wnt/{beta}-catenin and integrin signaling pathways. This study demonstrated that mesotheliomas in rats exposed to o-NT- and BCA were similar to mesotheliomas in humans, at least at the cellular and molecular level.« less
The genetic basis of female reproductive disorders: Etiology and clinical testing ☆
Layman, Lawrence C.
2013-01-01
With the advent of improved molecular biology techniques, the genetic basis of an increasing number of reproductive disorders has been elucidated. Mutations in at least 20 genes cause hypogonadotropic hypogonadism including Kallmann syndrome in about 35–40% of patients. The two most commonly involved genes are FGFR1 and CHD7. When combined pituitary hormone deficiency includes hypogonadotropic hypogonadism as a feature, PROP1 mutations are the most common of the six genes involved. For hypergonadotropic hypogonadism, mutations in 14 genes cause gonadal failure in 15% of affected females, most commonly in FMR1. In eugonadal disorders, activating FSHR mutations have been identified for spontaneous ovarian hyperstimulation syndrome; and WNT4 mutations have been described in mullerian aplasia. For other eugonadal disorders, such as endometriosis, polycystic ovary syndrome, and leiomyomata, specific germline gene mutations have not been identified, but some chromosomal regions are associated with the corresponding phenotype. Practical genetic testing is possible to perform in both hypogonadotropic and hypergonadotropic hypogonadism and spontaneous ovarian hyperstimulation syndrome. However, clinical testing for endometriosis, polycystic ovary syndrome, and leiomyomata is not currently practical for the clinician. PMID:23499866
Carr, Paul D.; Tuckwell, Danny; Hey, Peter M.; Simon, Laurence; d'Enfert, Christophe; Birch, Mike; Oliver, Jason D.; Bromley, Michael J.
2010-01-01
Genes that are essential for viability represent potential targets for the development of anti-infective agents. However, relatively few have been determined in the filamentous fungal pathogen Aspergillus fumigatus. A novel solution employing parasexual genetics coupled with transposon mutagenesis using the Fusarium oxysporum transposon impala had previously enabled the identification of 20 essential genes from A. fumigatus; however, further use of this system required a better understanding of the mode of action of the transposon itself. Examination of a range of conditions indicated that impala is activated by prolonged exposure to low temperatures. This newly identified property was then harnessed to identify 96 loci that are critical for viability in A. fumigatus, including genes required for RNA metabolism, organelle organization, protein transport, ribosome biogenesis, and transcription, as well as a number of noncoding RNAs. A number of these genes represent potential targets for much-needed novel antifungal drugs. PMID:20097738
Axon Regeneration Genes Identified by RNAi Screening in C. elegans
Nix, Paola; Hammarlund, Marc; Hauth, Linda; Lachnit, Martina; Jorgensen, Erik M.
2014-01-01
Axons of the mammalian CNS lose the ability to regenerate soon after development due to both an inhibitory CNS environment and the loss of cell-intrinsic factors necessary for regeneration. The complex molecular events required for robust regeneration of mature neurons are not fully understood, particularly in vivo. To identify genes affecting axon regeneration in Caenorhabditis elegans, we performed both an RNAi-based screen for defective motor axon regeneration in unc-70/β-spectrin mutants and a candidate gene screen. From these screens, we identified at least 50 conserved genes with growth-promoting or growth-inhibiting functions. Through our analysis of mutants, we shed new light on certain aspects of regeneration, including the role of β-spectrin and membrane dynamics, the antagonistic activity of MAP kinase signaling pathways, and the role of stress in promoting axon regeneration. Many gene candidates had not previously been associated with axon regeneration and implicate new pathways of interest for therapeutic intervention. PMID:24403161
Novel mutations in the SOX10 gene in the first two Chinese cases of type IV Waardenburg syndrome.
Jiang, Lu; Chen, Hongsheng; Jiang, Wen; Hu, Zhengmao; Mei, Lingyun; Xue, Jingjie; He, Chufeng; Liu, Yalan; Xia, Kun; Feng, Yong
2011-05-20
We analyzed the clinical features and family-related gene mutations for the first two Chinese cases of type IV Waardenburg syndrome (WS4). Two families were analyzed in this study. The analysis included a medical history, clinical analysis, a hearing test and a physical examination. In addition, the EDNRB, EDN3 and SOX10 genes were sequenced in order to identify the pathogenic mutation responsible for the WS4 observed in these patients. The two WS4 cases presented with high phenotypic variability. Two novel heterozygous mutations (c.254G>A and c.698-2A>T) in the SOX10 gene were detected. The mutations identified in the patients were not found in unaffected family members or in 200 unrelated control subjects. This is the first report of WS4 in Chinese patients. In addition, two novel mutations in SOX10 gene have been identified. Crown Copyright © 2011. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Donovan, Jenny; Venville, Grady
2014-01-01
Previous research showed that primary school children held several misconceptions about genetics of concern for their future lives. Included were beliefs that genes and DNA are separate substances, with genes causing family resemblance and DNA identifying suspects at crime scenes. Responses to this work "blamed" the mass media for these…
A genomics based discovery of secondary metabolite biosynthetic gene clusters in Aspergillus ustus.
Pi, Borui; Yu, Dongliang; Dai, Fangwei; Song, Xiaoming; Zhu, Congyi; Li, Hongye; Yu, Yunsong
2015-01-01
Secondary metabolites (SMs) produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic.
Spectrum of mutations in leiomyosarcomas identified by clinical targeted next-generation sequencing.
Lee, Paul J; Yoo, Naomi S; Hagemann, Ian S; Pfeifer, John D; Cottrell, Catherine E; Abel, Haley J; Duncavage, Eric J
2017-02-01
Recurrent genomic mutations in uterine and non-uterine leiomyosarcomas have not been well established. Using a next generation sequencing (NGS) panel of common cancer-associated genes, 25 leiomyosarcomas arising from multiple sites were examined to explore genetic alterations, including single nucleotide variants (SNV), small insertions/deletions (indels), and copy number alterations (CNA). Sequencing showed 86 non-synonymous, coding region somatic variants within 151 gene targets in 21 cases, with a mean of 4.1 variants per case; 4 cases had no putative mutations in the panel of genes assayed. The most frequently altered genes were TP53 (36%), ATM and ATRX (16%), and EGFR and RB1 (12%). CNA were identified in 85% of cases, with the most frequent copy number losses observed in chromosomes 10 and 13 including PTEN and RB1; the most frequent gains were seen in chromosomes 7 and 17. Our data show that deletions in canonical cancer-related genes are common in leiomyosarcomas. Further, the spectrum of gene mutations observed shows that defects in DNA repair and chromosomal maintenance are central to the biology of leiomyosarcomas, and that activating mutations observed in other common cancer types are rare in leiomyosarcomas. Copyright © 2017 Elsevier Inc. All rights reserved.
A Genomics Based Discovery of Secondary Metabolite Biosynthetic Gene Clusters in Aspergillus ustus
Pi, Borui; Yu, Dongliang; Dai, Fangwei; Song, Xiaoming; Zhu, Congyi; Li, Hongye; Yu, Yunsong
2015-01-01
Secondary metabolites (SMs) produced by Aspergillus have been extensively studied for their crucial roles in human health, medicine and industrial production. However, the resulting information is almost exclusively derived from a few model organisms, including A. nidulans and A. fumigatus, but little is known about rare pathogens. In this study, we performed a genomics based discovery of SM biosynthetic gene clusters in Aspergillus ustus, a rare human pathogen. A total of 52 gene clusters were identified in the draft genome of A. ustus 3.3904, such as the sterigmatocystin biosynthesis pathway that was commonly found in Aspergillus species. In addition, several SM biosynthetic gene clusters were firstly identified in Aspergillus that were possibly acquired by horizontal gene transfer, including the vrt cluster that is responsible for viridicatumtoxin production. Comparative genomics revealed that A. ustus shared the largest number of SM biosynthetic gene clusters with A. nidulans, but much fewer with other Aspergilli like A. niger and A. oryzae. These findings would help to understand the diversity and evolution of SM biosynthesis pathways in genus Aspergillus, and we hope they will also promote the development of fungal identification methodology in clinic. PMID:25706180
Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).
Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo
2012-01-01
Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.
Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)
Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo
2012-01-01
Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-05-26
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-01-01
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414
Gassó, Patricia; Mas, Sergi; Rodríguez, Natalia; Boloc, Daniel; García-Cerro, Susana; Bernardo, Miquel; Lafuente, Amalia; Parellada, Eduard
2017-12-01
Schizophrenia (SZ) is a chronic psychiatric disorder whose onset of symptoms occurs in late adolescence and early adulthood. The etiology is complex and involves important gene-environment interactions. Microarray gene-expression studies on SZ have identified alterations in several biological processes. The heterogeneity in the results can be attributed to the use of different sample types and other important confounding factors including age, illness chronicity and antipsychotic exposure. The aim of the present microarray study was to analyze, for the first time to our knowledge, differences in gene expression profiles in 18 fibroblast (FCLs) and 14 lymphoblastoid cell lines (LCLs) from antipsychotic-naïve first-episode schizophrenia (FES) patients and healthy controls. We used an analytical approach based on protein-protein interaction network construction and functional annotation analysis to identify the biological processes that are altered in SZ. Significant differences in the expression of 32 genes were found when LCLs were assessed. The network and gene set enrichment approach revealed the involvement of similar biological processes in FCLs and LCLs, including apoptosis and related biological terms such as cell cycle, autophagy, cytoskeleton organization and response to stress and stimulus. Metabolism and other processes, including signal transduction, kinase activity and phosphorylation, were also identified. These results were replicated in two independent cohorts using the same analytical approach. This provides more evidence for altered apoptotic processes in antipsychotic-naïve FES patients and other important biological functions such as cytoskeleton organization and metabolism. The convergent results obtained in both peripheral cell models support their usefulness for transcriptome studies on SZ. Copyright © 2017 Elsevier Ltd. All rights reserved.
Dai, Wei; Siddiq, Afshan; Walley, Andrew J; Limpaiboon, Temduang; Brown, Robert
2013-01-01
Genetic abnormalities of cholangiocarcinoma have been widely studied; however, epigenomic changes related to cholangiocarcinogenesis have been less well characterised. We have profiled the DNA methylomes of 28 primary cholangiocarcinoma and six matched adjacent normal tissues using Infinium’s HumanMethylation27 BeadChips with the aim of identifying gene sets aberrantly epigenetically regulated in this tumour type. Using a linear model for microarray data we identified 1610 differentially methylated autosomal CpG sites with 809 CpG sites (representing 603 genes) being hypermethylated and 801 CpG sites (representing 712 genes) being hypomethylated in cholangiocarcinoma versus adjacent normal tissues (false discovery rate ≤ 0.05). Gene ontology and gene set enrichment analyses identified gene sets significantly associated with hypermethylation at linked CpG sites in cholangiocarcinoma including homeobox genes and target genes of PRC2, EED, SUZ12 and histone H3 trimethylation at lysine 27. We confirmed frequent hypermethylation at the homeobox genes HOXA9 and HOXD9 by bisulfite pyrosequencing in a larger cohort of cholangiocarcinoma (n = 102). Our findings indicate a key role for hypermethylation of multiple CpG sites at genes associated with a stem cell-like phenotype as a common molecular aberration in cholangiocarcinoma. These data have implications for cholangiocarcinogenesis, as well as possible novel treatment options using histone methyltransferase inhibitors. PMID:24089088
Charlesworth, Jac C; Peralta, Juan M; Drigalenko, Eugene; Göring, Harald Hh; Almasy, Laura; Dyer, Thomas D; Blangero, John
2009-12-15
Gene identification using linkage, association, or genome-wide expression is often underpowered. We propose that formal combination of information from multiple gene-identification approaches may lead to the identification of novel loci that are missed when only one form of information is available. Firstly, we analyze the Genetic Analysis Workshop 16 Framingham Heart Study Problem 2 genome-wide association data for HDL-cholesterol using a "gene-centric" approach. Then we formally combine the association test results with genome-wide transcriptional profiling data for high-density lipoprotein cholesterol (HDL-C), from the San Antonio Family Heart Study, using a Z-transform test (Stouffer's method). We identified 39 genes by the joint test at a conservative 1% false-discovery rate, including 9 from the significant gene-based association test and 23 whose expression was significantly correlated with HDL-C. Seven genes identified as significant in the joint test were not independently identified by either the association or expression tests. This combined approach has increased power and leads to the direct nomination of novel candidate genes likely to be involved in the determination of HDL-C levels. Such information can then be used as justification for a more exhaustive search for functional sequence variation within the nominated genes. We anticipate that this type of analysis will improve our speed of identification of regulatory genes causally involved in disease risk.
Differential gene expression in human abdominal aortic aneurysm and aortic occlusive disease
Moran, Corey S.; Schreurs, Charlotte; Lindeman, Jan H. N.; Walker, Philip J.; Nataatmadja, Maria; West, Malcolm; Holdt, Lesca M.; Hinterseher, Irene; Pilarsky, Christian; Golledge, Jonathan
2015-01-01
Abdominal aortic aneurysm (AAA) and aortic occlusive disease (AOD) represent common causes of morbidity and mortality in elderly populations which were previously believed to have common aetiologies. The aim of this study was to assess the gene expression in human AAA and AOD. We performed microarrays using aortic specimen obtained from 20 patients with small AAAs (≤ 55mm), 29 patients with large AAAs (> 55mm), 9 AOD patients, and 10 control aortic specimens obtained from organ donors. Some differentially expressed genes were validated by quantitative-PCR (qRT-PCR)/immunohistochemistry. We identified 840 and 1,014 differentially expressed genes in small and large AAAs, respectively. Immune-related pathways including cytokine-cytokine receptor interaction and T-cell-receptor signalling were upregulated in both small and large AAAs. Examples of validated genes included CTLA4 (2.01-fold upregulated in small AAA, P = 0.002), NKTR (2.37-and 2.66-fold upregulated in small and large AAA with P = 0.041 and P = 0.015, respectively), and CD8A (2.57-fold upregulated in large AAA, P = 0.004). 1,765 differentially expressed genes were identified in AOD. Pathways upregulated in AOD included metabolic and oxidative phosphorylation categories. The UCP2 gene was downregulated in AOD (3.73-fold downregulated, validated P = 0.017). In conclusion, the AAA and AOD transcriptomes were very different suggesting that AAA and AOD have distinct pathogenic mechanisms. PMID:25944698
Magalhães, Alexandre P.; Verde, Nuno; Reis, Francisca; Martins, Inês; Costa, Daniela; Lino-Neto, Teresa; Castro, Pedro H.; Tavares, Rui M.; Azevedo, Herlânder
2016-01-01
Quercus suber (cork oak) is a West Mediterranean species of key economic interest, being extensively explored for its ability to generate cork. Like other Mediterranean plants, Q. suber is significantly threatened by climatic changes, imposing the need to quickly understand its physiological and molecular adaptability to drought stress imposition. In the present report, we uncovered the differential transcriptome of Q. suber roots exposed to long-term drought, using an RNA-Seq approach. 454-sequencing reads were used to de novo assemble a reference transcriptome, and mapping of reads allowed the identification of 546 differentially expressed unigenes. These were enriched in both effector genes (e.g., LEA, chaperones, transporters) as well as regulatory genes, including transcription factors (TFs) belonging to various different classes, and genes associated with protein turnover. To further extend functional characterization, we identified the orthologs of differentially expressed unigenes in the model species Arabidopsis thaliana, which then allowed us to perform in silico functional inference, including gene network analysis for protein function, protein subcellular localization and gene co-expression, and in silico enrichment analysis for TFs and cis-elements. Results indicated the existence of extensive transcriptional regulatory events, including activation of ABA-responsive genes and ABF-dependent signaling. We were then able to establish that a core ABA-signaling pathway involving PP2C-SnRK2-ABF components was induced in stressed Q. suber roots, identifying a key mechanism in this species’ response to drought. PMID:26793200
Janecko, Nicol; Halova, Dana; Jamborova, Ivana; Papousek, Ivo; Masarikova, Martina; Dolejska, Monika; Literak, Ivan
2018-04-19
The spread of antimicrobial resistance from human activity derived sources to natural habitats implicates wildlife as potential vectors of antimicrobial resistance transfer. Wild birds, including corvid species can disseminate mobile genetic resistance determinants through feces. This study aimed to determine the occurrence of plasmid-mediated quinolone resistance (PMQR) genes in Escherichia coli and Klebsiella spp. isolates obtained from winter roosting sites of American crows (Corvus brachyrhynchos) and common ravens (Corvus corax) in Canada. Fecal swabs were collected at five roosting sites across Canada. Selective media isolation and multiplex PCR screening was utilized to identify PMQR genes followed by gene sequencing, PFGE and MLST to characterize isolates. Despite the low prevalence of E. coli containing PMQR (1.3%, 6/449), qnrS1, qnrB19, qnrC, oqxAB and aac(6')-Ib-cr genes were found in five sequence types (ST), including E. coli ST 131. Conversely, one isolate of Klebsiella pneumoniae contained the plasmid-mediated resistance gene qnrB19. Five different K. pneumoniae STs were identified, including two novel types. The occurrence of PMQR genes and STs of public health significance in E. coli and Klebsiella pneumoniae recovered from corvids gives further evidence of the anthropogenic derived dissemination of antimicrobial resistance determinants at the human activity-wildlife-environment interface. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Genetic association of impulsivity in young adults: a multivariate study
Khadka, S; Narayanan, B; Meda, S A; Gelernter, J; Han, S; Sawyer, B; Aslanzadeh, F; Stevens, M C; Hawkins, K A; Anticevic, A; Potenza, M N; Pearlson, G D
2014-01-01
Impulsivity is a heritable, multifaceted construct with clinically relevant links to multiple psychopathologies. We assessed impulsivity in young adult (N~2100) participants in a longitudinal study, using self-report questionnaires and computer-based behavioral tasks. Analysis was restricted to the subset (N=426) who underwent genotyping. Multivariate association between impulsivity measures and single-nucleotide polymorphism data was implemented using parallel independent component analysis (Para-ICA). Pathways associated with multiple genes in components that correlated significantly with impulsivity phenotypes were then identified using a pathway enrichment analysis. Para-ICA revealed two significantly correlated genotype–phenotype component pairs. One impulsivity component included the reward responsiveness subscale and behavioral inhibition scale of the Behavioral-Inhibition System/Behavioral-Activation System scale, and the second impulsivity component included the non-planning subscale of the Barratt Impulsiveness Scale and the Experiential Discounting Task. Pathway analysis identified processes related to neurogenesis, nervous system signal generation/amplification, neurotransmission and immune response. We identified various genes and gene regulatory pathways associated with empirically derived impulsivity components. Our study suggests that gene networks implicated previously in brain development, neurotransmission and immune response are related to impulsive tendencies and behaviors. PMID:25268255
Nguyen, Thong T; Suryamohan, Kushal; Kuriakose, Boney; Janakiraman, Vasantharajan; Reichelt, Mike; Chaudhuri, Subhra; Guillory, Joseph; Divakaran, Neethu; Rabins, P E; Goel, Ridhi; Deka, Bhabesh; Sarkar, Suman; Ekka, Preety; Tsai, Yu-Chih; Vargas, Derek; Santhosh, Sam; Mohan, Sangeetha; Chin, Chen-Shan; Korlach, Jonas; Thomas, George; Babu, Azariah; Seshagiri, Somasekar
2018-06-12
We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089 bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.
Buchner, Peter; Hawkesford, Malcolm J.
2014-01-01
NPF (formerly referred to as low-affinity NRT1) and ‘high-affinity’ NRT2 nitrate transporter genes are involved in nitrate uptake by the root, and transport and distribution of nitrate within the plant. The NPF gene family consists of 53 members in Arabidopsis thaliana, however only 11 of these have been functionally characterized. Although homologous genes have been identified in genomes of different plant species including some cereals, there is little information available for wheat (Triticum aestivum). Sixteen genes were identified in wheat homologous to characterized Arabidopsis low-affinity nitrate transporter NPF genes, suggesting a complex wheat NPF gene family. The regulation of wheat NFP genes by plant N-status indicated involvement of these transporters in substrate transport in relation to N-metabolism. The complex expression pattern in relation to tissue specificity, nitrate availability and senescence may be associated with the complex growth patterns of wheat depending on sink/source demands, as well as remobilization during grain filling. PMID:24913625
A duplicated PLP gene causing Pelizaeus-Merzbacher disease detected by comparative multiplex PCR
DOE Office of Scientific and Technical Information (OSTI.GOV)
Inoue, K.; Sugiyama, N.; Kawanishi, C.
1996-07-01
Pelizaeus-Merzbacher disease (PMD) is an X-linked dysmyelinating disorder caused by abnormalities in the proteolipid protein (PLP) gene, which is essential for oligodendrocyte differentiation and CNS myelin formation. Although linkage analysis has shown the homogeneity at the PLP locus in patients with PMD, exonic mutations in the PLP gene have been identified in only 10% - 25% of all cases, which suggests the presence of other genetic aberrations, including gene duplication. In this study, we examined five families with PMD not carrying exonic mutations in PLP gene, using comparative multiplex PCR (CM-PCR) as a semiquantitative assay of gene dosage. PLP genemore » duplications were identified in four families by CM-PCR and confirmed in three families by densitometric RFLP analysis. Because a homologous myelin protein gene, PMP22, is duplicated in the majority of patients with Charcot-Marie-Tooth 1A, PLP gene overdosage may be an important genetic abnormality in PMD and affect myelin formation. 38 ref., 5 figs., 2 tabs.« less
Feature genes predicting the FLT3/ITD mutation in acute myeloid leukemia
LI, CHENGLONG; ZHU, BIAO; CHEN, JIAO; HUANG, XIAOBING
2016-01-01
In the present study, gene expression profiles of acute myeloid leukemia (AML) samples were analyzed to identify feature genes with the capacity to predict the mutation status of FLT3/ITD. Two machine learning models, namely the support vector machine (SVM) and random forest (RF) methods, were used for classification. Four datasets were downloaded from the European Bioinformatics Institute, two of which (containing 371 samples, including 281 FLT3/ITD mutation-negative and 90 mutation-positive samples) were randomly defined as the training group, while the other two datasets (containing 488 samples, including 350 FLT3/ITD mutation-negative and 138 mutation-positive samples) were defined as the test group. Differentially expressed genes (DEGs) were identified by significance analysis of the micro-array data by using the training samples. The classification efficiency of the SCM and RF methods was evaluated using the following parameters: Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and the area under the receiver operating characteristic curve. Functional enrichment analysis was performed for the feature genes with DAVID. A total of 585 DEGs were identified in the training group, of which 580 were upregulated and five were downregulated. The classification accuracy rates of the two methods for the training group, the test group and the combined group using the 585 feature genes were >90%. For the SVM and RF methods, the rates of correct determination, specificity and PPV were >90%, while the sensitivity and NPV were >80%. The SVM method produced a slightly better classification effect than the RF method. A total of 13 biological pathways were overrepresented by the feature genes, mainly involving energy metabolism, chromatin organization and translation. The feature genes identified in the present study may be used to predict the mutation status of FLT3/ITD in patients with AML. PMID:27177049
Shestov, Maksim; Ontañón, Santiago; Tozeren, Aydin
2015-10-13
Bacterial infections comprise a global health challenge as the incidences of antibiotic resistance increase. Pathogenic potential of bacteria has been shown to be context dependent, varying in response to environment and even within the strains of the same genus. We used the KEGG repository and extensive literature searches to identify among the 2527 bacterial genomes in the literature those implicated as pathogenic to the host, including those which show pathogenicity in a context dependent manner. Using data on the gene contents of these genomes, we identified sets of genes highly abundant in pathogenic but relatively absent in commensal strains and vice versa. In addition, we carried out genome comparison within a genus for the seventeen largest genera in our genome collection. We projected the resultant lists of ortholog genes onto KEGG bacterial pathways to identify clusters and circuits, which can be linked to either pathogenicity or synergy. Gene circuits relatively abundant in nonpathogenic bacteria often mediated biosynthesis of antibiotics. Other synergy-linked circuits reduced drug-induced toxicity. Pathogen-abundant gene circuits included modules in one-carbon folate, two-component system, type-3 secretion system, and peptidoglycan biosynthesis. Antibiotics-resistant bacterial strains possessed genes modulating phagocytosis, vesicle trafficking, cytoskeletal reorganization, and regulation of the inflammatory response. Our study also identified bacterial genera containing a circuit, elements of which were previously linked to Alzheimer's disease. Present study produces for the first time, a signature, in the form of a robust list of gene circuitry whose presence or absence could potentially define the pathogenicity of a microbiome. Extensive literature search substantiated a bulk majority of the commensal and pathogenic circuitry in our predicted list. Scanning microbiome libraries for these circuitry motifs will provide further insights into the complex and context dependent pathogenicity of bacteria.
Feature genes predicting the FLT3/ITD mutation in acute myeloid leukemia.
Li, Chenglong; Zhu, Biao; Chen, Jiao; Huang, Xiaobing
2016-07-01
In the present study, gene expression profiles of acute myeloid leukemia (AML) samples were analyzed to identify feature genes with the capacity to predict the mutation status of FLT3/ITD. Two machine learning models, namely the support vector machine (SVM) and random forest (RF) methods, were used for classification. Four datasets were downloaded from the European Bioinformatics Institute, two of which (containing 371 samples, including 281 FLT3/ITD mutation-negative and 90 mutation‑positive samples) were randomly defined as the training group, while the other two datasets (containing 488 samples, including 350 FLT3/ITD mutation-negative and 138 mutation-positive samples) were defined as the test group. Differentially expressed genes (DEGs) were identified by significance analysis of the microarray data by using the training samples. The classification efficiency of the SCM and RF methods was evaluated using the following parameters: Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and the area under the receiver operating characteristic curve. Functional enrichment analysis was performed for the feature genes with DAVID. A total of 585 DEGs were identified in the training group, of which 580 were upregulated and five were downregulated. The classification accuracy rates of the two methods for the training group, the test group and the combined group using the 585 feature genes were >90%. For the SVM and RF methods, the rates of correct determination, specificity and PPV were >90%, while the sensitivity and NPV were >80%. The SVM method produced a slightly better classification effect than the RF method. A total of 13 biological pathways were overrepresented by the feature genes, mainly involving energy metabolism, chromatin organization and translation. The feature genes identified in the present study may be used to predict the mutation status of FLT3/ITD in patients with AML.
Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia
Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.
2018-01-01
Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1, COL9A2, ERBB3, FBN1, GJA1, GNPTG, IFIH1, KIF11, LTBP2, OCA2, POLR3B, POMT1, PTPN11, TFAP2A, ZNF469). Conclusions Common genetic variants within or nearby genes that cause syndromic myopia are enriched for variants that cause nonsyndromic, common myopia. Analysis of syndromic forms of refractive errors can provide new insights into the etiology of myopia and additional potential targets for therapeutic interventions. PMID:29346494
Zhou, Qingyuan; Jia, Junting; Huang, Xing; Yan, Xueqing; Cheng, Liqin; Chen, Shuangyan; Li, Xiaoxia; Peng, Xianjun; Liu, Gongshe
2014-05-26
Many Poaceae species show a gametophytic self-incompatibility (GSI) system, which is controlled by at least two independent and multiallelic loci, S and Z. Until currently, the gene products for S and Z were unknown. Grass SI plant stigmas discriminate between pollen grains that land on its surface and support compatible pollen tube growth and penetration into the stigma, whereas recognizing incompatible pollen and thus inhibiting pollination behaviors. Leymus chinensis (Trin.) Tzvel. (sheepgrass) is a Poaceae SI species. A comprehensive analysis of sheepgrass stigma transcriptome may provide valuable information for understanding the mechanism of pollen-stigma interactions and grass SI. The transcript abundance profiles of mature stigmas, mature ovaries and leaves were examined using high-throughput next generation sequencing technology. A comparative transcriptomic analysis of these tissues identified 1,025 specifically or preferentially expressed genes in sheepgrass stigmas. These genes contained a significant proportion of genes predicted to function in cell-cell communication and signal transduction. We identified 111 putative transcription factors (TFs) genes and the most abundant groups were MYB, C2H2, C3H, FAR1, MADS. Comparative analysis of the sheepgrass, rice and Arabidopsis stigma-specific or preferential datasets showed broad similarities and some differences in the proportion of genes in the Gene Ontology (GO) functional categories. Potential SI candidate genes identified in other grasses were also detected in the sheepgrass stigma-specific or preferential dataset. Quantitative real-time PCR experiments validated the expression pattern of stigma preferential genes including homologous grass SI candidate genes. This study represents the first large-scale investigation of gene expression in the stigmas of an SI grass species. We uncovered many notable genes that are potentially involved in pollen-stigma interactions and SI mechanisms, including genes encoding receptor-like protein kinases (RLK), CBL (calcineurin B-like proteins) interacting protein kinases, calcium-dependent protein kinase, expansins, pectinesterase, peroxidases and various transcription factors. The availability of a pool of stigma-specific or preferential genes for L. chinensis offers an opportunity to elucidate the mechanisms of SI in Poaceae.
A gene expression signature associated with survival in metastatic melanoma
Mandruzzato, Susanna; Callegaro, Andrea; Turcatel, Gianluca; Francescato, Samuela; Montesco, Maria C; Chiarion-Sileni, Vanna; Mocellin, Simone; Rossi, Carlo R; Bicciato, Silvio; Wang, Ena; Marincola, Francesco M; Zanovello, Paola
2006-01-01
Background Current clinical and histopathological criteria used to define the prognosis of melanoma patients are inadequate for accurate prediction of clinical outcome. We investigated whether genome screening by means of high-throughput gene microarray might provide clinically useful information on patient survival. Methods Forty-three tumor tissues from 38 patients with stage III and stage IV melanoma were profiled with a 17,500 element cDNA microarray. Expression data were analyzed using significance analysis of microarrays (SAM) to identify genes associated with patient survival, and supervised principal components (SPC) to determine survival prediction. Results SAM analysis revealed a set of 80 probes, corresponding to 70 genes, associated with survival, i.e. 45 probes characterizing longer and 35 shorter survival times, respectively. These transcripts were included in a survival prediction model designed using SPC and cross-validation which allowed identifying 30 predicting probes out of the 80 associated with survival. Conclusion The longer-survival group of genes included those expressed in immune cells, both innate and acquired, confirming the interplay between immunological mechanisms and the natural history of melanoma. Genes linked to immune cells were totally lacking in the poor-survival group, which was instead associated with a number of genes related to highly proliferative and invasive tumor cells. PMID:17129373
Wang, Fang; Jia, Yongfang; Wang, Po; Yang, Qianwen; Du, QiYan; Chang, ZhongJie
2017-04-28
MicroRNAs (miRNAs) are endogenous small non-coding RNAs that regulate gene expression by targeting specific mRNAs. However, the possible role of miRNAs in the ovary differentiation and development of fish is not well understood. In this study, we examined the expression profiles and differential expression of miRNAs during three key stages of ovarian development and different developmental stages in common carp Cyprinus carpio. A total of 8765 miRNAs were identified, including 2155 conserved miRNAs highly conserved among various species, 145 miRNAs registered in miRBase for common carp, and 6505 novel miRNAs identified in common carp for the first time. Comparison of miRNA expression profiles among the five libraries identified 714 co-expressed and 2382 specific expressed miRNAs. Overall, 150, 628, and 431 specifically expressed miRNAs were identified in primordial gonad, juvenile ovary, and adult ovary, respectively. MiR-6758-3p, miR-3050-5p, and miR-2985-3p were highly expressed in primordial gonad, miR-3544-5p, miR-6877-3p, and miR-9086-5p were highly expressed in juvenile ovary, and miR-154-3p, miR-5307-5p, and miR-3958-3p were highly expressed in adult ovary. Predicted target genes of specific miRNAs in primordial gonad were involved in many reproductive biology signaling pathways, including transforming growth factor-β, Wnt, oocyte meiosis, mitogen-activated protein kinase, Notch, p53, and gonadotropin-releasing hormone pathways. Target-gene prediction revealed upward trends in miRNAs targeting male-bias genes, including dmrt1, atm, gsdf, and sox9, and downward trends in miRNAs targeting female-bias genes including foxl2, smad3, and smad4. Other sex-related genes such as sf1 were also predicted to be miRNA target genes. This comprehensive miRNA transcriptome analysis demonstrated differential expression profiles of miRNAs during ovary development in common carp. These results could facilitate future exploitation of the sex-regulatory roles and mechanisms of miRNAs, especially in primordial gonads, while the specifically expressed miRNAs represent candidates for studying the mechanisms of ovary determination in Yellow River carp.
[Identification of lactic acid bacteria in commercial yogurt and their antibiotic resistance].
Qin, Yuxuan; Li, Jing; Wang, Qiuya; Gao, Kexin; Zhu, Baoli; Lv, Na
2013-08-04
To identify lactic acid bacteria (LAB) in commercial yogurts and investigate their antibiotic resistance. LABs were cultured from 5 yogurt brands and the isolates were identified at the species level by 16S rRNA sequence. Genotyping was performed by repetitive extragenic palindromic PCR (rep-PCR). The sensitivity to 7 antibiotics was tested for all LAB isolates by Kirby-Bauer paper diffusion (K-B method). Meanwhile, 9 antibiotic resistance genes (ARGs), including erythromycin resistance genes (ermA and ermB) and tetracycline resistance genes (tetM, tetK, tetS, tetQ, tetO, tetL and tetW), were detected by PCR amplification in the identified LAB isolates. The PCR products were confirmed by sequencing. Total 100 LABs were isolated, including 23 Lactobacillus delbrueckii ssp. bulgaricus, 26 Lactobacillus casei, 30 Streptococcus thermophilus, 5 Lactobacillus acidophilus, 6 Lactobacillus plantarum, and 10 Lactobacillus paracasei. The drug susceptibility test shows that all 100 isolates were resistant to gentamicin and streptomycin, 42 isolates were resistant to vancomycin, and on the contrary all were sensitive to cefalexin, erythromycin, tetracycline and oxytetracycline. Moreover, 5 ARGs were found in the 28 sequencing confirmed isolates, ermB gene was detected in 8 isolates, tet K in 4 isolates, tetL in 2 isolates, tetM in 4 isolates, tetO in 2 isolates. erm A, tet S, tet Q and tet W genes were not detected in the isolates. Antibiotic resistance genes were found in 53.57% (15/28) sequenced isolates, 2 -3 antibiotic resistance genes were detected in 4 isolates of L. delbrueckii ssp. bulgaricus. Some LABs were not labeled in commercial yogurt products. Antibiotic resistance genes tend to be found in the starter culture of L. delbrueckii ssp. Bulgaricus and S. thermophilus. All the LAB isolates were sensitive to erythromycin and tetracycline, even though some carried erythromycin and/or tetracycline resistance genes. We proved again that LAB could carry antibiotic resistance gene(s) though it is sensitive to antibiotics.
NCI-60 Whole Exome Sequencing and Pharmacological CellMiner Analyses
Reinhold, William C.; Varma, Sudhir; Sousa, Fabricio; Sunshine, Margot; Abaan, Ogan D.; Davis, Sean R.; Reinhold, Spencer W.; Kohn, Kurt W.; Morris, Joel; Meltzer, Paul S.; Doroshow, James H.; Pommier, Yves
2014-01-01
Exome sequencing provides unprecedented insights into cancer biology and pharmacological response. Here we assess these two parameters for the NCI-60, which is among the richest genomic and pharmacological publicly available cancer cell line databases. Homozygous genetic variants that putatively affect protein function were identified in 1,199 genes (approximately 6% of all genes). Variants that are either enriched or depleted compared to non-cancerous genomes, and thus may be influential in cancer progression and differential drug response were identified for 2,546 genes. Potential gene knockouts are made available. Assessment of cell line response to 19,940 compounds, including 110 FDA-approved drugs, reveals ≈80-fold range in resistance versus sensitivity response across cell lines. 103,422 gene variants were significantly correlated with at least one compound (at p<0.0002). These include genes of known pharmacological importance such as IGF1R, BRAF, RAD52, MTOR, STAT2 and TSC2 as well as a large number of candidate genes such as NOM1, TLL2, and XDH. We introduce two new web-based CellMiner applications that enable exploration of variant-to-compound relationships for a broad range of researchers, especially those without bioinformatics support. The first tool, “Genetic variant versus drug visualization”, provides a visualization of significant correlations between drug activity-gene variant combinations. Examples are given for the known vemurafenib-BRAF, and novel ifosfamide-RAD52 pairings. The second, “Genetic variant summation” allows an assessment of cumulative genetic variations for up to 150 combined genes together; and is designed to identify the variant burden for molecular pathways or functional grouping of genes. An example of its use is provided for the EGFR-ERBB2 pathway gene variant data and the identification of correlated EGFR, ERBB2, MTOR, BRAF, MEK and ERK inhibitors. The new tools are implemented as an updated web-based CellMiner version, for which the present publication serves as a compendium. PMID:25032700
Transcriptome Analysis for Abnormal Spike Development of the Wheat Mutant dms
Zhu, Xin-Xin; Li, Qiao-Yun; Shen, Chun-Cai; Duan, Zong-Biao; Yu, Dong-Yan; Niu, Ji-Shan; Ni, Yong-Jing; Jiang, Yu-Mei
2016-01-01
Background Wheat (Triticum aestivum L.) spike development is the foundation for grain yield. We obtained a novel wheat mutant, dms, characterized as dwarf, multi-pistil and sterility. Although the genetic changes are not clear, the heredity of traits suggests that a recessive gene locus controls the two traits of multi-pistil and sterility in self-pollinating populations of the medium plants (M), such that the dwarf genotype (D) and tall genotype (T) in the progeny of the mutant are ideal lines for studies regarding wheat spike development. The objective of this study was to explore the molecular basis for spike abnormalities of dwarf genotype. Results Four unigene libraries were assembled by sequencing the mRNAs of the super-bulked differentiating spikes and stem tips of the D and T plants. Using integrative analysis, we identified 419 genes highly expressed in spikes, including nine typical homeotic genes of the MADS-box family and the genes TaAP2, TaFL and TaDL. We also identified 143 genes that were significantly different between young spikes of T and D, and 26 genes that were putatively involved in spike differentiation. The result showed that the expression levels of TaAP1-2, TaAP2, and other genes involved in the majority of biological processes such as transcription, translation, cell division, photosynthesis, carbohydrate transport and metabolism, and energy production and conversion were significantly lower in D than in T. Conclusions We identified a set of genes related to wheat floral organ differentiation, including typical homeotic genes. Our results showed that the major causal factors resulting in the spike abnormalities of dms were the lower expression homeotic genes, hormonal imbalance, repressed biological processes, and deficiency of construction materials and energy. We performed a series of studies on the homeotic genes, however the other three causal factors for spike abnormal phenotype of dms need further study. PMID:26982202
2014-01-01
Background Bean anthracnose is caused by the fungus Colletotrichum lindemuthianum (Sacc. & Magnus) Lams.- Scrib. Resistance to C. lindemuthianum in common bean (Phaseolus vulgaris L.) generally follows a qualitative mode of inheritance. The pathogen shows extensive pathogenic variation and up to 20 anthracnose resistance loci (named Co-), conferring resistance to specific races, have been described. Anthracnose resistance has generally been investigated by analyzing a limited number of isolates or races in segregating populations. In this work, we analyzed the response against eleven C. lindemuthianum races in a recombinant inbred line (RIL) common bean population derived from the cross Xana × Cornell 49242 in which a saturated linkage map was previously developed. Results A systematic genetic analysis was carried out to dissect the complex resistance segregations observed, which included contingency analyses, subpopulations and genetic mapping. Twenty two resistance genes were identified, some with a complementary mode of action. The Cornell 49242 genotype carries a complex cluster of resistance genes at the end of linkage group (LG) Pv11 corresponding to the previously described anthracnose resistance cluster Co-2. In this position, specific resistance genes to races 3, 6, 7, 19, 38, 39, 65, 357, 449 and 453 were identified, with one of them showing a complementary mode of action. In addition, Cornell 49242 had an independent gene on LG Pv09 showing a complementary mode of action for resistance to race 453. Resistance genes in genotype Xana were located on three regions involving LGs Pv01, Pv02 and Pv04. All resistance genes identified in Xana showed a complementary mode of action, except for two controlling resistance to races 65 and 73 located on LG Pv01, in the position of the previously described anthracnose resistance cluster Co-1. Conclusions Results shown herein reveal a complex and specific interaction between bean and fungus genotypes leading to anthracnose resistance. Organization of specific resistance genes in clusters including resistance genes with different modes of action (dominant and complementary genes) was also confirmed. Finally, new locations for anthracnose resistance genes were identified in LG Pv09. PMID:24779442
DOE Office of Scientific and Technical Information (OSTI.GOV)
Villiers, Etienne P. de, E-mail: e.villiers@cgiar.or; Gallardo, Carmina; Arias, Marisa
Viral molecular epidemiology has traditionally analyzed variation in single genes. Whole genome phylogenetic analysis of 123 concatenated genes from 11 ASFV genomes, including E75, a newly sequenced virulent isolate from Spain, identified two clusters. One contained South African isolates from ticks and warthog, suggesting derivation from a sylvatic transmission cycle. The second contained isolates from West Africa and the Iberian Peninsula. Two isolates, from Kenya and Malawi, were outliers. Of the nine genomes within the clusters, seven were within p72 genotype 1. The 11 genomes sequenced comprised only 5 of the 22 p72 genotypes. Comparison of synonymous and non-synonymous mutationsmore » at the genome level identified 20 genes subject to selection pressure for diversification. A novel gene of the E75 virus evolved by the fusion of two genes within the 360 multicopy family. Comparative genomics reveals high diversity within a limited sample of the ASFV viral gene pool.« less
Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits.
Wu, Yang; Zeng, Jian; Zhang, Futao; Zhu, Zhihong; Qi, Ting; Zheng, Zhili; Lloyd-Jones, Luke R; Marioni, Riccardo E; Martin, Nicholas G; Montgomery, Grant W; Deary, Ian J; Wray, Naomi R; Visscher, Peter M; McRae, Allan F; Yang, Jian
2018-03-02
The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.
Bowman, Shaun M; Piwowar, Amy; Ciocca, Maria; Free, Stephen J
2005-01-01
Two Neurospora mutants with a phenotype that includes a tight colonial growth pattern, an inability to form conidia and an inability to form protoperithecia have been isolated and characterized. The relevant mutations were mapped to the same locus on the sequenced Neurospora genome. The mutations responsible for the mutant phenotype then were identified by examining likely candidate genes from the mutant genomes at the mapped locus with PCR amplification and a sequencing assay. The results demonstrate that a map and sequence strategy is a feasible way to identify mutant genes in Neurospora. The gene responsible for the phenotype is a putative alpha-1,2-mannosyltransferase gene. The mutant cell wall has an altered composition demonstrating that the gene functions in cell wall biosynthesis. The results demonstrate that the mnt-1 gene is required for normal cell wall biosynthesis, morphology and for the regulation of asexual development.
Filatov, Victor; Dowdle, John; Smirnoff, Nicholas; Ford-Lloyd, Brian; Newbury, H John; Macnair, Mark R
2006-09-01
One of the challenges of comparative genomics is to identify specific genetic changes associated with the evolution of a novel adaptation or trait. We need to be able to disassociate the genes involved with a particular character from all the other genetic changes that take place as lineages diverge. Here we show that by comparing the transcriptional profile of segregating families with that of parent species differing in a novel trait, it is possible to narrow down substantially the list of potential target genes. In addition, by assuming synteny with a related model organism for which the complete genome sequence is available, it is possible to use the cosegregation of markers differing in transcription level to identify regions of the genome which probably contain quantitative trait loci (QTLs) for the character. This novel combination of genomics and classical genetics provides a very powerful tool to identify candidate genes. We use this methodology to investigate zinc hyperaccumulation in Arabidopsis halleri, the sister species to the model plant, Arabidopsis thaliana. We compare the transcriptional profile of A. halleri with that of its sister nonaccumulator species, Arabidopsis petraea, and between accumulator and nonaccumulator F(3)s derived from the cross between the two species. We identify eight genes which consistently show greater expression in accumulator phenotypes in both roots and shoots, including two metal transporter genes (NRAMP3 and ZIP6), and cytoplasmic aconitase, a gene involved in iron homeostasis in mammals. We also show that there appear to be two QTLs for zinc accumulation, on chromosomes 3 and 7.
Zhang, Shuwei; Ding, Feng; He, Xinhua; Luo, Cong; Huang, Guixiang; Hu, Ying
2015-02-01
Seedlessness is a desirable character in lemons and other citrus species. Seedless fruit can be induced in many ways, including through self-incompatibility (SI). SI is widely used as an intraspecific reproductive barrier that prevents self-fertilization in flowering plants. Although there have been many studies on SI, its mechanism remains unclear. The 'Xiangshui' lemon is an important seedless cultivar whose seedlessness has been caused by SI. It is essential to identify genes involved in SI in 'Xiangshui' lemon to clarify its molecular mechanism. In this study, candidate genes associated with SI were identified using high-throughput Illumina RNA sequencing (RNA-seq). A total of 61,224 unigenes were obtained (average, 948 bp; N50 of 1,457 bp), among which 47,260 unigenes were annotated by comparison to six public databases (Nr, Nt, Swiss-Prot, KEGG, COG, and GO). Differentially expressed genes were identified by comparing the transcriptomes of no-, self-, and cross-pollinated stigmas with styles of the 'Xiangshui' lemon. Several differentially expressed genes that might be associated with SI were identified, such as those involved in pollen tube growth, programmed cell death, signal transduction, and transcription. NADPH oxidase genes associated with apoptosis were highly upregulated in the self-pollinated transcriptome. The expression pattern of 12 genes was analyzed by quantitative real-time polymerase chain reaction. A putative S-RNase gene was identified that had not been previously associated with self-pollen rejection in lemon or citrus. This study provided a transcriptome dataset for further studies of SI and seedless lemon breeding.
Comparative transcriptome profiling of upland (VS16) and lowland (AP13) ecotypes of switchgrass.
Ayyappan, Vasudevan; Saha, Malay C; Thimmapuram, Jyothi; Sripathi, Venkateswara R; Bhide, Ketaki P; Fiedler, Elizabeth; Hayford, Rita K; Kalavacharla, Venu Kal
2017-01-01
Transcriptomes of two switchgrass genotypes representing the upland and lowland ecotypes will be key tools in switchgrass genome annotation and biotic and abiotic stress functional genomics. Switchgrass (Panicum virgatum L.) is an important bioenergy feedstock for cellulosic ethanol production. We report genome-wide transcriptome profiling of two contrasting tetraploid switchgrass genotypes, VS16 and AP13, representing the upland and lowland ecotypes, respectively. A total of 268 million Illumina short reads (50 nt) were generated, of which, 133 million were obtained in AP13 and the rest 135 million in VS16. More than 90% of these reads were mapped to the switchgrass reference genome (V1.1). We identified 6619 and 5369 differentially expressed genes in VS16 and AP13, respectively. Gene ontology and KEGG pathway analysis identified key genes that regulate important pathways including C4 photosynthesis, photorespiration and phenylpropanoid metabolism. A series of genes (33) involved in photosynthetic pathway were up-regulated in AP13 but only two genes showed higher expression in VS16. We identified three dicarboxylate transporter homologs that were highly expressed in AP13. Additionally, genes that mediate drought, heat, and salinity tolerance were also identified. Vesicular transport proteins, syntaxin and signal recognition particles were seen to be up-regulated in VS16. Analyses of selected genes involved in biosynthesis of secondary metabolites, plant-pathogen interaction, membrane transporters, heat, drought and salinity stress responses confirmed significant variation in the relative expression reflected in RNA-Seq data between VS16 and AP13 genotypes. The phenylpropanoid pathway genes identified here are potential targets for biofuel conversion.
Huang, Huiyan; Zhu, Yong; Eliot, Melissa N; Knopik, Valerie S; McGeary, John E; Carskadon, Mary A; Hart, Anne C
2017-06-01
We aimed to test a combined approach to identify conserved genes regulating sleep and to explore the association between DNA methylation and sleep length. We identified candidate genes associated with shorter versus longer sleep duration in college students based on DNA methylation using Illumina Infinium HumanMethylation450 BeadChip arrays. Orthologous genes in Caenorhabditis elegans were identified, and we examined whether their loss of function affected C. elegans sleep. For genes whose perturbation affected C. elegans sleep, we subsequently undertook a small pilot study to re-examine DNA methylation in an independent set of human participants with shorter versus longer sleep durations. Eighty-seven out of 485,577 CpG sites had significant differential methylation in young adults with shorter versus longer sleep duration, corresponding to 52 candidate genes. We identified 34 C. elegans orthologs, including NPY/flp-18 and flp-21, which are known to affect sleep. Loss of five additional genes alters developmentally timed C. elegans sleep (B4GALT6/bre-4, DOCK180/ced-5, GNB2L1/rack-1, PTPRN2/ida-1, ZFYVE28/lst-2). For one of these genes, ZFYVE28 (also known as hLst2), the pilot replication study again found decreased DNA methylation associated with shorter sleep duration at the same two CpG sites in the first intron of ZFYVE28. Using an approach that combines human epigenetics and C. elegans sleep studies, we identified five genes that play previously unidentified roles in C. elegans sleep. We suggest sleep duration in humans may be associated with differential DNA methylation at specific sites and that the conserved genes identified here likely play roles in C. elegans sleep and in other species. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming
2017-01-01
Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms underlying Pst -wheat interactions, to determine the effectiveness of resistance genes and further to develop durable resistance to stripe rust.
Emmanuel, Catherine; Gava, Natalie; Kennedy, Catherine; Balleine, Rosemary L.; Sharma, Raghwa; Wain, Gerard; Brand, Alison; Hogg, Russell; Etemadmoghadam, Dariush; George, Joshy; Birrer, Michael J.; Clarke, Christine L.; Chenevix-Trench, Georgia; Bowtell, David D. L.; Harnett, Paul R.; deFazio, Anna
2011-01-01
Molecular events leading to epithelial ovarian cancer are poorly understood but ovulatory hormones and a high number of life-time ovulations with concomitant proliferation, apoptosis, and inflammation, increases risk. We identified genes that are regulated during the estrous cycle in murine ovarian surface epithelium and analysed these profiles to identify genes dysregulated in human ovarian cancer, using publically available datasets. We identified 338 genes that are regulated in murine ovarian surface epithelium during the estrous cycle and dysregulated in ovarian cancer. Six of seven candidates selected for immunohistochemical validation were expressed in serous ovarian cancer, inclusion cysts, ovarian surface epithelium and in fallopian tube epithelium. Most were overexpressed in ovarian cancer compared with ovarian surface epithelium and/or inclusion cysts (EpCAM, EZH2, BIRC5) although BIRC5 and EZH2 were expressed as highly in fallopian tube epithelium as in ovarian cancer. We prioritised the 338 genes for those likely to be important for ovarian cancer development by in silico analyses of copy number aberration and mutation using publically available datasets and identified genes with established roles in ovarian cancer as well as novel genes for which we have evidence for involvement in ovarian cancer. Chromosome segregation emerged as an important process in which genes from our list of 338 were over-represented including two (BUB1, NCAPD2) for which there is evidence of amplification and mutation. NUAK2, upregulated in ovarian surface epithelium in proestrus and predicted to have a driver mutation in ovarian cancer, was examined in a larger cohort of serous ovarian cancer where patients with lower NUAK2 expression had shorter overall survival. In conclusion, defining genes that are activated in normal epithelium in the course of ovulation that are also dysregulated in cancer has identified a number of pathways and novel candidate genes that may contribute to the development of ovarian cancer. PMID:21423607
Terrados, Gloria; Finkernagel, Florian; Stielow, Bastian; Sadic, Dennis; Neubert, Juliane; Herdt, Olga; Krause, Michael; Scharfe, Maren; Jarek, Michael; Suske, Guntram
2012-01-01
The transcription factor Sp2 is essential for early mouse development and for proliferation of mouse embryonic fibroblasts in culture. Yet its mechanisms of action and its target genes are largely unknown. In this study, we have combined RNA interference, in vitro DNA binding, chromatin immunoprecipitation sequencing and global gene-expression profiling to investigate the role of Sp2 for cellular functions, to define target sites and to identify genes regulated by Sp2. We show that Sp2 is important for cellular proliferation that it binds to GC-boxes and occupies proximal promoters of genes essential for vital cellular processes including gene expression, replication, metabolism and signalling. Moreover, we identified important key target genes and cellular pathways that are directly regulated by Sp2. Most significantly, Sp2 binds and activates numerous sequence-specific transcription factor and co-activator genes, and represses the whole battery of cholesterol synthesis genes. Our results establish Sp2 as a sequence-specific regulator of vitally important genes. PMID:22684502
Identification of giant Mimivirus protein functions using RNA interference
Sobhy, Haitham; Scola, Bernard La; Pagnier, Isabelle; Raoult, Didier; Colson, Philippe
2015-01-01
Genomic analysis of giant viruses, such as Mimivirus, has revealed that more than half of the putative genes have no known functions (ORFans). We knocked down Mimivirus genes using short interfering RNA as a proof of concept to determine the functions of giant virus ORFans. As fibers are easy to observe, we targeted a gene encoding a protein absent in a Mimivirus mutant devoid of fibers as well as three genes encoding products identified in a protein concentrate of fibers, including one ORFan and one gene of unknown function. We found that knocking down these four genes was associated with depletion or modification of the fibers. Our strategy of silencing ORFan genes in giant viruses opens a way to identify its complete gene repertoire and may clarify the role of these genes, differentiating between junk DNA and truly used genes. Using this strategy, we were able to annotate four proteins in Mimivirus and 30 homologous proteins in other giant viruses. In addition, we were able to annotate >500 proteins from cellular organisms and 100 from metagenomic databases. PMID:25972846
Charles, Peter C; Alder, Brian D; Hilliard, Eleanor G; Schisler, Jonathan C; Lineberger, Robert E; Parker, Joel S; Mapara, Sabeen; Wu, Samuel S; Portbury, Andrea; Patterson, Cam; Stouffer, George A
2008-01-01
Background Strong epidemiologic evidence correlates tobacco use with a variety of serious adverse health effects, but the biological mechanisms that produce these effects remain elusive. Results We analyzed gene transcription data to identify expression spectra related to tobacco use in circulating leukocytes of 67 Caucasian male subjects. Levels of cotinine, a nicotine metabolite, were used as a surrogate marker for tobacco exposure. Significance Analysis of Microarray and Gene Set Analysis identified 109 genes in 16 gene sets whose transcription levels were differentially regulated by nicotine exposure. We subsequently analyzed this gene set by hyperclustering, a technique that allows the data to be clustered by both expression ratio and gene annotation (e.g. Gene Ontologies). Conclusion Our results demonstrate that tobacco use affects transcription of groups of genes that are involved in proliferation and apoptosis in circulating leukocytes. These transcriptional effects include a repertoire of transcriptional changes likely to increase the incidence of neoplasia through an altered expression of genes associated with transcription and signaling, interferon responses and repression of apoptotic pathways. PMID:18710571
Zhou, Lili; Bryant, Camron D.; Loudon, Andrew; Palmer, Abraham A.; Vitaterna, Martha Hotz; Turek, Fred W.
2014-01-01
Study Objectives: Efforts to identify the genetic basis of mammalian sleep have included quantitative trait locus (QTL) mapping and gene targeting of known core circadian clock genes. We combined three different genetic approaches to identify and test a positional candidate sleep gene — the circadian gene casein kinase 1 epsilon (Csnk1e), which is located in a QTL we identified for rapid eye movement (REM) sleep on chromosome 15. Measurements and Results: Using electroencephalographic (EEG) and electromyographic (EMG) recordings, baseline sleep was examined in a 12-h light:12-h dark (LD 12:12) cycle in mice of seven genotypes, including Csnk1etau/tau and Csnk1e-/- mutant mice, Csnk1eB6.D2 and Csnk1eD2.B6 congenic mice, and their respective wild-type littermate control mice. Additionally, Csnk1etau/tau and wild-type mice were examined in constant darkness (DD). Csnk1etau/tau mutant mice and both Csnk1eB6.D2 and Csnk1eD2.B6 congenic mice showed significantly higher proportion of sleep time spent in REM sleep during the dark period than wild-type controls — the original phenotype for which the QTL on chromosome 15 was identified. This phenotype persisted in Csnk1etau/tau mice while under free-running DD conditions. Other sleep phenotypes observed in Csnk1etau/tau mice and congenics included a decreased number of bouts of nonrapid eye movement (NREM) sleep and an increased average NREM sleep bout duration. Conclusions: These results demonstrate a role for Csnk1e in regulating not only the timing of sleep, but also the REM sleep amount and NREM sleep architecture, and support Csnk1e as a causal gene in the sleep QTL on chromosome 15. Citation: Zhou L; Bryant CD; Loudon A; Palmer AA; Vitaterna MH; Turek FW. The circadian clock gene Csnk1e regulates rapid eye movement sleep amount, and nonrapid eye movement sleep architecture in mice. SLEEP 2014;37(4):785-793. PMID:24744456
Gutiérrez, Rodrigo A; Stokes, Trevor L; Thum, Karen; Xu, Xiaodong; Obertello, Mariana; Katari, Manpreet S; Tanurdzic, Milos; Dean, Alexis; Nero, Damion C; McClung, C Robertson; Coruzzi, Gloria M
2008-03-25
Understanding how nutrients affect gene expression will help us to understand the mechanisms controlling plant growth and development as a function of nutrient availability. Nitrate has been shown to serve as a signal for the control of gene expression in Arabidopsis. There is also evidence, on a gene-by-gene basis, that downstream products of nitrogen (N) assimilation such as glutamate (Glu) or glutamine (Gln) might serve as signals of organic N status that in turn regulate gene expression. To identify genome-wide responses to such organic N signals, Arabidopsis seedlings were transiently treated with ammonium nitrate in the presence or absence of MSX, an inhibitor of glutamine synthetase, resulting in a block of Glu/Gln synthesis. Genes that responded to organic N were identified as those whose response to ammonium nitrate treatment was blocked in the presence of MSX. We showed that some genes previously identified to be regulated by nitrate are under the control of an organic N-metabolite. Using an integrated network model of molecular interactions, we uncovered a subnetwork regulated by organic N that included CCA1 and target genes involved in N-assimilation. We validated some of the predicted interactions and showed that regulation of the master clock control gene CCA1 by Glu or a Glu-derived metabolite in turn regulates the expression of key N-assimilatory genes. Phase response curve analysis shows that distinct N-metabolites can advance or delay the CCA1 phase. Regulation of CCA1 by organic N signals may represent a novel input mechanism for N-nutrients to affect plant circadian clock function.
Global Fitness Profiling Identifies Arsenic and Cadmium Tolerance Mechanisms in Fission Yeast.
Guo, Lan; Ganguly, Abantika; Sun, Lingling; Suo, Fang; Du, Li-Lin; Russell, Paul
2016-10-13
Heavy metals and metalloids such as cadmium [Cd(II)] and arsenic [As(III)] are widespread environmental toxicants responsible for multiple adverse health effects in humans. However, the molecular mechanisms underlying metal-induced cytotoxicity and carcinogenesis, as well as the detoxification and tolerance pathways, are incompletely understood. Here, we use global fitness profiling by barcode sequencing to quantitatively survey the Schizosaccharomyces pombe haploid deletome for genes that confer tolerance of cadmium or arsenic. We identified 106 genes required for cadmium resistance and 110 genes required for arsenic resistance, with a highly significant overlap of 36 genes. A subset of these 36 genes account for almost all proteins required for incorporating sulfur into the cysteine-rich glutathione and phytochelatin peptides that chelate cadmium and arsenic. A requirement for Mms19 is explained by its role in directing iron-sulfur cluster assembly into sulfite reductase as opposed to promoting DNA repair, as DNA damage response genes were not enriched among those required for cadmium or arsenic tolerance. Ubiquinone, siroheme, and pyridoxal 5'-phosphate biosynthesis were also identified as critical for Cd/As tolerance. Arsenic-specific pathways included prefoldin-mediated assembly of unfolded proteins and protein targeting to the peroxisome, whereas cadmium-specific pathways included plasma membrane and vacuolar transporters, as well as Spt-Ada-Gcn5-acetyltransferase (SAGA) transcriptional coactivator that controls expression of key genes required for cadmium tolerance. Notable differences are apparent with corresponding screens in the budding yeast Saccharomyces cerevisiae, underscoring the utility of analyzing toxic metal defense mechanisms in both organisms. Copyright © 2016 Guo et al.
2014-01-01
Background The rhizome, the original stem of land plants, enables species to invade new territory and is a critical component of perenniality, especially in grasses. Red rice (Oryza longistaminata) is a perennial wild rice species with many valuable traits that could be used to improve cultivated rice cultivars, including rhizomatousness, disease resistance and drought tolerance. Despite these features, little is known about the molecular mechanisms that contribute to rhizome growth, development and function in this plant. Results We used an integrated approach to compare the transcriptome, proteome and metabolome of the rhizome to other tissues of red rice. 116 Gb of transcriptome sequence was obtained from various tissues and used to identify rhizome-specific and preferentially expressed genes, including transcription factors and hormone metabolism and stress response-related genes. Proteomics and metabolomics approaches identified 41 proteins and more than 100 primary metabolites and plant hormones with rhizome preferential accumulation. Of particular interest was the identification of a large number of gene transcripts from Magnaportha oryzae, the fungus that causes rice blast disease in cultivated rice, even though the red rice plants showed no sign of disease. Conclusions A significant set of genes, proteins and metabolites appear to be specifically or preferentially expressed in the rhizome of O. longistaminata. The presence of M. oryzae gene transcripts at a high level in apparently healthy plants suggests that red rice is resistant to this pathogen, and may be able to provide genes to cultivated rice that will enable resistance to rice blast disease. PMID:24521476
Cheng, Yi-Qiang; Yang, Min; Matter, Andrea M
2007-06-01
A gene cluster responsible for the biosynthesis of anticancer agent FK228 has been identified, cloned, and partially characterized in Chromobacterium violaceum no. 968. First, a genome-scanning approach was applied to identify three distinctive C. violaceum no. 968 genomic DNA clones that code for portions of nonribosomal peptide synthetase and polyketide synthase. Next, a gene replacement system developed originally for Pseudomonas aeruginosa was adapted to inactivate the genomic DNA-associated candidate natural product biosynthetic genes in vivo with high efficiency. Inactivation of a nonribosomal peptide synthetase-encoding gene completely abolished FK228 production in mutant strains. Subsequently, the entire FK228 biosynthetic gene cluster was cloned and sequenced. This gene cluster is predicted to encompass a 36.4-kb DNA region that includes 14 genes. The products of nine biosynthetic genes are proposed to constitute an unusual hybrid nonribosomal peptide synthetase-polyketide synthase-nonribosomal peptide synthetase assembly line including accessory activities for the biosynthesis of FK228. In particular, a putative flavin adenine dinucleotide-dependent pyridine nucleotide-disulfide oxidoreductase is proposed to catalyze disulfide bond formation between two sulfhydryl groups of cysteine residues as the final step in FK228 biosynthesis. Acquisition of the FK228 biosynthetic gene cluster and acclimation of an efficient genetic system should enable genetic engineering of the FK228 biosynthetic pathway in C. violaceum no. 968 for the generation of structural analogs as anticancer drug candidates.
Sharma, Akshay; Easow Mathew, Manu; Sriganesh, Vasumathi; Reiss, Ulrike M
2016-12-20
Haemophilia is a genetic disorder characterized by spontaneous or provoked, often uncontrolled, bleeding into joints, muscles and other soft tissues. Current methods of treatment are expensive, challenging and involve regular administration of clotting factors. Gene therapy has recently been prompted as a curative treatment modality. This is an update of a published Cochrane Review. To evaluate the safety and efficacy of gene therapy for treating people with haemophilia A or B. We searched the Cochrane Cystic Fibrosis & Genetic Disorders Group's Coagulopathies Trials Register, compiled from electronic database searches and handsearching of journals and conference abstract books. We also searched the reference lists of relevant articles and reviews.Date of last search: 18 August 2016. Eligible trials include randomised or quasi-randomised clinical trials, including controlled clinical trials comparing gene therapy (with or without standard treatment) with standard treatment (factor replacement) or other 'curative' treatment such as stem cell transplantation for individuals with haemophilia A or B of all ages who do not have inhibitors to factor VIII or IX. No trials of gene therapy for haemophilia were found. No trials of gene therapy for haemophilia were identified. No randomised or quasi-randomised clinical trials of gene therapy for haemophilia were identified. Thus, we are unable to determine the safety and efficacy of gene therapy for haemophilia. Gene therapy for haemophilia is still in its nascent stages and there is a need for well-designed clinical trials to assess the long-term feasibility, success and risks of gene therapy for people with haemophilia.
Fields, Randall R.; Zhou, Guimei; Huang, Dali; Davis, Jack R.; Möller, Claes; Jacobson, Samuel G.; Kimberling, William J.; Sumegi, Janos
2002-01-01
Usher syndrome type III is an autosomal recessive disorder characterized by progressive sensorineural hearing loss, vestibular dysfunction, and retinitis pigmentosa. The disease gene was localized to 3q25 and recently was identified by positional cloning. In the present study, we have revised the structure of the USH3 gene, including a new translation start site, 5′ untranslated region, and a transcript encoding a 232–amino acid protein. The mature form of the protein is predicted to contain three transmembrane domains and 204 residues. We have found four new disease-causing mutations, including one that appears to be relatively common in the Ashkenazi Jewish population. We have also identified mouse (chromosome 3) and rat (chromosome 2) orthologues, as well as two human paralogues on chromosomes 4 and 10. PMID:12145752
A functional genomics screen in planarians reveals regulators of whole-brain regeneration.
Roberts-Galbraith, Rachel H; Brubacher, John L; Newmark, Phillip A
2016-09-09
Planarians regenerate all body parts after injury, including the central nervous system (CNS). We capitalized on this distinctive trait and completed a gene expression-guided functional screen to identify factors that regulate diverse aspects of neural regeneration in Schmidtea mediterranea . Our screen revealed molecules that influence neural cell fates, support the formation of a major connective hub, and promote reestablishment of chemosensory behavior. We also identified genes that encode signaling molecules with roles in head regeneration, including some that are produced in a previously uncharacterized parenchymal population of cells. Finally, we explored genes downregulated during planarian regeneration and characterized, for the first time, glial cells in the planarian CNS that respond to injury by repressing several transcripts. Collectively, our studies revealed diverse molecules and cell types that underlie an animal's ability to regenerate its brain.
2012-01-01
High-dimensional gene expression data provide a rich source of information because they capture the expression level of genes in dynamic states that reflect the biological functioning of a cell. For this reason, such data are suitable to reveal systems related properties inside a cell, e.g., in order to elucidate molecular mechanisms of complex diseases like breast or prostate cancer. However, this is not only strongly dependent on the sample size and the correlation structure of a data set, but also on the statistical hypotheses tested. Many different approaches have been developed over the years to analyze gene expression data to (I) identify changes in single genes, (II) identify changes in gene sets or pathways, and (III) identify changes in the correlation structure in pathways. In this paper, we review statistical methods for all three types of approaches, including subtypes, in the context of cancer data and provide links to software implementations and tools and address also the general problem of multiple hypotheses testing. Further, we provide recommendations for the selection of such analysis methods. Reviewers This article was reviewed by Arcady Mushegian, Byung-Soo Kim and Joel Bader. PMID:23227854
Copper homeostasis gene discovery in Drosophila melanogaster.
Norgate, Melanie; Southon, Adam; Zou, Sige; Zhan, Ming; Sun, Yu; Batterham, Phil; Camakaris, James
2007-06-01
Recent studies have shown a high level of conservation between Drosophila melanogaster and mammalian copper homeostasis mechanisms. These studies have also demonstrated the efficiency with which this species can be used to characterize novel genes, at both the cellular and whole organism level. As a versatile and inexpensive model organism, Drosophila is also particularly useful for gene discovery applications and thus has the potential to be extremely useful in identifying novel copper homeostasis genes and putative disease genes. In order to assess the suitability of Drosophila for this purpose, three screening approaches have been investigated. These include an analysis of the global transcriptional response to copper in both adult flies and an embryonic cell line using DNA microarray analysis. Two mutagenesis-based screens were also utilized. Several candidate copper homeostasis genes have been identified through this work. In addition, the results of each screen were carefully analyzed to identify any factors influencing efficiency and sensitivity. These are discussed here with the aim of maximizing the efficiency of future screens and the most suitable approaches are outlined. Building on this information, there is great potential for the further use of Drosophila for copper homeostasis gene discovery.
DNA methylation biomarkers for head and neck squamous cell carcinoma.
Zhou, Chongchang; Ye, Meng; Ni, Shumin; Li, Qun; Ye, Dong; Li, Jinyun; Shen, Zhishen; Deng, Hongxia
2018-06-21
DNA methylation plays an important role in the etiology and pathogenesis of head and neck squamous cell carcinoma (HNSCC). The current study aimed to identify aberrantly methylated-differentially expressed genes (DEGs) by a comprehensive bioinformatics analysis. In addition, we screened for DEGs affected by DNA methylation modification and further investigated their prognostic values for HNSCC. We included microarray data of DNA methylation (GSE25093 and GSE33202) and gene expression (GSE23036 and GSE58911) from Gene Expression Omnibus. Aberrantly methylated-DEGs were analyzed with R software. The Cancer Genome Atlas (TCGA) RNA sequencing and DNA methylation (Illumina HumanMethylation450) databases were utilized for validation. In total, 27 aberrantly methylated genes accompanied by altered expression were identified. After confirmation by The Cancer Genome Atlas (TCGA) database, 2 hypermethylated-low-expression genes (FAM135B and ZNF610) and 2 hypomethylated-high-expression genes (HOXA9 and DCC) were identified. A receiver operating characteristic (ROC) curve confirmed the diagnostic value of these four methylated genes for HNSCC. Multivariate Cox proportional hazards analysis showed that FAM135B methylation was a favorable independent prognostic biomarker for overall survival of HNSCC patients.
Characterizing the stress/defense transcriptome of Arabidopsis
Mahalingam, Ramamurthy; Gomez-Buitrago, AnaMaria; Eckardt, Nancy; Shah, Nigam; Guevara-Garcia, Angel; Day, Philip; Raina, Ramesh; Fedoroff, Nina V
2003-01-01
Background To understand the gene networks that underlie plant stress and defense responses, it is necessary to identify and characterize the genes that respond both initially and as the physiological response to the stress or pathogen develops. We used PCR-based suppression subtractive hybridization to identify Arabidopsis genes that are differentially expressed in response to ozone, bacterial and oomycete pathogens and the signaling molecules salicylic acid (SA) and jasmonic acid. Results We identified a total of 1,058 differentially expressed genes from eight stress cDNA libraries. Digital northern analysis revealed that 55% of the stress-inducible genes are rarely transcribed in unstressed plants and 17% of them were not previously represented in Arabidopsis expressed sequence tag databases. More than two-thirds of the genes in the stress cDNA collection have not been identified in previous studies as stress/defense response genes. Several stress-responsive cis-elements showed a statistically significant over-representation in the promoters of the genes in the stress cDNA collection. These include W- and G-boxes, the SA-inducible element, the abscisic acid response element and the TGA motif. Conclusions The stress cDNA collection comprises a broad repertoire of stress-responsive genes encoding proteins that are involved in both the initial and subsequent stages of the physiological response to abiotic stress and pathogens. This set of stress-, pathogen- and hormone-modulated genes is an important resource for understanding the genetic interactions underlying stress signaling and responses and may contribute to the characterization of the stress transcriptome through the construction of standardized specialized arrays. PMID:12620105
He, Hao; Zhang, Lei; Li, Jian; Wang, Yu-Ping; Zhang, Ji-Gang; Shen, Jie; Guo, Yan-Fang
2014-01-01
Context: To date, few systems genetics studies in the bone field have been performed. We designed our study from a systems-level perspective by integrating genome-wide association studies (GWASs), human protein-protein interaction (PPI) network, and gene expression to identify gene modules contributing to osteoporosis risk. Methods: First we searched for modules significantly enriched with bone mineral density (BMD)-associated genes in human PPI network by using 2 large meta-analysis GWAS datasets through a dense module search algorithm. One included 7 individual GWAS samples (Meta7). The other was from the Genetic Factors for Osteoporosis Consortium (GEFOS2). One was assigned as a discovery dataset and the other as an evaluation dataset, and vice versa. Results: In total, 42 modules and 129 modules were identified significantly in both Meta7 and GEFOS2 datasets for femoral neck and spine BMD, respectively. There were 3340 modules identified for hip BMD only in Meta7. As candidate modules, they were assessed for the biological relevance to BMD by gene set enrichment analysis in 2 expression profiles generated from circulating monocytes in subjects with low versus high BMD values. Interestingly, there were 2 modules significantly enriched in monocytes from the low BMD group in both gene expression datasets (nominal P value <.05). Two modules had 16 nonredundant genes. Functional enrichment analysis revealed that both modules were enriched for genes involved in Wnt receptor signaling and osteoblast differentiation. Conclusion: We highlighted 2 modules and novel genes playing important roles in the regulation of bone mass, providing important clues for therapeutic approaches for osteoporosis. PMID:25119315
Huang, Qianqian; Huang, Xiao; Deng, Juan; Liu, Hegang; Liu, Yanwen; Yu, Kun; Huang, Bisheng
2016-01-01
The rhizome of Atractylodes lancea is extensively used in the practice of Traditional Chinese Medicine because of its broad pharmacological activities. This study was designed to characterize the transcriptome profiling of the rhizome and leaf of Atractylodes lancea in an attempt to uncover the molecular mechanisms regulating rhizome formation and growth. Over 270 million clean reads were assembled into 92,366 unigenes, 58% of which are homologous with sequences in public protein databases (NR, Swiss-Prot, GO, and KEGG). Analysis of expression levels showed that genes involved in photosynthesis, stress response, and translation were the most abundant transcripts in the leaf, while transcripts involved in stress response, transcription regulation, translation, and metabolism were dominant in the rhizome. Tissue-specific gene analysis identified distinct gene families active in the leaf and rhizome. Differential gene expression analysis revealed a clear difference in gene expression pattern, identifying 1518 up-regulated genes and 3464 down-regulated genes in the rhizome compared with the leaf, including a series of genes related to signal transduction, primary and secondary metabolism. Transcription factor (TF) analysis identified 42 TF families, with 67 and 60 TFs up-regulated in the rhizome and leaf, respectively. A total of 104 unigenes were identified as candidates for regulating rhizome formation and development. These data offer an overview of the gene expression pattern of the rhizome and leaf and provide essential information for future studies on the molecular mechanisms of controlling rhizome formation and growth. The extensive transcriptome data generated in this study will be a valuable resource for further functional genomics studies of A. lancea. PMID:27066021
Postmortem brain abnormalities of the glutamate neurotransmitter system in autism.
Purcell, A E; Jeon, O H; Zimmerman, A W; Blue, M E; Pevsner, J
2001-11-13
Studies examining the brains of individuals with autism have identified anatomic and pathologic changes in regions such as the cerebellum and hippocampus. Little, if anything, is known, however, about the molecules that are involved in the pathogenesis of this disorder. To identify genes with abnormal expression levels in the cerebella of subjects with autism. Brain samples from a total of 10 individuals with autism and 23 matched controls were collected, mainly from the cerebellum. Two cDNA microarray technologies were used to identify genes that were significantly up- or downregulated in autism. The abnormal mRNA or protein levels of several genes identified by microarray analysis were investigated using PCR with reverse transcription and Western blotting. alpha-Amino-3-hydroxy-5-methyl-4-isoxazoleproprionic acid (AMPA)- and NMDA-type glutamate receptor densities were examined with receptor autoradiography in the cerebellum, caudate-putamen, and prefrontal cortex. The mRNA levels of several genes were significantly increased in autism, including excitatory amino acid transporter 1 and glutamate receptor AMPA 1, two members of the glutamate system. Abnormalities in the protein or mRNA levels of several additional molecules in the glutamate system were identified on further analysis, including glutamate receptor binding proteins. AMPA-type glutamate receptor density was decreased in the cerebellum of individuals with autism (p < 0.05). Subjects with autism may have specific abnormalities in the AMPA-type glutamate receptors and glutamate transporters in the cerebellum. These abnormalities may be directly involved in the pathogenesis of the disorder.
St-Amand, Jonny; Yoshioka, Mayumi; Tanaka, Keitaro; Nishida, Yuichiro
2012-01-01
To identify preferentially expressed genes in the central endocrine organs of the hypothalamus and pituitary gland, we generated transcriptome-wide mRNA profiles of the hypothalamus, pituitary gland, and parietal cortex in male mice (12–15 weeks old) using serial analysis of gene expression (SAGE). Total counts of SAGE tags for the hypothalamus, pituitary gland, and parietal cortex were 165824, 126688, and 161045 tags, respectively. This represented 59244, 45151, and 55131 distinct tags, respectively. Comparison of these mRNA profiles revealed that 22 mRNA species, including three potential novel transcripts, were preferentially expressed in the hypothalamus. In addition to well-known hypothalamic transcripts, such as hypocretin, several genes involved in hormone function, intracellular transduction, metabolism, protein transport, steroidogenesis, extracellular matrix, and brain disease were identified as preferentially expressed hypothalamic transcripts. In the pituitary gland, 106 mRNA species, including 60 potential novel transcripts, were preferentially expressed. In addition to well-known pituitary genes, such as growth hormone and thyroid stimulating hormone beta, a number of genes classified to function in transport, amino acid metabolism, intracellular transduction, cell adhesion, disulfide bond formation, stress response, transcription, protein synthesis, and turnover, cell differentiation, the cell cycle, and in the cytoskeleton and extracellular matrix were also preferentially expressed. In conclusion, the current study identified not only well-known hypothalamic and pituitary transcripts but also a number of new candidates likely to be involved in endocrine homeostatic systems regulated by the hypothalamus and pituitary gland. PMID:22649398
St-Amand, Jonny; Yoshioka, Mayumi; Tanaka, Keitaro; Nishida, Yuichiro
2011-01-01
To identify preferentially expressed genes in the central endocrine organs of the hypothalamus and pituitary gland, we generated transcriptome-wide mRNA profiles of the hypothalamus, pituitary gland, and parietal cortex in male mice (12-15 weeks old) using serial analysis of gene expression (SAGE). Total counts of SAGE tags for the hypothalamus, pituitary gland, and parietal cortex were 165824, 126688, and 161045 tags, respectively. This represented 59244, 45151, and 55131 distinct tags, respectively. Comparison of these mRNA profiles revealed that 22 mRNA species, including three potential novel transcripts, were preferentially expressed in the hypothalamus. In addition to well-known hypothalamic transcripts, such as hypocretin, several genes involved in hormone function, intracellular transduction, metabolism, protein transport, steroidogenesis, extracellular matrix, and brain disease were identified as preferentially expressed hypothalamic transcripts. In the pituitary gland, 106 mRNA species, including 60 potential novel transcripts, were preferentially expressed. In addition to well-known pituitary genes, such as growth hormone and thyroid stimulating hormone beta, a number of genes classified to function in transport, amino acid metabolism, intracellular transduction, cell adhesion, disulfide bond formation, stress response, transcription, protein synthesis, and turnover, cell differentiation, the cell cycle, and in the cytoskeleton and extracellular matrix were also preferentially expressed. In conclusion, the current study identified not only well-known hypothalamic and pituitary transcripts but also a number of new candidates likely to be involved in endocrine homeostatic systems regulated by the hypothalamus and pituitary gland.
Chang, Jiun C; Sebastian, Aimy; Murugesh, Deepa K; Hatsell, Sarah; Economides, Aris N; Christiansen, Blaine A; Loots, Gabriela G
2017-03-01
Joint injury causes post-traumatic osteoarthritis (PTOA). About ∼50% of patients rupturing their anterior cruciate ligament (ACL) will develop PTOA within 1-2 decades of the injury, yet the mechanisms responsible for the development of PTOA after joint injury are not well understood. In this study, we examined whole joint gene expression by RNA sequencing (RNAseq) at 1 day, 1-, 6-, and 12 weeks post injury, in a non-invasive tibial compression (TC) overload mouse model of PTOA that mimics ACL rupture in humans. We identified 1446 genes differentially regulated between injured and contralateral joints. This includes known regulators of osteoarthritis such as MMP3, FN1, and COMP, and several new genes including Suco, Sorcs2, and Medag. We also identified 18 long noncoding RNAs that are differentially expressed in the injured joints. By comparing our data to gene expression data generated using the surgical destabilization of the medial meniscus (DMM) PTOA model, we identified several common genes and shared mechanisms. Our study highlights several differences between these two models and suggests that the TC model may be a more rapidly progressing model of PTOA. This study provides the first account of gene expression changes associated with PTOA development and progression in a TC model. © 2016 The Authors. Journal of Orthopaedic Research Published by Wiley Periodicals, Inc. J Orthop Res 35:474-485, 2017. © 2016 The Authors. Journal of Orthopaedic Research Published by Wiley Periodicals, Inc.
Chang, Jiun C.; Sebastian, Aimy; Murugesh, Deepa K.; Hatsell, Sarah; Economides, Aris N.; Christiansen, Blaine A.
2016-01-01
ABSTRACT Joint injury causes post‐traumatic osteoarthritis (PTOA). About ∼50% of patients rupturing their anterior cruciate ligament (ACL) will develop PTOA within 1–2 decades of the injury, yet the mechanisms responsible for the development of PTOA after joint injury are not well understood. In this study, we examined whole joint gene expression by RNA sequencing (RNAseq) at 1 day, 1‐, 6‐, and 12 weeks post injury, in a non‐invasive tibial compression (TC) overload mouse model of PTOA that mimics ACL rupture in humans. We identified 1446 genes differentially regulated between injured and contralateral joints. This includes known regulators of osteoarthritis such as MMP3, FN1, and COMP, and several new genes including Suco, Sorcs2, and Medag. We also identified 18 long noncoding RNAs that are differentially expressed in the injured joints. By comparing our data to gene expression data generated using the surgical destabilization of the medial meniscus (DMM) PTOA model, we identified several common genes and shared mechanisms. Our study highlights several differences between these two models and suggests that the TC model may be a more rapidly progressing model of PTOA. This study provides the first account of gene expression changes associated with PTOA development and progression in a TC model. © 2016 The Authors. Journal of Orthopaedic Research Published by Wiley Periodicals, Inc. J Orthop Res 35:474–485, 2017. PMID:27088242
McGrath, Ken C.; Dombrecht, Bruno; Manners, John M.; Schenk, Peer M.; Edgar, Cameron I.; Maclean, Donald J.; Scheible, Wolf-Rüdiger; Udvardi, Michael K.; Kazan, Kemal
2005-01-01
To identify transcription factors (TFs) involved in jasmonate (JA) signaling and plant defense, we screened 1,534 Arabidopsis (Arabidopsis thaliana) TFs by real-time quantitative reverse transcription-PCR for their altered transcript at 6 h following either methyl JA treatment or inoculation with the incompatible pathogen Alternaria brassicicola. We identified 134 TFs that showed a significant change in expression, including many APETALA2/ethylene response factor (AP2/ERF), MYB, WRKY, and NAC TF genes with unknown functions. Twenty TF genes were induced by both the pathogen and methyl JA and these included 10 members of the AP2/ERF TF family, primarily from the B1a and B3 subclusters. Functional analysis of the B1a TF AtERF4 revealed that AtERF4 acts as a novel negative regulator of JA-responsive defense gene expression and resistance to the necrotrophic fungal pathogen Fusarium oxysporum and antagonizes JA inhibition of root elongation. In contrast, functional analysis of the B3 TF AtERF2 showed that AtERF2 is a positive regulator of JA-responsive defense genes and resistance to F. oxysporum and enhances JA inhibition of root elongation. Our results suggest that plants coordinately express multiple repressor- and activator-type AP2/ERFs during pathogen challenge to modulate defense gene expression and disease resistance. PMID:16183832
CREBBP mutations in relapsed acute lymphoblastic leukaemia
Mullighan, Charles G.; Zhang, Jinghui; Kasper, Lawryn H.; Lerach, Stephanie; Payne-Turner, Debbie; Phillips, Letha A.; Heatley, Sue L.; Holmfeldt, Linda; Collins-Underwood, J. Racquel; Ma, Jing; Buetow, Kenneth H.; Pui, Ching-Hon; Baker, Sharyn D.; Brindle, Paul K.; Downing, James R.
2010-01-01
Relapsed acute lymphoblastic leukaemia (ALL) is a leading cause of death due to disease in young people, but the biologic determinants of treatment failure remain poorly understood. Recent genome-wide profiling of structural DNA alterations in ALL have identified multiple submicroscopic somatic mutations targeting key cellular pathways1,2, and have demonstrated substantial evolution in genetic alterations from diagnosis to relapse3. However, detailed analysis of sequence mutations in ALL has not been performed. To identify novel mutations in relapsed ALL, we resequenced 300 genes in matched diagnosis and relapse samples from 23 patients with ALL. This identified 52 somatic non-synonymous mutations in 32 genes, many of which were novel, including the transcriptional coactivators CREBBP and NCOR1, the transcription factors ERG, SPI1, TCF4 and TCF7L2, components of the Ras signalling pathway, histone genes, genes involved in histone modification (CREBBP and CTCF), and genes previously shown to be targets of recurring DNA copy number alteration in ALL. Analysis of an extended cohort of 71 diagnosis-relapse cases and 270 acute leukaemia cases that did not relapse found that 18.3% of relapse cases had sequence or deletion mutations of CREBBP, which encodes the transcriptional coactivator and histone acetyltransferase (HAT) CREB-binding protein (CBP)4. The mutations were either present at diagnosis or acquired at relapse, and resulted in truncated alleles or deleterious substitutions in conserved residues of the HAT domain. Functionally, the mutations impaired histone acetylation and transcriptional regulation of CREBBP targets, including glucocorticoid responsive genes. Several mutations acquired at relapse were detected in subclones at diagnosis, suggesting that the mutations may confer resistance to therapy. These results extend the landscape of genetic alterations in leukaemia, and identify mutations targeting transcriptional and epigenetic regulation as a mechanism of resistance in ALL. PMID:21390130
Utility of Genetic Testing in Elite Volleyball Players with Aortic Root Dilation.
Herrick, Nicole; Davis, Christopher; Vargas, Lisa; Dietz, Hal; Grossfeld, Paul
2017-07-01
Basketball and volleyball attract individuals with a characteristic biophysical profile, mimicking features of Marfan syndrome. Consequently, identification of these abnormalities can be lifesaving. To determine how physical examination, echocardiography, and genetic screening can identify elite volleyball players with a previously undiagnosed aortopathy. We have performed cardiac screening on 90 US Volleyball National Team members and identified four individuals with dilated sinuses of Valsalva. This case series reports on three individuals who underwent a comprehensive genetics evaluation, including gene sequencing. Cardiac screening combined with genetic testing can identify previously undiagnosed tall athletes with an aortopathy, in the absence of noncardiac findings of a connective tissue disorder. Subject 1 had a revised Ghent systems (RGS) score of 2 and a normal aortopathy gene panel. Subject 2 had a RGS score of 1 and genetic testing revealed a de novo disease causing mutation in the gene encoding fibrillin-1 (FBN1). Subject 3 had an RGS score of 4.0 and had a normal aortopathy gene panel. Despite variable clinical features of Marfan syndrome, dilated sinuses of Valsalva were found in 4.9% of the athletes. A disease-causing mutation in the FBN1 gene was identified in subject 2, who had the lowest RGS but the largest aortic root measurement. Subjects 1 and 3, with the highest RGS, had a normal aortopathy gene panel. Our findings provide further evidence suggesting that a cardiac evaluation, including a screening echocardiogram, should be performed on all elite tall adult athletes independent of other physical findings. Genetic testing should be considered for athletes with dilated sinuses of Valsalva (male, >4.2 cm; female, >3.4 cm), regardless of other extracardiac findings.
Maran, Sathiya; Lee, Yeong Yeh; Xu, Shuhua; Rajab, Nur-Shafawati; Hasan, Norhazrini; Syed Abdul Aziz, Syed Hassan; Majid, Noorizan Abdul; Zilfalil, Bin Alwi
2013-01-01
AIM: To identify genes associated with gastric precancerous lesions in Helicobacter pylori (H. pylori)-susceptible ethnic Malays. METHODS: Twenty-three Malay subjects with H. pylori infection and gastric precancerous lesions identified during endoscopy were included as “cases”. Thirty-seven Malay subjects who were H. pylori negative and had no precancerous lesions were included as “controls”. Venous blood was collected for genotyping with Affymetrix 50K Xba1 kit. Genotypes with call rates < 90% for autosomal single nucleotide polymorphisms (SNPs) were excluded. For each precancerous lesion, associated SNPs were identified from Manhattan plots, and only SNPs with a χ2 P value < 0.05 and Hardy Weinberg Equilibrium P value > 0.5 was considered as significant markers. RESULTS: Of the 23 H. pylori-positive subjects recruited, one sample was excluded from further analysis due to a low genotyping call rate. Of the 22 H. pylori-positive samples, atrophic gastritis only was present in 50.0%, complete intestinal metaplasia was present in 18.25%, both incomplete intestinal metaplasia and dysplasia was present in 22.7%, and dysplasia only was present in 9.1%. SNPs rs9315542 (UFM1 gene), rs6878265 (THBS4 gene), rs1042194 (CYP2C19 gene) and rs10505799 (MGST1 gene) were significantly associated with atrophic gastritis, complete intestinal metaplasia, incomplete metaplasia with foci of dysplasia and dysplasia, respectively. Allele frequencies in “cases” vs “controls” for rs9315542, rs6878265, rs1042194 and rs10505799 were 0.4 vs 0.06, 0.6 vs 0.01, 0.6 vs 0.01 and 0.5 vs 0.02, respectively. CONCLUSION: Genetic variants possibly related to gastric precancerous lesions in ethnic Malays susceptible to H. pylori infection were identified for testing in subsequent trials. PMID:23801863
Tan, Chee K.; Carey, Alison J.; Cui, Xiangqin; Webb, Richard I.; Ipe, Deepak; Crowley, Michael; Cripps, Allan W.; Benjamin, William H.; Ulett, Kimberly B.; Schembri, Mark A.
2012-01-01
The most common causes of urinary tract infections (UTIs) are Gram-negative pathogens such as Escherichia coli; however, Gram-positive organisms, including Streptococcus agalactiae, or group B streptococcus (GBS), also cause UTI. In GBS infection, UTI progresses to cystitis once the bacteria colonize the bladder, but the host responses triggered in the bladder immediately following infection are largely unknown. Here, we used genome-wide expression profiling to map the bladder transcriptome of GBS UTI in mice infected transurethrally with uropathogenic GBS that was cultured from a 35-year-old women with cystitis. RNA from bladders was applied to Affymetrix Gene-1.0ST microarrays; quantitative reverse transcriptase PCR (qRT-PCR) was used to analyze selected gene responses identified in array data sets. A surprisingly small significant-gene list of 172 genes was identified at 24 h; this compared to 2,507 genes identified in a side-by-side comparison with uropathogenic E. coli (UPEC). No genes exhibited significantly altered expression at 2 h in GBS-infected mice according to arrays despite high bladder bacterial loads at this early time point. The absence of a marked early host response to GBS juxtaposed with broad-based bladder responses activated by UPEC at 2 h. Bioinformatics analyses, including integrative system-level network mapping, revealed multiple activated biological pathways in the GBS bladder transcriptome that regulate leukocyte activation, inflammation, apoptosis, and cytokine-chemokine biosynthesis. These findings define a novel, minimalistic type of bladder host response triggered by GBS UTI, which comprises collective antimicrobial pathways that differ dramatically from those activated by UPEC. Overall, this study emphasizes the unique nature of bladder immune activation mechanisms triggered by distinct uropathogens. PMID:22733575
Al-Hebshi, Nezar Noor; Li, Shiyong; Nasher, Akram Thabet; El-Setouhy, Maged; Alsanosi, Rashad; Blancato, Jan; Loffredo, Christopher
2016-07-15
The study sought to identify genetic aberrations driving oral squamous cell carcinoma (OSCC) development among users of shammah, an Arabian preparation of smokeless tobacco. Twenty archival OSCC samples, 15 of which with a history of shammah exposure, were whole-exome sequenced at an average depth of 127×. Somatic mutations were identified using a novel, matched controls-independent filtration algorithm. CODEX and Exomedepth coupled with a novel, Database of Genomic Variant-based filter were employed to call somatic gene-copy number variations. Significantly mutated genes were identified with Oncodrive FM and the Youn and Simon's method. Candidate driver genes were nominated based on Gene Set Enrichment Analysis. The observed mutational spectrum was similar to that reported by the TCGA project. In addition to confirming known genes of OSCC (TP53, CDKNA2, CASP8, PIK3CA, HRAS, FAT1, TP63, CCND1 and FADD) the analysis identified several candidate novel driver events including mutations of NOTCH3, CSMD3, CRB1, CLTCL1, OSMR and TRPM2, amplification of the proto-oncogenes FOSL1, RELA, TRAF6, MDM2, FRS2 and BAG1, and deletion of the recently described tumor suppressor SMARCC1. Analysis also revealed significantly altered pathways not previously implicated in OSCC including Oncostatin-M signalling pathway, AP-1 and C-MYB transcription networks and endocytosis. There was a trend for higher number of mutations, amplifications and driver events in samples with history of shammah exposure particularly those that tested EBV positive, suggesting an interaction between tobacco exposure and EBV. The work provides further evidence for the genetic heterogeneity of oral cancer and suggests shammah-associated OSCC is characterized by extensive amplification of oncogenes. © 2016 UICC.
Ferreira Filho, Jaire Alves; Horta, Maria Augusta Crivelente; Beloti, Lilian Luzia; Dos Santos, Clelton Aparecido; de Souza, Anete Pereira
2017-10-12
Trichoderma harzianum is used in biotechnology applications due to its ability to produce powerful enzymes for the conversion of lignocellulosic substrates into soluble sugars. Active enzymes involved in carbohydrate metabolism are defined as carbohydrate-active enzymes (CAZymes), and the most abundant family in the CAZy database is the glycoside hydrolases. The enzymes of this family play a fundamental role in the decomposition of plant biomass. In this study, the CAZymes of T. harzianum were identified and classified using bioinformatic approaches after which the expression profiles of all annotated CAZymes were assessed via RNA-Seq, and a phylogenetic analysis was performed. A total of 430 CAZymes (3.7% of the total proteins for this organism) were annotated in T. harzianum, including 259 glycoside hydrolases (GHs), 101 glycosyl transferases (GTs), 6 polysaccharide lyases (PLs), 22 carbohydrate esterases (CEs), 42 auxiliary activities (AAs) and 46 carbohydrate-binding modules (CBMs). Among the identified T. harzianum CAZymes, 47% were predicted to harbor a signal peptide sequence and were therefore classified as secreted proteins. The GH families were the CAZyme class with the greatest number of expressed genes, including GH18 (23 genes), GH3 (17 genes), GH16 (16 genes), GH2 (13 genes) and GH5 (12 genes). A phylogenetic analysis of the proteins in the AA9/GH61, CE5 and GH55 families showed high functional variation among the proteins. Identifying the main proteins used by T. harzianum for biomass degradation can ensure new advances in the biofuel production field. Herein, we annotated and characterized the expression levels of all of the CAZymes from T. harzianum, which may contribute to future studies focusing on the functional and structural characterization of the identified proteins.
2014-01-01
Background Copper is essential for the survival of aerobic organisms. If copper is not properly regulated in the body however, it can be extremely cytotoxic and genetic mutations that compromise copper homeostasis result in severe clinical phenotypes. Understanding how cells maintain optimal copper levels is therefore highly relevant to human health. Results We found that addition of copper (Cu) to culture medium leads to increased respiratory growth of yeast, a phenotype which we then systematically and quantitatively measured in 5050 homozygous diploid deletion strains. Cu’s positive effect on respiratory growth was quantitatively reduced in deletion strains representing 73 different genes, the function of which identify increased iron uptake as a cause of the increase in growth rate. Conversely, these effects were enhanced in strains representing 93 genes. Many of these strains exhibited respiratory defects that were specifically rescued by supplementing the growth medium with Cu. Among the genes identified are known and direct regulators of copper homeostasis, genes required to maintain low vacuolar pH, and genes where evidence supporting a functional link with Cu has been heretofore lacking. Roughly half of the genes are conserved in man, and several of these are associated with Mendelian disorders, including the Cu-imbalance syndromes Menkes and Wilson’s disease. We additionally demonstrate that pharmacological agents, including the approved drug disulfiram, can rescue Cu-deficiencies of both environmental and genetic origin. Conclusions A functional screen in yeast has expanded the list of genes required for Cu-dependent fitness, revealing a complex cellular system with implications for human health. Respiratory fitness defects arising from perturbations in this system can be corrected with pharmacological agents that increase intracellular copper concentrations. PMID:24708151
Camps, Carme; Petousi, Nayia; Bento, Celeste; Cario, Holger; Copley, Richard R.; McMullin, Mary Frances; van Wijk, Richard; Ratcliffe, Peter J.; Robbins, Peter A.; Taylor, Jenny C.
2016-01-01
Erythrocytosis is a rare disorder characterized by increased red cell mass and elevated hemoglobin concentration and hematocrit. Several genetic variants have been identified as causes for erythrocytosis in genes belonging to different pathways including oxygen sensing, erythropoiesis and oxygen transport. However, despite clinical investigation and screening for these mutations, the cause of disease cannot be found in a considerable number of patients, who are classified as having idiopathic erythrocytosis. In this study, we developed a targeted next-generation sequencing panel encompassing the exonic regions of 21 genes from relevant pathways (~79 Kb) and sequenced 125 patients with idiopathic erythrocytosis. The panel effectively screened 97% of coding regions of these genes, with an average coverage of 450×. It identified 51 different rare variants, all leading to alterations of protein sequence, with 57 out of 125 cases (45.6%) having at least one of these variants. Ten of these were known erythrocytosis-causing variants, which had been missed following existing diagnostic algorithms. Twenty-two were novel variants in erythrocytosis-associated genes (EGLN1, EPAS1, VHL, BPGM, JAK2, SH2B3) and in novel genes included in the panel (e.g. EPO, EGLN2, HIF3A, OS9), some with a high likelihood of functionality, for which future segregation, functional and replication studies will be useful to provide further evidence for causality. The rest were classified as polymorphisms. Overall, these results demonstrate the benefits of using a gene panel rather than existing methods in which focused genetic screening is performed depending on biochemical measurements: the gene panel improves diagnostic accuracy and provides the opportunity for discovery of novel variants. PMID:27651169
Camps, Carme; Petousi, Nayia; Bento, Celeste; Cario, Holger; Copley, Richard R; McMullin, Mary Frances; van Wijk, Richard; Ratcliffe, Peter J; Robbins, Peter A; Taylor, Jenny C
2016-11-01
Erythrocytosis is a rare disorder characterized by increased red cell mass and elevated hemoglobin concentration and hematocrit. Several genetic variants have been identified as causes for erythrocytosis in genes belonging to different pathways including oxygen sensing, erythropoiesis and oxygen transport. However, despite clinical investigation and screening for these mutations, the cause of disease cannot be found in a considerable number of patients, who are classified as having idiopathic erythrocytosis. In this study, we developed a targeted next-generation sequencing panel encompassing the exonic regions of 21 genes from relevant pathways (~79 Kb) and sequenced 125 patients with idiopathic erythrocytosis. The panel effectively screened 97% of coding regions of these genes, with an average coverage of 450×. It identified 51 different rare variants, all leading to alterations of protein sequence, with 57 out of 125 cases (45.6%) having at least one of these variants. Ten of these were known erythrocytosis-causing variants, which had been missed following existing diagnostic algorithms. Twenty-two were novel variants in erythrocytosis-associated genes (EGLN1, EPAS1, VHL, BPGM, JAK2, SH2B3) and in novel genes included in the panel (e.g. EPO, EGLN2, HIF3A, OS9), some with a high likelihood of functionality, for which future segregation, functional and replication studies will be useful to provide further evidence for causality. The rest were classified as polymorphisms. Overall, these results demonstrate the benefits of using a gene panel rather than existing methods in which focused genetic screening is performed depending on biochemical measurements: the gene panel improves diagnostic accuracy and provides the opportunity for discovery of novel variants. Copyright© Ferrata Storti Foundation.
Gravity-regulated gene expression in Arabidopsis thaliana
NASA Astrophysics Data System (ADS)
Sederoff, Heike; Brown, Christopher S.; Heber, Steffen; Kajla, Jyoti D.; Kumar, Sandeep; Lomax, Terri L.; Wheeler, Benjamin; Yalamanchili, Roopa
Plant growth and development is regulated by changes in environmental signals. Plants sense environmental changes and respond to them by modifying gene expression programs to ad-just cell growth, differentiation, and metabolism. Functional expression of genes comprises many different processes including transcription, translation, post-transcriptional and post-translational modifications, as well as the degradation of RNA and proteins. Recently, it was discovered that small RNAs (sRNA, 18-24 nucleotides long), which are heritable and systemic, are key elements in regulating gene expression in response to biotic and abiotic changes. Sev-eral different classes of sRNAs have been identified that are part of a non-cell autonomous and phloem-mobile network of regulators affecting transcript stability, translational kinetics, and DNA methylation patterns responsible for heritable transcriptional silencing (epigenetics). Our research has focused on gene expression changes in response to gravistimulation of Arabidopsis roots. Using high-throughput technologies including microarrays and 454 sequencing, we iden-tified rapid changes in transcript abundance of genes as well as differential expression of small RNA in Arabidopsis root apices after minutes of reorientation. Some of the differentially regu-lated transcripts are encoded by genes that are important for the bending response. Functional mutants of those genes respond faster to reorientation than the respective wild type plants, indicating that these proteins are repressors of differential cell elongation. We compared the gravity responsive sRNAs to the changes in transcript abundances of their putative targets and identified several potential miRNA: target pairs. Currently, we are using mutant and transgenic Arabidopsis plants to characterize the function of those miRNAs and their putative targets in gravitropic and phototropic responses in Arabidopsis.
Li, Jiang; Yoshikawa, Akane; Brennan, Mark D; Ramsey, Timothy L; Meltzer, Herbert Y
2018-02-01
Biomarkers which predict response to atypical antipsychotic drugs (AAPDs) increases their benefit/risk ratio. We sought to identify common variants in genes which predict response to lurasidone, an AAPD, by associating genome-wide association study (GWAS) data and changes (Δ) in Positive And Negative Syndrome Scale (PANSS) scores from two 6-week randomized, placebo-controlled trials of lurasidone in schizophrenia (SCZ) patients. We also included SCZ risk SNPs identified by the Psychiatric Genomics Consortium using a polygenic risk analysis. The top genomic loci, with uncorrected p<10 -4 , include: 1) synaptic adhesion (PTPRD, LRRC4C, NRXN1, ILIRAPL1, SLITRK1) and scaffolding (MAGI1, MAGI2, NBEA) genes, both essential for synaptic function; 2) other synaptic plasticity-related genes (NRG1/3 and KALRN); 3) the neuron-specific RNA splicing regulator, RBFOX1; and 4) ion channel genes, e.g. KCNA10, KCNAB1, KCNK9 and CACNA2D3). Some genes predicted response for patients with both European and African Ancestries. We replicated some SNPs reported to predict response to other atypical APDs in other GWAS. Although none of the biomarkers reached genome-wide significance, many of the genes and associated pathways have previously been linked to SCZ. Two polygenic modeling approaches, GCTA-GREML and PLINK-Polygenic Risk Score, demonstrated that some risk genes related to neurodevelopment, synaptic biology, immune response, and histones, also contributed to prediction of response. The top hits predicting response to lurasidone did not predict improvement with placebo. This is the first evidence from clinical trials that SCZ risk SNPs are related to clinical response to an AAPD. These results need to be replicated in an independent sample. Copyright © 2017. Published by Elsevier B.V.
Screening the molecular targets of ovarian cancer based on bioinformatics analysis.
Du, Lei; Qian, Xiaolei; Dai, Chenyang; Wang, Lihua; Huang, Ding; Wang, Shuying; Shen, Xiaowei
2015-01-01
Ovarian cancer (OC) is the most lethal gynecologic malignancy. This study aims to explore the molecular mechanisms of OC and identify potential molecular targets for OC treatment. Microarray gene expression data (GSE14407) including 12 normal ovarian surface epithelia samples and 12 OC epithelia samples were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) between 2 kinds of ovarian tissue were identified by using limma package in R language (|log2 fold change| gt;1 and false discovery rate [FDR] lt;0.05). Protein-protein interactions (PPIs) and known OC-related genes were screened from COXPRESdb and GenBank database, respectively. Furthermore, PPI network of top 10 upregulated DEGs and top 10 downregulated DEGs was constructed and visualized through Cytoscape software. Finally, for the genes involved in PPI network, functional enrichment analysis was performed by using DAVID (FDR lt;0.05). In total, 1136 DEGs were identified, including 544 downregulated and 592 upregulated DEGs. Then, PPI network was constructed, and DEGs CDKN2A, MUC1, OGN, ZIC1, SOX17, and TFAP2A interacted with known OC-related genes CDK4, EGFR/JUN, SRC, CLI1, CTNNB1, and TP53, respectively. Moreover, functions about oxygen transport and embryonic development were enriched by the genes involved in the network of downregulated DEGs. We propose that 4 DEGs (OGN, ZIC1, SOX17, and TFAP2A) and 2 functions (oxygen transport and embryonic development) might play a role in the development of OC. These 4 DEGs and known OC-related genes might serve as therapeutic targets for OC. Further studies are required to validate these predictions.
Cao, Chuanwang; Wang, Zhiying; Niu, Changying; Desneux, Nicolas; Gao, Xiwu
2013-01-01
Phenol is a major pollutant in aquatic ecosystems due to its chemical stability, water solubility and environmental mobility. To date, little is known about the molecular modifications of invertebrates under phenol stress. In the present study, we used Solexa sequencing technology to investigate the transcriptome and differentially expressed genes (DEGs) of midges (Chironomus kiinensis) in response to phenol stress. A total of 51,518,972 and 51,150,832 clean reads in the phenol-treated and control libraries, respectively, were obtained and assembled into 51,014 non-redundant (Nr) consensus sequences. A total of 6,032 unigenes were classified by Gene Ontology (GO), and 18,366 unigenes were categorized into 238 Kyoto Encyclopedia of Genes and Genomes (KEGG) categories. These genes included representatives from almost all functional categories. A total of 10,724 differentially expressed genes (P value <0.05) were detected in a comparative analysis of the expression profiles between phenol-treated and control C. kiinensis including 8,390 upregulated and 2,334 downregulated genes. The expression levels of 20 differentially expressed genes were confirmed by real-time RT-PCR, and the trends in gene expression that were observed matched the Solexa expression profiles, although the magnitude of the variations was different. Through pathway enrichment analysis, significantly enriched pathways were identified for the DEGs, including metabolic pathways, aryl hydrocarbon receptor (AhR), pancreatic secretion and neuroactive ligand-receptor interaction pathways, which may be associated with the phenol responses of C. kiinensis. Using Solexa sequencing technology, we identified several groups of key candidate genes as well as important biological pathways involved in the molecular modifications of chironomids under phenol stress. PMID:23527048
Meta-analysis of shared genetic architecture across ten pediatric autoimmune diseases
Li, Yun R; Li, Jin; Zhao, Sihai D; Bradfield, Jonathan P; Mentch, Frank D; Maggadottir, S Melkorka; Hou, Cuiping; Abrams, Debra J; Chang, Diana; Gao, Feng; Guo, Yiran; Wei, Zhi; Connolly, John J; Cardinale, Christopher J; Bakay, Marina; Glessner, Joseph T; Li, Dong; Kao, Charlly; Thomas, Kelly A; Qiu, Haijun; Chiavacci, Rosetta M; Kim, Cecilia E; Wang, Fengxiang; Snyder, James; Richie, Marylyn D; Flatø, Berit; Førre, Øystein; Denson, Lee A; Thompson, Susan D; Becker, Mara L; Guthery, Stephen L; Latiano, Anna; Perez, Elena; Resnick, Elena; Russell, Richard K; Wilson, David C; Silverberg, Mark S; Annese, Vito; Lie, Benedicte A; Punaro, Marilynn; Dubinsky, Marla C; Monos, Dimitri S; Strisciuglio, Caterina; Staiano, Annamaria; Miele, Erasmo; Kugathasan, Subra; Ellis, Justine A; Munro, Jane E; Sullivan, Kathleen E; Wise, Carol A; Chapel, Helen; Cunningham-Rundles, Charlotte; Grant, Struan F A; Orange, Jordan S; Sleiman, Patrick M A; Behrens, Edward M; Griffiths, Anne M; Satsangi, Jack; Finkel, Terri H; Keinan, Alon; Prak, Eline T Luning; Polychronakos, Constantin; Baldassano, Robert N; Li, Hongzhe; Keating, Brendan J; Hakonarson, Hakon
2016-01-01
Genome-wide association studies (GWASs) have identified hundreds of susceptibility genes, including shared associations across clinically distinct autoimmune diseases. We performed an inverse χ2 meta-analysis across ten pediatric-age-of-onset autoimmune diseases (pAIDs) in a case-control study including more than 6,035 cases and 10,718 shared population-based controls. We identified 27 genome-wide significant loci associated with one or more pAIDs, mapping to in silico–replicated autoimmune-associated genes (including IL2RA) and new candidate loci with established immunoregulatory functions such as ADGRL2, TENM3, ANKRD30A, ADCY7 and CD40LG. The pAID-associated single-nucleotide polymorphisms (SNPs) were functionally enriched for deoxyribonuclease (DNase)-hypersensitivity sites, expression quantitative trait loci (eQTLs), microRNA (miRNA)-binding sites and coding variants. We also identified biologically correlated, pAID-associated candidate gene sets on the basis of immune cell expression profiling and found evidence of genetic sharing. Network and protein-interaction analyses demonstrated converging roles for the signaling pathways of type 1, 2 and 17 helper T cells (TH1, TH2 and TH17), JAK-STAT, interferon and interleukin in multiple autoimmune diseases. PMID:26301688
Hernandez-Valladares, Maria; Rihet, Pascal; Iraqi, Fuad A
2014-01-01
There is growing evidence for human genetic factors controlling the outcome of malaria infection, while molecular basis of this genetic control is still poorly understood. Case-control and family-based studies have been carried out to identify genes underlying host susceptibility to malarial infection. Parasitemia and mild malaria have been genetically linked to human chromosomes 5q31-q33 and 6p21.3, and several immune genes located within those regions have been associated with malaria-related phenotypes. Association and linkage studies of resistance to malaria are not easy to carry out in human populations, because of the difficulty in surveying a significant number of families. Murine models have proven to be an excellent genetic tool for studying host response to malaria; their use allowed mapping 14 resistance loci, eight of them controlling parasitic levels and six controlling cerebral malaria. Once quantitative trait loci or genes have been identified, the human ortholog may then be identified. Comparative mapping studies showed that a couple of human and mouse might share similar genetically controlled mechanisms of resistance. In this way, char8, which controls parasitemia, was mapped on chromosome 11; char8 corresponds to human chromosome 5q31-q33 and contains immune genes, such as Il3, Il4, Il5, Il12b, Il13, Irf1, and Csf2. Nevertheless, part of the genetic factors controlling malaria traits might differ in both hosts because of specific host-pathogen interactions. Finally, novel genetic tools including animal models were recently developed and will offer new opportunities for identifying genetic factors underlying host phenotypic response to malaria, which will help in better therapeutic strategies including vaccine and drug development.
Johnson, S R; Leo, P J; McInerney-Leo, A M; Anderson, L K; Marshall, M; McGown, I; Newell, F; Brown, M A; Conwell, L S; Harris, M; Duncan, E L
2018-06-01
To assess the utility of whole-exome sequencing (WES) for mutation detection in maturity-onset diabetes of the young (MODY) and congenital hyperinsulinism (CHI). MODY and CHI are the two commonest monogenic disorders of glucose-regulated insulin secretion in childhood, with 13 causative genes known for MODY and 10 causative genes identified for CHI. The large number of potential genes makes comprehensive screening using traditional methods expensive and time-consuming. Ten subjects with MODY and five with CHI with known mutations underwent WES using two different exome capture kits (Nimblegen SeqCap EZ Human v3.0 Exome Enrichment Kit, Nextera Rapid Capture Exome Kit). Analysis was blinded to previously identified mutations, and included assessment for large deletions. The target capture of five exome capture technologies was also analyzed using sequencing data from >2800 unrelated samples. Four of five MODY mutations were identified using Nimblegen (including a large deletion in HNF1B). Although targeted, one mutation (in INS) had insufficient coverage for detection. Eleven of eleven mutations (six MODY, five CHI) were identified using Nextera Rapid (including the previously missed mutation). On reconciliation, all mutations concorded with previous data and no additional variants in MODY genes were detected. There were marked differences in the performance of the capture technologies. WES can be useful for screening for MODY/CHI mutations, detecting both point mutations and large deletions. However, capture technologies require careful selection. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Gene and miRNA expression profiles in autism spectrum disorders.
Ghahramani Seno, Mohammad M; Hu, Pingzhao; Gwadry, Fuad G; Pinto, Dalila; Marshall, Christian R; Casallo, Guillermo; Scherer, Stephen W
2011-03-22
Accumulating data indicate that there is significant genetic heterogeneity underlying the etiology in individuals diagnosed with autism spectrum disorder (ASD). Some rare and highly-penetrant gene variants and copy number variation (CNV) regions including NLGN3, NLGN4, NRXN1, SHANK2, SHANK3, PTCHD1, 1q21.1, maternally-inherited duplication of 15q11-q13, 16p11.2, amongst others, have been identified to be involved in ASD. Genome-wide association studies have identified other apparently low risk loci and in some other cases, ASD arises as a co-morbid phenotype with other medical genetic conditions (e.g. fragile X). The progress studying the genetics of ASD has largely been accomplished using genomic analyses of germline-derived DNA. Here, we used gene and miRNA expression profiling using cell-line derived total RNA to evaluate possible transcripts and networks of molecules involved in ASD. Our analysis identified several novel dysregulated genes and miRNAs in ASD compared with controls, including HEY1, SOX9, miR-486 and miR-181b. All of these are involved in nervous system development and function and some others, for example, are involved in NOTCH signaling networks (e.g. HEY1). Further, we found significant enrichment in molecules associated with neurological disorders such as Rett syndrome and those associated with nervous system development and function including long-term potentiation. Our data will provide a valuable resource for discovery purposes and for comparison to other gene expression-based, genome-wide DNA studies and other functional data. Copyright © 2010 Elsevier B.V. All rights reserved.
Egan, Jan B.; Barrett, Michael T.; Champion, Mia D.; Middha, Sumit; Lenkiewicz, Elizabeth; Evers, Lisa; Francis, Princy; Schmidt, Jessica; Shi, Chang-Xin; Van Wier, Scott; Badar, Sandra; Ahmann, Gregory; Kortuem, K. Martin; Boczek, Nicole J.; Fonseca, Rafael; Craig, David W.; Carpten, John D.; Borad, Mitesh J.; Stewart, A. Keith
2014-01-01
Liposarcoma is the most common soft tissue sarcoma, but little is known about the genomic basis of this disease. Given the low cell content of this tumor type, we utilized flow cytometry to isolate the diploid normal and aneuploid tumor populations from a well-differentiated liposarcoma prior to array comparative genomic hybridization and whole genome sequencing. This work revealed massive highly focal amplifications throughout the aneuploid tumor genome including MDM2, a gene that has previously been found to be amplified in well-differentiated liposarcoma. Structural analysis revealed massive rearrangement of chromosome 12 and 11 gene fusions, some of which may be part of double minute chromosomes commonly present in well-differentiated liposarcoma. We identified a hotspot of genomic instability localized to a region of chromosome 12 that includes a highly conserved, putative L1 retrotransposon element, LOC100507498 which resides within a gene cluster (NAV3, SYT1, PAWR) where 6 of the 11 fusion events occurred. Interestingly, a potential gene fusion was also identified in amplified DDR2, which is a potential therapeutic target of kinase inhibitors such as dastinib, that are not routinely used in the treatment of patients with liposarcoma. Furthermore, 7 somatic, damaging single nucleotide variants have also been identified, including D125N in the PTPRQ protein. In conclusion, this work is the first to report the entire genome of a well-differentiated liposarcoma with novel chromosomal rearrangements associated with amplification of therapeutically targetable genes such as MDM2 and DDR2. PMID:24505276
Ultsch, Alfred; Kringel, Dario; Kalso, Eija; Mogil, Jeffrey S; Lötsch, Jörn
2016-12-01
The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 535 genes identified empirically as relevant to pain with the knowledge about the functions of thousands of genes. Starting from an accepted description of chronic pain as displaying systemic features described by the terms "learning" and "neuronal plasticity," a functional genomics analysis proposed that among the functions of the 535 "pain genes," the biological processes "learning or memory" (P = 8.6 × 10) and "nervous system development" (P = 2.4 × 10) are statistically significantly overrepresented as compared with the annotations to these processes expected by chance. After establishing that the hypothesized biological processes were among important functional genomics features of pain, a subset of n = 34 pain genes were found to be annotated with both Gene Ontology terms. Published empirical evidence supporting their involvement in chronic pain was identified for almost all these genes, including 1 gene identified in March 2016 as being involved in pain. By contrast, such evidence was virtually absent in a randomly selected set of 34 other human genes. Hence, the present computational functional genomics-based method can be used for candidate gene selection, providing an alternative to established methods.
The ethylene response pathway in Arabidopsis
NASA Technical Reports Server (NTRS)
Kieber, J. J.; Evans, M. L. (Principal Investigator)
1997-01-01
The simple gas ethylene influences a diverse array of plant growth and developmental processes including germination, senescence, cell elongation, and fruit ripening. This review focuses on recent molecular genetic studies, principally in Arabidopsis, in which components of the ethylene response pathway have been identified. The isolation and characterization of two of these genes has revealed that ethylene sensing involves a protein kinase cascade. One of these genes encodes a protein with similarity to the ubiquitous Raf family of Ser/Thr protein kinases. A second gene shows similarity to the prokaryotic two-component histidine kinases and most likely encodes an ethylene receptor. Additional elements involved in ethylene signaling have only been identified genetically. The characterization of these genes and mutants will be discussed.
Tbx2/3 is an essential mediator within the Brachyury gene network during Ciona notochord development
José-Edwards, Diana S.; Oda-Ishii, Izumi; Nibu, Yutaka; Di Gregorio, Anna
2013-01-01
T-box genes are potent regulators of mesoderm development in many metazoans. In chordate embryos, the T-box transcription factor Brachyury (Bra) is required for specification and differentiation of the notochord. In some chordates, including the ascidian Ciona, members of the Tbx2 subfamily of T-box genes are also expressed in this tissue; however, their regulatory relationships with Bra and their contributions to the development of the notochord remain uncharacterized. We determined that the notochord expression of Ciona Tbx2/3 (Ci-Tbx2/3) requires Ci-Bra, and identified a Ci-Tbx2/3 notochord CRM that necessitates multiple Ci-Bra binding sites for its activity. Expression of mutant forms of Ci-Tbx2/3 in the developing notochord revealed a role for this transcription factor primarily in convergent extension. Through microarray screens, we uncovered numerous Ci-Tbx2/3 targets, some of which overlap with known Ci-Bra-downstream notochord genes. Among the Ci-Tbx2/3 notochord targets are evolutionarily conserved genes, including caspases, lineage-specific genes, such as Noto4, and newly identified genes, such as MLKL. This work sheds light on a large section of the notochord regulatory circuitry controlled by T-box factors, and reveals new components of the complement of genes required for the proper formation of this structure. PMID:23674602
José-Edwards, Diana S; Oda-Ishii, Izumi; Nibu, Yutaka; Di Gregorio, Anna
2013-06-01
T-box genes are potent regulators of mesoderm development in many metazoans. In chordate embryos, the T-box transcription factor Brachyury (Bra) is required for specification and differentiation of the notochord. In some chordates, including the ascidian Ciona, members of the Tbx2 subfamily of T-box genes are also expressed in this tissue; however, their regulatory relationships with Bra and their contributions to the development of the notochord remain uncharacterized. We determined that the notochord expression of Ciona Tbx2/3 (Ci-Tbx2/3) requires Ci-Bra, and identified a Ci-Tbx2/3 notochord CRM that necessitates multiple Ci-Bra binding sites for its activity. Expression of mutant forms of Ci-Tbx2/3 in the developing notochord revealed a role for this transcription factor primarily in convergent extension. Through microarray screens, we uncovered numerous Ci-Tbx2/3 targets, some of which overlap with known Ci-Bra-downstream notochord genes. Among the Ci-Tbx2/3 notochord targets are evolutionarily conserved genes, including caspases, lineage-specific genes, such as Noto4, and newly identified genes, such as MLKL. This work sheds light on a large section of the notochord regulatory circuitry controlled by T-box factors, and reveals new components of the complement of genes required for the proper formation of this structure.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl; Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht; Institute for Risk Assessment Sciences
2013-10-01
The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol andmore » saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.« less
Fear, Justin M; Arbeitman, Michelle N; Salomon, Matthew P; Dalton, Justin E; Tower, John; Nuzhdin, Sergey V; McIntyre, Lauren M
2015-09-04
The Drosophila sex determination hierarchy is a classic example of a transcriptional regulatory hierarchy, with sex-specific isoforms regulating morphology and behavior. We use a structural equation modeling approach, leveraging natural genetic variation from two studies on Drosophila female head tissues--DSPR collection (596 F1-hybrids from crosses between DSPR sub-populations) and CEGS population (75 F1-hybrids from crosses between DGRP/Winters lines to a reference strain w1118)--to expand understanding of the sex hierarchy gene regulatory network (GRN). This approach is completely generalizable to any natural population, including humans. We expanded the sex hierarchy GRN adding novel links among genes, including a link from fruitless (fru) to Sex-lethal (Sxl) identified in both populations. This link is further supported by the presence of fru binding sites in the Sxl locus. 754 candidate genes were added to the pathway, including the splicing factors male-specific lethal 2 and Rm62 as downstream targets of Sxl which are well-supported links in males. Independent studies of doublesex and transformer mutants support many additions, including evidence for a link between the sex hierarchy and metabolism, via Insulin-like receptor. The genes added in the CEGS population were enriched for genes with sex-biased splicing and components of the spliceosome. A common goal of molecular biologists is to expand understanding about regulatory interactions among genes. Using natural alleles we can not only identify novel relationships, but using supervised approaches can order genes into a regulatory hierarchy. Combining these results with independent large effect mutation studies, allows clear candidates for detailed molecular follow-up to emerge.
Dobson, Adam J.; Chaston, John M.; Newell, Peter D.; Donahue, Leanne; Hermann, Sara L.; Sannino, David R.; Westmiller, Stephanie; Wong, Adam C.-N.; Clark, Andrew G.; Lazzaro, Brian P.; Douglas, Angela E.
2015-01-01
Animals bear communities of gut microorganisms with substantial effects on animal nutrition, but the host genetic basis of these effects is unknown. Here, we use Drosophila to demonstrate substantial among-genotype variation in the effects of eliminating the gut microbiota on five host nutritional indices (weight, and protein, lipid, glucose and glycogen contents); this includes variation in both the magnitude and direction of microbiota-dependent effects. Genome-wide associations to identify the genetic basis of the microbiota-dependent variation reveal polymorphisms in largely non-overlapping sets of genes associated with variation in the nutritional traits, including strong representation of conserved genes functioning in signaling. Key genes identified by the GWA study are validated by loss-of-function mutations that altered microbiota-dependent nutritional effects. We conclude that the microbiota interacts with the animal at multiple points in the signaling and regulatory networks that determine animal nutrition. These interactions with the microbiota are likely conserved across animals, including humans. PMID:25692519
Page, Robert B; Monaghan, James R; Samuels, Amy K; Smith, Jeramiah J; Beachy, Christopher K; Voss, S Randal
2007-02-01
Ambystomatid salamanders offer several advantages for endocrine disruption research, including genomic and bioinformatics resources, an accessible laboratory model (Ambystoma mexicanum), and natural lineages that are broadly distributed among North American habitats. We used microarray analysis to measure the relative abundance of transcripts isolated from A. mexicanum epidermis (skin) after exogenous application of thyroid hormone (TH). Only one gene had a >2-fold change in transcript abundance after 2 days of TH treatment. However, hundreds of genes showed significantly different transcript levels at days 12 and 28 in comparison to day 0. A list of 123 TH-responsive genes was identified using statistical, BLAST, and fold level criteria. Cluster analysis identified two groups of genes with similar transcription patterns: up-regulated versus down-regulated. Most notably, several keratins exhibited dramatic (1000 fold) increases or decreases in transcript abundance. Keratin gene expression changes coincided with morphological remodeling of epithelial tissues. This suggests that keratin loci can be developed as sensitive biomarkers to assay temporal disruptions of larval-to-adult gene expression programs. Our study has identified the first collection of loci that are regulated during TH-induced metamorphosis in a salamander, thus setting the stage for future investigations of TH disruption in the Mexican axolotl and other salamanders of the genus Ambystoma.
Identification of somatic mutations in non-small cell lung carcinomas using whole-exome sequencing
Liu, Pengyuan; Morrison, Carl; Wang, Liang; Xiong, Donghai; Vedell, Peter; Cui, Peng; Hua, Xing; Ding, Feng; Lu, Yan; James, Michael; Ebben, John D.; Xu, Haiming; Adjei, Alex A.; Head, Karen; Andrae, Jaime W.; Tschannen, Michael R.; Jacob, Howard; Pan, Jing; Zhang, Qi; Van den Bergh, Francoise; Xiao, Haijie; Lo, Ken C.; Patel, Jigar; Richmond, Todd; Watt, Mary-Anne; Albert, Thomas; Selzer, Rebecca; Anderson, Marshall; Wang, Jiang; Wang, Yian; Starnes, Sandra; Yang, Ping; You, Ming
2012-01-01
Lung cancer is the leading cause of cancer-related death, with non-small cell lung cancer (NSCLC) being the predominant form of the disease. Most lung cancer is caused by the accumulation of genomic alterations due to tobacco exposure. To uncover its mutational landscape, we performed whole-exome sequencing in 31 NSCLCs and their matched normal tissue samples. We identified both common and unique mutation spectra and pathway activation in lung adenocarcinomas and squamous cell carcinomas, two major histologies in NSCLC. In addition to identifying previously known lung cancer genes (TP53, KRAS, EGFR, CDKN2A and RB1), the analysis revealed many genes not previously implicated in this malignancy. Notably, a novel gene CSMD3 was identified as the second most frequently mutated gene (next to TP53) in lung cancer. We further demonstrated that loss of CSMD3 results in increased proliferation of airway epithelial cells. The study provides unprecedented insights into mutational processes, cellular pathways and gene networks associated with lung cancer. Of potential immediate clinical relevance, several highly mutated genes identified in our study are promising druggable targets in cancer therapy including ALK, CTNNA3, DCC, MLL3, PCDHIIX, PIK3C2B, PIK3CG and ROCK2. PMID:22510280
Xu, Jiajia; Bräutigam, Andrea; Weber, Andreas P. M.; Zhu, Xin-Guang
2016-01-01
Identification of potential cis-regulatory motifs controlling the development of C4 photosynthesis is a major focus of current research. In this study, we used time-series RNA-seq data collected from etiolated maize and rice leaf tissues sampled during a de-etiolation process to systematically characterize the expression patterns of C4-related genes and to further identify potential cis elements in five different genomic regions (i.e. promoter, 5′UTR, 3′UTR, intron, and coding sequence) of C4 orthologous genes. The results demonstrate that although most of the C4 genes show similar expression patterns, a number of them, including chloroplast dicarboxylate transporter 1, aspartate aminotransferase, and triose phosphate transporter, show shifted expression patterns compared with their C3 counterparts. A number of conserved short DNA motifs between maize C4 genes and their rice orthologous genes were identified not only in the promoter, 5′UTR, 3′UTR, and coding sequences, but also in the introns of core C4 genes. We also identified cis-regulatory motifs that exist in maize C4 genes and also in genes showing similar expression patterns as maize C4 genes but that do not exist in rice C3 orthologs, suggesting a possible recruitment of pre-existing cis-elements from genes unrelated to C4 photosynthesis into C4 photosynthesis genes during C4 evolution. PMID:27436282
Identification of a novel Gig2 gene family specific to non-amniote vertebrates.
Zhang, Yi-Bing; Liu, Ting-Kai; Jiang, Jun; Shi, Jun; Liu, Ying; Li, Shun; Gui, Jian-Fang
2013-01-01
Gig2 (grass carp reovirus (GCRV)-induced gene 2) is first identified as a novel fish interferon (IFN)-stimulated gene (ISG). Overexpression of a zebrafish Gig2 gene can protect cultured fish cells from virus infection. In the present study, we identify a novel gene family that is comprised of genes homologous to the previously characterized Gig2. EST/GSS search and in silico cloning identify 190 Gig2 homologous genes in 51 vertebrate species ranged from lampreys to amphibians. Further large-scale search of vertebrate and invertebrate genome databases indicate that Gig2 gene family is specific to non-amniotes including lampreys, sharks/rays, ray-finned fishes and amphibians. Phylogenetic analysis and synteny analysis reveal lineage-specific expansion of Gig2 gene family and also provide valuable evidence for the fish-specific genome duplication (FSGD) hypothesis. Although Gig2 family proteins exhibit no significant sequence similarity to any known proteins, a typical Gig2 protein appears to consist of two conserved parts: an N-terminus that bears very low homology to the catalytic domains of poly(ADP-ribose) polymerases (PARPs), and a novel C-terminal domain that is unique to this gene family. Expression profiling of zebrafish Gig2 family genes shows that some duplicate pairs have diverged in function via acquisition of novel spatial and/or temporal expression under stresses. The specificity of this gene family to non-amniotes might contribute to a large extent to distinct physiology in non-amniote vertebrates.
Characterisation of the subtelomeric regions of Giardia lamblia genome isolate WBC6.
Prabhu, Anjali; Morrison, Hilary G; Martinez, Charles R; Adam, Rodney D
2007-04-01
Giardia trophozoites are polyploid and have five chromosomes. The chromosome homologues demonstrate considerable size heterogeneity due to variation in the subtelomeric regions. We used clones from the genome project with telomeric sequence at one end to identify six subtelomeric regions in addition to previously identified subtelomeric regions, to study the telomeric arrangement of the chromosomes. The subtelomeric regions included two retroposons, one retroposon pseudogene, and two vsp genes, in addition to the previously identified subtelomeric regions that include ribosomal DNA repeats. The presence of vsp genes in a subtelomeric region suggests that telomeric rearrangements may contribute to the generation of vsp diversity. These studies of the subtelomeric regions of Giardia may contribute to our understanding of the factors that maintain stability, while allowing diversity in chromosome structure.
Voss, Joachim G.; Dobra, Adrian; Morse, Caryn; Kovacs, Joseph A.; Danner, Robert L.; Munson, Peter J.; Logan, Carolea; Rangel, Zoila; Adelsberger, Joseph W.; McLaughlin, Mary; Adams, Larry D.; Raju, Raghavan; Dalakas, Marinos C.
2016-01-01
Purpose Human immunodeficiency virus (HIV)–related fatigue (HRF) is multicausal and potentially related to mitochondrial dysfunction caused by antiretroviral therapy with nucleoside reverse transcriptase inhibitors (NRTIs). Methodology The authors compared gene expression profiles of CD14+ cells of low versus high fatigued, NRTI-treated HIV patients to healthy controls (n = 5/group). The authors identified 32 genes predictive of low versus high fatigue and 33 genes predictive of healthy versus HIV infection. The authors constructed genetic networks to further elucidate the possible biological pathways in which these genes are involved. Relevance for nursing practice Genes including the actin cytoskeletal regulatory proteins Prokineticin 2 and Cofilin 2 along with mitochondrial inner membrane proteins are involved in multiple pathways and were predictors of fatigue status. Previously identified inflammatory and signaling genes were predictive of HIV status, clearly confirming our results and suggesting a possible further connection between mitochondrial function and HIV. Isolated CD14+ cells are easily accessible cells that could be used for further study of the connection between fatigue and mitochondrial function of HIV patients. Implication for Practice The findings from this pilot study take us one step closer to identifying biomarker targets for fatigue status and mitochondrial dysfunction. Specific biomarkers will be pertinent to the development of methodologies to diagnosis, monitor, and treat fatigue and mitochondrial dysfunction. PMID:23324479
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods.
Wang, Liming; Zhu, L; Luan, R; Wang, L; Fu, J; Wang, X; Sui, L
2016-10-10
Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM.
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods
Wang, Liming; Zhu, L.; Luan, R.; Wang, L.; Fu, J.; Wang, X.; Sui, L.
2016-01-01
Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM. PMID:27737314
Cell Wall Composition and Candidate Biosynthesis Gene Expression During Rice Development
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Fan; Manisseri, Chithra; Fagerström, Alexandra
Cell walls of grasses, including cereal crops and biofuel grasses, comprise the majority of plant biomass and intimately influence plant growth, development and physiology. However, the functions of many cell wall synthesis genes, and the relationships among and the functions of cell wall components remain obscure. To better understand the patterns of cell wall accumulation and identify genes that act in grass cell wall biosynthesis, we characterized 30 samples from aerial organs of rice (Oryza sativa cv. Kitaake) at 10 developmental time points, 3-100 d post-germination. Within these samples, we measured 15 cell wall chemical components, enzymatic digestibility and 18more » cell wall polysaccharide epitopes/ligands. We also used quantitative reverse transcription-PCR to measure expression of 50 glycosyltransferases, 15 acyltransferases and eight phenylpropanoid genes, many of which had previously been identified as being highly expressed in rice. Most cell wall components vary significantly during development, and correlations among them support current understanding of cell walls. We identified 92 significant correlations between cell wall components and gene expression and establish nine strong hypotheses for genes that synthesize xylans, mixed linkage glucan and pectin components. This work provides an extensive analysis of cell wall composition throughout rice development, identifies genes likely to synthesize grass cell walls, and provides a framework for development of genetically improved grasses for use in lignocellulosic biofuel production and agriculture.« less
Peng, W-F; Xu, S-S; Ren, X; Lv, F-H; Xie, X-L; Zhao, Y-X; Zhang, M; Shen, Z-Q; Ren, Y-L; Gao, L; Shen, M; Kantanen, J; Li, M-H
2017-10-01
Genome-wide association studies (GWASs) have been widely applied in livestock to identify genes associated with traits of economic interest. Here, we conducted the first GWAS of the supernumerary nipple phenotype in Wadi sheep, a native Chinese sheep breed, based on Ovine Infinium HD SNP BeadChip genotypes in a total of 144 ewes (75 cases with four teats, including two normal and two supernumerary teats, and 69 control cases with two teats). We detected 63 significant SNPs at the chromosome-wise threshold. Additionally, one candidate region (chr1: 170.723-170.734 Mb) was identified by haplotype-based association tests, with one SNP (rs413490006) surrounding functional genes BBX and CD47 on chromosome 1 being commonly identified as significant by the two mentioned analyses. Moreover, Gene Ontology enrichment for the significant SNPs identified by the GWAS analysis was functionally clustered into the categories of receptor activity and synaptic membrane. In addition, pathway mapping revealed four promising pathways (Wnt, oxytocin, MAPK and axon guidance) involved in the development of the supernumerary nipple phenotype. Our results provide novel and important insights into the genetic mechanisms underlying the phenotype of supernumerary nipples in mammals, including humans. These findings may be useful for future breeding and genetics in sheep and other livestock. © 2017 Stichting International Foundation for Animal Genetics.
Weerakkody, Ruwan A; Vandrovcova, Jana; Kanonidou, Christina; Mueller, Michael; Gampawar, Piyush; Ibrahim, Yousef; Norsworthy, Penny; Biggs, Jennifer; Abdullah, Abdulshakur; Ross, David; Black, Holly A; Ferguson, David; Cheshire, Nicholas J; Kazkaz, Hanadi; Grahame, Rodney; Ghali, Neeti; Vandersteen, Anthony; Pope, F Michael; Aitman, Timothy J
2016-11-01
Ehlers-Danlos syndrome (EDS) comprises a group of overlapping hereditary disorders of connective tissue with significant morbidity and mortality, including major vascular complications. We sought to identify the diagnostic utility of a next-generation sequencing (NGS) panel in a mixed EDS cohort. We developed and applied PCR-based NGS assays for targeted, unbiased sequencing of 12 collagen and aortopathy genes to a cohort of 177 unrelated EDS patients. Variants were scored blind to previous genetic testing and then compared with results of previous Sanger sequencing. Twenty-eight pathogenic variants in COL5A1/2, COL3A1, FBN1, and COL1A1 and four likely pathogenic variants in COL1A1, TGFBR1/2, and SMAD3 were identified by the NGS assays. These included all previously detected single-nucleotide and other short pathogenic variants in these genes, and seven newly detected pathogenic or likely pathogenic variants leading to clinically significant diagnostic revisions. Twenty-two variants of uncertain significance were identified, seven of which were in aortopathy genes and required clinical follow-up. Unbiased NGS-based sequencing made new molecular diagnoses outside the expected EDS genotype-phenotype relationship and identified previously undetected clinically actionable variants in aortopathy susceptibility genes. These data may be of value in guiding future clinical pathways for genetic diagnosis in EDS.Genet Med 18 11, 1119-1127.
Exome sequencing in amyotrophic lateral sclerosis identifies risk genes and pathways.
Cirulli, Elizabeth T; Lasseigne, Brittany N; Petrovski, Slavé; Sapp, Peter C; Dion, Patrick A; Leblond, Claire S; Couthouis, Julien; Lu, Yi-Fan; Wang, Quanli; Krueger, Brian J; Ren, Zhong; Keebler, Jonathan; Han, Yujun; Levy, Shawn E; Boone, Braden E; Wimbish, Jack R; Waite, Lindsay L; Jones, Angela L; Carulli, John P; Day-Williams, Aaron G; Staropoli, John F; Xin, Winnie W; Chesi, Alessandra; Raphael, Alya R; McKenna-Yasek, Diane; Cady, Janet; Vianney de Jong, J M B; Kenna, Kevin P; Smith, Bradley N; Topp, Simon; Miller, Jack; Gkazi, Athina; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan; Silani, Vincenzo; Ticozzi, Nicola; Shaw, Christopher E; Baloh, Robert H; Appel, Stanley; Simpson, Ericka; Lagier-Tourenne, Clotilde; Pulst, Stefan M; Gibson, Summer; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Grossman, Murray; Shneider, Neil A; Chung, Wendy K; Ravits, John M; Glass, Jonathan D; Sims, Katherine B; Van Deerlin, Vivianna M; Maniatis, Tom; Hayes, Sebastian D; Ordureau, Alban; Swarup, Sharan; Landers, John; Baas, Frank; Allen, Andrew S; Bedlack, Richard S; Harper, J Wade; Gitler, Aaron D; Rouleau, Guy A; Brown, Robert; Harms, Matthew B; Cooper, Gregory M; Harris, Tim; Myers, Richard M; Goldstein, David B
2015-03-27
Amyotrophic lateral sclerosis (ALS) is a devastating neurological disease with no effective treatment. We report the results of a moderate-scale sequencing study aimed at increasing the number of genes known to contribute to predisposition for ALS. We performed whole-exome sequencing of 2869 ALS patients and 6405 controls. Several known ALS genes were found to be associated, and TBK1 (the gene encoding TANK-binding kinase 1) was identified as an ALS gene. TBK1 is known to bind to and phosphorylate a number of proteins involved in innate immunity and autophagy, including optineurin (OPTN) and p62 (SQSTM1/sequestosome), both of which have also been implicated in ALS. These observations reveal a key role of the autophagic pathway in ALS and suggest specific targets for therapeutic intervention. Copyright © 2015, American Association for the Advancement of Science.
Elucidate the Mechanism of Telomere Maintenance in STAG2 Mutated Tumor Cells
2017-12-01
recent analysis identified the cohesin subunit STAG2 as one of twelve genes mutated in four or more tumor types including melanoma, pancreatic...conferences, seminars, study groups , and individual study. Include participation in conferences, workshops, and seminars not listed under major...only 12 genes found to be significantly mutated in four or more cancer types (18). Approximately 85% of STAG2 mutations are truncating and often result
Integration of QTL and bioinformatic tools to identify candidate genes for triglycerides in mice[S
Leduc, Magalie S.; Hageman, Rachael S.; Verdugo, Ricardo A.; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A.; Paigen, Beverly
2011-01-01
To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a “toolbox” of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits. PMID:21622629
Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network.
Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin
2016-05-05
Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.
Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network
Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin
2016-01-01
Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer. PMID:27149165
A Multiomics Approach to Identify Genes Associated with Childhood Asthma Risk and Morbidity.
Forno, Erick; Wang, Ting; Yan, Qi; Brehm, John; Acosta-Perez, Edna; Colon-Semidey, Angel; Alvarez, Maria; Boutaoui, Nadia; Cloutier, Michelle M; Alcorn, John F; Canino, Glorisa; Chen, Wei; Celedón, Juan C
2017-10-01
Childhood asthma is a complex disease. In this study, we aim to identify genes associated with childhood asthma through a multiomics "vertical" approach that integrates multiple analytical steps using linear and logistic regression models. In a case-control study of childhood asthma in Puerto Ricans (n = 1,127), we used adjusted linear or logistic regression models to evaluate associations between several analytical steps of omics data, including genome-wide (GW) genotype data, GW methylation, GW expression profiling, cytokine levels, asthma-intermediate phenotypes, and asthma status. At each point, only the top genes/single-nucleotide polymorphisms/probes/cytokines were carried forward for subsequent analysis. In step 1, asthma modified the gene expression-protein level association for 1,645 genes; pathway analysis showed an enrichment of these genes in the cytokine signaling system (n = 269 genes). In steps 2-3, expression levels of 40 genes were associated with intermediate phenotypes (asthma onset age, forced expiratory volume in 1 second, exacerbations, eosinophil counts, and skin test reactivity); of those, methylation of seven genes was also associated with asthma. Of these seven candidate genes, IL5RA was also significant in analytical steps 4-8. We then measured plasma IL-5 receptor α levels, which were associated with asthma age of onset and moderate-severe exacerbations. In addition, in silico database analysis showed that several of our identified IL5RA single-nucleotide polymorphisms are associated with transcription factors related to asthma and atopy. This approach integrates several analytical steps and is able to identify biologically relevant asthma-related genes, such as IL5RA. It differs from other methods that rely on complex statistical models with various assumptions.
Di Gregorio, E; Riberi, E; Belligni, E F; Biamino, E; Spielmann, M; Ala, U; Calcia, A; Bagnasco, I; Carli, D; Gai, G; Giordano, M; Guala, A; Keller, R; Mandrile, G; Arduino, C; Maffè, A; Naretto, V G; Sirchia, F; Sorasio, L; Ungari, S; Zonta, A; Zacchetti, G; Talarico, F; Pappi, P; Cavalieri, S; Giorgio, E; Mancini, C; Ferrero, M; Brussino, A; Savin, E; Gandione, M; Pelle, A; Giachino, D F; De Marchi, M; Restagno, G; Provero, P; Cirillo Silengo, M; Grosso, E; Buxbaum, J D; Pasini, B; De Rubeis, S; Brusco, A; Ferrero, G B
2017-10-01
Array-comparative genomic hybridization (array-CGH) is a widely used technique to detect copy number variants (CNVs) associated with developmental delay/intellectual disability (DD/ID). Identification of genomic disorders in DD/ID. We performed a comprehensive array-CGH investigation of 1,015 consecutive cases with DD/ID and combined literature mining, genetic evidence, evolutionary constraint scores, and functional information in order to assess the pathogenicity of the CNVs. We identified non-benign CNVs in 29% of patients. Amongst the pathogenic variants (11%), detected with a yield consistent with the literature, we found rare genomic disorders and CNVs spanning known disease genes. We further identified and discussed 51 cases with likely pathogenic CNVs spanning novel candidate genes, including genes encoding synaptic components and/or proteins involved in corticogenesis. Additionally, we identified two deletions spanning potential Topological Associated Domain (TAD) boundaries probably affecting the regulatory landscape. We show how phenotypic and genetic analyses of array-CGH data allow unraveling complex cases, identifying rare disease genes, and revealing unexpected position effects. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Long, Qi; Xu, Jianpeng; Osunkoya, Adeboye O; Sannigrahi, Soma; Johnson, Brent A; Zhou, Wei; Gillespie, Theresa; Park, Jong Y; Nam, Robert K; Sugar, Linda; Stanimirovic, Aleksandra; Seth, Arun K; Petros, John A; Moreno, Carlos S
2014-06-15
Prostate cancer remains the second leading cause of cancer death in American men and there is an unmet need for biomarkers to identify patients with aggressive disease. In an effort to identify biomarkers of recurrence, we performed global RNA sequencing on 106 formalin-fixed, paraffin-embedded prostatectomy samples from 100 patients at three independent sites, defining a 24-gene signature panel. The 24 genes in this panel function in cell-cycle progression, angiogenesis, hypoxia, apoptosis, PI3K signaling, steroid metabolism, translation, chromatin modification, and transcription. Sixteen genes have been associated with cancer, with five specifically associated with prostate cancer (BTG2, IGFBP3, SIRT1, MXI1, and FDPS). Validation was performed on an independent publicly available dataset of 140 patients, where the new signature panel outperformed markers published previously in terms of predicting biochemical recurrence. Our work also identified differences in gene expression between Gleason pattern 4 + 3 and 3 + 4 tumors, including several genes involved in the epithelial-to-mesenchymal transition and developmental pathways. Overall, this study defines a novel biomarker panel that has the potential to improve the clinical management of prostate cancer. ©2014 American Association for Cancer Research.
Toro, León; Pinilla, Laura; Avignone-Rossa, Claudio; Ríos-Estepa, Rigoberto
2018-05-01
In this work, we expanded and updated a genome-scale metabolic model of Streptomyces clavuligerus. The model includes 1021 genes and 1494 biochemical reactions; genome-reaction information was curated and new features related to clavam metabolism and to the biomass synthesis equation were incorporated. The model was validated using experimental data from the literature and simulations were performed to predict cellular growth and clavulanic acid biosynthesis. Flux balance analysis (FBA) showed that limiting concentrations of phosphate and an excess of ammonia accumulation are unfavorable for growth and clavulanic acid biosynthesis. The evaluation of different objective functions for FBA showed that maximization of ATP yields the best predictions for cellular behavior in continuous cultures, while the maximization of growth rate provides better predictions for batch cultures. Through gene essentiality analysis, 130 essential genes were found using a limited in silico media, while 100 essential genes were identified in amino acid-supplemented media. Finally, a strain design was carried out to identify candidate genes to be overexpressed or knocked out so as to maximize antibiotic biosynthesis. Interestingly, potential metabolic engineering targets, identified in this study, have not been tested experimentally.
Genomic structure and expression of STM2, the chromosome 1 familial Alzheimer disease gene.
Levy-Lahad, E; Poorkaj, P; Wang, K; Fu, Y H; Oshima, J; Mulligan, J; Schellenberg, G D
1996-06-01
Mutations in the gene STM2 result in autosomal dominant familial Alzheimer disease. To screen for mutations and to identify regulatory elements for this gene, the genomic DNA sequence and intron-exon structure were determined. Twelve exons including 10 coding exons were identified in a genomic region spanning 23,737 bp. The first 2 exons encode the 5'-untranslated region. Expression analysis of STM2 indicates that two transcripts of 2.4 and 2.8 kb are found in skeletal muscle, pancreas, and heart. In addition, a splice variant of the 2.4-kb transcript was identified that is the result of the use of an alternative splice acceptor site located in exon 10. The use of this site results in a transcript lacking a single glutamate. The promotor for this gene and the alternatively spliced exons leading to the 2.8-kb form of the gene remain to be identified. Expression of STM2 was high in skeletal muscle and pancreas, with comparatively low levels observed in brain. This expression pattern is intriguing since in Alzheimer disease, pathology and degeneration are observed only in the central nervous system.