Science.gov

Sample records for gene fancc identified

  1. Genetic inactivation of the Fanconi anemia gene FANCC identified in the hepatocellular carcinoma cell line HuH-7 confers sensitivity towards DNA-interstrand crosslinking agents

    PubMed Central

    2010-01-01

    Background Inactivation of the Fanconi anemia (FA) pathway through defects in one of 13 FA genes occurs at low frequency in various solid cancer entities among the general population. As FA pathway inactivation confers a distinct hypersensitivity towards DNA interstrand-crosslinking (ICL)-agents, FA defects represent rational targets for individualized therapeutic strategies. Except for pancreatic cancer, however, the prevalence of FA defects in gastrointestinal (GI) tumors has not yet been systematically explored. Results A panel of GI cancer cell lines was screened for FA pathway inactivation applying FANCD2 monoubiquitination and FANCD2/RAD51 nuclear focus formation and a newly identified FA pathway-deficient cell line was functionally characterized. The hepatocellular carcinoma (HCC) line HuH-7 was defective in FANCD2 monoubiquitination and FANCD2 nuclear focus formation but proficient in RAD51 focus formation. Gene complementation studies revealed that this proximal FA pathway inactivation was attributable to defective FANCC function in HuH-7 cells. Accordingly, a homozygous inactivating FANCC nonsense mutation (c.553C > T, p.R185X) was identified in HuH-7, resulting in partial transcriptional skipping of exon 6 and leading to the classic cellular FA hypersensitivity phenotype; HuH-7 cells exhibited a strongly reduced proliferation rate and a pronounced G2 cell cycle arrest at distinctly lower concentrations of ICL-agents than a panel of non-isogenic, FA pathway-proficient HCC cell lines. Upon retroviral transduction of HuH-7 cells with FANCC cDNA, FA pathway functions were restored and ICL-hypersensitivity abrogated. Analyses of 18 surgical HCC specimens yielded no further examples for genetic or epigenetic inactivation of FANCC, FANCF, or FANCG in HCC, suggesting a low prevalence of proximal FA pathway inactivation in this tumor type. Conclusions As the majority of HCC are chemoresistant, assessment of FA pathway function in HCC could identify small

  2. Alterations in candidate genes PHF2, FANCC, PTCH1 and XPA at chromosomal 9q22.3 region: Pathological significance in early- and late-onset breast carcinoma

    PubMed Central

    Sinha, Satyabrata; Singh, Ratnesh K; Alam, Neyaz; Roy, Anup; Roychoudhury, Susanta; Panda, Chinmay Kumar

    2008-01-01

    Introduction Younger women with breast carcinoma (BC) exhibits more aggressive pathologic features compared to older women; young age could be an independent predictor of adverse prognosis. To find any existing differences in the molecular pathogenesis of BC in both younger and older women, alterations at chromosomal (chr.) 9q22.32-22.33 region were studied owing to its association in wide variety of tumors. Present work focuses on comparative analysis of alterations of four candidate genes; PHF2, FANCC, PTCH1 and XPA located within 4.4 Mb region of the afore-said locus in two age groups of BC, as well as the interrelation and prognostic significance of alterations of these genes. Methods Deletion analysis of PHF2, FANCC, PTCH1 and XPA were examined in a subset of 47 early-onset (group-A: ≤ 40 years) and 59 late-onset (group-B: > 40 years) breast carcinomas using both microsatellite and exonic markers. Methylation Sensitive Restriction analysis (MSRA) was done to check for promoter methylation. Quantitative real-time polymerase chain reaction (Q-PCR) and immunohistochemisty (IHC) was done in some genes to see their relative mRNA and protein expressions respectively. Clinico-pathological correlation of different parameters as well as patient survival was calculated using different statistical softwares like EpiInfo 6.04b, SPSS 10.0 etc. Results Either age group exhibited high frequency of overall alterations in PHF2, FANCC and PTCH1 compared to XPA. Samples with alteration (deletion/methylation) in these genes showed reduced level of mRNA expression as seen by Q-PCR. Immunohistochemical analysis of FANCC and PTCH1 also supported this observation. Poor patient survival was noted in both age groups having alterations in FANCC. Similar result was also seen with PTCH1 and XPA alterations in group-A and PHF2 alterations in group-B. This reflected their roles as prognostic tools in the respective groups in which they were altered. Conclusion Overall alterations of PHF2

  3. FANCC, FANCE, and FANCD2 form a ternary complex essential to the integrity of the Fanconi anemia DNA damage response pathway.

    PubMed

    Gordon, Susan M; Alon, Noa; Buchwald, Manuel

    2005-10-28

    Fanconi anemia (FA) is a genetically heterogeneous disorder characterized by bone marrow failure, cancer predisposition, and increased cellular sensitivity to DNA-cross-linking agents. The products of seven of the nine identified FA genes participate in a protein complex required for monoubiquitination of the FANCD2 protein. Direct interaction of the FANCE protein with both fellow FA complex component FANCC and the downstream FANCD2 protein has been observed in the yeast two-hybrid system. Here, we demonstrate the ability of FANCE to mediate the interaction between FANCC and FANCD2 in the yeast three-hybrid system and confirm the FANCE-mediated association of FANCC with FANCD2 in human cells. A yeast two-hybrid system-based screen was devised to identify randomly mutagenized FANCE proteins capable of interaction with FANCC but not with FANCD2. Exogenous expression of these mutants in an FA-E cell line and subsequent evaluation of FANCD2 monoubiquitination and DNA cross-linker sensitivity indicated a critical role for the FANCE/FANCD2 interaction in maintaining FA pathway integrity. Three-hybrid experiments also demonstrated the ability of FANCE to mediate the interaction between FA core complex components FANCC and FANCF, indicating an additional role for FANCE in complex assembly. Thus, FANCE is shown to be a key mediator of protein interactions both in the architecture of the FA protein complex and in the connection of complex components to the putative downstream targets of complex activity.

  4. The risk for developing cancer in Israeli ATM, BLM, and FANCC heterozygous mutation carriers.

    PubMed

    Laitman, Yael; Boker-Keinan, Lital; Berkenstadt, Michal; Liphsitz, Irena; Weissglas-Volkov, Daphna; Ries-Levavi, Liat; Sarouk, Ifat; Pras, Elon; Friedman, Eitan

    2016-03-01

    Cancer risks in heterozygous mutation carriers of the ATM, BLM, and FANCC genes are controversial. To shed light on this issue, cancer rates were evaluated by cross referencing asymptomatic Israeli heterozygous mutation carriers in the ATM, BLM, and FANCC genes with cancer diagnoses registered at the Israeli National Cancer Registry (INCR). Comparison of observed to expected Standardized Incidence Rates (SIR) was performed. Overall, 474 individuals participated in the study: 378 females; 25 Arab and 31 Jewish ATM carriers, 152 BLM carriers, and 170 FANCC carriers (all Ashkenazim). Age range at genotyping was 19-53 years (mean + SD 30.6 + 5 years). In addition, 96 males were included; 5, 34, and 57 ATM, BLM, and FANCC mutation carriers, respectively. Over 5-16 years from genotyping (4721 person/years), 15 new cancers were diagnosed in mutation carriers: 5 breast, 4 cervical, 3 melanomas, and one each bone sarcoma, pancreatic, and colorectal cancer. No single cancer diagnosis was more prevalent then expected in all groups combined or per gene analyzed. Specifically breast cancer SIR was 0.02-0.77. We conclude that Israeli ATM, BLM, and FANCC heterozygous mutation carriers are not at an increased risk for developing cancer.

  5. Concomitant Inactivation of Foxo3a and Fancc or Fancd2 Reveals a Two-Tier Protection from Oxidative Stress-Induced Hydrocephalus

    PubMed Central

    Li, Xiaoli; Li, Liang; Li, Jie; Sipple, Jared; Schick, Jonathan; Mehta, Parinda A.; Davies, Stella M.; Dasgupta, Biplab; Waclaw, Ronald R.

    2014-01-01

    Abstract Aims: This study seeks at investigating the cause of hydrocephalus, and at identifying therapeutic targets for the prevention of hydrocephalus. Results: In this study, we show that inactivation of the Foxo3a gene in two mouse models of Fanconi anemia (FA) leads to the development of hydrocephalus in late embryonic stage and after birth. More than 50% of Foxo3a−/− Fancc−/− or Foxo3a−/− Fancd2−/− mice die during embryonic development or within 6 months of life as a result of hydrocephalus characterized by cranial distortion, dilation of the ventricular system, reduced thickness of the cerebral cortex, and disorganization of the ependymal cilia and subcommissural organ. Combined deficiency of Foxo3a and Fancc or Fancd2 not only impairs the self-renewal capacity but also markedly increases the apoptosis of neural stem and progenitor cells (NSPCs), leading to defective neurogenesis. Increased accumulation of reactive oxygen species (ROS) and subsequently de-regulated mitosis and ultimately apoptosis in the neural stem or progenitor cells is identified as one of the potential mechanisms of congenital obstructive hydrocephalus. Innovation: The work unravels a two-tier protective mechanism for preventing oxidative stress-induced hydrocephalus. Conclusion: The deletion of Foxo3a in FA mice increased the accumulation of ROS and subsequently de-regulated mitosis and ultimately apoptosis in the NSPCs, leading to hydrocephalus development. Antioxid. Redox Signal. 21, 1675–1692. PMID:24483844

  6. Loss of FANCC function is associated with failure to inhibit late firing replication origins after DNA cross-linking

    SciTech Connect

    Phelps, Randall A.; Gingras, Helene; Hockenbery, David M. . E-mail: dhockenb@fhcrc.org

    2007-07-01

    Fanconi anemia (FA) cells are abnormally sensitive to DNA cross-linking agents with increased levels of apoptosis and chromosomal instability. Defects in eight FA complementation groups inhibit monoubiquitination of FANCD2, and subsequent recruitment of FANCD2 to DNA damage and S-phase-associated nuclear foci. The specific functional defect in repair or response to DNA damage in FA cells remains unknown. Damage-resistant DNA synthesis is present 2.5-5 h after cross-linker treatment of FANCC, FANCA and FANCD2-deficient cells. Analysis of the size distribution of labeled DNA replication strands revealed that diepoxybutane treatment suppressed labeling of early but not late-firing replicons in FANCC-deficient cells. In contrast, normal responses to ionizing radiation were observed in FANCC-deficient cells. Absence of this late S-phase response in FANCC-deficient cells leads to activation of secondary checkpoint responses.

  7. Strategies to identify disease genes.

    PubMed

    Ashton, Gabrielle H S; McGrath, John A; South, Andrew P

    2002-04-01

    The correlation between genes and disease began in earnest in the early 1900s with the identification of Mendelian-like inheritance of "inborn errors of metabolism." Since then, the ever-broadening field of genetics has been established as one of the most important and groundbreaking branches of science and medicine to date. With the announcement of a "working draft" sequence of the human genome in 2001, the vast array of both genomic and expressed sequence information available in the public databases alone has meant that the concept of hunting for genes is evolving. Nowadays, researchers can substitute many labor-intensive hours in the lab for less time searching on the World Wide Web. Specialization within genetics has been continuously providing subsets of the genre such as genomics, pharmacogenetics, chemogenomics, gene therapy, proteomics and functional genomics, all of which are based on the fundamental starting block, the gene. This review aims to summarize both traditional and current strategies for identifying susceptibility and monogenetic disease genes and describes how these strategies have evolved in tune with the ever-expanding wealth of information now available at our fingertips.

  8. Identifying essential genes in Arabidopsis thaliana.

    PubMed

    Meinke, David; Muralla, Rosanna; Sweeney, Colleen; Dickerman, Allan

    2008-09-01

    Eight years after publication of the Arabidopsis genome sequence and two years before completing the first phase of an international effort to characterize the function of every Arabidopsis gene, plant biologists remain unable to provide a definitive answer to the following basic question: what is the minimal gene set required for normal growth and development? The purpose of this review is to summarize different strategies employed to identify essential genes in Arabidopsis, an important component of the minimal gene set in plants, to present an overview of the datasets and specific genes identified to date, and to discuss the prospects for future saturation of this important class of genes. The long-term goal of this collaborative effort is to facilitate basic research in plant biology and complement ongoing research with other model organisms.

  9. Stratified gene expression analysis identifies major amyotrophic lateral sclerosis genes.

    PubMed

    Jones, Ashley R; Troakes, Claire; King, Andrew; Sahni, Vibhu; De Jong, Simone; Bossers, Koen; Papouli, Efterpi; Mirza, Muddassar; Al-Sarraj, Safa; Shaw, Christopher E; Shaw, Pamela J; Kirby, Janine; Veldink, Jan H; Macklis, Jeffrey D; Powell, John F; Al-Chalabi, Ammar

    2015-05-01

    Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease of motor neurons resulting in progressive paralysis. Gene expression studies of ALS only rarely identify the same gene pathways as gene association studies. We hypothesized that analyzing tissues by matching on degree of disease severity would identify different patterns of gene expression from a traditional case-control comparison. We analyzed gene expression changes in 4 postmortem central nervous system regions, stratified by severity of motor neuron loss. An overall comparison of cases (n = 6) and controls (n = 3) identified known ALS gene, SOX5, as showing differential expression (log2 fold change = 0.09, p = 5.5 × 10(-5)). Analyses stratified by disease severity identified expression changes in C9orf72 (p = 2.77 × 10(-3)), MATR3 (p = 3.46 × 10(-3)), and VEGFA (p = 8.21 × 10(-4)), all implicated in ALS through genetic studies, and changes in other genes in pathways involving RNA processing and immune response. These findings suggest that analysis of gene expression stratified by disease severity can identify major ALS genes and may be more efficient than traditional case-control comparison.

  10. Phenoscape: Identifying Candidate Genes for Evolutionary Phenotypes

    PubMed Central

    Edmunds, Richard C.; Su, Baofeng; Balhoff, James P.; Eames, B. Frank; Dahdul, Wasila M.; Lapp, Hilmar; Lundberg, John G.; Vision, Todd J.; Dunham, Rex A.; Mabee, Paula M.; Westerfield, Monte

    2016-01-01

    Phenotypes resulting from mutations in genetic model organisms can help reveal candidate genes for evolutionarily important phenotypic changes in related taxa. Although testing candidate gene hypotheses experimentally in nonmodel organisms is typically difficult, ontology-driven information systems can help generate testable hypotheses about developmental processes in experimentally tractable organisms. Here, we tested candidate gene hypotheses suggested by expert use of the Phenoscape Knowledgebase, specifically looking for genes that are candidates responsible for evolutionarily interesting phenotypes in the ostariophysan fishes that bear resemblance to mutant phenotypes in zebrafish. For this, we searched ZFIN for genetic perturbations that result in either loss of basihyal element or loss of scales phenotypes, because these are the ancestral phenotypes observed in catfishes (Siluriformes). We tested the identified candidate genes by examining their endogenous expression patterns in the channel catfish, Ictalurus punctatus. The experimental results were consistent with the hypotheses that these features evolved through disruption in developmental pathways at, or upstream of, brpf1 and eda/edar for the ancestral losses of basihyal element and scales, respectively. These results demonstrate that ontological annotations of the phenotypic effects of genetic alterations in model organisms, when aggregated within a knowledgebase, can be used effectively to generate testable, and useful, hypotheses about evolutionary changes in morphology. PMID:26500251

  11. Comprehensive Analysis of Pathogenic Deletion Variants in Fanconi Anemia Genes

    PubMed Central

    Flynn, Elizabeth K.; Kamat, Aparna; Lach, Francis P.; Donovan, Frank X.; Kimble, Danielle C.; Narisu, Narisu; Sanborn, Erica; Boulad, Farid; Davies, Stella M.; Gillio, Alfred P.; Harris, Richard E.; MacMillan, Margaret L.; Wagner, John E.; Smogorzewska, Agata; Auerbach, Arleen D.; Ostrander, Elaine A.; Chandrasekharappa, Settara C.

    2014-01-01

    Fanconi anemia (FA) is a rare recessive disease resulting from mutations in one of at least 16 different genes. Mutation types and phenotypic manifestations of FA are highly heterogeneous and influence the clinical management of the disease. We analyzed 202 FA families for large deletions, using high-resolution Comparative Genome Hybridization arrays (arrayCGH), Single Nucleotide Polymorphism arrays (SNParrays) and DNA sequencing. We found pathogenic deletions in 88 FANCA, seven FANCC, two FANCD2, and one FANCB families. We find 35% of FA families carry large deletions, accounting for 18% of all FA pathogenic variants. Cloning and sequencing across the deletion breakpoints revealed that 52 FANCA deletion ends, and one FANCC deletion end extended beyond the gene boundaries, potentially affecting neighboring genes with phenotypic consequences. Seventy-five percent of the FANCA deletions are Alu-Alu mediated, predominantly by AluY elements, and appear to be caused by Non-Allelic Homologous Recombination. Individual Alu hotspots were identified. Defining the haplotypes of four FANCA deletions shared by multiple families revealed that three share a common ancestry. Knowing the exact molecular changes that lead to the disease may be critical for a better understanding of the FA phenotype, and to gain insight into the mechanisms driving these pathogenic deletion variants. PMID:25168418

  12. Identifying genes of gene regulatory networks using formal concept analysis.

    PubMed

    Gebert, Jutta; Motameny, Susanne; Faigle, Ulrich; Forst, Christian V; Schrader, Rainer

    2008-03-01

    In order to understand the behavior of a gene regulatory network, it is essential to know the genes that belong to it. Identifying the correct members (e.g., in order to build a model) is a difficult task even for small subnetworks. Usually only few members of a network are known and one needs to guess the missing members based on experience or informed speculation. It is beneficial if one can additionally rely on experimental data to support this guess. In this work we present a new method based on formal concept analysis to detect unknown members of a gene regulatory network from gene expression time series data. We show that formal concept analysis is able to find a list of candidate genes for inclusion into a partially known basic network. This list can then be reduced by a statistical analysis so that the resulting genes interact strongly with the basic network and therefore should be included when modeling the network. The method has been applied to the DNA repair system of Mycobacterium tuberculosis. In this application, our method produces comparable results to an already existing method of component selection while it is applicable to a broader range of problems.

  13. Virus induced gene silencing of Arabidopsis gene homologues in wheat identify genes conferring improved drought tolerance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In a non-model staple crop like wheat, functional validation of potential drought stress responsive genes identified in Arabidopsis could provide gene targets for wheat breeding. Virus induced gene silencing (VIGS) of genes of interest can overcome the inherent problems of polyploidy and limited tra...

  14. NIH Researchers Identify OCD Risk Gene

    MedlinePlus

    ... gene (SERT), site of action for the selective serotonin reuptake inhibitors (SSRIs) that are today's mainstay medications for OCD, other anxiety disorders and depression. "Improved knowledge of SERT's role in OCD raises ...

  15. Cancer genomics identifies disrupted epigenetic genes.

    PubMed

    Simó-Riudalbas, Laia; Esteller, Manel

    2014-06-01

    Latest advances in genome technologies have greatly advanced the discovery of epigenetic genes altered in cancer. The initial single candidate gene approaches have been coupled with newly developed epigenomic platforms to hasten the convergence of scientific discoveries and translational applications. Here, we present an overview of the evolution of cancer epigenomics and an updated catalog of disruptions in epigenetic pathways, whose misregulation can culminate in cancer. The creation of these basic mutational catalogs in cell lines and primary tumors will provide us with enough knowledge to move diagnostics and therapy from the laboratory bench to the bedside.

  16. Identifying driver genes in cancer by triangulating gene expression, gene location, and survival data.

    PubMed

    Rouam, Sigrid; Miller, Lance D; Karuturi, R Krishna Murthy

    2014-01-01

    Driver genes are directly responsible for oncogenesis and identifying them is essential in order to fully understand the mechanisms of cancer. However, it is difficult to delineate them from the larger pool of genes that are deregulated in cancer (ie, passenger genes). In order to address this problem, we developed an approach called TRIAngulating Gene Expression (TRIAGE through clinico-genomic intersects). Here, we present a refinement of this approach incorporating a new scoring methodology to identify putative driver genes that are deregulated in cancer. TRIAGE triangulates - or integrates - three levels of information: gene expression, gene location, and patient survival. First, TRIAGE identifies regions of deregulated expression (ie, expression footprints) by deriving a newly established measure called the Local Singular Value Decomposition (LSVD) score for each locus. Driver genes are then distinguished from passenger genes using dual survival analyses. Incorporating measurements of gene expression and weighting them according to the LSVD weight of each tumor, these analyses are performed using the genes located in significant expression footprints. Here, we first use simulated data to characterize the newly established LSVD score. We then present the results of our application of this refined version of TRIAGE to gene expression data from five cancer types. This refined version of TRIAGE not only allowed us to identify known prominent driver genes, such as MMP1, IL8, and COL1A2, but it also led us to identify several novel ones. These results illustrate that TRIAGE complements existing tools, allows for the identification of genes that drive cancer and could perhaps elucidate potential future targets of novel anticancer therapeutics.

  17. Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors.

    PubMed

    Bii, Victor M; Trobridge, Grant D

    2016-10-25

    Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types.

  18. Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

    PubMed Central

    Bii, Victor M.; Trobridge, Grant D.

    2016-01-01

    Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types. PMID:27792127

  19. A predictive approach to identify genes differentially expressed

    NASA Astrophysics Data System (ADS)

    Saraiva, Erlandson F.; Louzada, Francisco; Milan, Luís A.; Meira, Silvana; Cobre, Juliana

    2012-10-01

    The main objective of gene expression data analysis is to identify genes that present significant changes in expression levels between a treatment and a control biological condition. In this paper, we propose a Bayesian approach to identify genes differentially expressed calculating credibility intervals from predictive densities which are constructed using sampled mean treatment effect from all genes in study excluding the treatment effect of genes previously identified with statistical evidence for difference. We compare our Bayesian approach with the standard ones based on the use of the t-test and modified t-tests via a simulation study, using small sample sizes which are common in gene expression data analysis. Results obtained indicate that the proposed approach performs better than standard ones, especially for cases with mean differences and increases in treatment variance in relation to control variance. We also apply the methodologies to a publicly available data set on Escherichia coli bacteria.

  20. Identifying gene-environment and gene-gene interactions using a progressive penalization approach.

    PubMed

    Zhu, Ruoqing; Zhao, Hongyu; Ma, Shuangge

    2014-05-01

    In genomic studies, identifying important gene-environment and gene-gene interactions is a challenging problem. In this study, we adopt the statistical modeling approach, where interactions are represented by product terms in regression models. For the identification of important interactions, we adopt penalization, which has been used in many genomic studies. Straightforward application of penalization does not respect the "main effect, interaction" hierarchical structure. A few recently proposed methods respect this structure by applying constrained penalization. However, they demand very complicated computational algorithms and can only accommodate a small number of genomic measurements. We propose a computationally fast penalization method that can identify important gene-environment and gene-gene interactions and respect a strong hierarchical structure. The method takes a stagewise approach and progressively expands its optimization domain to account for possible hierarchical interactions. It is applicable to multiple data types and models. A coordinate descent method is utilized to produce the entire regularized solution path. Simulation study demonstrates the superior performance of the proposed method. We analyze a lung cancer prognosis study with gene expression measurements and identify important gene-environment interactions.

  1. Identifying potential cancer driver genes by genomic data integration

    NASA Astrophysics Data System (ADS)

    Chen, Yong; Hao, Jingjing; Jiang, Wei; He, Tong; Zhang, Xuegong; Jiang, Tao; Jiang, Rui

    2013-12-01

    Cancer is a genomic disease associated with a plethora of gene mutations resulting in a loss of control over vital cellular functions. Among these mutated genes, driver genes are defined as being causally linked to oncogenesis, while passenger genes are thought to be irrelevant for cancer development. With increasing numbers of large-scale genomic datasets available, integrating these genomic data to identify driver genes from aberration regions of cancer genomes becomes an important goal of cancer genome analysis and investigations into mechanisms responsible for cancer development. A computational method, MAXDRIVER, is proposed here to identify potential driver genes on the basis of copy number aberration (CNA) regions of cancer genomes, by integrating publicly available human genomic data. MAXDRIVER employs several optimization strategies to construct a heterogeneous network, by means of combining a fused gene functional similarity network, gene-disease associations and a disease phenotypic similarity network. MAXDRIVER was validated to effectively recall known associations among genes and cancers. Previously identified as well as novel driver genes were detected by scanning CNAs of breast cancer, melanoma and liver carcinoma. Three predicted driver genes (CDKN2A, AKT1, RNF139) were found common in these three cancers by comparative analysis.

  2. Identifying gene networks underlying the neurobiology of ethanol and alcoholism.

    PubMed

    Wolen, Aaron R; Miles, Michael F

    2012-01-01

    For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.

  3. Rice transcriptome analysis to identify possible herbicide quinclorac detoxification genes

    PubMed Central

    Xu, Wenying; Di, Chao; Zhou, Shaoxia; Liu, Jia; Li, Li; Liu, Fengxia; Yang, Xinling; Ling, Yun; Su, Zhen

    2015-01-01

    Quinclorac is a highly selective auxin-type herbicide and is widely used in the effective control of barnyard grass in paddy rice fields, improving the world's rice yield. The herbicide mode of action of quinclorac has been proposed, and hormone interactions affecting quinclorac signaling has been identified. Because of widespread use, quinclorac may be transported outside rice fields with the drainage waters, leading to soil and water pollution and other environmental health problems. In this study, we used 57K Affymetrix rice whole-genome array to identify quinclorac signaling response genes to study the molecular mechanisms of action and detoxification of quinclorac in rice plants. Overall, 637 probe sets were identified with differential expression levels under either 6 or 24 h of quinclorac treatment. Auxin-related genes such as GH3 and OsIAAs responded to quinclorac treatment. Gene Ontology analysis showed that genes of detoxification-related family genes were significantly enriched, including cytochrome P450, GST, UGT, and ABC and drug transporter genes. Moreover, real-time RT-PCR analysis showed that top candidate genes of P450 families such as CYP81, CYP709C, and CYP72A were universally induced by different herbicides. Some Arabidopsis genes of the same P450 family were up-regulated under quinclorac treatment. We conducted rice whole-genome GeneChip analysis and the first global identification of quinclorac response genes. This work may provide potential markers for detoxification of quinclorac and biomonitors of environmental chemical pollution. PMID:26483837

  4. GENE EXPRESSION PROFILING TO IDENTIFY BIOMARKERS OF REPRODUCTIVE TOXICITY

    EPA Science Inventory

    SOT 2005 SESSION ABSTRACT

    GENE EXPRESSION PROFILING TO IDENTIFY BIOMARKERS OF REPRODUCTIVE TOXICITY

    David J. Dix. National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency, Research Triangle...

  5. Identifying a gene expression signature of cluster headache in blood

    PubMed Central

    Eising, Else; Pelzer, Nadine; Vijfhuizen, Lisanne S.; Vries, Boukje de; Ferrari, Michel D.; ‘t Hoen, Peter A. C.; Terwindt, Gisela M.; van den Maagdenberg, Arn M. J. M.

    2017-01-01

    Cluster headache is a relatively rare headache disorder, typically characterized by multiple daily, short-lasting attacks of excruciating, unilateral (peri-)orbital or temporal pain associated with autonomic symptoms and restlessness. To better understand the pathophysiology of cluster headache, we used RNA sequencing to identify differentially expressed genes and pathways in whole blood of patients with episodic (n = 19) or chronic (n = 20) cluster headache in comparison with headache-free controls (n = 20). Gene expression data were analysed by gene and by module of co-expressed genes with particular attention to previously implicated disease pathways including hypocretin dysregulation. Only moderate gene expression differences were identified and no associations were found with previously reported pathogenic mechanisms. At the level of functional gene sets, associations were observed for genes involved in several brain-related mechanisms such as GABA receptor function and voltage-gated channels. In addition, genes and modules of co-expressed genes showed a role for intracellular signalling cascades, mitochondria and inflammation. Although larger study samples may be required to identify the full range of involved pathways, these results indicate a role for mitochondria, intracellular signalling and inflammation in cluster headache. PMID:28074859

  6. A Gene Recommender Algorithm to Identify Coexpressed Genes in C. elegans

    PubMed Central

    Owen, Art B.; Stuart, Josh; Mach, Kathy; Villeneuve, Anne M.; Kim, Stuart

    2003-01-01

    One of the most important uses of whole-genome expression data is for the discovery of new genes with similar function to a given list of genes (the query) already known to have closely related function. We have developed an algorithm, called the gene recommender, that ranks genes according to how strongly they correlate with a set of query genes in those experiments for which the query genes are most strongly coregulated. We used the gene recommender to find other genes coexpressed with several sets of query genes, including genes known to function in the retinoblastoma complex. Genetic experiments confirmed that one gene (JC8.6) identified by the gene recommender acts with lin-35 Rb to regulate vulval cell fates, and that another gene (wrm-1) acts antagonistically. We find that the gene recommender returns lists of genes with better precision, for fixed levels of recall, than lists generated using the C. elegans expression topomap. PMID:12902378

  7. ENU mutagenesis in mice identifies candidate genes for hypogonadism.

    PubMed

    Weiss, Jeffrey; Hurley, Lisa A; Harris, Rebecca M; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A; Moran, Jennifer L; Beier, David R; Mason, Christopher; Jameson, J Larry

    2012-06-01

    Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low-density genome-wide SNP arrays. Ten of the 15 lines were pursued further using higher-resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism, candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility.

  8. Analysis of gene expression profile identifies potential biomarkers for atherosclerosis

    PubMed Central

    Liu, Luran; Liu, Yan; Liu, Chang; Zhang, Zhuobo; Du, Yaojun; Zhao, Hao

    2016-01-01

    The present study aimed to identify potential biomarkers for atherosclerosis via analysis of gene expression profiles. The microarray dataset no. GSE20129 was downloaded from the Gene Expression Omnibus database. A total of 118 samples from the peripheral blood of female patients was used, including 47 atherosclerotic and 71 non-atherosclerotic patients. The differentially expressed genes (DEGs) in the atherosclerosis samples were identified using the Limma package. Gene ontology term and Kyoto Encyclopedia of Genes and Genomes pathway analyses for DEGs were performed using the Database for Annotation, Visualization and Integrated Discovery tool. The recursive feature elimination (RFE) algorithm was applied for feature selection via iterative classification, and support vector machine classifier was used for the validation of prediction accuracy. A total of 430 DEGs in the atherosclerosis samples were identified, including 149 up- and 281 downregulated genes. Subsequently, the RFE algorithm was used to identify 11 biomarkers, whose receiver operating characteristic curves had an area under curve of 0.92, indicating that the identified 11 biomarkers were representative. The present study indicated that APH1B, JAM3, FBLN2, CSAD and PSTPIP2 may have important roles in the progression of atherosclerosis in females and may be potential biomarkers for early diagnosis and prognosis as well as treatment targets for this disease. PMID:27573188

  9. Functional epigenetic approach identifies frequently methylated genes in Ewing sarcoma.

    PubMed

    Alholle, Abdullah; Brini, Anna T; Gharanei, Seley; Vaiyapuri, Sumathi; Arrigoni, Elena; Dallol, Ashraf; Gentle, Dean; Kishida, Takeshi; Hiruma, Toru; Avigad, Smadar; Grimer, Robert; Maher, Eamonn R; Latif, Farida

    2013-11-01

    Using a candidate gene approach we recently identified frequent methylation of the RASSF2 gene associated with poor overall survival in Ewing sarcoma (ES). To identify effective biomarkers in ES on a genome-wide scale, we used a functionally proven epigenetic approach, in which gene expression was induced in ES cell lines by treatment with a demethylating agent followed by hybridization onto high density gene expression microarrays. After following a strict selection criterion, 34 genes were selected for expression and methylation analysis in ES cell lines and primary ES. Eight genes (CTHRC1, DNAJA4, ECHDC2, NEFH, NPTX2, PHF11, RARRES2, TSGA14) showed methylation frequencies of>20% in ES tumors (range 24-71%), these genes were expressed in human bone marrow derived mesenchymal stem cells (hBMSC) and hypermethylation was associated with transcriptional silencing. Methylation of NPTX2 or PHF11 was associated with poorer prognosis in ES. In addition, six of the above genes also showed methylation frequency of>20% (range 36-50%) in osteosarcomas. Identification of these genes may provide insights into bone cancer tumorigenesis and development of epigenetic biomarkers for prognosis and detection of these rare tumor types.

  10. Identifying genes related with rheumatoid arthritis via system biology analysis.

    PubMed

    Liu, Tao; Lin, Xinmei; Yu, Hongjian

    2015-10-15

    Rheumatoid arthritis (RA) is a chronic, inflammatory joint disease that mainly attacks synovial joints. However, the underlying systematic relationship among different genes and biological processes involved in the pathogenesis are still unclear. By analyzing and comparing the transcriptional profiles from RA, OA (osteoarthritis) patients as well as ND (normal donors) with bioinformatics methods, we tend to uncover the potential molecular networks and critical genes which play important roles in RA and OA development. Initially, hierarchical clustering was performed to classify the overall transcriptional profiles. Differentially expressed genes (DEGs) between ND and RA and OA patients were identified. Furthermore, PPI networks were constructed, functional modules were extracted, and functional annotation was also applied. Our functional analysis identifies 22 biological processes and 2 KEGG pathways enriched in the commonly-regulated gene set. However, we found that number of set of genes differentially expressed genes only between RA and ND reaches up to 244, indicating this gene set may specifically accounts for processing to disease of RA. Additionally, 142 biological processes and 19 KEGG pathways are over-represented by these 244 genes. Meanwhile, although another 21 genes were differentially expressed only in OA and ND, no biological process nor pathway is over-represented by them.

  11. Identifying gene regulatory network rewiring using latent differential graphical models

    PubMed Central

    Tian, Dechao; Gu, Quanquan; Ma, Jian

    2016-01-01

    Gene regulatory networks (GRNs) are highly dynamic among different tissue types. Identifying tissue-specific gene regulation is critically important to understand gene function in a particular cellular context. Graphical models have been used to estimate GRN from gene expression data to distinguish direct interactions from indirect associations. However, most existing methods estimate GRN for a specific cell/tissue type or in a tissue-naive way, or do not specifically focus on network rewiring between different tissues. Here, we describe a new method called Latent Differential Graphical Model (LDGM). The motivation of our method is to estimate the differential network between two tissue types directly without inferring the network for individual tissues, which has the advantage of utilizing much smaller sample size to achieve reliable differential network estimation. Our simulation results demonstrated that LDGM consistently outperforms other Gaussian graphical model based methods. We further evaluated LDGM by applying to the brain and blood gene expression data from the GTEx consortium. We also applied LDGM to identify network rewiring between cancer subtypes using the TCGA breast cancer samples. Our results suggest that LDGM is an effective method to infer differential network using high-throughput gene expression data to identify GRN dynamics among different cellular conditions. PMID:27378774

  12. The genetics of alcoholism: identifying specific genes through family studies.

    PubMed

    Edenberg, Howard J; Foroud, Tatiana

    2006-09-01

    Alcoholism is a complex disorder with both genetic and environmental risk factors. Studies in humans have begun to elucidate the genetic underpinnings of the risk for alcoholism. Here we briefly review strategies for identifying individual genes in which variations affect the risk for alcoholism and related phenotypes, in the context of one large study that has successfully identified such genes. The Collaborative Study on the Genetics of Alcoholism (COGA) is a family-based study that has collected detailed phenotypic data on individuals in families with multiple alcoholic members. A genome-wide linkage approach led to the identification of chromosomal regions containing genes that influenced alcoholism risk and related phenotypes. Subsequently, single nucleotide polymorphisms (SNPs) were genotyped in positional candidate genes located within the linked chromosomal regions, and analyzed for association with these phenotypes. Using this sequential approach, COGA has detected association with GABRA2, CHRM2 and ADH4; these associations have all been replicated by other researchers. COGA has detected association to additional genes including GABRG3, TAS2R16, SNCA, OPRK1 and PDYN, results that are awaiting confirmation. These successes demonstrate that genes contributing to the risk for alcoholism can be reliably identified using human subjects.

  13. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.

    PubMed

    Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming

    2015-01-01

    The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  14. Identifying Mendelian disease genes with the Variant Effect Scoring Tool

    PubMed Central

    2013-01-01

    Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is

  15. Using RNA interference to identify genes required for RNA interference

    PubMed Central

    Dudley, Nathaniel R.; Labbé, Jean-Claude; Goldstein, Bob

    2002-01-01

    RNA interference (RNAi) is a phenomenon in which double-stranded RNA (dsRNA) silences endogenous gene expression. By injecting pools of dsRNAs into Caenorhabditis elegans, we identified a dsRNA that acts as a potent suppressor of the RNAi mechanism. We have used coinjection of dsRNAs to identify four additional candidates for genes involved in the RNAi mechanism in C. elegans. Three of the genes are C. elegans mes genes, some of which encode homologs of the Drosophila chromatin-binding Polycomb-group proteins. We have used loss-of-function mutants to confirm a role for mes-3, -4, and -6 in RNAi. Interestingly, introducing very low levels of dsRNA can bypass a requirement for these genes in RNAi. The finding that genes predicted to encode proteins that associate with chromatin are involved in RNAi in C. elegans raises the possibility that chromatin may play a role in RNAi in animals, as it does in plants. PMID:11904378

  16. GENE EXPRESSION PROFILING TO IDENTIFY MECHANISMS OF MALE REPRODUCTIVE TOXICITY

    EPA Science Inventory

    Gene Expression Profiling to Identify Mechanisms of Male Reproductive Toxicity
    David J. Dix
    National Health and Environmental Effects Research Laboratory, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27711, USA.
    Ab...

  17. Candidate olfaction genes identified within the Helicoverpa armigera Antennal Transcriptome.

    PubMed

    Liu, Yang; Gu, Shaohua; Zhang, Yongjun; Guo, Yuyuan; Wang, Guirong

    2012-01-01

    Antennal olfaction is extremely important for insect survival, mediating key behaviors such as host preference, mate choice, and oviposition site selection. Multiple antennal proteins are involved in olfactory signal transduction pathways. Of these, odorant receptors (ORs) and ionotropic receptors (IRs) confer specificity on olfactory sensory neuron responses. In this study, we identified the olfactory gene repertoire of the economically important agricultural pest moth, Helicoverpa armigera, by assembling the adult male and female antennal transcriptomes. Within the male and female antennal transcriptomes we identified a total of 47 OR candidate genes containing 6 pheromone receptor candidates. Additionally, 12 IR genes as well as 26 odorant-binding proteins and 12 chemosensory proteins were annotated. Our results allow a systematic functional analysis across much of conventional ORs repertoire and newly reported IRs mediating the key olfaction-mediated behaviors of H. armigera.

  18. A genomics approach identifies senescence-specific gene expression regulation

    PubMed Central

    Lackner, Daniel H; Hayashi, Makoto T; Cesare, Anthony J; Karlseder, Jan

    2014-01-01

    Replicative senescence is a fundamental tumor-suppressive mechanism triggered by telomere erosion that results in a permanent cell cycle arrest. To understand the impact of telomere shortening on gene expression, we analyzed the transcriptome of diploid human fibroblasts as they progressed toward and entered into senescence. We distinguished novel transcription regulation due to replicative senescence by comparing senescence-specific expression profiles to profiles from cells arrested by DNA damage or serum starvation. Only a small specific subset of genes was identified that was truly senescence-regulated and changes in gene expression were exacerbated from presenescent to senescent cells. The majority of gene expression regulation in replicative senescence was shown to occur due to telomere shortening, as exogenous telomerase activity reverted most of these changes. PMID:24863242

  19. A genomics approach identifies senescence-specific gene expression regulation.

    PubMed

    Lackner, Daniel H; Hayashi, Makoto T; Cesare, Anthony J; Karlseder, Jan

    2014-10-01

    Replicative senescence is a fundamental tumor-suppressive mechanism triggered by telomere erosion that results in a permanent cell cycle arrest. To understand the impact of telomere shortening on gene expression, we analyzed the transcriptome of diploid human fibroblasts as they progressed toward and entered into senescence. We distinguished novel transcription regulation due to replicative senescence by comparing senescence-specific expression profiles to profiles from cells arrested by DNA damage or serum starvation. Only a small specific subset of genes was identified that was truly senescence-regulated and changes in gene expression were exacerbated from presenescent to senescent cells. The majority of gene expression regulation in replicative senescence was shown to occur due to telomere shortening, as exogenous telomerase activity reverted most of these changes.

  20. Variation in Arabidopsis flooding responses identifies numerous putative "tolerance genes".

    PubMed

    Vashisht, Divya; van Veen, Hans; Akman, Melis; Sasidharan, Rashmi

    2016-11-01

    Plant survival in flooded environments requires a combinatory response to multiple stress conditions such as limited light availability, reduced gas exchange and nutrient uptake. The ability to fine-tune the molecular response at the transcriptional and/or post-transcriptional level that can eventually lead to metabolic and anatomical adjustments are the underlying requirements to confer tolerance. Previously, we compared the transcriptomic adjustment of submergence tolerant, intolerant accessions and identified a core conserved and genotype-specific response to flooding stress, identifying numerous 'putative' tolerance genes. Here, we performed genome wide association analyses on 81 natural Arabidopsis accessions that identified 30 additional SNP markers associated with flooding tolerance. We argue that, given the many genes associated with flooding tolerance in Arabidopsis, improving resistance to submergence requires numerous genetic changes.

  1. Gene expression profiling in bladder cancer identifies potential therapeutic targets

    PubMed Central

    Hussain, Syed A.; Palmer, Daniel H.; Syn, Wing-Kin; Sacco, Joseph J.; Greensmith, Richard M.D.; Elmetwali, Taha; Aachi, Vijay; Lloyd, Bryony H.; Jithesh, Puthen V.; Arrand, John; Barton, Darren; Ansari, Jawaher; Sibson, D. Ross; James, Nicholas D.

    2017-01-01

    Despite advances in management, bladder cancer remains a major cause of cancer related complications. Characterisation of gene expression patterns in bladder cancer allows the identification of pathways involved in its pathogenesis, and may stimulate the development of novel therapies targeting these pathways. Between 2004 and 2005, cystoscopic bladder biopsies were obtained from 19 patients and 11 controls. These were subjected to whole transcript-based microarray analysis. Unsupervised hierarchical clustering was used to identify samples with similar expression profiles. Hypergeometric analysis was used to identify canonical pathways and curated networks having statistically significant enrichment of differentially expressed genes. Osteopontin (OPN) expression was validated by immunohistochemistry. Hierarchical clustering defined signatures, which differentiated between cancer and healthy tissue, muscle-invasive or non-muscle invasive cancer and healthy tissue, grade 1 and grade 3. Pathways associated with cell cycle and proliferation were markedly upregulated in muscle-invasive and grade 3 cancers. Genes associated with the classical complement pathway were downregulated in non-muscle invasive cancer. Osteopontin was markedly overexpressed in invasive cancer compared to healthy tissue. The present study contributes to a growing body of work on gene expression signatures in bladder cancer. The data support an important role for osteopontin in bladder cancer, and identify several pathways worthy of further investigation. PMID:28259975

  2. Phage cluster relationships identified through single gene analysis

    PubMed Central

    2013-01-01

    Background Phylogenetic comparison of bacteriophages requires whole genome approaches such as dotplot analysis, genome pairwise maps, and gene content analysis. Currently mycobacteriophages, a highly studied phage group, are categorized into related clusters based on the comparative analysis of whole genome sequences. With the recent explosion of phage isolation, a simple method for phage cluster prediction would facilitate analysis of crude or complex samples without whole genome isolation and sequencing. The hypothesis of this study was that mycobacteriophage-cluster prediction is possible using comparison of a single, ubiquitous, semi-conserved gene. Tape Measure Protein (TMP) was selected to test the hypothesis because it is typically the longest gene in mycobacteriophage genomes and because regions within the TMP gene are conserved. Results A single gene, TMP, identified the known Mycobacteriophage clusters and subclusters using a Gepard dotplot comparison or a phylogenetic tree constructed from global alignment and maximum likelihood comparisons. Gepard analysis of 247 mycobacteriophage TMP sequences appropriately recovered 98.8% of the subcluster assignments that were made by whole-genome comparison. Subcluster-specific primers within TMP allow for PCR determination of the mycobacteriophage subcluster from DNA samples. Using the single-gene comparison approach for siphovirus coliphages, phage groupings by TMP comparison reflected relationships observed in a whole genome dotplot comparison and confirm the potential utility of this approach to another widely studied group of phages. Conclusions TMP sequence comparison and PCR results support the hypothesis that a single gene can be used for distinguishing phage cluster and subcluster assignments. TMP single-gene analysis can quickly and accurately aid in mycobacteriophage classification. PMID:23777341

  3. DAWN: a framework to identify autism genes and subnetworks using gene expression and genetics

    PubMed Central

    2014-01-01

    Background De novo loss-of-function (dnLoF) mutations are found twofold more often in autism spectrum disorder (ASD) probands than their unaffected siblings. Multiple independent dnLoF mutations in the same gene implicate the gene in risk and hence provide a systematic, albeit arduous, path forward for ASD genetics. It is likely that using additional non-genetic data will enhance the ability to identify ASD genes. Methods To accelerate the search for ASD genes, we developed a novel algorithm, DAWN, to model two kinds of data: rare variations from exome sequencing and gene co-expression in the mid-fetal prefrontal and motor-somatosensory neocortex, a critical nexus for risk. The algorithm casts the ensemble data as a hidden Markov random field in which the graph structure is determined by gene co-expression and it combines these interrelationships with node-specific observations, namely gene identity, expression, genetic data and the estimated effect on risk. Results Using currently available genetic data and a specific developmental time period for gene co-expression, DAWN identified 127 genes that plausibly affect risk, and a set of likely ASD subnetworks. Validation experiments making use of published targeted resequencing results demonstrate its efficacy in reliably predicting ASD genes. DAWN also successfully predicts known ASD genes, not included in the genetic data used to create the model. Conclusions Validation studies demonstrate that DAWN is effective in predicting ASD genes and subnetworks by leveraging genetic and gene expression data. The findings reported here implicate neurite extension and neuronal arborization as risks for ASD. Using DAWN on emerging ASD sequence data and gene expression data from other brain regions and tissues would likely identify novel ASD genes. DAWN can also be used for other complex disorders to identify genes and subnetworks in those disorders. PMID:24602502

  4. Genes Necessary for Bacterial Magnetite Biomineralization Identified by Transposon Mutagenesis

    NASA Astrophysics Data System (ADS)

    Nash, C. Z.; Komeili, A.; Newman, D. K.; Kirschvink, J. L.

    2004-12-01

    Magnetic bacteria synthesize nanoscale crystals of magnetite in intracellular, membrane-bounded organelles (magnetosomes). These crystals are preserved in the fossil record at least as far back as the late Neoproterozoic and have been tentatively identified in much older rocks (1). This fossil record may provide deep time calibration points for molecular evolution studies once the genes involved in biologically controlled magnetic mineralization (BCMM) are known. Further, a genetic and biochemical understanding of BCMM will give insight into the depositional environment and biogeochemical cycles in which magnetic bacteria play a role. The BCMM process is not well understood, though proteins have been identified from the magnetosome membrane and genetic manipulation and biochemical characterization of these proteins are underway. Most of the proteins currently thought to be involved are encoded within the mam cluster, a large cluster of genes whose products localize to the magnetosome membrane and are conserved among magnetic bacteria (2). In an effort to identify all of the genes necessary for bacterial BCMM, we undertook a transposon mutagenesis of Magnetospirillum magneticum AMB-1. Non-magnetic mutants (MNMs) were identified by growth in liquid culture followed by a magnetic assay. The insertion site of the transposon was identified two ways. First MNMs were screened with a PCR assay to determine if the transposon had inserted into the mam cluster. Second, the transposon was rescued from the mutant DNA and cloned for sequencing. The majority insertion sites are located within the mam cluster. Insertion sites also occur in operons which have not previously been suspected to be involved in magnetite biomineralization. None of the insertion sites have occurred within genes reported from previous transposon mutagenesis studies of AMB-1 (3, 4). Two of the non-mam cluster insertion sites occur in operons containing genes conserved particularly between MS-1 and MC-1. We

  5. Identifying genes required for respiratory growth of fission yeast

    PubMed Central

    2016-01-01

    We have used both auxotroph and prototroph versions of the latest deletion-mutant library to identify genes required for respiratory growth on solid glycerol medium in fission yeast. This data set complements and enhances our recent study on functional and regulatory aspects of energy metabolism by providing additional proteins that are involved in respiration. Most proteins identified in this mutant screen have not been implicated in respiration in budding yeast. We also provide a protocol to generate a prototrophic mutant library, and data on technical and biological reproducibility of colony-based high-throughput screens. PMID:27918601

  6. Axon Regeneration Genes Identified by RNAi Screening in C. elegans

    PubMed Central

    Nix, Paola; Hammarlund, Marc; Hauth, Linda; Lachnit, Martina; Jorgensen, Erik M.

    2014-01-01

    Axons of the mammalian CNS lose the ability to regenerate soon after development due to both an inhibitory CNS environment and the loss of cell-intrinsic factors necessary for regeneration. The complex molecular events required for robust regeneration of mature neurons are not fully understood, particularly in vivo. To identify genes affecting axon regeneration in Caenorhabditis elegans, we performed both an RNAi-based screen for defective motor axon regeneration in unc-70/β-spectrin mutants and a candidate gene screen. From these screens, we identified at least 50 conserved genes with growth-promoting or growth-inhibiting functions. Through our analysis of mutants, we shed new light on certain aspects of regeneration, including the role of β-spectrin and membrane dynamics, the antagonistic activity of MAP kinase signaling pathways, and the role of stress in promoting axon regeneration. Many gene candidates had not previously been associated with axon regeneration and implicate new pathways of interest for therapeutic intervention. PMID:24403161

  7. Sleeping Beauty Mouse Models Identify Candidate Genes Involved in Gliomagenesis

    PubMed Central

    Vyazunova, Irina; Maklakova, Vilena I.; Berman, Samuel; De, Ishani; Steffen, Megan D.; Hong, Won; Lincoln, Hayley; Morrissy, A. Sorana; Taylor, Michael D.; Akagi, Keiko; Brennan, Cameron W.; Rodriguez, Fausto J.; Collier, Lara S.

    2014-01-01

    Genomic studies of human high-grade gliomas have discovered known and candidate tumor drivers. Studies in both cell culture and mouse models have complemented these approaches and have identified additional genes and processes important for gliomagenesis. Previously, we found that mobilization of Sleeping Beauty transposons in mice ubiquitously throughout the body from the Rosa26 locus led to gliomagenesis with low penetrance. Here we report the characterization of mice in which transposons are mobilized in the Glial Fibrillary Acidic Protein (GFAP) compartment. Glioma formation in these mice did not occur on an otherwise wild-type genetic background, but rare gliomas were observed when mobilization occurred in a p19Arf heterozygous background. Through cloning insertions from additional gliomas generated by transposon mobilization in the Rosa26 compartment, several candidate glioma genes were identified. Comparisons to genetic, epigenetic and mRNA expression data from human gliomas implicates several of these genes as tumor suppressor genes and oncogenes in human glioblastoma. PMID:25423036

  8. Methods for identifying an essential gene in a prokaryotic microorganism

    DOEpatents

    Shizuya, Hiroaki

    2006-01-31

    Methods are provided for the rapid identification of essential or conditionally essential DNA segments in any species of haploid cell (one copy chromosome per cell) that is capable of being transformed by artificial means and is capable of undergoing DNA recombination. This system offers an enhanced means of identifying essential function genes in diploid pathogens, such as gram-negative and gram-positive bacteria.

  9. Analysis of gene order conservation in eukaryotes identifies transcriptionally and functionally linked genes.

    PubMed

    Dávila López, Marcela; Martínez Guerra, Juan José; Samuelsson, Tore

    2010-05-14

    The order of genes in eukaryotes is not entirely random. Studies of gene order conservation are important to understand genome evolution and to reveal mechanisms why certain neighboring genes are more difficult to separate during evolution. Here, genome-wide gene order information was compiled for 64 species, representing a wide variety of eukaryotic phyla. This information is presented in a browser where gene order may be displayed and compared between species. Factors related to non-random gene order in eukaryotes were examined by considering pairs of neighboring genes. The evolutionary conservation of gene pairs was studied with respect to relative transcriptional direction, intergenic distance and functional relationship as inferred by gene ontology. The results show that among gene pairs that are conserved the divergently and co-directionally transcribed genes are much more common than those that are convergently transcribed. Furthermore, highly conserved pairs, in particular those of fungi, are characterized by a short intergenic distance. Finally, gene pairs of metazoa and fungi that are evolutionary conserved and that are divergently transcribed are much more likely to be related by function as compared to poorly conserved gene pairs. One example is the ribosomal protein gene pair L13/S16, which is unusual as it occurs both in fungi and alveolates. A specific functional relationship between these two proteins is also suggested by the fact that they are part of the same operon in both eubacteria and archaea. In conclusion, factors associated with non-random gene order in eukaryotes include relative gene orientation, intergenic distance and functional relationships. It seems likely that certain pairs of genes are conserved because the genes involved have a transcriptional and/or functional relationship. The results also indicate that studies of gene order conservation aid in identifying genes that are related in terms of transcriptional control.

  10. Analysis of Gene Order Conservation in Eukaryotes Identifies Transcriptionally and Functionally Linked Genes

    PubMed Central

    Dávila López, Marcela; Martínez Guerra, Juan José; Samuelsson, Tore

    2010-01-01

    The order of genes in eukaryotes is not entirely random. Studies of gene order conservation are important to understand genome evolution and to reveal mechanisms why certain neighboring genes are more difficult to separate during evolution. Here, genome-wide gene order information was compiled for 64 species, representing a wide variety of eukaryotic phyla. This information is presented in a browser where gene order may be displayed and compared between species. Factors related to non-random gene order in eukaryotes were examined by considering pairs of neighboring genes. The evolutionary conservation of gene pairs was studied with respect to relative transcriptional direction, intergenic distance and functional relationship as inferred by gene ontology. The results show that among gene pairs that are conserved the divergently and co-directionally transcribed genes are much more common than those that are convergently transcribed. Furthermore, highly conserved pairs, in particular those of fungi, are characterized by a short intergenic distance. Finally, gene pairs of metazoa and fungi that are evolutionary conserved and that are divergently transcribed are much more likely to be related by function as compared to poorly conserved gene pairs. One example is the ribosomal protein gene pair L13/S16, which is unusual as it occurs both in fungi and alveolates. A specific functional relationship between these two proteins is also suggested by the fact that they are part of the same operon in both eubacteria and archaea. In conclusion, factors associated with non-random gene order in eukaryotes include relative gene orientation, intergenic distance and functional relationships. It seems likely that certain pairs of genes are conserved because the genes involved have a transcriptional and/or functional relationship. The results also indicate that studies of gene order conservation aid in identifying genes that are related in terms of transcriptional control. PMID:20498846

  11. Using Drosophila melanogaster to identify chemotherapy toxicity genes.

    PubMed

    King, Elizabeth G; Kislukhin, Galina; Walters, Kelli N; Long, Anthony D

    2014-09-01

    The severity of the toxic side effects of chemotherapy shows a great deal of interindividual variability, and much of this variation is likely genetically based. Simple DNA tests predictive of toxic side effects could revolutionize the way chemotherapy is carried out. Due to the challenges in identifying polymorphisms that affect toxicity in humans, we use Drosophila fecundity following oral exposure to carboplatin, gemcitabine and mitomycin C as a model system to identify naturally occurring DNA variants predictive of toxicity. We use the Drosophila Synthetic Population Resource (DSPR), a panel of recombinant inbred lines derived from a multiparent advanced intercross, to map quantitative trait loci affecting chemotoxicity. We identify two QTL each for carboplatin and gemcitabine toxicity and none for mitomycin. One QTL is associated with fly orthologs of a priori human carboplatin candidate genes ABCC2 and MSH2, and a second QTL is associated with fly orthologs of human gemcitabine candidate genes RRM2 and RRM2B. The third, a carboplatin QTL, is associated with a posteriori human orthologs from solute carrier family 7A, INPP4A&B, and NALCN. The fourth, a gemcitabine QTL that also affects methotrexate toxicity, is associated with human ortholog GPx4. Mapped QTL each explain a significant fraction of variation in toxicity, yet individual SNPs and transposable elements in the candidate gene regions fail to singly explain QTL peaks. Furthermore, estimates of founder haplotype effects are consistent with genes harboring several segregating functional alleles. We find little evidence for nonsynonymous SNPs explaining mapped QTL; thus it seems likely that standing variation in toxicity is due to regulatory alleles.

  12. Efficient Strategy to Identify Gene-Gene Interactions and Its Application to Type 2 Diabetes

    PubMed Central

    Li, Donghe

    2016-01-01

    Over the past decade, the detection of gene-gene interactions has become more and more popular in the field of genome-wide association studies (GWASs). The goal of the GWAS is to identify genetic susceptibility to complex diseases by assaying and analyzing hundreds of thousands of single-nucleotide polymorphisms. However, such tests are computationally demanding and methodologically challenging. Recently, a simple but powerful method, named “BOolean Operation-based Screening and Testing” (BOOST), was proposed for genome-wide gene-gene interaction analyses. BOOST was designed with a Boolean representation of genotype data and is approximately equivalent to the log-linear model. It is extremely fast, and genome-wide gene-gene interaction analyses can be completed within a few hours. However, BOOST can not adjust for covariate effects, and its type-1 error control is not correct. Thus, we considered two-step approaches for gene-gene interaction analyses. First, we selected gene-gene interactions with BOOST and applied logistic regression with covariate adjustments to select gene-gene interactions. We applied the two-step approach to type 2 diabetes (T2D) in the Korea Association Resource (KARE) cohort and identified some promising pairs of single-nucleotide polymorphisms associated with T2D. PMID:28154506

  13. Meta-analysis of gene expression data identifies causal genes for prostate cancer.

    PubMed

    Wang, Xiang-Yang; Hao, Jian-Wei; Zhou, Rui-Jin; Zhang, Xiang-Sheng; Yan, Tian-Zhong; Ding, De-Gang; Shan, Lei

    2013-01-01

    Prostate cancer is a leading cause of death in male populations across the globe. With the advent of gene expression arrays, many microarray studies have been conducted in prostate cancer, but the results have varied across different studies. To better understand the genetic and biologic mechanisms of prostate cancer, we conducted a meta-analysis of two studies on prostate cancer. Eight key genes were identified to be differentially expressed with progression. After gene co-expression analysis based on data from the GEO database, we obtained a co- expressed gene list which included 725 genes. Gene Ontology analysis revealed that these genes are involved in actin filament-based processes, locomotion and cell morphogenesis. Further analysis of the gene list should provide important clues for developing new prognostic markers and therapeutic targets.

  14. Identifying Francisella tularensis genes required for growth in host cells.

    PubMed

    Brunton, J; Steele, S; Miller, C; Lovullo, E; Taft-Benz, S; Kawula, T

    2015-08-01

    Francisella tularensis is a highly virulent Gram-negative intracellular pathogen capable of infecting a vast diversity of hosts, ranging from amoebae to humans. A hallmark of F. tularensis virulence is its ability to quickly grow to high densities within a diverse set of host cells, including, but not limited to, macrophages and epithelial cells. We developed a luminescence reporter system to facilitate a large-scale transposon mutagenesis screen to identify genes required for growth in macrophage and epithelial cell lines. We screened 7,454 individual mutants, 269 of which exhibited reduced intracellular growth. Transposon insertions in the 269 growth-defective strains mapped to 68 different genes. FTT_0924, a gene of unknown function but highly conserved among Francisella species, was identified in this screen to be defective for intracellular growth within both macrophage and epithelial cell lines. FTT_0924 was required for full Schu S4 virulence in a murine pulmonary infection model. The ΔFTT_0924 mutant bacterial membrane is permeable when replicating in hypotonic solution and within macrophages, resulting in strongly reduced viability. The permeability and reduced viability were rescued when the mutant was grown in a hypertonic solution, indicating that FTT_0924 is required for resisting osmotic stress. The ΔFTT_0924 mutant was also significantly more sensitive to β-lactam antibiotics than Schu S4. Taken together, the data strongly suggest that FTT_0924 is required for maintaining peptidoglycan integrity and virulence.

  15. A recellularized human colon model identifies cancer driver genes

    PubMed Central

    Chen, Huanhuan Joyce; Wei, Zhubo; Sun, Jian; Bhattacharya, Asmita; Savage, David J; Serda, Rita; Mackeyev, Yuri; Curley, Steven A.; Bu, Pengcheng; Wang, Lihua; Chen, Shuibing; Cohen-Gould, Leona; Huang, Emina; Shen, Xiling; Lipkin, Steven M.; Copeland, Neal G.; Jenkins, Nancy A.; Shuler, Michael L.

    2016-01-01

    Refined cancer models are needed to bridge the gap between cell-line, animal and clinical research. Here we describe the engineering of an organotypic colon cancer model by recellularization of a native human matrix that contains cell-populated mucosa and an intact muscularis mucosa layer. This ex vivo system recapitulates the pathophysiological progression from APC-mutant neoplasia to submucosal invasive tumor. We used it to perform a Sleeping Beauty transposon mutagenesis screen to identify genes that cooperate with mutant APC in driving invasive neoplasia. 38 candidate invasion driver genes were identified, 17 of which have been previously implicated in colorectal cancer progression, including TCF7L2, TWIST2, MSH2, DCC and EPHB1/2. Six invasion driver genes that to our knowledge have not been previously described were validated in vitro using cell proliferation, migration and invasion assays, and ex vivo using recellularized human colon. These results demonstrate the utility of our organoid model for studying cancer biology. PMID:27398792

  16. Identifying Francisella tularensis Genes Required for Growth in Host Cells

    PubMed Central

    Brunton, J.; Steele, S.; Miller, C.; Lovullo, E.; Taft-Benz, S.

    2015-01-01

    Francisella tularensis is a highly virulent Gram-negative intracellular pathogen capable of infecting a vast diversity of hosts, ranging from amoebae to humans. A hallmark of F. tularensis virulence is its ability to quickly grow to high densities within a diverse set of host cells, including, but not limited to, macrophages and epithelial cells. We developed a luminescence reporter system to facilitate a large-scale transposon mutagenesis screen to identify genes required for growth in macrophage and epithelial cell lines. We screened 7,454 individual mutants, 269 of which exhibited reduced intracellular growth. Transposon insertions in the 269 growth-defective strains mapped to 68 different genes. FTT_0924, a gene of unknown function but highly conserved among Francisella species, was identified in this screen to be defective for intracellular growth within both macrophage and epithelial cell lines. FTT_0924 was required for full Schu S4 virulence in a murine pulmonary infection model. The ΔFTT_0924 mutant bacterial membrane is permeable when replicating in hypotonic solution and within macrophages, resulting in strongly reduced viability. The permeability and reduced viability were rescued when the mutant was grown in a hypertonic solution, indicating that FTT_0924 is required for resisting osmotic stress. The ΔFTT_0924 mutant was also significantly more sensitive to β-lactam antibiotics than Schu S4. Taken together, the data strongly suggest that FTT_0924 is required for maintaining peptidoglycan integrity and virulence. PMID:25987704

  17. Anaerobically expressed Escherichia coli genes identified by operon fusion techniques.

    PubMed Central

    Choe, M; Reznikoff, W S

    1991-01-01

    Genes that are expressed under anaerobic conditions were identified by operon fusion techniques with a hybrid bacteriophage of lambda and Mu, lambda placMu53, which creates transcriptional fusions to lacZY. Cells were screened for anaerobic expression on XG medium. Nine strains were selected, and the insertion point of the hybrid phage in each strain was mapped on the Escherichia coli chromosome linkage map. The anaerobic and aerobic expression levels of these genes were measured by beta-galactosidase assays in different medium conditions and in the presence of three regulatory mutations (fnr, narL, and rpoN). The anaerobically expressed genes (aeg) located at minute 99 (aeg-99) and 75 (aeg-75) appeared to be partially regulated by fnr, and aeg-93 is tightly regulated by fnr. aeg-60 requires a functional rpoN gene for its anaerobic expression. aeg-46.5 is repressed by narL. aeg-65A and aeg-65C are partially controlled by fnr but only in media containing nitrate or fumarate. aeg-47.5 and aeg-48.5 were found to be anaerobically induced only in rich media. The effects of a narL mutation on aeg-46.5 expression were observed in all medium conditions regardless of the presence or absence of nitrate. This suggests that narL has a regulatory function in the absence of exogenously added nitrate. PMID:1917846

  18. A Metastatic Mouse Model Identifies Genes That Regulate Neuroblastoma Metastasis.

    PubMed

    Seong, Bo Kyung A; Fathers, Kelly E; Hallett, Robin; Yung, Christina K; Stein, Lincoln D; Mouaaz, Samar; Kee, Lynn; Hawkins, Cynthia E; Irwin, Meredith S; Kaplan, David R

    2017-02-01

    Metastatic relapse is the major cause of death in pediatric neuroblastoma, where there remains a lack of therapies to target this stage of disease. To understand the molecular mechanisms mediating neuroblastoma metastasis, we developed a mouse model using intracardiac injection and in vivo selection to isolate malignant cell subpopulations with a higher propensity for metastasis to bone and the central nervous system. Gene expression profiling revealed primary and metastatic cells as two distinct cell populations defined by differential expression of 412 genes and of multiple pathways, including CADM1, SPHK1, and YAP/TAZ, whose expression independently predicted survival. In the metastatic subpopulations, a gene signature was defined (MET-75) that predicted survival of neuroblastoma patients with metastatic disease. Mechanistic investigations demonstrated causal roles for CADM1, SPHK1, and YAP/TAZ in mediating metastatic phenotypes in vitro and in vivo Notably, pharmacologic targeting of SPHK1 or YAP/TAZ was sufficient to inhibit neuroblastoma metastasis in vivo Overall, we identify gene expression signatures and candidate therapeutics that could improve the treatment of metastatic neuroblastoma. Cancer Res; 77(3); 696-706. ©2017 AACR.

  19. Identifying Neighborhoods of Coordinated Gene Expression and Metabolite Profiles

    PubMed Central

    Hancock, Timothy; Wicker, Nicolas; Takigawa, Ichigaku; Mamitsuka, Hiroshi

    2012-01-01

    In this paper we investigate how metabolic network structure affects any coordination between transcript and metabolite profiles. To achieve this goal we conduct two complementary analyses focused on the metabolic response to stress. First, we investigate the general size of any relationship between metabolic network gene expression and metabolite profiles. We find that strongly correlated transcript-metabolite profiles are sustained over surprisingly long network distances away from any target metabolite. Secondly, we employ a novel pathway mining method to investigate the structure of this transcript-metabolite relationship. The objective of this method is to identify a minimum set of metabolites which are the target of significantly correlated gene expression pathways. The results reveal that in general, a global regulation signature targeting a small number of metabolites is responsible for a large scale metabolic response. However, our method also reveals pathway specific effects that can degrade this global regulation signature and complicates the observed coordination between transcript-metabolite profiles. PMID:22355360

  20. Gastric Cancer Associated Genes Identified by an Integrative Analysis of Gene Expression Data

    PubMed Central

    Jiang, Bing; Li, Shuwen; Jiang, Zhi

    2017-01-01

    Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding. PMID:28232943

  1. Novel radiation response genes identified in gene-trapped MCF10A mammary epithelial cells.

    PubMed

    Malone, Jennifer; Ullrich, Robert

    2007-02-01

    We have used a gene-trapping strategy to screen human mammary epithelial cells for radiation response genes. Relative mRNA expression levels of five candidate genes in MCF10A cells were analyzed, both with and without exposure to radiation. In all five cases, the trapped genes were significantly down-regulated after radiation treatment. Sequence analysis of the fusion transcripts identified the trapped genes: (1) the human androgen receptor, (2) the uncharacterized DREV1 gene, which has known homology to DNA methyltransferases, (3) the human creatine kinase gene, (4) the human eukaryotic translation elongation factor 1 beta 2, and (5) the human ribosomal protein L27. All five genes were down-regulated significantly after treatment with varying doses of ionizing radiation (0.10 to 4.0 Gy) and at varying times (2-30 h after treatment). The genes were also analyzed in human fibroblast and lymphoblastoid cell lines to determine whether the radiation response being observed was cell-type specific. The results verified that the observed radiation response was not a cell-type-specific phenomenon, suggesting that the genes play essential roles in the radiation damage control pathways. This study demonstrates the potential of the gene-trap approach for the identification and functional analysis of novel radiation response genes.

  2. Identifying the genes of unconventional high temperature superconductors.

    PubMed

    Hu, Jiangping

    We elucidate a recently emergent framework in unifying the two families of high temperature (high [Formula: see text]) superconductors, cuprates and iron-based superconductors. The unification suggests that the latter is simply the counterpart of the former to realize robust extended s-wave pairing symmetries in a square lattice. The unification identifies that the key ingredients (gene) of high [Formula: see text] superconductors is a quasi two dimensional electronic environment in which the d-orbitals of cations that participate in strong in-plane couplings to the p-orbitals of anions are isolated near Fermi energy. With this gene, the superexchange magnetic interactions mediated by anions could maximize their contributions to superconductivity. Creating the gene requires special arrangements between local electronic structures and crystal lattice structures. The speciality explains why high [Formula: see text] superconductors are so rare. An explicit prediction is made to realize high [Formula: see text] superconductivity in Co/Ni-based materials with a quasi two dimensional hexagonal lattice structure formed by trigonal bipyramidal complexes.

  3. New mutations identified in the ocular albinism type 1 gene.

    PubMed

    Roma, Cristin; Ferrante, Paola; Guardiola, Ombretta; Ballabio, Andrea; Zollo, Massimo

    2007-11-01

    As the most common form of ocular albinism, ocular albinism type I (OA1) is an X-linked disorder that has an estimated prevalence of about 1:50,000. We searched for mutations through the human genome sequence draft by direct sequencing on eighteen patients with OA1, both within the coding region and in a thousand base pairs upstream of its start site. Here, we have identified eight new mutations located in the coding region of the gene. Two independent mutations, both located in the most carboxyterminal protein regions, were further characterized by immunofluorescence confocal microscopy, thus showing an impairment in their subcellular distribution into the lysosomal compartment of Cos-7A cells. The mutations found can result in protein misfolding, thus underlining the importance of the structure-function relationships of the protein as a major pathogenic mechanism in ocular albinism. Seven individuals out of eighteen (38.9%) with a clinical diagnosis of ocular albinism showed mutations, thus underlining the discrepancies between the clinical phenotype features and their genotype correlations. We postulate that mutations that have not yet been identified are potentially located in non-coding conserved regions or regulatory sequences of the OA1 gene.

  4. Utilizing Gene Tree Variation to Identify Candidate Effector Genes in Zymoseptoria tritici.

    PubMed

    McDonald, Megan C; McGinness, Lachlan; Hane, James K; Williams, Angela H; Milgate, Andrew; Solomon, Peter S

    2016-04-07

    Zymoseptoria tritici is a host-specific, necrotrophic pathogen of wheat. Infection by Z. tritici is characterized by its extended latent period, which typically lasts 2 wks, and is followed by extensive host cell death, and rapid proliferation of fungal biomass. This work characterizes the level of genomic variation in 13 isolates, for which we have measured virulence on 11 wheat cultivars with differential resistance genes. Between the reference isolate, IPO323, and the 13 Australian isolates we identified over 800,000 single nucleotide polymorphisms, of which ∼10% had an effect on the coding regions of the genome. Furthermore, we identified over 1700 probable presence/absence polymorphisms in genes across the Australian isolates using de novo assembly. Finally, we developed a gene tree sorting method that quickly identifies groups of isolates within a single gene alignment whose sequence haplotypes correspond with virulence scores on a single wheat cultivar. Using this method, we have identified < 100 candidate effector genes whose gene sequence correlates with virulence toward a wheat cultivar carrying a major resistance gene.

  5. GeneValidator: identify problems with protein-coding gene predictions

    PubMed Central

    Drăgan, Monica-Andreea; Moghul, Ismail; Priyam, Anurag; Bustos, Claudio; Wurm, Yannick

    2016-01-01

    Summary: Genomes of emerging model organisms are now being sequenced at very low cost. However, obtaining accurate gene predictions remains challenging: even the best gene prediction algorithms make substantial errors and can jeopardize subsequent analyses. Therefore, many predicted genes must be time-consumingly visually inspected and manually curated. We developed GeneValidator (GV) to automatically identify problematic gene predictions and to aid manual curation. For each gene, GV performs multiple analyses based on comparisons to gene sequences from large databases. The resulting report identifies problematic gene predictions and includes extensive statistics and graphs for each prediction to guide manual curation efforts. GV thus accelerates and enhances the work of biocurators and researchers who need accurate gene predictions from newly sequenced genomes. Availability and implementation: GV can be used through a web interface or in the command-line. GV is open-source (AGPL), available at https://wurmlab.github.io/tools/genevalidator. Contact: y.wurm@qmul.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26787666

  6. Blood pressure loci identified with a gene-centric array.

    PubMed

    Johnson, Toby; Gaunt, Tom R; Newhouse, Stephen J; Padmanabhan, Sandosh; Tomaszewski, Maciej; Kumari, Meena; Morris, Richard W; Tzoulaki, Ioanna; O'Brien, Eoin T; Poulter, Neil R; Sever, Peter; Shields, Denis C; Thom, Simon; Wannamethee, Sasiwarang G; Whincup, Peter H; Brown, Morris J; Connell, John M; Dobson, Richard J; Howard, Philip J; Mein, Charles A; Onipinla, Abiodun; Shaw-Hawkins, Sue; Zhang, Yun; Davey Smith, George; Day, Ian N M; Lawlor, Debbie A; Goodall, Alison H; Fowkes, F Gerald; Abecasis, Gonçalo R; Elliott, Paul; Gateva, Vesela; Braund, Peter S; Burton, Paul R; Nelson, Christopher P; Tobin, Martin D; van der Harst, Pim; Glorioso, Nicola; Neuvrith, Hani; Salvi, Erika; Staessen, Jan A; Stucchi, Andrea; Devos, Nabila; Jeunemaitre, Xavier; Plouin, Pierre-François; Tichet, Jean; Juhanson, Peeter; Org, Elin; Putku, Margus; Sõber, Siim; Veldre, Gudrun; Viigimaa, Margus; Levinsson, Anna; Rosengren, Annika; Thelle, Dag S; Hastie, Claire E; Hedner, Thomas; Lee, Wai K; Melander, Olle; Wahlstrand, Björn; Hardy, Rebecca; Wong, Andrew; Cooper, Jackie A; Palmen, Jutta; Chen, Li; Stewart, Alexandre F R; Wells, George A; Westra, Harm-Jan; Wolfs, Marcel G M; Clarke, Robert; Franzosi, Maria Grazia; Goel, Anuj; Hamsten, Anders; Lathrop, Mark; Peden, John F; Seedorf, Udo; Watkins, Hugh; Ouwehand, Willem H; Sambrook, Jennifer; Stephens, Jonathan; Casas, Juan-Pablo; Drenos, Fotios; Holmes, Michael V; Kivimaki, Mika; Shah, Sonia; Shah, Tina; Talmud, Philippa J; Whittaker, John; Wallace, Chris; Delles, Christian; Laan, Maris; Kuh, Diana; Humphries, Steve E; Nyberg, Fredrik; Cusi, Daniele; Roberts, Robert; Newton-Cheh, Christopher; Franke, Lude; Stanton, Alice V; Dominiczak, Anna F; Farrall, Martin; Hingorani, Aroon D; Samani, Nilesh J; Caulfield, Mark J; Munroe, Patricia B

    2011-12-09

    Raised blood pressure (BP) is a major risk factor for cardiovascular disease. Previous studies have identified 47 distinct genetic variants robustly associated with BP, but collectively these explain only a few percent of the heritability for BP phenotypes. To find additional BP loci, we used a bespoke gene-centric array to genotype an independent discovery sample of 25,118 individuals that combined hypertensive case-control and general population samples. We followed up four SNPs associated with BP at our p < 8.56 × 10(-7) study-specific significance threshold and six suggestively associated SNPs in a further 59,349 individuals. We identified and replicated a SNP at LSP1/TNNT3, a SNP at MTHFR-NPPB independent (r(2) = 0.33) of previous reports, and replicated SNPs at AGT and ATP2B1 reported previously. An analysis of combined discovery and follow-up data identified SNPs significantly associated with BP at p < 8.56 × 10(-7) at four further loci (NPR3, HFE, NOS3, and SOX6). The high number of discoveries made with modest genotyping effort can be attributed to using a large-scale yet targeted genotyping array and to the development of a weighting scheme that maximized power when meta-analyzing results from samples ascertained with extreme phenotypes, in combination with results from nonascertained or population samples. Chromatin immunoprecipitation and transcript expression data highlight potential gene regulatory mechanisms at the MTHFR and NOS3 loci. These results provide candidates for further study to help dissect mechanisms affecting BP and highlight the utility of studying SNPs and samples that are independent of those studied previously even when the sample size is smaller than that in previous studies.

  7. Reconstructability analysis as a tool for identifying gene-gene interactions in studies of human diseases.

    PubMed

    Shervais, Stephen; Kramer, Patricia L; Westaway, Shawn K; Cox, Nancy J; Zwick, Martin

    2010-01-01

    There are a number of common human diseases for which the genetic component may include an epistatic interaction of multiple genes. Detecting these interactions with standard statistical tools is difficult because there may be an interaction effect, but minimal or no main effect. Reconstructability analysis (RA) uses Shannon's information theory to detect relationships between variables in categorical datasets. We applied RA to simulated data for five different models of gene-gene interaction, and find that even with heritability levels as low as 0.008, and with the inclusion of 50 non-associated genes in the dataset, we can identify the interacting gene pairs with an accuracy of > or =80%. We applied RA to a real dataset of type 2 non-insulin-dependent diabetes (NIDDM) cases and controls, and closely approximated the results of more conventional single SNP disease association studies. In addition, we replicated prior evidence for epistatic interactions between SNPs on chromosomes 2 and 15.

  8. BCIP: a gene-centered platform for identifying potential regulatory genes in breast cancer

    PubMed Central

    Wu, Jiaqi; Hu, Shuofeng; Chen, Yaowen; Li, Zongcheng; Zhang, Jian; Yuan, Hanyu; Shi, Qiang; Shao, Ningsheng; Ying, Xiaomin

    2017-01-01

    Breast cancer is a disease with high heterogeneity. Many issues on tumorigenesis and progression are still elusive. It is critical to identify genes that play important roles in the progression of tumors, especially for tumors with poor prognosis such as basal-like breast cancer and tumors in very young women. To facilitate the identification of potential regulatory or driver genes, we present the Breast Cancer Integrative Platform (BCIP, http://omics.bmi.ac.cn/bcancer/). BCIP maintains multi-omics data selected with strict quality control and processed with uniform normalization methods, including gene expression profiles from 9,005 tumor and 376 normal tissue samples, copy number variation information from 3,035 tumor samples, microRNA-target interactions, co-expressed genes, KEGG pathways, and mammary tissue-specific gene functional networks. This platform provides a user-friendly interface integrating comprehensive and flexible analysis tools on differential gene expression, copy number variation, and survival analysis. The prominent characteristic of BCIP is that users can perform analysis by customizing subgroups with single or combined clinical features, including subtypes, histological grades, pathologic stages, metastasis status, lymph node status, ER/PR/HER2 status, TP53 mutation status, menopause status, age, tumor size, therapy responses, and prognosis. BCIP will help to identify regulatory or driver genes and candidate biomarkers for further research in breast cancer. PMID:28327601

  9. Gene-based rare allele analysis identified a risk gene of Alzheimer's disease.

    PubMed

    Kim, Jong Hun; Song, Pamela; Lim, Hyunsun; Lee, Jae-Hyung; Lee, Jun Hong; Park, Sun Ah

    2014-01-01

    Alzheimer's disease (AD) has a strong propensity to run in families. However, the known risk genes excluding APOE are not clinically useful. In various complex diseases, gene studies have targeted rare alleles for unsolved heritability. Our study aims to elucidate previously unknown risk genes for AD by targeting rare alleles. We used data from five publicly available genetic studies from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and the database of Genotypes and Phenotypes (dbGaP). A total of 4,171 cases and 9,358 controls were included. The genotype information of rare alleles was imputed using 1,000 genomes. We performed gene-based analysis of rare alleles (minor allele frequency≤3%). The genome-wide significance level was defined as meta P<1.8×10(-6) (0.05/number of genes in human genome = 0.05/28,517). ZNF628, which is located at chromosome 19q13.42, showed a genome-wide significant association with AD. The association of ZNF628 with AD was not dependent on APOE ε4. APOE and TREM2 were also significantly associated with AD, although not at genome-wide significance levels. Other genes identified by targeting common alleles could not be replicated in our gene-based rare allele analysis. We identified that rare variants in ZNF628 are associated with AD. The protein encoded by ZNF628 is known as a transcription factor. Furthermore, the associations of APOE and TREM2 with AD were highly significant, even in gene-based rare allele analysis, which implies that further deep sequencing of these genes is required in AD heritability studies.

  10. Functional gene group analysis identifies synaptic gene groups as risk factor for schizophrenia

    PubMed Central

    Lips, E S; Cornelisse, L N; Toonen, R F; Min, J L; Hultman, C M; Holmans, P A; O'Donovan, M C; Purcell, S M; Smit, A B; Verhage, M; Sullivan, P F; Visscher, P M; Posthuma, D

    2012-01-01

    Schizophrenia is a highly heritable disorder with a polygenic pattern of inheritance and a population prevalence of ∼1%. Previous studies have implicated synaptic dysfunction in schizophrenia. We tested the accumulated association of genetic variants in expert-curated synaptic gene groups with schizophrenia in 4673 cases and 4965 healthy controls, using functional gene group analysis. Identifying groups of genes with similar cellular function rather than genes in isolation may have clinical implications for finding additional drug targets. We found that a group of 1026 synaptic genes was significantly associated with the risk of schizophrenia (P=7.6 × 10−11) and more strongly associated than 100 randomly drawn, matched control groups of genetic variants (P<0.01). Subsequent analysis of synaptic subgroups suggested that the strongest association signals are derived from three synaptic gene groups: intracellular signal transduction (P=2.0 × 10−4), excitability (P=9.0 × 10−4) and cell adhesion and trans-synaptic signaling (P=2.4 × 10−3). These results are consistent with a role of synaptic dysfunction in schizophrenia and imply that impaired intracellular signal transduction in synapses, synaptic excitability and cell adhesion and trans-synaptic signaling play a role in the pathology of schizophrenia. PMID:21931320

  11. Functional gene group analysis identifies synaptic gene groups as risk factor for schizophrenia.

    PubMed

    Lips, E S; Cornelisse, L N; Toonen, R F; Min, J L; Hultman, C M; Holmans, P A; O'Donovan, M C; Purcell, S M; Smit, A B; Verhage, M; Sullivan, P F; Visscher, P M; Posthuma, D

    2012-10-01

    Schizophrenia is a highly heritable disorder with a polygenic pattern of inheritance and a population prevalence of ~1%. Previous studies have implicated synaptic dysfunction in schizophrenia. We tested the accumulated association of genetic variants in expert-curated synaptic gene groups with schizophrenia in 4673 cases and 4965 healthy controls, using functional gene group analysis. Identifying groups of genes with similar cellular function rather than genes in isolation may have clinical implications for finding additional drug targets. We found that a group of 1026 synaptic genes was significantly associated with the risk of schizophrenia (P=7.6 × 10(-11)) and more strongly associated than 100 randomly drawn, matched control groups of genetic variants (P<0.01). Subsequent analysis of synaptic subgroups suggested that the strongest association signals are derived from three synaptic gene groups: intracellular signal transduction (P=2.0 × 10(-4)), excitability (P=9.0 × 10(-4)) and cell adhesion and trans-synaptic signaling (P=2.4 × 10(-3)). These results are consistent with a role of synaptic dysfunction in schizophrenia and imply that impaired intracellular signal transduction in synapses, synaptic excitability and cell adhesion and trans-synaptic signaling play a role in the pathology of schizophrenia.

  12. Identifying nonspecific SAGE tags by context of gene expression.

    PubMed

    Ge, Xijin; Wang, San Ming

    2008-01-01

    Many serial analysis of gene expression (SAGE) tags can be matched to multiple genes, leading to difficulty in SAGE data interpretation and analysis. As only a subset of genes in the human genome are transcribed in a certain type of tissue/cell, we used microarray expression data from different tissue types to define contexts of gene expression and to annotate SAGE tags collected from the same or similar tissue sources. To predict the original transcript contributing a nonspecific SAGE tag collected from a particular tissue, we ranked the corresponding genes by their expression levels determined by microarray. We developed a tissue-specific SAGE tag annotation database based on microarray data collected from 73 normal human tissues and 18 cancer tissues and cell lines. The database can be queried online at: http://www.basic.northwestern.edu/SAGE/. The accuracy of this database was confirmed by experimental data.

  13. An in vivo screening system to identify tumorigenic genes.

    PubMed

    Ihara, T; Hosokawa, Y; Kumazawa, K; Ishikawa, K; Fujimoto, J; Yamamoto, M; Muramkami, T; Goshima, N; Ito, E; Watanabe, S; Semba, K

    2017-04-06

    Screening for oncogenes has mostly been performed by in vitro transformation assays. However, some oncogenes might not exhibit their transforming activities in vitro unless putative essential factors from in vivo microenvironments are adequately supplied. Here, we have developed an in vivo screening system that evaluates the tumorigenicity of target genes. This system uses a retroviral high-efficiency gene transfer technique, a large collection of human cDNA clones corresponding to ~70% of human genes and a luciferase-expressing immortalized mouse mammary epithelial cell line (NMuMG-luc). From 845 genes that were highly expressed in human breast cancer cell lines, we focused on 205 genes encoding membrane proteins and/or kinases as that had the greater possibility of being oncogenes or drug targets. The 205 genes were divided into five subgroups, each containing 34-43 genes, and then introduced them into NMuMG-luc cells. These cells were subcutaneously injected into nude mice and monitored for tumor development by in vivo imaging. Tumors were observed in three subgroups. Using DNA microarray analyses and individual tumorigenic assays, we found that three genes, ADORA2B, PRKACB and LPAR3, were tumorigenic. ADORA2B and LPAR3 encode G-protein-coupled receptors and PRKACB encodes a protein kinase A catalytic subunit. Cells overexpressing ADORA2B, LPAR3 or PRKACB did not show transforming phenotypes in vitro, suggesting that transformation by these genes requires in vivo microenvironments. In addition, several clinical data sets, including one for breast cancer, showed that the expression of these genes correlated with lower overall survival rate.

  14. Gene expression in human hippocampus from cocaine abusers identifies genes which regulate extracellular matrix remodeling.

    PubMed

    Mash, Deborah C; ffrench-Mullen, Jarlath; Adi, Nikhil; Qin, Yujing; Buck, Andrew; Pablo, John

    2007-11-14

    The chronic effects of cocaine abuse on brain structure and function are blamed for the inability of most addicts to remain abstinent. Part of the difficulty in preventing relapse is the persisting memory of the intense euphoria or cocaine "rush". Most abused drugs and alcohol induce neuroplastic changes in brain pathways subserving emotion and cognition. Such changes may account for the consolidation and structural reconfiguration of synaptic connections with exposure to cocaine. Adaptive hippocampal plasticity could be related to specific patterns of gene expression with chronic cocaine abuse. Here, we compare gene expression profiles in the human hippocampus from cocaine addicts and age-matched drug-free control subjects. Cocaine abusers had 151 gene transcripts upregulated, while 91 gene transcripts were downregulated. Topping the list of cocaine-regulated transcripts was RECK in the human hippocampus (FC = 2.0; p<0.05). RECK is a membrane-anchored MMP inhibitor that is implicated in the coordinated regulation of extracellular matrix integrity and angiogenesis. In keeping with elevated RECK expression, active MMP9 protein levels were decreased in the hippocampus from cocaine abusers. Pathway analysis identified other genes regulated by cocaine that code for proteins involved in the remodeling of the cytomatrix and synaptic connections and the inhibition of blood vessel proliferation (PCDH8, LAMB1, ITGB6, CTGF and EphB4). The observed microarray phenotype in the human hippocampus identified RECK and other region-specific genes that may promote long-lasting structural changes with repeated cocaine abuse. Extracellular matrix remodeling in the hippocampus may be a persisting effect of chronic abuse that contributes to the compulsive and relapsing nature of cocaine addiction.

  15. Dissecting the gene network of dietary restriction to identify evolutionarily conserved pathways and new functional genes.

    PubMed

    Wuttke, Daniel; Connor, Richard; Vora, Chintan; Craig, Thomas; Li, Yang; Wood, Shona; Vasieva, Olga; Shmookler Reis, Robert; Tang, Fusheng; de Magalhães, João Pedro

    2012-01-01

    Dietary restriction (DR), limiting nutrient intake from diet without causing malnutrition, delays the aging process and extends lifespan in multiple organisms. The conserved life-extending effect of DR suggests the involvement of fundamental mechanisms, although these remain a subject of debate. To help decipher the life-extending mechanisms of DR, we first compiled a list of genes that if genetically altered disrupt or prevent the life-extending effects of DR. We called these DR-essential genes and identified more than 100 in model organisms such as yeast, worms, flies, and mice. In order for other researchers to benefit from this first curated list of genes essential for DR, we established an online database called GenDR (http://genomics.senescence.info/diet/). To dissect the interactions of DR-essential genes and discover the underlying lifespan-extending mechanisms, we then used a variety of network and systems biology approaches to analyze the gene network of DR. We show that DR-essential genes are more conserved at the molecular level and have more molecular interactions than expected by chance. Furthermore, we employed a guilt-by-association method to predict novel DR-essential genes. In budding yeast, we predicted nine genes related to vacuolar functions; we show experimentally that mutations deleting eight of those genes prevent the life-extending effects of DR. Three of these mutants (OPT2, FRE6, and RCR2) had extended lifespan under ad libitum, indicating that the lack of further longevity under DR is not caused by a general compromise of fitness. These results demonstrate how network analyses of DR using GenDR can be used to make phenotypically relevant predictions. Moreover, gene-regulatory circuits reveal that the DR-induced transcriptional signature in yeast involves nutrient-sensing, stress responses and meiotic transcription factors. Finally, comparing the influence of gene expression changes during DR on the interactomes of multiple organisms led

  16. Epidermal growth factor gene is a newly identified candidate gene for gout

    PubMed Central

    Han, Lin; Cao, Chunwei; Jia, Zhaotong; Liu, Shiguo; Liu, Zhen; Xin, Ruosai; Wang, Can; Li, Xinde; Ren, Wei; Wang, Xuefeng; Li, Changgui

    2016-01-01

    Chromosome 4q25 has been identified as a genomic region associated with gout. However, the associations of gout with the genes in this region have not yet been confirmed. Here, we performed two-stage analysis to determine whether variations in candidate genes in the 4q25 region are associated with gout in a male Chinese Han population. We first evaluated 96 tag single nucleotide polymorphisms (SNPs) in eight inflammatory/immune pathway- or glucose/lipid metabolism-related genes in the 4q25 region in 480 male gout patients and 480 controls. The SNP rs12504538, located in the elongation of very-long-chain-fatty-acid-like family member 6 gene (Elovl6), was found to be associated with gout susceptibility (Padjusted = 0.00595). In the second stage of analysis, we performed fine mapping analysis of 93 tag SNPs in Elovl6 and in the epidermal growth factor gene (EGF) and its flanking regions in 1017 male patients gout and 1897 healthy male controls. We observed a significant association between the T allele of EGF rs2298999 and gout (odds ratio = 0.77, 95% confidence interval = 0.67–0.88, Padjusted = 6.42 × 10−3). These results provide the first evidence for an association between the EGF rs2298999 C/T polymorphism and gout. Our findings should be validated in additional populations. PMID:27506295

  17. Systems Approaches to Identifying Gene Regulatory Networks in Plants

    PubMed Central

    Long, Terri A.; Brady, Siobhan M.; Benfey, Philip N.

    2009-01-01

    Complex gene regulatory networks are composed of genes, noncoding RNAs, proteins, metabolites, and signaling components. The availability of genome-wide mutagenesis libraries; large-scale transcriptome, proteome, and metabalome data sets; and new high-throughput methods that uncover protein interactions underscores the need for mathematical modeling techniques that better enable scientists to synthesize these large amounts of information and to understand the properties of these biological systems. Systems biology approaches can allow researchers to move beyond a reductionist approach and to both integrate and comprehend the interactions of multiple components within these systems. Descriptive and mathematical models for gene regulatory networks can reveal emergent properties of these plant systems. This review highlights methods that researchers are using to obtain large-scale data sets, and examples of gene regulatory networks modeled with these data. Emergent properties revealed by the use of these network models and perspectives on the future of systems biology are discussed. PMID:18616425

  18. Virus-induced gene silencing of Arabidopsis thaliana gene homologues in wheat identifies genes conferring improved drought tolerance

    PubMed Central

    Lapitan, Nora

    2013-01-01

    In a non-model staple crop like wheat (Triticum aestivumI L.), functional validation of potential drought stress responsive genes identified in Arabidopsis could provide gene targets for breeding. Virus-induced gene silencing (VIGS) of genes of interest can overcome the inherent problems of polyploidy and limited transformation potential that hamper functional validation studies in wheat. In this study, three potential candidate genes shown to be involved in abiotic stress response pathways in Arabidopsis thaliana were selected for VIGS experiments in wheat. These include Era1 (enhanced response to abscisic acid), Cyp707a (ABA 8’-hydroxylase), and Sal1 (inositol polyphosphate 1-phosphatase). Gene homologues for these three genes were identified in wheat and cloned in the viral vector barley stripe mosaic virus (BSMV) in the antisense direction, followed by rub inoculation of BSMV viral RNA transcripts onto wheat plants. Quantitative real-time PCR showed that VIGS-treated wheat plants had significant reductions in target gene transcripts. When VIGS-treated plants generated for Era1 and Sal1 were subjected to limiting water conditions, they showed increased relative water content, improved water use efficiency, reduced gas exchange, and better vigour compared to water-stressed control plants inoculated with RNA from the empty viral vector (BSMV0). In comparison, the Cyp707a-silenced plants showed no improvement over BSMV0-inoculated plants under limited water condition. These results indicate that Era1 and Sal1 play important roles in conferring drought tolerance in wheat. Other traits affected by Era1 silencing were also studied. Delayed seed germination in Era1-silenced plants suggests this gene may be a useful target for developing resistance to pre-harvest sprouting. PMID:23364940

  19. Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis

    PubMed Central

    Méheust, Raphaël; Zelzion, Ehud; Bhattacharya, Debashish; Lopez, Philippe; Bapteste, Eric

    2016-01-01

    The integration of foreign genetic information is central to the evolution of eukaryotes, as has been demonstrated for the origin of the Calvin cycle and of the heme and carotenoid biosynthesis pathways in algae and plants. For photosynthetic lineages, this coordination involved three genomes of divergent phylogenetic origins (the nucleus, plastid, and mitochondrion). Major hurdles overcome by the ancestor of these lineages were harnessing the oxygen-evolving organelle, optimizing the use of light, and stabilizing the partnership between the plastid endosymbiont and host through retargeting of proteins to the nascent organelle. Here we used protein similarity networks that can disentangle reticulate gene histories to explore how these significant challenges were met. We discovered a previously hidden component of algal and plant nuclear genomes that originated from the plastid endosymbiont: symbiogenetic genes (S genes). These composite proteins, exclusive to photosynthetic eukaryotes, encode a cyanobacterium-derived domain fused to one of cyanobacterial or another prokaryotic origin and have emerged multiple, independent times during evolution. Transcriptome data demonstrate the existence and expression of S genes across a wide swath of algae and plants, and functional data indicate their involvement in tolerance to oxidative stress, phototropism, and adaptation to nitrogen limitation. Our research demonstrates the “recycling” of genetic information by photosynthetic eukaryotes to generate novel composite genes, many of which function in plastid maintenance. PMID:26976593

  20. Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis.

    PubMed

    Méheust, Raphaël; Zelzion, Ehud; Bhattacharya, Debashish; Lopez, Philippe; Bapteste, Eric

    2016-03-29

    The integration of foreign genetic information is central to the evolution of eukaryotes, as has been demonstrated for the origin of the Calvin cycle and of the heme and carotenoid biosynthesis pathways in algae and plants. For photosynthetic lineages, this coordination involved three genomes of divergent phylogenetic origins (the nucleus, plastid, and mitochondrion). Major hurdles overcome by the ancestor of these lineages were harnessing the oxygen-evolving organelle, optimizing the use of light, and stabilizing the partnership between the plastid endosymbiont and host through retargeting of proteins to the nascent organelle. Here we used protein similarity networks that can disentangle reticulate gene histories to explore how these significant challenges were met. We discovered a previously hidden component of algal and plant nuclear genomes that originated from the plastid endosymbiont: symbiogenetic genes (S genes). These composite proteins, exclusive to photosynthetic eukaryotes, encode a cyanobacterium-derived domain fused to one of cyanobacterial or another prokaryotic origin and have emerged multiple, independent times during evolution. Transcriptome data demonstrate the existence and expression of S genes across a wide swath of algae and plants, and functional data indicate their involvement in tolerance to oxidative stress, phototropism, and adaptation to nitrogen limitation. Our research demonstrates the "recycling" of genetic information by photosynthetic eukaryotes to generate novel composite genes, many of which function in plastid maintenance.

  1. Exploiting natural variation to identify insect-resistance genes.

    PubMed

    Broekgaarden, Colette; Snoeren, Tjeerd A L; Dicke, Marcel; Vosman, Ben

    2011-10-01

    Herbivorous insects are widespread and often serious constraints to crop production. The use of insect-resistant crops is a very effective way to control insect pests in agriculture, and the development of such crops can be greatly enhanced by knowledge on plant resistance mechanisms and the genes involved. Plants have evolved diverse ways to cope with insect attack that has resulted in natural variation for resistance towards herbivorous insects. Studying the molecular genetics and transcriptional background of this variation has facilitated the identification of resistance genes and processes that lead to resistance against insects. With the development of new technologies, molecular studies are not restricted to model plants anymore. This review addresses the need to exploit natural variation in resistance towards insects to increase our knowledge on resistance mechanisms and the genes involved. We will discuss how this knowledge can be exploited in breeding programmes to provide sustainable crop protection against insect pests. Additionally, we discuss the current status of genetic research on insect-resistance genes. We conclude that insect-resistance mechanisms are still unclear at the molecular level and that exploiting natural variation with novel technologies will contribute greatly to the development of insect-resistant crop varieties.

  2. Gene expression profiling identifies genes predictive of oral squamous cell carcinoma.

    PubMed

    Chen, Chu; Méndez, Eduardo; Houck, John; Fan, Wenhong; Lohavanichbutr, Pawadee; Doody, Dave; Yueh, Bevan; Futran, Neal D; Upton, Melissa; Farwell, D Gregory; Schwartz, Stephen M; Zhao, Lue Ping

    2008-08-01

    Oral squamous cell carcinoma (OSCC) is associated with substantial mortality and morbidity. To identify potential biomarkers for the early detection of invasive OSCC, we compared the gene expressions of incident primary OSCC, oral dysplasia, and clinically normal oral tissue from surgical patients without head and neck cancer or preneoplastic oral lesions (controls), using Affymetrix U133 2.0 Plus arrays. We identified 131 differentially expressed probe sets using a training set of 119 OSCC patients and 35 controls. Forward and stepwise logistic regression analyses identified 10 successive combinations of genes which expression differentiated OSCC from controls. The best model included LAMC2, encoding laminin-gamma2 chain, and COL4A1, encoding collagen, type IV alpha1 chain. Subsequent modeling without these two markers showed that COL1A1, encoding collagen, type I alpha1 chain, and PADI1, encoding peptidyl arginine deiminase, type 1, could also distinguish OSCC from controls. We validated these two models using an internal independent testing set of 48 invasive OSCC and 10 controls and an external testing set of 42 head and neck squamous cell carcinoma cases and 14 controls (GEO GSE6791), with sensitivity and specificity above 95%. These two models were also able to distinguish dysplasia (n = 17) from control (n = 35) tissue. Differential expression of these four genes was confirmed by quantitative reverse transcription-PCR. If confirmed in larger studies, the proposed models may hold promise for monitoring local recurrence at surgical margins and the development of second primary oral cancer in patients with OSCC.

  3. Transcriptome Sequencing Identified Genes and Gene Ontologies Associated with Early Freezing Tolerance in Maize

    PubMed Central

    Li, Zhao; Hu, Guanghui; Liu, Xiangfeng; Zhou, Yao; Li, Yu; Zhang, Xu; Yuan, Xiaohui; Zhang, Qian; Yang, Deguang; Wang, Tianyu; Zhang, Zhiwu

    2016-01-01

    Originating in a tropical climate, maize has faced great challenges as cultivation has expanded to the majority of the world's temperate zones. In these zones, frost and cold temperatures are major factors that prevent maize from reaching its full yield potential. Among 30 elite maize inbred lines adapted to northern China, we identified two lines of extreme, but opposite, freezing tolerance levels—highly tolerant and highly sensitive. During the seedling stage of these two lines, we used RNA-seq to measure changes in maize whole genome transcriptome before and after freezing treatment. In total, 19,794 genes were expressed, of which 4550 exhibited differential expression due to either treatment (before or after freezing) or line type (tolerant or sensitive). Of the 4550 differently expressed genes, 948 exhibited differential expression due to treatment within line or lines under freezing condition. Analysis of gene ontology found that these 948 genes were significantly enriched for binding functions (DNA binding, ATP binding, and metal ion binding), protein kinase activity, and peptidase activity. Based on their enrichment, literature support, and significant levels of differential expression, 30 of these 948 genes were selected for quantitative real-time PCR (qRT-PCR) validation. The validation confirmed our RNA-Seq-based findings, with squared correlation coefficients of 80% and 50% in the tolerance and sensitive lines, respectively. This study provided valuable resources for further studies to enhance understanding of the molecular mechanisms underlying maize early freezing response and enable targeted breeding strategies for developing varieties with superior frost resistance to achieve yield potential. PMID:27774095

  4. Identifying mechanistic indicators of childhood asthma from blood gene expression

    EPA Science Inventory

    Asthmatic individuals have been identified as a susceptible subpopulation for air pollutants. However, asthma represents a syndrome with multiple probable etiologies, and the identification of these asthma endotypes is critical to accurately define the most susceptible subpopula...

  5. DNA methylation profiling identifies CG methylation clusters in Arabidopsis genes.

    PubMed

    Tran, Robert K; Henikoff, Jorja G; Zilberman, Daniel; Ditt, Renata F; Jacobsen, Steven E; Henikoff, Steven

    2005-01-26

    Cytosine DNA methylation in vertebrates is widespread, but methylation in plants is found almost exclusively at transposable elements and repetitive DNA. Within regions of methylation, methylcytosines are typically found in CG, CNG, and asymmetric contexts. CG sites are maintained by a plant homolog of mammalian Dnmt1 acting on hemi-methylated DNA after replication. Methylation of CNG and asymmetric sites appears to be maintained at each cell cycle by other mechanisms. We report a new type of DNA methylation in Arabidopsis, dense CG methylation clusters found at scattered sites throughout the genome. These clusters lack non-CG methylation and are preferentially found in genes, although they are relatively deficient toward the 5' end. CG methylation clusters are present in lines derived from different accessions and in mutants that eliminate de novo methylation, indicating that CG methylation clusters are stably maintained at specific sites. Because 5-methylcytosine is mutagenic, the appearance of CG methylation clusters over evolutionary time predicts a genome-wide deficiency of CG dinucleotides and an excess of C(A/T)G trinucleotides within transcribed regions. This is exactly what we find, implying that CG methylation clusters have contributed profoundly to plant gene evolution. We suggest that CG methylation clusters silence cryptic promoters that arise sporadically within transcription units.

  6. Candidate genes for limiting cholestatic intestinal injury identified by gene expression profiling

    PubMed Central

    Alaish, Samuel M; Timmons, Jennifer; Smith, Alexis; Buzza, Marguerite S; Murphy, Ebony; Zhao, Aiping; Sun, Yezhou; Turner, Douglas J; Shea-Donahue, Terez; Antalis, Toni M; Cross, Alan; Dorsey, Susan G

    2013-01-01

    The lack of bile flow from the liver into the intestine can have devastating complications including hepatic failure, sepsis, and even death. This pathologic condition known as cholestasis can result from etiologies as diverse as total parenteral nutrition (TPN), hepatitis, and pancreatic cancer. The intestinal injury associated with cholestasis has been shown to result in decreased intestinal resistance, increased bacterial translocation, and increased endotoxemia. Anecdotal clinical evidence suggests a genetic predisposition to exaggerated injury. Recent animal research on two different strains of inbred mice demonstrating different rates of bacterial translocation with different mortality rates supports this premise. In this study, a microarray analysis of intestinal tissue following common bile duct ligation (CBDL) performed under general anesthesia on these same two strains of inbred mice was done with the goal of identifying the potential molecular mechanistic pathways responsible. Over 500 genes were increased more than 2.0-fold following CBDL. The most promising candidate genes included major urinary proteins (MUPs), serine protease-1-inhibitor (Serpina1a), and lipocalin-2 (LCN-2). Quantitative polymerase chain reaction (qPCR) validated the microarray results for these candidate genes. In an in vitro experiment using differentiated intestinal epithelial cells, inhibition of MUP-1 by siRNA resulted in increased intestinal epithelial cell permeability. Diverse novel mechanisms involving the growth hormone pathway, the acute phase response, and the innate immune response are thus potential avenues for limiting cholestatic intestinal injury. Changes in gene expression were at times found to be not only due to the CBDL but also due to the murine strain. Should further studies in cholestatic patients demonstrate interindividual variability similar to what we have shown in mice, then a “personalized medicine” approach to cholestatic patients may become

  7. Systematic analysis of microarray datasets to identify Parkinson's disease-associated pathways and genes

    PubMed Central

    Feng, Yinling; Wang, Xuefeng

    2017-01-01

    In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co-expression networks and clinical information was adopted, using weighted gene co-expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co-pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution-based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD-associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis. PMID:28098893

  8. Linkage, Association, and Gene-Expression Analyses Identify CNTNAP2 as an Autism-Susceptibility Gene

    PubMed Central

    Alarcón, Maricela; Abrahams, Brett S.; Stone, Jennifer L.; Duvall, Jacqueline A.; Perederiy, Julia V.; Bomar, Jamee M.; Sebat, Jonathan; Wigler, Michael; Martin, Christa L.; Ledbetter, David H.; Nelson, Stanley F.; Cantor, Rita M.; Geschwind, Daniel H.

    2008-01-01

    Autism is a genetically complex neurodevelopmental syndrome in which language deficits are a core feature. We describe results from two complimentary approaches used to identify risk variants on chromosome 7 that likely contribute to the etiology of autism. A two-stage association study tested 2758 SNPs across a 10 Mb 7q35 language-related autism QTL in AGRE (Autism Genetic Resource Exchange) trios1,2 and found significant association with Contactin Associated Protein-Like 2 (CNTNAP2), a strong a priori candidate. Male-only containing families were identified as primarily responsible for this association signal, consistent with the strong male affection bias in ASD and other language-based disorders. Gene-expression analyses in developing human brain further identified CNTNAP2 as enriched in circuits important for language development. Together, these results provide convergent evidence for involvement of CNTNAP2, a Neurexin family member, in autism, and demonstrate a connection between genetic risk for autism and specific brain structures. PMID:18179893

  9. Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development.

    PubMed

    Takeda, Haruna; Rust, Alistair G; Ward, Jerrold M; Yew, Christopher Chin Kuan; Jenkins, Nancy A; Copeland, Neal G

    2016-04-05

    Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4(+/-) mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC.

  10. Sleeping Beauty transposon mutagenesis identifies genes that cooperate with mutant Smad4 in gastric cancer development

    PubMed Central

    Takeda, Haruna; Rust, Alistair G.; Ward, Jerrold M.; Yew, Christopher Chin Kuan; Jenkins, Nancy A.; Copeland, Neal G.

    2016-01-01

    Mutations in SMAD4 predispose to the development of gastrointestinal cancer, which is the third leading cause of cancer-related deaths. To identify genes driving gastric cancer (GC) development, we performed a Sleeping Beauty (SB) transposon mutagenesis screen in the stomach of Smad4+/− mutant mice. This screen identified 59 candidate GC trunk drivers and a much larger number of candidate GC progression genes. Strikingly, 22 SB-identified trunk drivers are known or candidate cancer genes, whereas four SB-identified trunk drivers, including PTEN, SMAD4, RNF43, and NF1, are known human GC trunk drivers. Similar to human GC, pathway analyses identified WNT, TGF-β, and PI3K-PTEN signaling, ubiquitin-mediated proteolysis, adherens junctions, and RNA degradation in addition to genes involved in chromatin modification and organization as highly deregulated pathways in GC. Comparative oncogenomic filtering of the complete list of SB-identified genes showed that they are highly enriched for genes mutated in human GC and identified many candidate human GC genes. Finally, by comparing our complete list of SB-identified genes against the list of mutated genes identified in five large-scale human GC sequencing studies, we identified LDL receptor-related protein 1B (LRP1B) as a previously unidentified human candidate GC tumor suppressor gene. In LRP1B, 129 mutations were found in 462 human GC samples sequenced, and LRP1B is one of the top 10 most deleted genes identified in a panel of 3,312 human cancers. SB mutagenesis has, thus, helped to catalog the cooperative molecular mechanisms driving SMAD4-induced GC growth and discover genes with potential clinical importance in human GC. PMID:27006499

  11. Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.

    PubMed

    Allman, Elizabeth S; Degnan, James H; Rhodes, John A

    2011-06-01

    Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. This multispecies coalescent model provides a framework for phylogeneticists to infer species trees from gene trees using maximum likelihood or Bayesian approaches. Because the coalescent models a branching process over time, all trees are typically assumed to be rooted in this setting. Often, however, gene trees inferred by traditional phylogenetic methods are unrooted. We investigate probabilities of unrooted gene trees under the multispecies coalescent model. We show that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooted species tree topology and some, but not all, information in the species tree edges (branch lengths). The location of the root on the species tree is not identifiable in this situation. However, for 5 or more species with one gene sampled per species, we show that the distribution of unrooted gene tree topologies identifies the rooted species tree topology and all its internal branch lengths. The length of any pendant branch leading to a leaf of the species tree is also identifiable for any species from which more than one gene is sampled.

  12. Effective Boolean dynamics analysis to identify functionally important genes in large-scale signaling networks.

    PubMed

    Trinh, Hung-Cuong; Kwon, Yung-Keun

    2015-11-01

    Efficiently identifying functionally important genes in order to understand the minimal requirements of normal cellular development is challenging. To this end, a variety of structural measures have been proposed and their effectiveness has been investigated in recent literature; however, few studies have shown the effectiveness of dynamics-based measures. This led us to investigate a dynamic measure to identify functionally important genes, and the effectiveness of which was verified through application on two large-scale human signaling networks. We specifically consider Boolean sensitivity-based dynamics against an update-rule perturbation (BSU) as a dynamic measure. Through investigations on two large-scale human signaling networks, we found that genes with relatively high BSU values show slower evolutionary rate and higher proportions of essential genes and drug targets than other genes. Gene-ontology analysis showed clear differences between the former and latter groups of genes. Furthermore, we compare the identification accuracies of essential genes and drug targets via BSU and five well-known structural measures. Although BSU did not always show the best performance, it effectively identified the putative set of genes, which is significantly different from the results obtained via the structural measures. Most interestingly, BSU showed the highest synergy effect in identifying the functionally important genes in conjunction with other measures. Our results imply that Boolean-sensitive dynamics can be used as a measure to effectively identify functionally important genes in signaling networks.

  13. LGscore: A method to identify disease-related genes using biological literature and Google data.

    PubMed

    Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

    2015-04-01

    Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods.

  14. An Orthologous Epigenetic Gene Expression Signature Derived from Differentiating Embryonic Stem Cells Identifies Regulators of Cardiogenesis.

    PubMed

    Busser, Brian W; Lin, Yongshun; Yang, Yanqin; Zhu, Jun; Chen, Guokai; Michelson, Alan M

    2015-01-01

    Here we used predictive gene expression signatures within a multi-species framework to identify the genes that underlie cardiac cell fate decisions in differentiating embryonic stem cells. We show that the overlapping orthologous mouse and human genes are the most accurate candidate cardiogenic genes as these genes identified the most conserved developmental pathways that characterize the cardiac lineage. An RNAi-based screen of the candidate genes in Drosophila uncovered numerous novel cardiogenic genes. shRNA knockdown combined with transcriptome profiling of the newly-identified transcription factors zinc finger protein 503 and zinc finger E-box binding homeobox 2 and the well-known cardiac regulatory factor NK2 homeobox 5 revealed that zinc finger E-box binding homeobox 2 activates terminal differentiation genes required for cardiomyocyte structure and function whereas zinc finger protein 503 and NK2 homeobox 5 are required for specification of the cardiac lineage. We further demonstrated that an essential role of NK2 homeobox 5 and zinc finger protein 503 in specification of the cardiac lineage is the repression of gene expression programs characteristic of alternative cell fates. Collectively, these results show that orthologous gene expression signatures can be used to identify conserved cardiogenic pathways.

  15. Identifying differentially expressed genes in cancer patients using a non-parameter Ising model.

    PubMed

    Li, Xumeng; Feltus, Frank A; Sun, Xiaoqian; Wang, James Z; Luo, Feng

    2011-10-01

    Identification of genes and pathways involved in diseases and physiological conditions is a major task in systems biology. In this study, we developed a novel non-parameter Ising model to integrate protein-protein interaction network and microarray data for identifying differentially expressed (DE) genes. We also proposed a simulated annealing algorithm to find the optimal configuration of the Ising model. The Ising model was applied to two breast cancer microarray data sets. The results showed that more cancer-related DE sub-networks and genes were identified by the Ising model than those by the Markov random field model. Furthermore, cross-validation experiments showed that DE genes identified by Ising model can improve classification performance compared with DE genes identified by Markov random field model.

  16. Identifying the optimal gene and gene set in hepatocellular carcinoma based on differential expression and differential co-expression algorithm.

    PubMed

    Dong, Li-Yang; Zhou, Wei-Zhong; Ni, Jun-Wei; Xiang, Wei; Hu, Wen-Hao; Yu, Chang; Li, Hai-Yan

    2017-02-01

    The objective of this study was to identify the optimal gene and gene set for hepatocellular carcinoma (HCC) utilizing differential expression and differential co-expression (DEDC) algorithm. The DEDC algorithm consisted of four parts: calculating differential expression (DE) by absolute t-value in t-statistics; computing differential co-expression (DC) based on Z-test; determining optimal thresholds on the basis of Chi-squared (χ2) maximization and the corresponding gene was the optimal gene; and evaluating functional relevance of genes categorized into different partitions to determine the optimal gene set with highest mean minimum functional information (FI) gain (Δ*G). The optimal thresholds divided genes into four partitions, high DE and high DC (HDE-HDC), high DE and low DC (HDE-LDC), low DE and high DC (LDE‑HDC), and low DE and low DC (LDE-LDC). In addition, the optimal gene was validated by conducting reverse transcription-polymerase chain reaction (RT-PCR) assay. The optimal threshold for DC and DE were 1.032 and 1.911, respectively. Using the optimal gene, the genes were divided into four partitions including: HDE-HDC (2,053 genes), HED-LDC (2,822 genes), LDE-HDC (2,622 genes), and LDE-LDC (6,169 genes). The optimal gene was microtubule‑associated protein RP/EB family member 1 (MAPRE1), and RT-PCR assay validated the significant difference between the HCC and normal state. The optimal gene set was nucleoside metabolic process (GO\\GO:0009116) with Δ*G = 18.681 and 24 HDE-HDC partitions in total. In conclusion, we successfully investigated the optimal gene, MAPRE1, and gene set, nucleoside metabolic process, which may be potential biomarkers for targeted therapy and provide significant insight for revealing the pathological mechanism underlying HCC.

  17. Analysis of gene expression in the nervous system identifies key genes and novel candidates for health and disease.

    PubMed

    Carpanini, Sarah M; Wishart, Thomas M; Gillingwater, Thomas H; Manson, Jean C; Summers, Kim M

    2017-04-01

    The incidence of neurodegenerative diseases in the developed world has risen over the last century, concomitant with an increase in average human lifespan. A major challenge is therefore to identify genes that control neuronal health and viability with a view to enhancing neuronal health during ageing and reducing the burden of neurodegeneration. Analysis of gene expression data has recently been used to infer gene functions for a range of tissues from co-expression networks. We have now applied this approach to transcriptomic datasets from the mammalian nervous system available in the public domain. We have defined the genes critical for influencing neuronal health and disease in different neurological cell types and brain regions. The functional contribution of genes in each co-expression cluster was validated using human disease and knockout mouse phenotypes, pathways and gene ontology term annotation. Additionally a number of poorly annotated genes were implicated by this approach in nervous system function. Exploiting gene expression data available in the public domain allowed us to validate key nervous system genes and, importantly, to identify additional genes with minimal functional annotation but with the same expression pattern. These genes are thus novel candidates for a role in neurological health and disease and could now be further investigated to confirm their function and regulation during ageing and neurodegeneration.

  18. Gene Expression Signature-Based Screening Identifies New Broadly Effective Influenza A Antivirals

    PubMed Central

    Josset, Laurence; Textoris, Julien; Loriod, Béatrice; Ferraris, Olivier; Moules, Vincent; Lina, Bruno; N'Guyen, Catherine; Diaz, Jean-Jacques; Rosa-Calatrava, Manuel

    2010-01-01

    Classical antiviral therapies target viral proteins and are consequently subject to resistance. To counteract this limitation, alternative strategies have been developed that target cellular factors. We hypothesized that such an approach could also be useful to identify broad-spectrum antivirals. The influenza A virus was used as a model for its viral diversity and because of the need to develop therapies against unpredictable viruses as recently underlined by the H1N1 pandemic. We proposed to identify a gene-expression signature associated with infection by different influenza A virus subtypes which would allow the identification of potential antiviral drugs with a broad anti-influenza spectrum of activity. We analyzed the cellular gene expression response to infection with five different human and avian influenza A virus strains and identified 300 genes as differentially expressed between infected and non-infected samples. The most 20 dysregulated genes were used to screen the connectivity map, a database of drug-associated gene expression profiles. Candidate antivirals were then identified by their inverse correlation to the query signature. We hypothesized that such molecules would induce an unfavorable cellular environment for influenza virus replication. Eight potential antivirals including ribavirin were identified and their effects were tested in vitro on five influenza A strains. Six of the molecules inhibited influenza viral growth. The new pandemic H1N1 virus, which was not used to define the gene expression signature of infection, was inhibited by five out of the eight identified molecules, demonstrating that this strategy could contribute to identifying new broad anti-influenza agents acting on cellular gene expression. The identified infection signature genes, the expression of which are modified upon infection, could encode cellular proteins involved in the viral life cycle. This is the first study showing that gene expression-based screening can be

  19. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations

    PubMed Central

    Peng, Cheng; Shiu, Shin-Han

    2016-01-01

    Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets. PMID:27935950

  20. Systems Biology Approach to Identify Gene Network Signatures for Colorectal Cancer

    PubMed Central

    Sonachalam, Madhankumar; Shen, Jeffrey; Huang, Hui; Wu, Xiaogang

    2012-01-01

    In this work, we integrated prior knowledge from gene signatures and protein interactions with gene set enrichment analysis (GSEA), and gene/protein network modeling together to identify gene network signatures from gene expression microarray data. We demonstrated how to apply this approach into discovering gene network signatures for colorectal cancer (CRC) from microarray datasets. First, we used GSEA to analyze the microarray data through enriching differential genes in different CRC-related gene sets from two publicly available up-to-date gene set databases – Molecular Signatures Database (MSigDB) and Gene Signatures Database (GeneSigDB). Second, we compared the enriched gene sets through enrichment score, false-discovery rate, and nominal p-value. Third, we constructed an integrated protein–protein interaction (PPI) network through connecting these enriched genes by high-quality interactions from a human annotated and predicted protein interaction database, with a confidence score labeled for each interaction. Finally, we mapped differential gene expressions onto the constructed network to build a comprehensive network model containing visualized transcriptome and proteome data. The results show that although MSigDB has more CRC-relevant gene sets than GeneSigDB, the integrated PPI network connecting the enriched genes from both MSigDB and GeneSigDB can provide a more complete view for discovering gene network signatures. We also found several important sub-network signatures for CRC, such as TP53 sub-network, PCNA sub-network, and IL8 sub-network, corresponding to apoptosis, DNA repair, and immune response, respectively. PMID:22629282

  1. Transposon insertional mutagenesis in mice identifies human breast cancer susceptibility genes and signatures for stratification

    PubMed Central

    Chen, Liming; Jenjaroenpun, Piroon; Pillai, Andrea Mun Ching; Ivshina, Anna V.; Ow, Ghim Siong; Efthimios, Motakis; Zhiqun, Tang; Lee, Song-Choon; Rogers, Keith; Ward, Jerrold M.; Mori, Seiichi; Adams, David J.; Jenkins, Nancy A.; Copeland, Neal G.; Ban, Kenneth Hon-Kim; Kuznetsov, Vladimir A.; Thiery, Jean Paul

    2017-01-01

    Robust prognostic gene signatures and therapeutic targets are difficult to derive from expression profiling because of the significant heterogeneity within breast cancer (BC) subtypes. Here, we performed forward genetic screening in mice using Sleeping Beauty transposon mutagenesis to identify candidate BC driver genes in an unbiased manner, using a stabilized N-terminal truncated β-catenin gene as a sensitizer. We identified 134 mouse susceptibility genes from 129 common insertion sites within 34 mammary tumors. Of these, 126 genes were orthologous to protein-coding genes in the human genome (hereafter, human BC susceptibility genes, hBCSGs), 70% of which are previously reported cancer-associated genes, and ∼16% are known BC suppressor genes. Network analysis revealed a gene hub consisting of E1A binding protein P300 (EP300), CD44 molecule (CD44), neurofibromin (NF1) and phosphatase and tensin homolog (PTEN), which are linked to a significant number of mutated hBCSGs. From our survival prediction analysis of the expression of human BC genes in 2,333 BC cases, we isolated a six-gene-pair classifier that stratifies BC patients with high confidence into prognostically distinct low-, moderate-, and high-risk subgroups. Furthermore, we proposed prognostic classifiers identifying three basal and three claudin-low tumor subgroups. Intriguingly, our hBCSGs are mostly unrelated to cell cycle/mitosis genes and are distinct from the prognostic signatures currently used for stratifying BC patients. Our findings illustrate the strength and validity of integrating functional mutagenesis screens in mice with human cancer transcriptomic data to identify highly prognostic BC subtyping biomarkers. PMID:28251929

  2. Transposon insertional mutagenesis in mice identifies human breast cancer susceptibility genes and signatures for stratification.

    PubMed

    Chen, Liming; Jenjaroenpun, Piroon; Pillai, Andrea Mun Ching; Ivshina, Anna V; Ow, Ghim Siong; Efthimios, Motakis; Zhiqun, Tang; Tan, Tuan Zea; Lee, Song-Choon; Rogers, Keith; Ward, Jerrold M; Mori, Seiichi; Adams, David J; Jenkins, Nancy A; Copeland, Neal G; Ban, Kenneth Hon-Kim; Kuznetsov, Vladimir A; Thiery, Jean Paul

    2017-03-14

    Robust prognostic gene signatures and therapeutic targets are difficult to derive from expression profiling because of the significant heterogeneity within breast cancer (BC) subtypes. Here, we performed forward genetic screening in mice using Sleeping Beauty transposon mutagenesis to identify candidate BC driver genes in an unbiased manner, using a stabilized N-terminal truncated β-catenin gene as a sensitizer. We identified 134 mouse susceptibility genes from 129 common insertion sites within 34 mammary tumors. Of these, 126 genes were orthologous to protein-coding genes in the human genome (hereafter, human BC susceptibility genes, hBCSGs), 70% of which are previously reported cancer-associated genes, and ∼16% are known BC suppressor genes. Network analysis revealed a gene hub consisting of E1A binding protein P300 (EP300), CD44 molecule (CD44), neurofibromin (NF1) and phosphatase and tensin homolog (PTEN), which are linked to a significant number of mutated hBCSGs. From our survival prediction analysis of the expression of human BC genes in 2,333 BC cases, we isolated a six-gene-pair classifier that stratifies BC patients with high confidence into prognostically distinct low-, moderate-, and high-risk subgroups. Furthermore, we proposed prognostic classifiers identifying three basal and three claudin-low tumor subgroups. Intriguingly, our hBCSGs are mostly unrelated to cell cycle/mitosis genes and are distinct from the prognostic signatures currently used for stratifying BC patients. Our findings illustrate the strength and validity of integrating functional mutagenesis screens in mice with human cancer transcriptomic data to identify highly prognostic BC subtyping biomarkers.

  3. Using phylogenomic patterns and gene ontology to identify proteins of importance in plant evolution.

    PubMed

    Cibrián-Jaramillo, Angélica; De la Torre-Bárcena, Jose E; Lee, Ernest K; Katari, Manpreet S; Little, Damon P; Stevenson, Dennis W; Martienssen, Rob; Coruzzi, Gloria M; DeSalle, Rob

    2010-07-12

    We use measures of congruence on a combined expressed sequenced tag genome phylogeny to identify proteins that have potential significance in the evolution of seed plants. Relevant proteins are identified based on the direction of partitioned branch and hidden support on the hypothesis obtained on a 16-species tree, constructed from 2,557 concatenated orthologous genes. We provide a general method for detecting genes or groups of genes that may be under selection in directions that are in agreement with the phylogenetic pattern. Gene partitioning methods and estimates of the degree and direction of support of individual gene partitions to the overall data set are used. Using this approach, we correlate positive branch support of specific genes for key branches in the seed plant phylogeny. In addition to basic metabolic functions, such as photosynthesis or hormones, genes involved in posttranscriptional regulation by small RNAs were significantly overrepresented in key nodes of the phylogeny of seed plants. Two genes in our matrix are of critical importance as they are involved in RNA-dependent regulation, essential during embryo and leaf development. These are Argonaute and the RNA-dependent RNA polymerase 6 found to be overrepresented in the angiosperm clade. We use these genes as examples of our phylogenomics approach and show that identifying partitions or genes in this way provides a platform to explain some of the more interesting organismal differences among species, and in particular, in the evolution of plants.

  4. Identifying and Analyzing Novel Epilepsy-Related Genes Using Random Walk with Restart Algorithm

    PubMed Central

    Guo, Wei; Shang, Dong-Mei; Cao, Jing-Hui; Feng, Kaiyan; Wang, ShaoPeng

    2017-01-01

    As a pathological condition, epilepsy is caused by abnormal neuronal discharge in brain which will temporarily disrupt the cerebral functions. Epilepsy is a chronic disease which occurs in all ages and would seriously affect patients' personal lives. Thus, it is highly required to develop effective medicines or instruments to treat the disease. Identifying epilepsy-related genes is essential in order to understand and treat the disease because the corresponding proteins encoded by the epilepsy-related genes are candidates of the potential drug targets. In this study, a pioneering computational workflow was proposed to predict novel epilepsy-related genes using the random walk with restart (RWR) algorithm. As reported in the literature RWR algorithm often produces a number of false positive genes, and in this study a permutation test and functional association tests were implemented to filter the genes identified by RWR algorithm, which greatly reduce the number of suspected genes and result in only thirty-three novel epilepsy genes. Finally, these novel genes were analyzed based upon some recently published literatures. Our findings implicate that all novel genes were closely related to epilepsy. It is believed that the proposed workflow can also be applied to identify genes related to other diseases and deepen our understanding of the mechanisms of these diseases. PMID:28255556

  5. Genes identified by visible mutant phenotypes show increased bias toward one of two subgenomes of maize.

    PubMed

    Schnable, James C; Freeling, Michael

    2011-03-10

    Not all genes are created equal. Despite being supported by sequence conservation and expression data, knockout homozygotes of many genes show no visible effects, at least under laboratory conditions. We have identified a set of maize (Zea mays L.) genes which have been the subject of a disproportionate share of publications recorded at MaizeGDB. We manually anchored these "classical" maize genes to gene models in the B73 reference genome, and identified syntenic orthologs in other grass genomes. In addition to proofing the most recent version 2 maize gene models, we show that a subset of these genes, those that were identified by morphological phenotype prior to cloning, are retained at syntenic locations throughout the grasses at much higher levels than the average expressed maize gene, and are preferentially found on the maize1 subgenome even with a duplicate copy is still retained on the opposite subgenome. Maize1 is the subgenome that experienced less gene loss following the whole genome duplication in maize lineage 5-12 million years ago and genes located on this subgenome tend to be expressed at higher levels in modern maize. Links to the web based software that supported our syntenic analyses in the grasses should empower further research and support teaching involving the history of maize genetic research. Our findings exemplify the concept of "grasses as a single genetic system," where what is learned in one grass may be applied to another.

  6. A novel reverse-genetic approach (SIMF) identifies Mutator insertions in new Myb genes.

    PubMed

    Rabinowicz, P D; Grotewold, E

    2000-11-01

    We have developed a new strategy designated SIMF (Systematic Insertional Mutagenesis of Families), to identify DNA insertions in many members of a gene family simultaneously. This method requires only a short amino acid sequence conserved in all members of the family to make a degenerate oligonucleotide, and a sequence from the end of the DNA insertion. The SIMF strategy was successfully applied to the large maize R2R3 Myb family of regulatory genes, and Mutator insertions in several novel Myb genes were identified. Application of this technique to identify insertions in other large gene families could significantly decrease the effort involved in screening at the same time for insertions in all members of groups of genes that share a limited sequence identity.

  7. Identifying suitable reference genes for gene expression analysis in developing skeletal muscle in pigs

    PubMed Central

    Zhang, YuanYuan; Hua, Chaoju; Wang, Zishuai; Li, Kui

    2016-01-01

    The selection of suitable reference genes is crucial to accurately evaluate and normalize the relative expression level of target genes for gene function analysis. However, commonly used reference genes have variable expression levels in developing skeletal muscle. There are few reports that systematically evaluate the expression stability of reference genes across prenatal and postnatal developing skeletal muscle in mammals. Here, we used quantitative PCR to examine the expression levels of 15 candidate reference genes (ACTB, GAPDH, RNF7, RHOA, RPS18, RPL32, PPIA, H3F3, API5, B2M, AP1S1, DRAP1, TBP, WSB, and VAPB) in porcine skeletal muscle at 26 different developmental stages (15 prenatal and 11 postnatal periods). We evaluated gene expression stability using the computer algorithms geNorm, NormFinder, and BestKeeper. Our results indicated that GAPDH and ACTB had the greatest variability among the candidate genes across prenatal and postnatal stages of skeletal muscle development. RPS18, API5, and VAPB had stable expression levels in prenatal stages, whereas API5, RPS18, RPL32, and H3F3 had stable expression levels in postnatal stages. API5 and H3F3 expression levels had the greatest stability in all tested prenatal and postnatal stages, and were the most appropriate reference genes for gene expression normalization in developing skeletal muscle. Our data provide valuable information for gene expression analysis during different stages of skeletal muscle development in mammals. This information can provide a valuable guide for the analysis of human diseases. PMID:27994956

  8. Identifying suitable reference genes for gene expression analysis in developing skeletal muscle in pigs.

    PubMed

    Niu, Guanglin; Yang, Yalan; Zhang, YuanYuan; Hua, Chaoju; Wang, Zishuai; Tang, Zhonglin; Li, Kui

    2016-01-01

    The selection of suitable reference genes is crucial to accurately evaluate and normalize the relative expression level of target genes for gene function analysis. However, commonly used reference genes have variable expression levels in developing skeletal muscle. There are few reports that systematically evaluate the expression stability of reference genes across prenatal and postnatal developing skeletal muscle in mammals. Here, we used quantitative PCR to examine the expression levels of 15 candidate reference genes (ACTB, GAPDH, RNF7, RHOA, RPS18, RPL32, PPIA, H3F3, API5, B2M, AP1S1, DRAP1, TBP, WSB, and VAPB) in porcine skeletal muscle at 26 different developmental stages (15 prenatal and 11 postnatal periods). We evaluated gene expression stability using the computer algorithms geNorm, NormFinder, and BestKeeper. Our results indicated that GAPDH and ACTB had the greatest variability among the candidate genes across prenatal and postnatal stages of skeletal muscle development. RPS18, API5, and VAPB had stable expression levels in prenatal stages, whereas API5, RPS18, RPL32, and H3F3 had stable expression levels in postnatal stages. API5 and H3F3 expression levels had the greatest stability in all tested prenatal and postnatal stages, and were the most appropriate reference genes for gene expression normalization in developing skeletal muscle. Our data provide valuable information for gene expression analysis during different stages of skeletal muscle development in mammals. This information can provide a valuable guide for the analysis of human diseases.

  9. The compact Selaginella genome identifies changes in gene content associated with the evolution of vascular plants

    SciTech Connect

    Grigoriev, Igor V.; Banks, Jo Ann; Nishiyama, Tomoaki; Hasebe, Mitsuyasu; Bowman, John L.; Gribskov, Michael; dePamphilis, Claude; Albert, Victor A.; Aono, Naoki; Aoyama, Tsuyoshi; Ambrose, Barbara A.; Ashton, Neil W.; Axtell, Michael J.; Barker, Elizabeth; Barker, Michael S.; Bennetzen, Jeffrey L.; Bonawitz, Nicholas D.; Chapple, Clint; Cheng, Chaoyang; Correa, Luiz Gustavo Guedes; Dacre, Michael; DeBarry, Jeremy; Dreyer, Ingo; Elias, Marek; Engstrom, Eric M.; Estelle, Mark; Feng, Liang; Finet, Cedric; Floyd, Sandra K.; Frommer, Wolf B.; Fujita, Tomomichi; Gramzow, Lydia; Gutensohn, Michael; Harholt, Jesper; Hattori, Mitsuru; Heyl, Alexander; Hirai, Tadayoshi; Hiwatashi, Yuji; Ishikawa, Masaki; Iwata, Mineko; Karol, Kenneth G.; Koehler, Barbara; Kolukisaoglu, Uener; Kubo, Minoru; Kurata, Tetsuya; Lalonde, Sylvie; Li, Kejie; Li, Ying; Litt, Amy; Lyons, Eric; Manning, Gerard; Maruyama, Takeshi; Michael, Todd P.; Mikami, Koji; Miyazaki, Saori; Morinaga, Shin-ichi; Murata, Takashi; Mueller-Roeber, Bernd; Nelson, David R.; Obara, Mari; Oguri, Yasuko; Olmstead, Richard G.; Onodera, Naoko; Petersen, Bent Larsen; Pils, Birgit; Prigge, Michael; Rensing, Stefan A.; Riano-Pachon, Diego Mauricio; Roberts, Alison W.; Sato, Yoshikatsu; Scheller, Henrik Vibe; Schulz, Burkhard; Schulz, Christian; Shakirov, Eugene V.; Shibagaki, Nakako; Shinohara, Naoki; Shippen, Dorothy E.; Sorensen, Iben; Sotooka, Ryo; Sugimoto, Nagisa; Sugita, Mamoru; Sumikawa, Naomi; Tanurdzic, Milos; Theilsen, Gunter; Ulvskov, Peter; Wakazuki, Sachiko; Weng, Jing-Ke; Willats, William W.G.T.; Wipf, Daniel; Wolf, Paul G.; Yang, Lixing; Zimmer, Andreas D.; Zhu, Qihui; Mitros, Therese; Hellsten, Uffe; Loque, Dominique; Otillar, Robert; Salamov, Asaf; Schmutz, Jeremy; Shapiro, Harris; Lindquist, Erika; Lucas, Susan; Rokhsar, Daniel

    2011-04-28

    We report the genome sequence of the nonseed vascular plant, Selaginella moellendorffii, and by comparative genomics identify genes that likely played important roles in the early evolution of vascular plants and their subsequent evolution

  10. Genome-Wide RNAi Screens in C. elegans to Identify Genes Influencing Lifespan and Innate Immunity.

    PubMed

    Sinha, Amit; Rae, Robbie

    2016-01-01

    RNA interference is a rapid, inexpensive, and highly effective tool used to inhibit gene function. In C. elegans, whole genome screens have been used to identify genes involved with numerous traits including aging and innate immunity. RNAi in C. elegans can be carried out via feeding, soaking, or injection. Here we outline protocols used to maintain, grow, and carry out RNAi via feeding in C. elegans and determine whether the inhibited genes are essential for lifespan or innate immunity.

  11. Protein functional links in Trypanosoma brucei, identified by gene fusion analysis

    PubMed Central

    2011-01-01

    Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs. PMID:21729286

  12. Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.

    PubMed

    Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin

    2016-04-01

    Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.

  13. Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments.

    PubMed

    Leng, Ning; Chu, Li-Fang; Barry, Chris; Li, Yuan; Choi, Jeea; Li, Xiaomao; Jiang, Peng; Stewart, Ron M; Thomson, James A; Kendziorski, Christina

    2015-10-01

    Oscillatory gene expression is fundamental to development, but technologies for monitoring expression oscillations are limited. We have developed a statistical approach called Oscope to identify and characterize the transcriptional dynamics of oscillating genes in single-cell RNA-seq data from an unsynchronized cell population. Applying Oscope to a number of data sets, we demonstrated its utility and also identified a potential artifact in the Fluidigm C1 platform.

  14. Epigenetic Characterization of the Growth Hormone Gene Identifies SmcHD1 as a Regulator of Autosomal Gene Clusters

    PubMed Central

    Massah, Shabnam; Hollebakken, Robert; Labrecque, Mark P.; Kolybaba, Addie M.; Beischlag, Timothy V.; Prefontaine, Gratien G.

    2014-01-01

    Regulatory elements for the mouse growth hormone (GH) gene are located distally in a putative locus control region (LCR) in addition to key elements in the promoter proximal region. The role of promoter DNA methylation for GH gene regulation is not well understood. Pit-1 is a POU transcription factor required for normal pituitary development and obligatory for GH gene expression. In mammals, Pit-1 mutations eliminate GH production resulting in a dwarf phenotype. In this study, dwarf mice illustrated that Pit-1 function was obligatory for GH promoter hypomethylation. By monitoring promoter methylation levels during developmental GH expression we found that the GH promoter became hypomethylated coincident with gene expression. We identified a promoter differentially methylated region (DMR) that was used to characterize a methylation-dependent DNA binding activity. Upon DNA affinity purification using the DMR and nuclear extracts, we identified structural maintenance of chromosomes hinge domain containing -1 (SmcHD1). To better understand the role of SmcHD1 in genome-wide gene expression, we performed microarray analysis and compared changes in gene expression upon reduced levels of SmcHD1 in human cells. Knock-down of SmcHD1 in human embryonic kidney (HEK293) cells revealed a disproportionate number of up-regulated genes were located on the X-chromosome, but also suggested regulation of genes on non-sex chromosomes. Among those, we identified several genes located in the protocadherin β cluster. In addition, we found that imprinted genes in the H19/Igf2 cluster associated with Beckwith-Wiedemann and Silver-Russell syndromes (BWS & SRS) were dysregulated. For the first time using human cells, we showed that SmcHD1 is an important regulator of imprinted and clustered genes. PMID:24818964

  15. A Novel Prioritization Method in Identifying Recurrent Venous Thromboembolism-Related Genes

    PubMed Central

    Xie, Ruiqiang; Chen, Binbin; Huang, Hao; Li, Yiran; He, Yuehan; Lv, Junjie; He, Weiming; Chen, Lina

    2016-01-01

    Identifying the genes involved in venous thromboembolism (VTE) recurrence is important not only for understanding the pathogenesis but also for discovering the therapeutic targets. We proposed a novel prioritization method called Function-Interaction-Pearson (FIP) by creating gene-disease similarity scores to prioritize candidate genes underling VTE. The scores were calculated by integrating and optimizing three types of resources including gene expression, gene ontology and protein-protein interaction. As a result, 124 out of top 200 prioritized candidate genes had been confirmed in literature, among which there were 34 antithrombotic drug targets. Compared with two well-known gene prioritization tools Endeavour and ToppNet, FIP was shown to have better performance. The approach provides a valuable alternative for drug targets discovery and disease therapy. PMID:27050193

  16. Identifying Novel Transcriptional and Epigenetic Features of Nuclear Lamina-associated Genes.

    PubMed

    Wu, Feinan; Yao, Jie

    2017-12-01

    Because a large portion of the mammalian genome is associated with the nuclear lamina (NL), it is interesting to study how native genes resided there are transcribed and regulated. In this study, we report unique transcriptional and epigenetic features of nearly 3,500 NL-associated genes (NL genes). Promoter regions of active NL genes are often excluded from NL-association, suggesting that NL-promoter interactions may repress transcription. Active NL genes with higher RNA polymerase II (Pol II) recruitment levels tend to display Pol II promoter-proximal pausing, while Pol II recruitment and Pol II pausing are not correlated among non-NL genes. At the genome-wide scale, NL-association and H3K27me3 distinguishes two large gene classes with low transcriptional activities. Notably, NL-association is anti-correlated with both transcription and active histone mark levels among genes not significantly enriched with H3K9me3 or H3K27me3, suggesting that NL-association may represent a novel gene repression pathway. Interestingly, an NL gene subgroup is not significantly enriched with H3K9me3 or H3K27me3 and is transcribed at higher levels than the rest of NL genes. Furthermore, we identified distal enhancers associated with active NL genes and reported their epigenetic features.

  17. A genomic strategy for the functional validation of colorectal cancer genes identifies potential therapeutic targets.

    PubMed

    Grade, Marian; Hummon, Amanda B; Camps, Jordi; Emons, Georg; Spitzner, Melanie; Gaedcke, Jochen; Hoermann, Patrick; Ebner, Reinhard; Becker, Heinz; Difilippantonio, Michael J; Ghadimi, B Michael; Beissbarth, Tim; Caplen, Natasha J; Ried, Thomas

    2011-03-01

    Genes that are highly overexpressed in tumor cells can be required for tumor cell survival and have the potential to be selective therapeutic targets. In an attempt to identify such targets, we combined a functional genomics and a systems biology approach to assess the consequences of RNAi-mediated silencing of overexpressed genes that were selected from 140 gene expression profiles from colorectal cancers (CRCs) and matched normal mucosa. In order to identify credible models for in-depth functional analysis, we first confirmed the overexpression of these genes in 25 different CRC cell lines. We then identified five candidate genes that profoundly reduced the viability of CRC cell lines when silenced with either siRNAs or short-hairpin RNAs (shRNAs), i.e., HMGA1, TACSTD2, RRM2, RPS2 and NOL5A. These genes were further studied by systematic analysis of comprehensive gene expression profiles generated following siRNA-mediated silencing. Exploration of these RNAi-specific gene expression signatures allowed the identification of the functional space in which the five genes operate and showed enrichment for cancer-specific signaling pathways, some known to be involved in CRC. By comparing the expression of the RNAi signature genes with their respective expression levels in an independent set of primary rectal carcinomas, we could recapitulate these defined RNAi signatures, therefore, establishing the biological relevance of our observations. This strategy identified the signaling pathways that are affected by the prominent oncogenes HMGA1 and TACSTD2, established a yet unknown link between RRM2 and PLK1 and identified RPS2 and NOL5A as promising potential therapeutic targets in CRC.

  18. A cross-species bi-clustering approach to identifying conserved co-regulated genes

    PubMed Central

    Sun, Jiangwen; Jiang, Zongliang; Tian, Xiuchun; Bi, Jinbo

    2016-01-01

    Motivation: A growing number of studies have explored the process of pre-implantation embryonic development of multiple mammalian species. However, the conservation and variation among different species in their developmental programming are poorly defined due to the lack of effective computational methods for detecting co-regularized genes that are conserved across species. The most sophisticated method to date for identifying conserved co-regulated genes is a two-step approach. This approach first identifies gene clusters for each species by a cluster analysis of gene expression data, and subsequently computes the overlaps of clusters identified from different species to reveal common subgroups. This approach is ineffective to deal with the noise in the expression data introduced by the complicated procedures in quantifying gene expression. Furthermore, due to the sequential nature of the approach, the gene clusters identified in the first step may have little overlap among different species in the second step, thus difficult to detect conserved co-regulated genes. Results: We propose a cross-species bi-clustering approach which first denoises the gene expression data of each species into a data matrix. The rows of the data matrices of different species represent the same set of genes that are characterized by their expression patterns over the developmental stages of each species as columns. A novel bi-clustering method is then developed to cluster genes into subgroups by a joint sparse rank-one factorization of all the data matrices. This method decomposes a data matrix into a product of a column vector and a row vector where the column vector is a consistent indicator across the matrices (species) to identify the same gene cluster and the row vector specifies for each species the developmental stages that the clustered genes co-regulate. Efficient optimization algorithm has been developed with convergence analysis. This approach was first validated on

  19. A cross-species genetic analysis identifies candidate genes for mouse anxiety and human bipolar disorder

    PubMed Central

    Ashbrook, David G.; Williams, Robert W.; Lu, Lu; Hager, Reinmar

    2015-01-01

    Bipolar disorder (BD) is a significant neuropsychiatric disorder with a lifetime prevalence of ~1%. To identify genetic variants underlying BD genome-wide association studies (GWAS) have been carried out. While many variants of small effect associated with BD have been identified few have yet been confirmed, partly because of the low power of GWAS due to multiple comparisons being made. Complementary mapping studies using murine models have identified genetic variants for behavioral traits linked to BD, often with high power, but these identified regions often contain too many genes for clear identification of candidate genes. In the current study we have aligned human BD GWAS results and mouse linkage studies to help define and evaluate candidate genes linked to BD, seeking to use the power of the mouse mapping with the precision of GWAS. We use quantitative trait mapping for open field test and elevated zero maze data in the largest mammalian model system, the BXD recombinant inbred mouse population, to identify genomic regions associated with these BD-like phenotypes. We then investigate these regions in whole genome data from the Psychiatric Genomics Consortium's bipolar disorder GWAS to identify candidate genes associated with BD. Finally we establish the biological relevance and pathways of these genes in a comprehensive systems genetics analysis. We identify four genes associated with both mouse anxiety and human BD. While TNR is a novel candidate for BD, we can confirm previously suggested associations with CMYA5, MCTP1, and RXRG. A cross-species, systems genetics analysis shows that MCTP1, RXRG, and TNR coexpress with genes linked to psychiatric disorders and identify the striatum as a potential site of action. CMYA5, MCTP1, RXRG, and TNR are associated with mouse anxiety and human BD. We hypothesize that MCTP1, RXRG, and TNR influence intercellular signaling in the striatum. PMID:26190982

  20. Oligonucleotide microarray identifies genes differentially expressed during tumorigenesis of DMBA-induced pancreatic cancer in rats.

    PubMed

    Guo, Jun-Chao; Li, Jian; Yang, Ying-Chi; Zhou, Li; Zhang, Tai-Ping; Zhao, Yu-Pei

    2013-01-01

    The extremely dismal prognosis of pancreatic cancer (PC) is attributed, at least in part, to lack of early diagnosis. Therefore, identifying differentially expressed genes in multiple steps of tumorigenesis of PC is of great interest. In the present study, a 7,12-dimethylbenzanthraene (DMBA)-induced PC model was established in male Sprague-Dawley rats. The gene expression profile was screened using an oligonucleotide microarray, followed by real-time quantitative polymerase chain reaction (qRT-PCR) and immunohistochemical staining validation. A total of 661 differentially expressed genes were identified in stages of pancreatic carcinogenesis. According to GO classification, these genes were involved in multiple molecular pathways. Using two-way hierarchical clustering analysis, normal pancreas, acute and chronic pancreatitis, PanIN, early and advanced pancreatic cancer were completely discriminated. Furthermore, 11 upregulated and 142 downregulated genes (probes) were found by Mann-Kendall trend Monotone test, indicating homologous genes of rat and human. The qRT-PCR and immunohistochemistry analysis of CXCR7 and UBe2c, two of the identified genes, confirmed the microarray results. In human PC cell lines, knockdown of CXCR7 resulted in decreased migration and invasion. Collectively, our data identified several promising markers and therapeutic targets of PC based on a comprehensive screening and systemic validation.

  1. Gene-Based Genome-Wide Association Analysis in European and Asian Populations Identified Novel Genes for Rheumatoid Arthritis

    PubMed Central

    Zhu, Hong; Xia, Wei; Mo, Xing-Bo; Lin, Xiang; Qiu, Ying-Hua; Yi, Neng-Jun; Zhang, Yong-Hong; Deng, Fei-Yan; Lei, Shu-Feng

    2016-01-01

    Objective Rheumatoid arthritis (RA) is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations. Methods Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects). For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected ‘highly verified’ genes were measured by ELISA among our in-house RA cases and controls. Results A total of 221 RA-associated genes were newly identified by gene-based association study, including 71‘overlapped’, 76 ‘European-specific’ and 74 ‘Asian-specific’ genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 ‘overlapped’ (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA), 5 ‘European-specific’ (PHTF1, RPS18, BAK1, TNFRSF14, SUOX) and 4 ‘Asian-specific’ (RNASET2, HFE, BTN2A2, MAPK13) genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02) and HLA-DMA (P value = 4.70E-02) in plasma were significantly different in our in-house samples. Conclusion Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and

  2. Identifying stably expressed genes from multiple RNA-Seq data sets

    PubMed Central

    Emerson, Sarah; Chang, Jeff H.; Di, Yanming

    2016-01-01

    We examined RNA-Seq data on 211 biological samples from 24 different Arabidopsis experiments carried out by different labs. We grouped the samples according to tissue types, and in each of the groups, we identified genes that are stably expressed across biological samples, treatment conditions, and experiments. We fit a Poisson log-linear mixed-effect model to the read counts for each gene and decomposed the total variance into between-sample, between-treatment and between-experiment variance components. Identifying stably expressed genes is useful for count normalization and differential expression analysis. The variance component analysis that we explore here is a first step towards understanding the sources and nature of the RNA-Seq count variation. When using a numerical measure to identify stably expressed genes, the outcome depends on multiple factors: the background sample set and the reference gene set used for count normalization, the technology used for measuring gene expression, and the specific numerical stability measure used. Since differential expression (DE) is measured by relative frequencies, we argue that DE is a relative concept. We advocate using an explicit reference gene set for count normalization to improve interpretability of DE results, and recommend using a common reference gene set when analyzing multiple RNA-Seq experiments to avoid potential inconsistent conclusions. PMID:28028467

  3. Transcriptional Analysis of Gli3 Mutants Identifies Wnt Target Genes in the Developing Hippocampus

    PubMed Central

    Hasenpusch-Theil, Kerstin; Magnani, Dario; Amaniti, Eleni-Maria; Han, Lin; Armstrong, Douglas

    2012-01-01

    Early development of the hippocampus, which is essential for spatial memory and learning, is controlled by secreted signaling molecules of the Wnt gene family and by Wnt/β-catenin signaling. Despite its importance, little is known, however, about Wnt-regulated genes during hippocampal development. Here, we used the Gli3 mutant mouse extra-toes (XtJ), in which Wnt gene expression in the forebrain is severely affected, as a tool in a microarray analyses to identify potential Wnt target genes. This approach revealed 53 candidate genes with restricted or graded expression patterns in the dorsomedial telencephalon. We identified conserved Tcf/Lef-binding sites in telencephalon-specific enhancers of several of these genes, including Dmrt3, Gli3, Nfia, and Wnt8b. Binding of Lef1 to these sites was confirmed using electrophoretic mobility shift assays. Mutations in these Tcf/Lef-binding sites disrupted or reduced enhancer activity in vivo. Moreover, ectopic activation of Wnt/β-catenin signaling in an ex vivo explant system led to increased telencephalic expression of these genes. Finally, conditional inactivation of Gli3 results in defective hippocampal growth. Collectively, these data strongly suggest that we have identified a set of direct Wnt target genes in the developing hippocampus and provide inside into the genetic hierarchy underlying Wnt-regulated hippocampal development. PMID:22235033

  4. Identifying reproducible cancer-associated highly expressed genes with important functional significances using multiple datasets

    PubMed Central

    Huang, Haiyan; Li, Xiangyu; Guo, You; Zhang, Yuncong; Deng, Xusheng; Chen, Lufei; Zhang, Jiahui; Guo, Zheng; Ao, Lu

    2016-01-01

    Identifying differentially expressed (DE) genes between cancer and normal tissues is of basic importance for studying cancer mechanisms. However, current methods, such as the commonly used Significance Analysis of Microarrays (SAM), are biased to genes with low expression levels. Recently, we proposed an algorithm, named the pairwise difference (PD) algorithm, to identify highly expressed DE genes based on reproducibility evaluation of top-ranked expression differences between paired technical replicates of cells under two experimental conditions. In this study, we extended the application of the algorithm to the identification of DE genes between two types of tissue samples (biological replicates) based on several independent datasets or sub-datasets of a dataset, by constructing multiple paired average gene expression profiles for the two types of samples. Using multiple datasets for lung and esophageal cancers, we demonstrated that PD could identify many DE genes highly expressed in both cancer and normal tissues that tended to be missed by the commonly used SAM. These highly expressed DE genes, including many housekeeping genes, were significantly enriched in many conservative pathways, such as ribosome, proteasome, phagosome and TNF signaling pathways with important functional significances in oncogenesis. PMID:27796338

  5. Genes associated with agronomic traits in non-heading Chinese cabbage identified by expression profiling

    PubMed Central

    2014-01-01

    Background The genomes of non-heading Chinese cabbage (Brassica rapa ssp. chinensis), heading Chinese cabbage (Brassica rapa ssp. pekinensis) and their close relative Arabidopsis thaliana have provided important resources for studying the evolution and genetic improvement of cruciferous plants. Natural growing conditions present these plants with a variety of physiological challenges for which they have a repertoire of genes that ensure adaptability and normal growth. We investigated the differential expressions of genes that control adaptability and development in plants growing in the natural environment to study underlying mechanisms of their expression. Results Using digital gene expression tag profiling, we constructed an expression profile to identify genes related to important agronomic traits under natural growing conditions. Among three non-heading Chinese cabbage cultivars, we found thousands of genes that exhibited significant differences in expression levels at five developmental stages. Through comparative analysis and previous reports, we identified several candidate genes associated with late flowering, cold tolerance, self-incompatibility, and leaf color. Two genes related to cold tolerance were verified using quantitative real-time PCR. Conclusions We identified a large number of genes associated with important agronomic traits of non-heading Chinese cabbage. This analysis will provide a wealth of resources for molecular-assisted breeding of cabbage. The raw data and detailed results of this analysis are available at the website http://nhccdata.njau.edu.cn. PMID:24655567

  6. The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

    PubMed

    Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

    2013-10-01

    The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens.

  7. Genes that affect brain structure and function identified by rare variant analyses of Mendelian neurologic disease

    PubMed Central

    Karaca, Ender; Harel, Tamar; Pehlivan, Davut; Jhangiani, Shalini N.; Gambin, Tomasz; Akdemir, Zeynep Coban; Gonzaga-Jauregui, Claudia; Erdin, Serkan; Bayram, Yavuz; Campbell, Ian M.; Hunter, Jill V.; Atik, Mehmed M.; Van Esch, Hilde; Yuan, Bo; Wiszniewski, Wojciech; Isikay, Sedat; Yesil, Gozde; Yuregir, Ozge O.; Bozdogan, Sevcan Tug; Aslan, Huseyin; Aydin, Hatip; Tos, Tulay; Aksoy, Ayse; De Vivo, Darryl C.; Jain, Preti; Geckinli, B. Bilge; Sezer, Ozlem; Gul, Davut; Durmaz, Burak; Cogulu, Ozgur; Ozkinay, Ferda; Topcu, Vehap; Candan, Sukru; Cebi, Alper Han; Ikbal, Mevlit; Gulec, Elif Yilmaz; Gezdirici, Alper; Koparir, Erkan; Ekici, Fatma; Coskun, Salih; Cicek, Salih; Karaer, Kadri; Koparir, Asuman; Duz, Mehmet Bugrahan; Kirat, Emre; Fenercioglu, Elif; Ulucan, Hakan; Seven, Mehmet; Guran, Tulay; Elcioglu, Nursel; Yildirim, Mahmut Selman; Aktas, Dilek; Alikaşifoğlu, Mehmet; Ture, Mehmet; Yakut, Tahsin; Overton, John D.; Yuksel, Adnan; Ozen, Mustafa; Muzny, Donna M.; Adams, David R.; Boerwinkle, Eric; Chung, Wendy K.; Gibbs, Richard A.; Lupski, James R

    2015-01-01

    Development of the human nervous system involves complex interactions between fundamental cellular processes and requires a multitude of genes, many of which remain to be associated with human disease. We applied whole exome sequencing to 128 mostly consanguineous families with neurogenetic disorders that often included brain malformations. Rare variant analyses for both single nucleotide variant (SNV) and copy number variant (CNV) alleles allowed for identification of 45 novel variants in 43 known disease genes, 41 candidate genes, and CNVs in 10 families, with an overall potential molecular cause identified in >85% of families studied. Among the candidate genes identified, we found PRUNE, VARS, and DHX37 in multiple families, and homozygous loss of function variants in AGBL2, SLC18A2, SMARCA1, UBQLN1, and CPLX1. Neuroimaging and in silico analysis of functional and expression proximity between candidate and known disease genes allowed for further understanding of genetic networks underlying specific types of brain malformations. PMID:26539891

  8. Utilization of digital differential display to identify differentially expressed genes related to rumen development.

    PubMed

    Kato, Daichi; Suzuki, Yutaka; Haga, Satoshi; So, KyoungHa; Yamauchi, Eri; Nakano, Miwa; Ishizaki, Hiroshi; Choi, Kichoon; Katoh, Kazuo; Roh, Sang-Gun

    2016-04-01

    This study aimed to identify the genes associated with the development of the rumen epithelium by screening for candidate genes by digital differential display (DDD) in silico. Using DDD in NCBI's UniGene database, expressed sequence tag (EST)-based gene expression profiles were analyzed in rumen, reticulum, omasum, abomasum and other tissues in cattle. One hundred and ten candidate genes with high expression in the rumen were derived from a library of all tissues. The expression levels of 11 genes in all candidate genes were analyzed in the rumen, reticulum, omasum and abomasum of nine Japanese Black male calves (5-week-old pre-weaning: n = 3; 15-week-old weaned calves: n = 6). Among the 11 genes, only 3-hydroxy-3-methylglutaryl-CoA synthase 2 (HMGCS2), aldo-keto reductase family 1, member C1-like (AKR1C1), and fatty acid binding protein 3 (FABP3) showed significant changes in the levels of gene expression in the rumen between the pre- and post-weaning of calves. These results indicate that DDD analysis in silico can be useful for screening candidate genes related to rumen development, and that the changes in expression levels of three genes in the rumen may have been caused by weaning, aging or both.

  9. CAsubtype: An R Package to Identify Gene Sets Predictive of Cancer Subtypes and Clinical Outcomes.

    PubMed

    Kong, Hualei; Tong, Pan; Zhao, Xiaodong; Sun, Jielin; Li, Hua

    2017-01-21

    In the past decade, molecular classification of cancer has gained high popularity owing to its high predictive power on clinical outcomes as compared with traditional methods commonly used in clinical practice. In particular, using gene expression profiles, recent studies have successfully identified a number of gene sets for the delineation of cancer subtypes that are associated with distinct prognosis. However, identification of such gene sets remains a laborious task due to the lack of tools with flexibility, integration and ease of use. To reduce the burden, we have developed an R package, CAsubtype, to efficiently identify gene sets predictive of cancer subtypes and clinical outcomes. By integrating more than 13,000 annotated gene sets, CAsubtype provides a comprehensive repertoire of candidates for new cancer subtype identification. For easy data access, CAsubtype further includes the gene expression and clinical data of more than 2000 cancer patients from TCGA. CAsubtype first employs principal component analysis to identify gene sets (from user-provided or package-integrated ones) with robust principal components representing significantly large variation between cancer samples. Based on these principal components, CAsubtype visualizes the sample distribution in low-dimensional space for better understanding of the distinction between samples and classifies samples into subgroups with prevalent clustering algorithms. Finally, CAsubtype performs survival analysis to compare the clinical outcomes between the identified subgroups, assessing their clinical value as potentially novel cancer subtypes. In conclusion, CAsubtype is a flexible and well-integrated tool in the R environment to identify gene sets for cancer subtype identification and clinical outcome prediction. Its simple R commands and comprehensive data sets enable efficient examination of the clinical value of any given gene set, thus facilitating hypothesis generating and testing in biological and

  10. Microarray analysis of hepatic gene expression identifies new genes involved in steatotic liver

    PubMed Central

    Guillén, Natalia; Navarro, María A.; Arnal, Carmen; Noone, Enda; Arbonés-Mainar, José M.; Acín, Sergio; Surra, Joaquín C.; Muniesa, Pedro; Roche, Helen M.; Osada, Jesús

    2009-01-01

    Trans-10, cis-12-conjugated linoleic acid (CLA)-enriched diets promote fatty liver in mice, while cis-9, trans-11-CLA ameliorates this effect, suggesting regulation of multiple genes. To test this hypothesis, apoE-deficient mice were fed a Western-type diet enriched with linoleic acid isomers, and their hepatic gene expression was analyzed with DNA microarrays. To provide an initial screening of candidate genes, only 12 with remarkably modified expression between both CLA isomers were considered and confirmed by quantitative RT-PCR. Additionally mRNA expression of 15 genes involved in lipid metabolism was also studied. Ten genes (Fsp27, Aqp4, Cd36, Ly6d, Scd1, Hsd3b5, Syt1, Cyp7b1, and Tff3) showed significant associations among their expressions and the degree of hepatic steatosis. Their involvement was also analyzed in other models of steatosis. In hyperhomocysteinemic mice lacking Cbs gene, only Fsp27, Cd36, Scd1, Syt1, and Hsd3b5 hepatic expressions were associated with steatosis. In apoE-deficient mice consuming olive-enriched diet displaying reduction of the fatty liver, only Fsp27 and Syt1 expressions were found associated. Using this strategy, we have shown that expression of these genes is highly associated with hepatic steatosis in a genetic disease such as Cbs deficiency and in two common situations such as Western diets containing CLA isomers or a Mediterranean-type diet. Conclusion: The results highlight new processes involved in lipid handling in liver and will help to understand the complex human pathology providing new proteins and new strategies to cope with hepatic steatosis. PMID:19258494

  11. Variability of Gene Expression Identifies Transcriptional Regulators of Early Human Embryonic Development

    PubMed Central

    Hasegawa, Yu; Taylor, Deanne; Ovchinnikov, Dmitry A.; Wolvetang, Ernst J.; de Torrenté, Laurence; Mar, Jessica C.

    2015-01-01

    An analysis of gene expression variability can provide an insightful window into how regulatory control is distributed across the transcriptome. In a single cell analysis, the inter-cellular variability of gene expression measures the consistency of transcript copy numbers observed between cells in the same population. Application of these ideas to the study of early human embryonic development may reveal important insights into the transcriptional programs controlling this process, based on which components are most tightly regulated. Using a published single cell RNA-seq data set of human embryos collected at four-cell, eight-cell, morula and blastocyst stages, we identified genes with the most stable, invariant expression across all four developmental stages. Stably-expressed genes were found to be enriched for those sharing indispensable features, including essentiality, haploinsufficiency, and ubiquitous expression. The stable genes were less likely to be associated with loss-of-function variant genes or human recessive disease genes affected by a DNA copy number variant deletion, suggesting that stable genes have a functional impact on the regulation of some of the basic cellular processes. Genes with low expression variability at early stages of development are involved in regulation of DNA methylation, responses to hypoxia and telomerase activity, whereas by the blastocyst stage, low-variability genes are enriched for metabolic processes as well as telomerase signaling. Based on changes in expression variability, we identified a putative set of gene expression markers of morulae and blastocyst stages. Experimental validation of a blastocyst-expressed variability marker demonstrated that HDDC2 plays a role in the maintenance of pluripotency in human ES and iPS cells. Collectively our analyses identified new regulators involved in human embryonic development that would have otherwise been missed using methods that focus on assessment of the average expression

  12. Overexpression screens identify conserved dosage chromosome instability genes in yeast and human cancer

    PubMed Central

    Duffy, Supipi; Fam, Hok Khim; Wang, Yi Kan; Styles, Erin B.; Kim, Jung-Hyun; Ang, J. Sidney; Singh, Tejomayee; Larionov, Vladimir; Shah, Sohrab P.; Andrews, Brenda; Boerkoel, Cornelius F.; Hieter, Philip

    2016-01-01

    Somatic copy number amplification and gene overexpression are common features of many cancers. To determine the role of gene overexpression on chromosome instability (CIN), we performed genome-wide screens in the budding yeast for yeast genes that cause CIN when overexpressed, a phenotype we refer to as dosage CIN (dCIN), and identified 245 dCIN genes. This catalog of genes reveals human orthologs known to be recurrently overexpressed and/or amplified in tumors. We show that two genes, TDP1, a tyrosyl-DNA-phosphdiesterase, and TAF12, an RNA polymerase II TATA-box binding factor, cause CIN when overexpressed in human cells. Rhabdomyosarcoma lines with elevated human Tdp1 levels also exhibit CIN that can be partially rescued by siRNA-mediated knockdown of TDP1. Overexpression of dCIN genes represents a genetic vulnerability that could be leveraged for selective killing of cancer cells through targeting of an unlinked synthetic dosage lethal (SDL) partner. Using SDL screens in yeast, we identified a set of genes that when deleted specifically kill cells with high levels of Tdp1. One gene was the histone deacetylase RPD3, for which there are known inhibitors. Both HT1080 cells overexpressing hTDP1 and rhabdomyosarcoma cells with elevated levels of hTdp1 were more sensitive to histone deacetylase inhibitors valproic acid (VPA) and trichostatin A (TSA), recapitulating the SDL interaction in human cells and suggesting VPA and TSA as potential therapeutic agents for tumors with elevated levels of hTdp1. The catalog of dCIN genes presented here provides a candidate list to identify genes that cause CIN when overexpressed in cancer, which can then be leveraged through SDL to selectively target tumors. PMID:27551064

  13. Identifying Novel Candidate Genes Related to Apoptosis from a Protein-Protein Interaction Network

    PubMed Central

    Wang, Baoman; Yuan, Fei; Kong, Xiangyin; Hu, Lan-Dian; Cai, Yu-Dong

    2015-01-01

    Apoptosis is the process of programmed cell death (PCD) that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature. PMID:26543496

  14. A Screen for Genes Expressed in the Olfactory Organs of Drosophila melanogaster Identifies Genes Involved in Olfactory Behaviour

    PubMed Central

    Tunstall, Narelle E.; Herr, Anabel; de Bruyne, Marien; Warr, Coral G.

    2012-01-01

    Background For insects the sense of smell and associated olfactory-driven behaviours are essential for survival. Insects detect odorants with families of olfactory receptor proteins that are very different to those of mammals, and there are likely to be other unique genes and genetic pathways involved in the function and development of the insect olfactory system. Methodology/Principal Findings We have performed a genetic screen of a set of 505 Drosophila melanogaster gene trap insertion lines to identify novel genes expressed in the adult olfactory organs. We identified 16 lines with expression in the olfactory organs, many of which exhibited expression of the trapped genes in olfactory receptor neurons. Phenotypic analysis showed that six of the lines have decreased olfactory responses in a behavioural assay, and for one of these we showed that precise excision of the P element reverts the phenotype to wild type, confirming a role for the trapped gene in olfaction. To confirm the identity of the genes trapped in the lines we performed molecular analysis of some of the insertion sites. While for many lines the reported insertion sites were correct, we also demonstrated that for a number of lines the reported location of the element was incorrect, and in three lines there were in fact two pGT element insertions. Conclusions/Significance We identified 16 new genes expressed in the Drosophila olfactory organs, the majority in neurons, and for several of the gene trap lines demonstrated a defect in olfactory-driven behaviour. Further characterisation of these genes and their roles in olfactory system function and development will increase our understanding of how the insect olfactory system has evolved to perform the same essential function to that of mammals, but using very different molecular genetic mechanisms. PMID:22530061

  15. A combined analysis of microarray gene expression studies of the human prefrontal cortex identifies genes implicated in schizophrenia.

    PubMed

    Pérez-Santiago, Josué; Diez-Alarcia, Rebeca; Callado, Luis F; Zhang, Jin X; Chana, Gursharan; White, Cory H; Glatt, Stephen J; Tsuang, Ming T; Everall, Ian P; Meana, J Javier; Woelk, Christopher H

    2012-11-01

    Small cohort sizes and modest levels of gene expression changes in brain tissue have plagued the statistical approaches employed in microarray studies investigating the mechanism of schizophrenia. To combat these problems a combined analysis of six prior microarray studies was performed to facilitate the robust statistical analysis of gene expression data from the dorsolateral prefrontal cortex of 107 patients with schizophrenia and 118 healthy subjects. Multivariate permutation tests identified 144 genes that were differentially expressed between schizophrenia and control groups. Seventy of these genes were identified as differentially expressed in at least one component microarray study but none of these individual studies had the power to identify the remaining 74 genes, demonstrating the utility of a combined approach. Gene ontology terms and biological pathways that were significantly enriched for differentially expressed genes were related to neuronal cell-cell signaling, mesenchymal induction, and mitogen-activated protein kinase signaling, which have all previously been associated with the etiopathogenesis of schizophrenia. The differential expression of BAG3, C4B, EGR1, MT1X, NEUROD6, SST and S100A8 was confirmed by real-time quantitative PCR in an independent cohort using postmortem human prefrontal cortex samples. Comparison of gene expression between schizophrenic subjects with and without detectable levels of antipsychotics in their blood suggests that the modulation of MT1X and S100A8 may be the result of drug exposure. In conclusion, this combined analysis has resulted in a statistically robust identification of genes whose dysregulation may contribute to the mechanism of schizophrenia.

  16. Functional genomics screening utilizing mutant mouse embryonic stem cells identifies novel radiation-response genes.

    PubMed

    Loesch, Kimberly; Galaviz, Stacy; Hamoui, Zaher; Clanton, Ryan; Akabani, Gamal; Deveau, Michael; DeJesus, Michael; Ioerger, Thomas; Sacchettini, James C; Wallis, Deeann

    2015-01-01

    Elucidating the genetic determinants of radiation response is crucial to optimizing and individualizing radiotherapy for cancer patients. In order to identify genes that are involved in enhanced sensitivity or resistance to radiation, a library of stable mutant murine embryonic stem cells (ESCs), each with a defined mutation, was screened for cell viability and gene expression in response to radiation exposure. We focused on a cancer-relevant subset of over 500 mutant ESC lines. We identified 13 genes; 7 genes that have been previously implicated in radiation response and 6 other genes that have never been implicated in radiation response. After screening, proteomic analysis showed enrichment for genes involved in cellular component disassembly (e.g. Dstn and Pex14) and regulation of growth (e.g. Adnp2, Epc1, and Ing4). Overall, the best targets with the highest potential for sensitizing cancer cells to radiation were Dstn and Map2k6, and the best targets for enhancing resistance to radiation were Iqgap and Vcan. Hence, we provide compelling evidence that screening mutant ESCs is a powerful approach to identify genes that alter radiation response. Ultimately, this knowledge can be used to define genetic variants or therapeutic targets that will enhance clinical therapy.

  17. Haplotype Association Mapping Identifies a Candidate Gene Region in Mice Infected With Staphylococcus aureus.

    PubMed

    Johnson, Nicole V; Ahn, Sun Hee; Deshmukh, Hitesh; Levin, Mikhail K; Nelson, Charlotte L; Scott, William K; Allen, Andrew; Fowler, Vance G; Cowell, Lindsay G

    2012-06-01

    Exposure to Staphylococcus aureus has a variety of outcomes, from asymptomatic colonization to fatal infection. Strong evidence suggests that host genetics play an important role in susceptibility, but the specific host genetic factors involved are not known. The availability of genome-wide single nucleotide polymorphism (SNP) data for inbred Mus musculus strains means that haplotype association mapping can be used to identify candidate susceptibility genes. We applied haplotype association mapping to Perlegen SNP data and kidney bacterial counts from Staphylococcus aureus-infected mice from 13 inbred strains and detected an associated block on chromosome 7. Strong experimental evidence supports the result: a separate study demonstrated the presence of a susceptibility locus on chromosome 7 using consomic mice. The associated block contains no genes, but lies within the gene cluster of the 26-member extended kallikrein gene family, whose members have well-recognized roles in the generation of antimicrobial peptides and the regulation of inflammation. Efficient mixed-model association (EMMA) testing of all SNPs with two alleles and located within the gene cluster boundaries finds two significant associations: one of the three polymorphisms defining the associated block and one in the gene closest to the block, Klk1b11. In addition, we find that 7 of the 26 kallikrein genes are differentially expressed between susceptible and resistant mice, including the Klk1b11 gene. These genes represent a promising set of candidate genes influencing susceptibility to Staphylococcus aureus.

  18. Suppression subtractive hybridization and comparative expression analysis to identify developmentally regulated genes in filamentous fungi.

    PubMed

    Gesing, Stefan; Schindler, Daniel; Nowrousian, Minou

    2013-09-01

    Ascomycetes differentiate four major morphological types of fruiting bodies (apothecia, perithecia, pseudothecia and cleistothecia) that are derived from an ancestral fruiting body. Thus, fruiting body differentiation is most likely controlled by a set of common core genes. One way to identify such genes is to search for genes with evolutionary conserved expression patterns. Using suppression subtractive hybridization (SSH), we selected differentially expressed transcripts in Pyronema confluens (Pezizales) by comparing two cDNA libraries specific for sexual and for vegetative development, respectively. The expression patterns of selected genes from both libraries were verified by quantitative real time PCR. Expression of several corresponding homologous genes was found to be conserved in two members of the Sordariales (Sordaria macrospora and Neurospora crassa), a derived group of ascomycetes that is only distantly related to the Pezizales. Knockout studies with N. crassa orthologues of differentially regulated genes revealed a functional role during fruiting body development for the gene NCU05079, encoding a putative MFS peptide transporter. These data indicate conserved gene expression patterns and a functional role of the corresponding genes during fruiting body development; such genes are candidates of choice for further functional analysis.

  19. Identifying genes and gene networks involved in chromium metabolism and detoxification in Crambe abyssinica.

    PubMed

    Zulfiqar, Asma; Paulose, Bibin; Chhikara, Sudesh; Dhankher, Om Parkash

    2011-10-01

    Chromium pollution is a serious environmental problem with few cost-effective remediation strategies available. Crambe abyssinica (a member of Brassicaseae), a non-food, fast growing high biomass crop, is an ideal candidate for phytoremediation of heavy metals contaminated soils. The present study used a PCR-Select Suppression Subtraction Hybridization approach in C. abyssinica to isolate differentially expressed genes in response to Cr exposure. A total of 72 differentially expressed subtracted cDNAs were sequenced and found to represent 43 genes. The subtracted cDNAs suggest that Cr stress significantly affects pathways related to stress/defense, ion transporters, sulfur assimilation, cell signaling, protein degradation, photosynthesis and cell metabolism. The regulation of these genes in response to Cr exposure was further confirmed by semi-quantitative RT-PCR. Characterization of these differentially expressed genes may enable the engineering of non-food, high-biomass plants, including C. abyssinica, for phytoremediation of Cr-contaminated soils and sediments.

  20. Candidate Luminal B Breast Cancer Genes Identified by Genome, Gene Expression and DNA Methylation Profiling

    PubMed Central

    Addou-Klouche, Lynda; Finetti, Pascal; Saade, Marie-Rose; Manai, Marwa; Carbuccia, Nadine; Bekhouche, Ismahane; Letessier, Anne; Charafe-Jauffret, Emmanuelle; Jacquemier, Jocelyne; Spicuglia, Salvatore; de The, Hugues; Viens, Patrice; Bertucci, François; Birnbaum, Daniel; Chaffanet, Max

    2014-01-01

    Breast cancers (BCs) of the luminal B subtype are estrogen receptor-positive (ER+), highly proliferative, resistant to standard therapies and have a poor prognosis. To better understand this subtype we compared DNA copy number aberrations (CNAs), DNA promoter methylation, gene expression profiles, and somatic mutations in nine selected genes, in 32 luminal B tumors with those observed in 156 BCs of the other molecular subtypes. Frequent CNAs included 8p11-p12 and 11q13.1-q13.2 amplifications, 7q11.22-q34, 8q21.12-q24.23, 12p12.3-p13.1, 12q13.11-q24.11, 14q21.1-q23.1, 17q11.1-q25.1, 20q11.23-q13.33 gains and 6q14.1-q24.2, 9p21.3-p24,3, 9q21.2, 18p11.31-p11.32 losses. A total of 237 and 101 luminal B-specific candidate oncogenes and tumor suppressor genes (TSGs) presented a deregulated expression in relation with their CNAs, including 11 genes previously reported associated with endocrine resistance. Interestingly, 88% of the potential TSGs are located within chromosome arm 6q, and seven candidate oncogenes are potential therapeutic targets. A total of 100 candidate oncogenes were validated in a public series of 5,765 BCs and the overexpression of 67 of these was associated with poor survival in luminal tumors. Twenty-four genes presented a deregulated expression in relation with a high DNA methylation level. FOXO3, PIK3CA and TP53 were the most frequent mutated genes among the nine tested. In a meta-analysis of next-generation sequencing data in 875 BCs, KCNB2 mutations were associated with luminal B cases while candidate TSGs MDN1 (6q15) and UTRN (6q24), were mutated in this subtype. In conclusion, we have reported luminal B candidate genes that may play a role in the development and/or hormone resistance of this aggressive subtype. PMID:24416132

  1. Candidate luminal B breast cancer genes identified by genome, gene expression and DNA methylation profiling.

    PubMed

    Cornen, Stéphanie; Guille, Arnaud; Adélaïde, José; Addou-Klouche, Lynda; Finetti, Pascal; Saade, Marie-Rose; Manai, Marwa; Carbuccia, Nadine; Bekhouche, Ismahane; Letessier, Anne; Raynaud, Stéphane; Charafe-Jauffret, Emmanuelle; Jacquemier, Jocelyne; Spicuglia, Salvatore; de The, Hugues; Viens, Patrice; Bertucci, François; Birnbaum, Daniel; Chaffanet, Max

    2014-01-01

    Breast cancers (BCs) of the luminal B subtype are estrogen receptor-positive (ER+), highly proliferative, resistant to standard therapies and have a poor prognosis. To better understand this subtype we compared DNA copy number aberrations (CNAs), DNA promoter methylation, gene expression profiles, and somatic mutations in nine selected genes, in 32 luminal B tumors with those observed in 156 BCs of the other molecular subtypes. Frequent CNAs included 8p11-p12 and 11q13.1-q13.2 amplifications, 7q11.22-q34, 8q21.12-q24.23, 12p12.3-p13.1, 12q13.11-q24.11, 14q21.1-q23.1, 17q11.1-q25.1, 20q11.23-q13.33 gains and 6q14.1-q24.2, 9p21.3-p24,3, 9q21.2, 18p11.31-p11.32 losses. A total of 237 and 101 luminal B-specific candidate oncogenes and tumor suppressor genes (TSGs) presented a deregulated expression in relation with their CNAs, including 11 genes previously reported associated with endocrine resistance. Interestingly, 88% of the potential TSGs are located within chromosome arm 6q, and seven candidate oncogenes are potential therapeutic targets. A total of 100 candidate oncogenes were validated in a public series of 5,765 BCs and the overexpression of 67 of these was associated with poor survival in luminal tumors. Twenty-four genes presented a deregulated expression in relation with a high DNA methylation level. FOXO3, PIK3CA and TP53 were the most frequent mutated genes among the nine tested. In a meta-analysis of next-generation sequencing data in 875 BCs, KCNB2 mutations were associated with luminal B cases while candidate TSGs MDN1 (6q15) and UTRN (6q24), were mutated in this subtype. In conclusion, we have reported luminal B candidate genes that may play a role in the development and/or hormone resistance of this aggressive subtype.

  2. Integrating Epigenomic Elements and GWASs Identifies BDNF Gene Affecting Bone Mineral Density and Osteoporotic Fracture Risk

    PubMed Central

    Guo, Yan; Dong, Shan-Shan; Chen, Xiao-Feng; Jing, Ying-Aisha; Yang, Man; Yan, Han; Shen, Hui; Chen, Xiang-Ding; Tan, Li-Jun; Tian, Qing; Deng, Hong-Wen; Yang, Tie-Lin

    2016-01-01

    To identify susceptibility genes for osteoporosis, we conducted an integrative analysis that combined epigenomic elements and previous genome-wide association studies (GWASs) data, followed by validation at population and functional levels, which could identify common regulatory elements and predict new susceptibility genes that are biologically meaningful to osteoporosis. By this approach, we found a set of distinct epigenomic elements significantly enriched or depleted in the promoters of osteoporosis-associated genes, including 4 transcription factor binding sites, 27 histone marks, and 21 chromatin states segmentation types. Using these epigenomic marks, we performed reverse prediction analysis to prioritize the discovery of new candidate genes. Functional enrichment analysis of all the prioritized genes revealed several key osteoporosis related pathways, including Wnt signaling. Genes with high priority were further subjected to validation using available GWASs datasets. Three genes were significantly associated with spine bone mineral density, including BDNF, PDE4D, and SATB2, which all closely related to bone metabolism. The most significant gene BDNF was also associated with osteoporotic fractures. RNA interference revealed that BDNF knockdown can suppress osteoblast differentiation. Our results demonstrated that epigenomic data could be used to indicate common epigenomic marks to discover additional loci with biological functions for osteoporosis. PMID:27465306

  3. Ectopic Activation of Germline and Placental Genes Identifies Aggressive Metastasis-Prone Lung Cancers

    PubMed Central

    Rousseaux, Sophie; Debernardi, Alexandra; Jacquiau, Baptiste; Vitte, Anne-Laure; Vesin, Aurélien; Nagy-Mignotte, Hélène; Moro-Sibilot, Denis; Brichon, Pierre-Yves; Lantuejoul, Sylvie; Hainaut, Pierre; Laffaire, Julien; de Reyniès, Aurélien; Beer, David G.; Timsit, Jean-François; Brambilla, Christian; Brambilla, Elisabeth; Khochbin, Saadi

    2016-01-01

    Activation of normally silent tissue-specific genes and the resulting cell “identity crisis” are the unexplored consequences of malignant epigenetic reprogramming. We designed a strategy for investigating this reprogramming, which consisted of identifying a large number of tissue-restricted genes that are epigenetically silenced in normal somatic cells and then detecting their expression in cancer. This approach led to the demonstration that large-scale “off-context” gene activations systematically occur in a variety of cancer types. In our series of 293 lung tumors, we identified an ectopic gene expression signature associated with a subset of highly aggressive tumors, which predicted poor prognosis independently of the TNM (tumor size, node positivity, and metastasis) stage or histological subtype. The ability to isolate these tumors allowed us to reveal their common molecular features characterized by the acquisition of embryonic stem cell/germ cell gene expression profiles and the down-regulation of immune response genes. The methodical recognition of ectopic gene activations in cancer cells could serve as a basis for gene signature–guided tumor stratification, as well as for the discovery of oncogenic mechanisms, and expand the understanding of the biology of very aggressive tumors. PMID:23698379

  4. Microarray expression profiling identifies genes with altered expression in HDL-deficient mice

    SciTech Connect

    Callow, Matthew J.; Dudoit, Sandrine; Gong, Elaine L.; Speed, Terence P.; Rubin, Edward M.

    2000-05-05

    Based on the assumption that severe alterations in the expression of genes known to be involved in HDL metabolism may affect the expression of other genes we screened an array of over 5000 mouse expressed sequence tags (ESTs) for altered gene expression in the livers of two lines of mice with dramatic decreases in HDL plasma concentrations. Labeled cDNA from livers of apolipoprotein AI (apo AI) knockout mice, Scavenger Receptor BI (SR-BI) transgenic mice and control mice were co-hybridized to microarrays. Two-sample t-statistics were used to identify genes with altered expression levels in the knockout or transgenic mice compared with the control mice. In the SR-BI group we found 9 array elements representing at least 5 genes to be significantly altered on the basis of an adjusted p value of less than 0.05. In the apo AI knockout group 8 array elements representing 4 genes were altered compared with the control group (p < 0.05). Several of the genes identified in the SR-BI transgenic suggest altered sterol metabolism and oxidative processes. These studies illustrate the use of multiple-testing methods for the identification of genes with altered expression in replicated microarray experiments of apo AI knockout and SR-BI transgenic mice.

  5. Transcriptional Profile Analysis of RPGRORF15 Frameshift Mutation Identifies Novel Genes Associated with Retinal Degeneration

    PubMed Central

    Genini, Sem; Zangerl, Barbara; Slavik, Julianna; Acland, Gregory M.; Beltran, William A.

    2010-01-01

    Purpose. To identify genes and molecular mechanisms associated with photoreceptor degeneration in a canine model of XLRP caused by an RPGR exon ORF15 microdeletion. Methods. Expression profiles of mutant and normal retinas were compared by using canine retinal custom cDNA microarrays. qRT-PCR, Western blot analysis, and immunohistochemistry (IHC) were applied to selected genes, to confirm and expand the microarray results. Results. At 7 and 16 weeks, respectively, 56 and 18 transcripts were downregulated in the mutant retinas, but none were differentially expressed (DE) at both ages, suggesting the involvement of temporally distinct pathways. Downregulated genes included the known retina-relevant genes PAX6, CHML, and RDH11 at 7 weeks and CRX and SAG at 16 weeks. Genes directly or indirectly active in apoptotic processes were altered at 7 weeks (CAMK2G, NTRK2, PRKCB, RALA, RBBP6, RNF41, SMYD3, SPP1, and TUBB2C) and 16 weeks (SLC25A5 and NKAP). Furthermore, the DE genes at 7 weeks (ELOVL6, GLOD4, NDUFS4, and REEP1) and 16 weeks (SLC25A5 and TARS2) are related to mitochondrial functions. qRT-PCR of 18 genes confirmed the microarray results and showed DE of additional genes not on the array. Only GFAP was DE at 3 weeks of age. Western blot and IHC analyses also confirmed the high reliability of the transcriptomic data. Conclusions. Several DE genes were identified in mutant retinas. At 7 weeks, a combination of nonclassic anti- and proapoptosis genes appear to be involved in photoreceptor degeneration, whereas at both 7 and 16 weeks, the expression of mitochondria-related genes indicates that they may play a relevant role in the disease process. PMID:20574030

  6. De Novo Transcriptome Sequencing of Oryza officinalis Wall ex Watt to Identify Disease-Resistance Genes.

    PubMed

    He, Bin; Gu, Yinghong; Tao, Xiang; Cheng, Xiaojie; Wei, Changhe; Fu, Jian; Cheng, Zaiquan; Zhang, Yizheng

    2015-12-10

    Oryza officinalis Wall ex Watt is one of the most important wild relatives of cultivated rice and exhibits high resistance to many diseases. It has been used as a source of genes for introgression into cultivated rice. However, there are limited genomic resources and little genetic information publicly reported for this species. To better understand the pathways and factors involved in disease resistance and accelerating the process of rice breeding, we carried out a de novo transcriptome sequencing of O. officinalis. In this research, 137,229 contigs were obtained ranging from 200 to 19,214 bp with an N50 of 2331 bp through de novo assembly of leaves, stems and roots in O. officinalis using an Illumina HiSeq 2000 platform. Based on sequence similarity searches against a non-redundant protein database, a total of 88,249 contigs were annotated with gene descriptions and 75,589 transcripts were further assigned to GO terms. Candidate genes for plant-pathogen interaction and plant hormones regulation pathways involved in disease-resistance were identified. Further analyses of gene expression profiles showed that the majority of genes related to disease resistance were all expressed in the three tissues. In addition, there are two kinds of rice bacterial blight-resistant genes in O. officinalis, including two Xa1 genes and three Xa26 genes. All 2 Xa1 genes showed the highest expression level in stem, whereas one of Xa26 was expressed dominantly in leaf and other 2 Xa26 genes displayed low expression level in all three tissues. This transcriptomic database provides an opportunity for identifying the genes involved in disease-resistance and will provide a basis for studying functional genomics of O. officinalis and genetic improvement of cultivated rice in the future.

  7. A loss of function screen identifies nine new radiation susceptibility genes

    SciTech Connect

    Sudo, Hitomi; Tsuji, Atsushi B. Sugyo, Aya; Imai, Takashi; Saga, Tsuneo; Harada, Yoshi-nobu

    2007-12-21

    Genomic instability is considered a hallmark of carcinogenesis, and dysfunction of DNA repair and cell cycle regulation in response to DNA damage caused by ionizing radiation are thought to be important factors in the early stages of genomic instability. We performed cell-based functional screening using an RNA interference library targeting 200 genes in human cells. We identified three known and nine new radiation susceptibility genes, eight of which are linked directly or potentially with cell cycle progression. Cell cycle analysis on four of the genes not previously linked to cell cycle progression demonstrated that one, ZDHHC8, was associated with the G{sub 2}/M checkpoint in response to DNA damage. Further study of the 12 radiation susceptibility genes identified in this screen may help to elucidate the molecular mechanisms of cell cycle progression, DNA repair, cell death, cell growth and genomic instability, and to develop new radiation sensitizing agents for radiotherapy.

  8. Harnessing single cell sorting to identify cell division genes and regulators in bacteria.

    PubMed

    Burke, Catherine; Liu, Michael; Britton, Warwick; Triccas, James A; Thomas, Torsten; Smith, Adrian L; Allen, Steven; Salomon, Robert; Harry, Elizabeth

    2013-01-01

    Cell division is an essential cellular process that requires an array of known and unknown proteins for its spatial and temporal regulation. Here we develop a novel, high-throughput screening method for the identification of bacterial cell division genes and regulators. The method combines the over-expression of a shotgun genomic expression library to perturb the cell division process with high-throughput flow cytometry sorting to screen many thousands of clones. Using this approach, we recovered clones with a filamentous morphology for the model bacterium, Escherichia coli. Genetic analysis revealed that our screen identified both known cell division genes, and genes that have not previously been identified to be involved in cell division. This novel screening strategy is applicable to a wide range of organisms, including pathogenic bacteria, where cell division genes and regulators are attractive drug targets for antibiotic development.

  9. Harnessing Single Cell Sorting to Identify Cell Division Genes and Regulators in Bacteria

    PubMed Central

    Burke, Catherine; Liu, Michael; Britton, Warwick; Triccas, James A.; Thomas, Torsten; Smith, Adrian L.; Allen, Steven; Salomon, Robert; Harry, Elizabeth

    2013-01-01

    Cell division is an essential cellular process that requires an array of known and unknown proteins for its spatial and temporal regulation. Here we develop a novel, high-throughput screening method for the identification of bacterial cell division genes and regulators. The method combines the over-expression of a shotgun genomic expression library to perturb the cell division process with high-throughput flow cytometry sorting to screen many thousands of clones. Using this approach, we recovered clones with a filamentous morphology for the model bacterium, Escherichia coli. Genetic analysis revealed that our screen identified both known cell division genes, and genes that have not previously been identified to be involved in cell division. This novel screening strategy is applicable to a wide range of organisms, including pathogenic bacteria, where cell division genes and regulators are attractive drug targets for antibiotic development. PMID:23565292

  10. Transposon mutagenesis identifies genes that cooperate with mutant Pten in breast cancer progression

    PubMed Central

    Rangel, Roberto; Lee, Song-Choon; Hon-Kim Ban, Kenneth; Guzman-Rojas, Liliana; Mann, Michael B.; Newberg, Justin Y.; McNoe, Leslie A.; Selvanesan, Luxmanan; Ward, Jerrold M.; Rust, Alistair G.; Chin, Kuan-Yew; Black, Michael A.; Jenkins, Nancy A.; Copeland, Neal G.

    2016-01-01

    Triple-negative breast cancer (TNBC) has the worst prognosis of any breast cancer subtype. To better understand the genetic forces driving TNBC, we performed a transposon mutagenesis screen in a phosphatase and tensin homolog (Pten) mutant mice and identified 12 candidate trunk drivers and a much larger number of progression genes. Validation studies identified eight TNBC tumor suppressor genes, including the GATA-like transcriptional repressor TRPS1. Down-regulation of TRPS1 in TNBC cells promoted epithelial-to-mesenchymal transition (EMT) by deregulating multiple EMT pathway genes, in addition to increasing the expression of SERPINE1 and SERPINB2 and the subsequent migration, invasion, and metastasis of tumor cells. Transposon mutagenesis has thus provided a better understanding of the genetic forces driving TNBC and discovered genes with potential clinical importance in TNBC. PMID:27849608

  11. Analysis of global gene expression profiles to identify differentially expressed genes critical for embryo development in Brassica rapa.

    PubMed

    Zhang, Yu; Peng, Lifang; Wu, Ya; Shen, Yanyue; Wu, Xiaoming; Wang, Jianbo

    2014-11-01

    Embryo development represents a crucial developmental period in the life cycle of flowering plants. To gain insights into the genetic programs that control embryo development in Brassica rapa L., RNA sequencing technology was used to perform transcriptome profiling analysis of B. rapa developing embryos. The results generated 42,906,229 sequence reads aligned with 32,941 genes. In total, 27,760, 28,871, 28,384, and 25,653 genes were identified from embryos at globular, heart, early cotyledon, and mature developmental stages, respectively, and analysis between stages revealed a subset of stage-specific genes. We next investigated 9,884 differentially expressed genes with more than fivefold changes in expression and false discovery rate ≤ 0.001 from three adjacent-stage comparisons; 1,514, 3,831, and 6,633 genes were detected between globular and heart stage embryo libraries, heart stage and early cotyledon stage, and early cotyledon and mature stage, respectively. Large numbers of genes related to cellular process, metabolism process, response to stimulus, and biological process were expressed during the early and middle stages of embryo development. Fatty acid biosynthesis, biosynthesis of secondary metabolites, and photosynthesis-related genes were expressed predominantly in embryos at the middle stage. Genes for lipid metabolism and storage proteins were highly expressed in the middle and late stages of embryo development. We also identified 911 transcription factor genes that show differential expression across embryo developmental stages. These results increase our understanding of the complex molecular and cellular events during embryo development in B. rapa and provide a foundation for future studies on other oilseed crops.

  12. Comparative Genomics and an Insect Model Rapidly Identify Novel Virulence Genes of Burkholderia mallei

    DTIC Science & Technology

    2008-04-01

    2,500 shared “housekeeping” genes whose products share 60% amino acid sequence identity. Of greater interest are the spe - cies- or isolate-specific...FEMS Microbiol. Lett. 192:67–72. 42 . Liu, B., G. C . Koo, E. H. Yap, K. L. Chua, and Y. H. Gan. 2002. Model of differential susceptibility to mucosal...Microbiology. All Rights Reserved. Comparative Genomics and an Insect Model Rapidly Identify Novel Virulence Genes of Burkholderia mallei† Mark A

  13. Cluster Analysis of Tumor Suppressor Genes in Canine Leukocytes Identifies Activation State

    PubMed Central

    Daly, Julie-Anne; Mortlock, Sally-Anne; Taylor, Rosanne M.; Williamson, Peter

    2015-01-01

    Cells of the immune system undergo activation and subsequent proliferation in the normal course of an immune response. Infrequently, the molecular and cellular events that underlie the mechanisms of proliferation are dysregulated and may lead to oncogenesis, leading to tumor formation. The most common forms of immunological cancers are lymphomas, which in dogs account for 8%–20% of all cancers, affecting up to 1.2% of the dog population. Key genes involved in negatively regulating proliferation of lymphocytes include a group classified as tumor suppressor genes (TSGs). These genes are also known to be associated with progression of lymphoma in humans, mice, and dogs and are potential candidates for pathological grading and diagnosis. The aim of the present study was to analyze TSG profiles in stimulated leukocytes from dogs to identify genes that discriminate an activated phenotype. A total of 554 TSGs and three gene set collections were analyzed from microarray data. Cluster analysis of three subsets of genes discriminated between stimulated and unstimulated cells. These included 20 most upregulated and downregulated TSGs, TSG in hallmark gene sets significantly enriched in active cells, and a selection of candidate TSGs, p15 (CDKN2B), p18 (CDKN2C), p19 (CDKN1A), p21 (CDKN2A), p27 (CDKN1B), and p53 (TP53) in the third set. Analysis of two subsets suggested that these genes or a subset of these genes may be used as a specialized PCR set for additional analysis. PMID:27478369

  14. High-throughput screening of mouse gene knockouts identifies established and novel skeletal phenotypes

    PubMed Central

    Brommage, Robert; Liu, Jeff; Hansen, Gwenn M; Kirkpatrick, Laura L; Potter, David G; Sands, Arthur T; Zambrowicz, Brian; Powell, David R; Vogel, Peter

    2014-01-01

    Screening gene function in vivo is a powerful approach to discover novel drug targets. We present high-throughput screening (HTS) data for 3 762 distinct global gene knockout (KO) mouse lines with viable adult homozygous mice generated using either gene-trap or homologous recombination technologies. Bone mass was determined from DEXA scans of male and female mice at 14 weeks of age and by microCT analyses of bones from male mice at 16 weeks of age. Wild-type (WT) cagemates/littermates were examined for each gene KO. Lethality was observed in an additional 850 KO lines. Since primary HTS are susceptible to false positive findings, additional cohorts of mice from KO lines with intriguing HTS bone data were examined. Aging, ovariectomy, histomorphometry and bone strength studies were performed and possible non-skeletal phenotypes were explored. Together, these screens identified multiple genes affecting bone mass: 23 previously reported genes (Calcr, Cebpb, Crtap, Dcstamp, Dkk1, Duoxa2, Enpp1, Fgf23, Kiss1/Kiss1r, Kl (Klotho), Lrp5, Mstn, Neo1, Npr2, Ostm1, Postn, Sfrp4, Slc30a5, Slc39a13, Sost, Sumf1, Src, Wnt10b), five novel genes extensively characterized (Cldn18, Fam20c, Lrrk1, Sgpl1, Wnt16), five novel genes with preliminary characterization (Agpat2, Rassf5, Slc10a7, Slc26a7, Slc30a10) and three novel undisclosed genes coding for potential osteoporosis drug targets. PMID:26273529

  15. Transcriptome profiling identifies differentially expressed genes in postnatal developing pituitary gland of miniature pig.

    PubMed

    Shan, Lei; Wu, Qi; Li, Yuli; Shang, Haitao; Guo, Kenan; Wu, Jiayan; Wei, Hong; Zhao, Jianguo; Yu, Jun; Li, Meng-Hua

    2014-01-01

    In recent years, Tibetan pig and Bama pig are popularly used as animal models for medical researches. However, little genomic information is available for the two breeds, particularly regarding gene expression pattern at the whole-transcriptome level. In this study, we characterized the pituitary transcriptome profile along their postnatal developmental stages within and between the two breeds in order to illustrate the differential dynamics and functions of differentially expressed genes. We obtained a total of ∼300 million 80-bp paired-end reads, detected 15 715 previously annotated genes. Most of the genes (90.33%) were shared between the two breeds with the main functions in metabolic process. Four hormone genes (GH, PRL, LHB, and FSHB) were detected in all samples with extremely high levels of expression. Functional differences between the three developmental stages (infancy, puberty and adulthood) in each breed were dominantly presented by the gene expressions at the first stage. That is, Bama pig was over-represented in the genes involved in the cellular process, while Tibetan pig was over-represented in the genes represented by the reproductive process. The identified SNPs indicated that the divergence between the miniature pig breeds and the large pig (Duroc) were greater than that between the two miniature pig breeds. This study substantially expands our knowledge concerning the genes transcribed in the pig pituitary gland and provides an overview of pituitary transcriptome dynamics throughout the period of postnatal development.

  16. Transcriptome Profiling Identifies Differentially Expressed Genes in Postnatal Developing Pituitary Gland of Miniature Pig

    PubMed Central

    Shan, Lei; Wu, Qi; Li, Yuli; Shang, Haitao; Guo, Kenan; Wu, Jiayan; Wei, Hong; Zhao, Jianguo; Yu, Jun; Li, Meng-Hua

    2014-01-01

    In recent years, Tibetan pig and Bama pig are popularly used as animal models for medical researches. However, little genomic information is available for the two breeds, particularly regarding gene expression pattern at the whole-transcriptome level. In this study, we characterized the pituitary transcriptome profile along their postnatal developmental stages within and between the two breeds in order to illustrate the differential dynamics and functions of differentially expressed genes. We obtained a total of ∼300 million 80-bp paired-end reads, detected 15 715 previously annotated genes. Most of the genes (90.33%) were shared between the two breeds with the main functions in metabolic process. Four hormone genes (GH, PRL, LHB, and FSHB) were detected in all samples with extremely high levels of expression. Functional differences between the three developmental stages (infancy, puberty and adulthood) in each breed were dominantly presented by the gene expressions at the first stage. That is, Bama pig was over-represented in the genes involved in the cellular process, while Tibetan pig was over-represented in the genes represented by the reproductive process. The identified SNPs indicated that the divergence between the miniature pig breeds and the large pig (Duroc) were greater than that between the two miniature pig breeds. This study substantially expands our knowledge concerning the genes transcribed in the pig pituitary gland and provides an overview of pituitary transcriptome dynamics throughout the period of postnatal development. PMID:24282060

  17. Systematically identify key genes in inflammatory and non-inflammatory breast cancer.

    PubMed

    Chai, Fan; Liang, Yan; Zhang, Fan; Wang, Minghao; Zhong, Ling; Jiang, Jun

    2016-01-10

    Although the gene expression in breast tumor stroma, playing a critical role in determining inflammatory breast cancer (IBC) phenotype, has been proved to be significantly different between IBC and non-inflammatory breast cancer (non-IBC), more effort needs to systematically investigate the gene expression profiles between tumor epithelium and stroma and to efficiently uncover the potential molecular networks and critical genes for IBC and non-IBC. Here, we comprehensively analyzed and compared the transcriptional profiles from IBC and non-IBC patients using hierarchical clustering, protein-protein interaction (PPI) network, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database analyses, and identified PDGFRβ, SUMO1, COL1A1, FYN, CAV1, COL5A1 and MMP2 to be the key genes for breast cancer. Interestingly, PDGFRβ was found to be the hub gene in both IBC and non-IBC; SUMO1 and COL1A1 were respectively the key genes for IBC and non-IBC. These analysis results indicated that those key genes might play important role in IBC and non-IBC and provided some clues for future studies.

  18. A Comprehensive Gene Expression Meta-analysis Identifies Novel Immune Signatures in Rheumatoid Arthritis Patients

    PubMed Central

    Afroz, Sumbul; Giddaluru, Jeevan; Vishwakarma, Sandeep; Naz, Saima; Khan, Aleem Ahmed; Khan, Nooruddin

    2017-01-01

    Rheumatoid arthritis (RA), a symmetric polyarticular arthritis, has long been feared as one of the most disabling forms of arthritis. Identification of gene signatures associated with RA onset and progression would lead toward development of novel diagnostics and therapeutic interventions. This study was undertaken to identify unique gene signatures of RA patients through large-scale meta-profiling of a diverse collection of gene expression data sets. We carried out a meta-analysis of 8 publicly available RA patients’ (107 RA patients and 76 healthy controls) gene expression data sets and further validated a few meta-signatures in RA patients through quantitative real-time PCR (RT-qPCR). We identified a robust meta-profile comprising 33 differentially expressed genes, which were consistently and significantly expressed across all the data sets. Our meta-analysis unearthed upregulation of a few novel gene signatures including PLCG2, HLA-DOB, HLA-F, EIF4E2, and CYFIP2, which were validated in peripheral blood mononuclear cell samples of RA patients. Further, functional and pathway enrichment analysis reveals perturbation of several meta-genes involved in signaling pathways pertaining to inflammation, antigen presentation, hypoxia, and apoptosis during RA. Additionally, PLCG2 (phospholipase Cγ2) popped out as a novel meta-gene involved in most of the pathways relevant to RA including inflammasome activation, platelet aggregation, and activation, thereby suggesting PLCG2 as a potential therapeutic target for controlling excessive inflammation during RA. In conclusion, these findings highlight the utility of meta-analysis approach in identifying novel gene signatures that might provide mechanistic insights into disease onset, progression and possibly lead toward the development of better diagnostic and therapeutic interventions against RA. PMID:28210261

  19. A Comprehensive Gene Expression Meta-analysis Identifies Novel Immune Signatures in Rheumatoid Arthritis Patients.

    PubMed

    Afroz, Sumbul; Giddaluru, Jeevan; Vishwakarma, Sandeep; Naz, Saima; Khan, Aleem Ahmed; Khan, Nooruddin

    2017-01-01

    Rheumatoid arthritis (RA), a symmetric polyarticular arthritis, has long been feared as one of the most disabling forms of arthritis. Identification of gene signatures associated with RA onset and progression would lead toward development of novel diagnostics and therapeutic interventions. This study was undertaken to identify unique gene signatures of RA patients through large-scale meta-profiling of a diverse collection of gene expression data sets. We carried out a meta-analysis of 8 publicly available RA patients' (107 RA patients and 76 healthy controls) gene expression data sets and further validated a few meta-signatures in RA patients through quantitative real-time PCR (RT-qPCR). We identified a robust meta-profile comprising 33 differentially expressed genes, which were consistently and significantly expressed across all the data sets. Our meta-analysis unearthed upregulation of a few novel gene signatures including PLCG2, HLA-DOB, HLA-F, EIF4E2, and CYFIP2, which were validated in peripheral blood mononuclear cell samples of RA patients. Further, functional and pathway enrichment analysis reveals perturbation of several meta-genes involved in signaling pathways pertaining to inflammation, antigen presentation, hypoxia, and apoptosis during RA. Additionally, PLCG2 (phospholipase Cγ2) popped out as a novel meta-gene involved in most of the pathways relevant to RA including inflammasome activation, platelet aggregation, and activation, thereby suggesting PLCG2 as a potential therapeutic target for controlling excessive inflammation during RA. In conclusion, these findings highlight the utility of meta-analysis approach in identifying novel gene signatures that might provide mechanistic insights into disease onset, progression and possibly lead toward the development of better diagnostic and therapeutic interventions against RA.

  20. De novo transcriptome sequencing to identify the sex-determination genes in Hyriopsis schlegelii.

    PubMed

    Shi, Jianwu; Hong, Yijiang; Sheng, Junqing; Peng, Kou; Wang, Junhua

    2015-01-01

    This study presents the first analysis of expressed transcripts in the spermary and ovary of Hyriopsis schlegelii (H. schlegelii). A total of 132,055 unigenes were obtained and 31,781 of these genes were annotated. In addition, 19,511 upregulated and 25,911 downregulated unigenes were identified in the spermary. Ten sex-determination genes were selected and further analyzed by real-time PCR. In addition, mammalian genes reported to govern sex-determination pathways, including Sry, Dmrt1, Dmrt2, Sox9, GATA4, and WT1 in males and Wnt4, Rspo1, Foxl2, and β-catenin in females, were also identified in H. schlegelii. These results suggest that H. schlegelii and mammals use similar gene regulatory mechanisms to control sex determination. Moreover, genes associated with dosage compensation mechanisms, such as Msl1, Msl2, and Msl3, and hermaphrodite phenotypes, such as Tra-1, Tra-2α, Tra-2β, Fem1A, Fem1B, and Fem1C, were also identified in H. schlegelii. The identification of these genes indicates that diverse regulatory mechanisms regulate sexual polymorphism in H. schlegelii.

  1. Functional genomics identifies neural stem cell sub-type expression profiles and genes regulating neuroblast homeostasis

    PubMed Central

    Carney, Travis D.; Miller, Michael R.; Robinson, Kristin J.; Bayraktar, Omer A.; Osterhout, Jessica A.; Doe, Chris Q.

    2014-01-01

    The Drosophila larval central brain contains about 10,000 differentiated neurons and 200 scattered neural progenitors (neuroblasts), which can be further subdivided into ~95 type I neuroblasts and eight type II neuroblasts per brain lobe. Only type II neuroblasts generate self-renewing intermediate neural progenitors (INPs), and consequently each contributes more neurons to the brain, including much of the central complex. We characterized six different mutant genotypes that lead to expansion of neuroblast numbers; some preferentially expand type II or type I neuroblasts. Transcriptional profiling of larval brains from these mutant genotypes versus wild-type allowed us to identify small clusters of transcripts enriched in type II or type I neuroblasts, and we validated these clusters by gene expression analysis. Unexpectedly, only a few genes were found to be differentially expressed between type I/II neuroblasts, suggesting that these genes play a large role in establishing the different cell types. We also identified a large group of genes predicted to be expressed in all neuroblasts but not neurons. We performed a neuroblast-specific, RNAi-based functional screen and identified 84 genes that are required to maintain proper neuroblast numbers; all have conserved mammalian orthologs. These genes are excellent candidates for regulating neural progenitor self-renewal in Drosophila and mammals. PMID:22061480

  2. Copy number variation analysis identifies novel CAKUT candidate genes in children with a solitary functioning kidney

    PubMed Central

    Westland, Rik; Verbitsky, Miguel; Vukojevic, Katarina; Perry, Brittany J.; Fasel, David A.; Zwijnenburg, Petra J.G.; Bökenkamp, Arend; Gille, Johan J.P.; Saraga-Babic, Mirna; Ghiggeri, Gian Marco; D’Agati, Vivette D.; Schreuder, Michiel F.; Gharavi, Ali G.; van Wijk, Joanna A.E.; Sanna-Cherchi, Simone

    2016-01-01

    Copy number variations associate with different developmental phenotypes and represent a major cause of congenital anomalies of the kidney and urinary tract (CAKUT). Because rare pathogenic copy number variations are often large and contain multiple genes, identification of the underlying genetic drivers has proven to be difficult. Here we studied the role of rare copy number variations in 80 patients from the KIMONO-study cohort for which pathogenic mutations in three genes commonly implicated in CAKUT were excluded. In total, 13 known or novel genomic imbalances in 11 of 80 patients were absent or extremely rare in 23,362 population controls. To identify the most likely genetic drivers for the CAKUT phenotype underlying these rare copy number variations, we used a systematic in silico approach based on frequency in a large dataset of controls, annotation with publicly available databases for developmental diseases, tolerance and haploinsufficiency scores, and gene expression profile in the developing kidney and urinary tract. Five novel candidate genes for CAKUT were identified that showed specific expression in the human and mouse developing urinary tract. Among these genes, DLG1 and KIF12 are likely novel susceptibility genes for CAKUT in humans. Thus, there is a significant role of genomic imbalance in the determination of kidney developmental phenotypes. Additionally, we defined a systematic strategy to identify genetic drivers underlying rare copy number variations. PMID:26352300

  3. A genome-wide screen for identifying all regulators of a target gene

    PubMed Central

    Baptist, Guillaume; Pinel, Corinne; Ranquet, Caroline; Izard, Jérôme; Ropers, Delphine; de Jong, Hidde; Geiselmann, Johannes

    2013-01-01

    We have developed a new screening methodology for identifying all genes that control the expression of a target gene through genetic or metabolic interactions. The screen combines mutant libraries with luciferase reporter constructs, whose expression can be monitored in vivo and over time in different environmental conditions. We apply the method to identify the genes that control the expression of the gene acs, encoding the acetyl coenzyme A synthetase, in Escherichia coli. We confirm most of the known genetic regulators, including CRP–cAMP, IHF and components of the phosphotransferase system. In addition, we identify new regulatory interactions, many of which involve metabolic intermediates or metabolic sensing, such as the genes pgi, pfkA, sucB and lpdA, encoding enzymes in glycolysis and the TCA cycle. Some of these novel interactions were validated by quantitative reverse transcriptase-polymerase chain reaction. More generally, we observe that a large number of mutants directly or indirectly influence acs expression, an effect confirmed for a second promoter, sdhC. The method is applicable to any promoter fused to a luminescent reporter gene in combination with a deletion mutant library. PMID:23892289

  4. Integrative strategies to identify candidate genes in rodent models of human alcoholism.

    PubMed

    Treadwell, Julie A

    2006-01-01

    The search for genes underlying alcohol-related behaviours in rodent models of human alcoholism has been ongoing for many years with only limited success. Recently, new strategies that integrate several of the traditional approaches have provided new insights into the molecular mechanisms underlying ethanol's actions in the brain. We have used alcohol-preferring C57BL/6J (B6) and alcohol-avoiding DBA/2J (D2) genetic strains of mice in an integrative strategy combining high-throughput gene expression screening, genetic segregation analysis, and mapping to previously published quantitative trait loci to uncover candidate genes for the ethanol-preference phenotype. In our study, 2 genes, retinaldehyde binding protein 1 (Rlbp1) and syntaxin 12 (Stx12), were found to be strong candidates for ethanol preference. Such experimental approaches have the power and the potential to greatly speed up the laborious process of identifying candidate genes for the animal models of human alcoholism.

  5. A gene expression biomarker identifies in vitro and in vivo ERα modulators in a human gene expression compendium

    EPA Science Inventory

    We propose the use of gene expression profiling to complement the chemical characterization currently based on HTS assay data and present a case study relevant to the Endocrine Disruptor Screening Program. We have developed computational methods to identify estrogen receptor &alp...

  6. Identifying the source of unknown microcystin genes and predicting microcystin variants by comparing genes within uncultured cyanobacterial cells.

    PubMed

    Allender, Christopher J; LeCleir, Gary R; Rinta-Kanto, Johanna M; Small, Randall L; Satchwell, Michael F; Boyer, Gregory L; Wilhelm, Steven W

    2009-06-01

    While multiple phylogenetic markers have been used in the culture-independent study of microcystin-producing cyanobacteria, in only a few instances have multiple markers been studied within individual cells, and in all cases these studies have been conducted with cultured isolates. Here, we isolate and evaluate large DNA fragments (>6 kb) encompassing two genes involved in microcystin biosynthesis (mcyA2 and mcyB1) and use them to identify the source of gene fragments found in water samples. Further investigation of these gene loci from individual cyanobacterial cells allowed for improved analysis of the genetic diversity within microcystin producers as well as a method to predict microcystin variants for individuals. These efforts have also identified the source of the novel mcyA genotype previously termed Microcystis-like that is pervasive in the Laurentian Great Lakes and they predict the microcystin variant(s) that it produces.

  7. Integration of QTL and bioinformatic tools to identify candidate genes for triglycerides in mice.

    PubMed

    Leduc, Magalie S; Hageman, Rachael S; Verdugo, Ricardo A; Tsaih, Shirng-Wern; Walsh, Kenneth; Churchill, Gary A; Paigen, Beverly

    2011-09-01

    To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a "toolbox" of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits.

  8. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network

    PubMed Central

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-01-01

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer. PMID:27149165

  9. Identifying novel genes and chemicals related to nasopharyngeal cancer in a heterogeneous network.

    PubMed

    Li, Zhandong; An, Lifeng; Li, Hao; Wang, ShaoPeng; Zhou, You; Yuan, Fei; Li, Lin

    2016-05-05

    Nasopharyngeal cancer or nasopharyngeal carcinoma (NPC) is the most common cancer originating in the nasopharynx. The factors that induce nasopharyngeal cancer are still not clear. Additional information about the chemicals or genes related to nasopharyngeal cancer will promote a better understanding of the pathogenesis of this cancer and the factors that induce it. Thus, a computational method NPC-RGCP was proposed in this study to identify the possible relevant chemicals and genes based on the presently known chemicals and genes related to nasopharyngeal cancer. To extensively utilize the functional associations between proteins and chemicals, a heterogeneous network was constructed based on interactions of proteins and chemicals. The NPC-RGCP included two stages: the searching stage and the screening stage. The former stage is for finding new possible genes and chemicals in the heterogeneous network, while the latter stage is for screening and removing false discoveries and selecting the core genes and chemicals. As a result, five putative genes, CXCR3, IRF1, CDK1, GSTP1, and CDH2, and seven putative chemicals, iron, propionic acid, dimethyl sulfoxide, isopropanol, erythrose 4-phosphate, β-D-Fructose 6-phosphate, and flavin adenine dinucleotide, were identified by NPC-RGCP. Extensive analyses provided confirmation that the putative genes and chemicals have significant associations with nasopharyngeal cancer.

  10. A Genome-Wide Regulatory Framework Identifies Maize Pericarp Color1 Controlled Genes[C][W

    PubMed Central

    Morohashi, Kengo; Casas, María Isabel; Ferreyra, Lorena Falcone; Mejía-Guerra, María Katherine; Pourcel, Lucille; Yilmaz, Alper; Feller, Antje; Carvalho, Bruna; Emiliani, Julia; Rodriguez, Eduardo; Pellegrinet, Silvina; McMullen, Michael; Casati, Paula; Grotewold, Erich

    2012-01-01

    Pericarp Color1 (P1) encodes an R2R3-MYB transcription factor responsible for the accumulation of insecticidal flavones in maize (Zea mays) silks and red phlobaphene pigments in pericarps and other floral tissues, which makes P1 an important visual marker. Using genome-wide expression analyses (RNA sequencing) in pericarps and silks of plants with contrasting P1 alleles combined with chromatin immunoprecipitation coupled with high-throughput sequencing, we show here that the regulatory functions of P1 are much broader than the activation of genes corresponding to enzymes in a branch of flavonoid biosynthesis. P1 modulates the expression of several thousand genes, and ∼1500 of them were identified as putative direct targets of P1. Among them, we identified F2H1, corresponding to a P450 enzyme that converts naringenin into 2-hydroxynaringenin, a key branch point in the P1-controlled pathway and the first step in the formation of insecticidal C-glycosyl flavones. Unexpectedly, the binding of P1 to gene regulatory regions can result in both gene activation and repression. Our results indicate that P1 is the major regulator for a set of genes involved in flavonoid biosynthesis and a minor modulator of the expression of a much larger gene set that includes genes involved in primary metabolism and production of other specialized compounds. PMID:22822204

  11. Comparison of inherently essential genes of Porphyromonas gingivalis identified in two transposon-sequencing libraries.

    PubMed

    Hutcherson, J A; Gogeneni, H; Yoder-Himes, D; Hendrickson, E L; Hackett, M; Whiteley, M; Lamont, R J; Scott, D A

    2016-08-01

    Porphyromonas gingivalis is a Gram-negative anaerobe and keystone periodontal pathogen. A mariner transposon insertion mutant library has recently been used to define 463 genes as putatively essential for the in vitro growth of P. gingivalis ATCC 33277 in planktonic culture (Library 1). We have independently generated a transposon insertion mutant library (Library 2) for the same P. gingivalis strain and herein compare genes that are putatively essential for in vitro growth in complex media, as defined by both libraries. In all, 281 genes (61%) identified by Library 1 were common to Library 2. Many of these common genes are involved in fundamentally important metabolic pathways, notably pyrimidine cycling as well as lipopolysaccharide, peptidoglycan, pantothenate and coenzyme A biosynthesis, and nicotinate and nicotinamide metabolism. Also in common are genes encoding heat-shock protein homologues, sigma factors, enzymes with proteolytic activity, and the majority of sec-related protein export genes. In addition to facilitating a better understanding of critical physiological processes, transposon-sequencing technology has the potential to identify novel strategies for the control of P. gingivalis infections. Those genes defined as essential by two independently generated TnSeq mutant libraries are likely to represent particularly attractive therapeutic targets.

  12. Transcriptome analysis identifies genes involved in ethanol response of Saccharomyces cerevisiae in Agave tequilana juice.

    PubMed

    Ramirez-Córdova, Jesús; Drnevich, Jenny; Madrigal-Pulido, Jaime Alberto; Arrizon, Javier; Allen, Kirk; Martínez-Velázquez, Moisés; Alvarez-Maya, Ikuri

    2012-08-01

    During ethanol fermentation, yeast cells are exposed to stress due to the accumulation of ethanol, cell growth is altered and the output of the target product is reduced. For Agave beverages, like tequila, no reports have been published on the global gene expression under ethanol stress. In this work, we used microarray analysis to identify Saccharomyces cerevisiae genes involved in the ethanol response. Gene expression of a tequila yeast strain of S. cerevisiae (AR5) was explored by comparing global gene expression with that of laboratory strain S288C, both after ethanol exposure. Additionally, we used two different culture conditions, cells grown in Agave tequilana juice as a natural fermentation media or grown in yeast-extract peptone dextrose as artificial media. Of the 6368 S. cerevisiae genes in the microarray, 657 genes were identified that had different expression responses to ethanol stress due to strain and/or media. A cluster of 28 genes was found over-expressed specifically in the AR5 tequila strain that could be involved in the adaptation to tequila yeast fermentation, 14 of which are unknown such as yor343c, ylr162w, ygr182c, ymr265c, yer053c-a or ydr415c. These could be the most suitable genes for transforming tequila yeast to increase ethanol tolerance in the tequila fermentation process. Other genes involved in response to stress (RFC4, TSA1, MLH1, PAU3, RAD53) or transport (CYB2, TIP20, QCR9) were expressed in the same cluster. Unknown genes could be good candidates for the development of recombinant yeasts with ethanol tolerance for use in industrial tequila fermentation.

  13. A Sleeping Beauty forward genetic screen identifies new genes and pathways driving osteosarcoma development and metastasis

    PubMed Central

    Moriarity, Branden S; Otto, George M; Rahrmann, Eric P; Rathe, Susan K; Wolf, Natalie K; Weg, Madison T; Manlove, Luke A; LaRue, Rebecca S; Temiz, Nuri A; Molyneux, Sam D; Choi, Kwangmin; Holly, Kevin J; Sarver, Aaron L; Scott, Milcah C; Forster, Colleen L; Modiano, Jaime F; Khanna, Chand; Hewitt, Stephen M; Khokha, Rama; Yang, Yi; Gorlick, Richard; Dyer, Michael A; Largaespada, David A

    2016-01-01

    Osteosarcomas are sarcomas of the bone, derived from osteoblasts or their precursors, with a high propensity to metastasize. Osteosarcoma is associated with massive genomic instability, making it problematic to identify driver genes using human tumors or prototypical mouse models, many of which involve loss of Trp53 function. To identify the genes driving osteosarcoma development and metastasis, we performed a Sleeping Beauty (SB) transposon-based forward genetic screen in mice with and without somatic loss of Trp53. Common insertion site (CIS) analysis of 119 primary tumors and 134 metastatic nodules identified 232 sites associated with osteosarcoma development and 43 sites associated with metastasis, respectively. Analysis of CIS-associated genes identified numerous known and new osteosarcoma-associated genes enriched in the ErbB, PI3K-AKT-mTOR and MAPK signaling pathways. Lastly, we identified several oncogenes involved in axon guidance, including Sema4d and Sema6d, which we functionally validated as oncogenes in human osteosarcoma. PMID:25961939

  14. Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture.

    PubMed

    González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R

    2016-01-01

    Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.

  15. Transcriptomic Analysis Using Olive Varieties and Breeding Progenies Identifies Candidate Genes Involved in Plant Architecture

    PubMed Central

    González-Plaza, Juan J.; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F.; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R.; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R.

    2016-01-01

    Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species. PMID:26973682

  16. Transcriptome analysis identifies genes with enriched expression in the mouse central Extended Amygdala

    PubMed Central

    Becker, Jérôme A. J.; Befort, Katia; Blad, Clara; Filliol, Dominique; Ghate, Aditee; Dembele, Doulaye; Thibault, Christelle; Koch, Muriel; Muller, Jean; Lardenois, Aurélie; Poch, Olivier; Kieffer, Brigitte L.

    2008-01-01

    The central Extended Amygdala (EAc) is an ensemble of highly interconnected limbic structures of the anterior brain, and forms a cellular continuum including the Bed Nucleus of the Stria Terminalis (BNST), the central nucleus of the Amygdala (CeA) and the Nucleus Accumbens shell (AcbSh). This neural network is a key site for interactions between brain reward and stress systems, and has been implicated in several aspects of drug abuse. In order to increase our understanding of EAc function at the molecular level, we undertook a genome-wide screen (Affymetrix) to identify genes whose expression is enriched in the EAc. We focused on the less-well known BNST-CeA areas of the EAc, and identified 121 genes that exhibit more than 2-fold higher expression level in the EAc compared to whole brain. Among these, forty-three genes have never been described to be expressed in the EAc. We mapped these genes throughout the brain, using non-radioactive in situ hybridization, and identified eight genes with a unique and distinct rostro-caudal expression pattern along AcbSh, BNST and CeA. Q-PCR analysis performed in brain and peripheral organ tissues indicated that, with the exception of one (Spata13), all these genes are predominantly expressed in brain. These genes encode signaling proteins (Adora2, GPR88, Arpp21 and Rem2), a transcription factor (Limh6) or proteins of unknown function (Rik130, Spata13 and Wfs1). The identification of genes with enriched expression expands our knowledge of EAc at a molecular level, and provides useful information to towards genetic manipulations within the EAc. PMID:18786617

  17. X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes.

    PubMed

    Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

    2016-01-01

    X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.

  18. Gene expression in bovine rumen epithelium during weaning identifies molecular regulators of rumen development and growth.

    PubMed

    Connor, Erin E; Baldwin, Ransom L; Li, Cong-jun; Li, Robert W; Chung, Hoyoung

    2013-03-01

    During weaning, epithelial cell function in the rumen transitions in response to conversion from a pre-ruminant to a true ruminant environment to ensure efficient nutrient absorption and metabolism. To identify gene networks affected by weaning in bovine rumen, Holstein bull calves were fed commercial milk replacer only (MRO) until 42 days of age, then were provided diets of either milk + orchardgrass hay (MH) or milk + grain-based calf starter (MG). Rumen epithelial RNA was extracted from calves sacrificed at four time points: day 14 (n = 3) and day 42 (n = 3) of age while fed the MRO diet and day 56 (n = 3/diet) and day 70 (n = 3/diet) while fed the MH and MG diets for transcript profiling by microarray hybridization. Five two-group comparisons were made using Permutation Analysis of Differential Expression® to identify differentially expressed genes over time and developmental stage between days 14 and 42 within the MRO diet, between day 42 on the MRO diet and day 56 on the MG or MH diets, and between the MG and MH diets at days 56 and 70. Ingenuity Pathway Analysis (IPA) of differentially expressed genes during weaning indicated the top 5 gene networks involving molecules participating in lipid metabolism, cell morphology and death, cellular growth and proliferation, molecular transport, and the cell cycle. Putative genes functioning in the establishment of the rumen microbial population and associated rumen epithelial inflammation during weaning were identified. Activation of transcription factor PPAR-α was identified by IPA software as an important regulator of molecular changes in rumen epithelium that function in papillary development and fatty acid oxidation during the transition from pre-rumination to rumination. Thus, molecular markers of rumen development and gene networks regulating differentiation and growth of rumen epithelium were identified for selecting targets and methods for improving and assessing rumen development and

  19. Identifying genes of agronomic importance in maize by screening microsatellites for evidence of selection during domestication.

    PubMed

    Vigouroux, Y; McMullen, M; Hittinger, C T; Houchins, K; Schulz, L; Kresovich, S; Matsuoka, Y; Doebley, J

    2002-07-23

    Crop species experienced strong selective pressure directed at genes controlling traits of agronomic importance during their domestication and subsequent episodes of selective breeding. Consequently, these genes are expected to exhibit the signature of selection. We screened 501 maize genes for the signature of selection using microsatellites or simple sequence repeats (SSRs). We applied the Ewens-Watterson test, which can reveal deviations from a neutral-equilibrium model, as well as two nonequilibrium tests that incorporate the domestication bottleneck. We investigated two classes of SSRs: those known to be polymorphic in maize (Class I) and those previously classified as monomorphic in maize (Class II). Fifteen SSRs exhibited some evidence for selection in maize and 10 showed evidence under stringent criteria. The genes containing nonneutral SSRs are candidates for agronomically important genes. Because demographic factors can bias our tests, further independent tests of these candidates are necessary. We applied such an additional test to one candidate, which encodes a MADS box transcriptional regulator, and confirmed that this gene experienced a selective sweep during maize domestication. Genomic scans for the signature of selection offer a means of identifying new genes of agronomic importance even when gene function and the phenotype of interest are unknown.

  20. New SigD-regulated genes identified in the rhizobacterium Bacillus amyloliquefaciens FZB42

    PubMed Central

    Li, Yu-Long; Mariappan, Aruljothi; Becker, Anke; Wu, Xiao-Qin; Borriss, Rainer

    2016-01-01

    ABSTRACT The alternative sigma factor D is known to be involved in at least three biological processes in Bacilli: flagellin synthesis, methyl-accepting chemotaxis and autolysin synthesis. Although many Bacillus genes have been identified as SigD regulon, the list may be not be complete. With microarray-based systemic screening, we found a set of genes downregulated in the sigD knockout mutant of the plant growth-promoting rhizobacterium B. amyloliquefaciens subsp. plantarum FZB42. Eight genes (appA, blsA, dhaS, spoVG, yqgA, RBAM_004640, RBAM_018080 and ytk) were further confirmed by quantitative PCR and/or northern blot to be controlled by SigD at the transcriptional level. These genes are hitherto not reported to be controlled by SigD. Among them, four genes are of unknown function and two genes (RBAM_004640 and RBAM_018080), absent in the model strain B. subtilis 168, are unique to B. amyloliquefaciens stains. The eight genes are involved in sporulation, biofilm formation, metabolite transport and several other functions. These findings extend our knowledge of the regulatory network governed by SigD in Bacillus and will further help to decipher the roles of the genes. PMID:27797724

  1. Comparative genomics between fly, mouse, and cattle identifies genes associated with sire conception rate.

    PubMed

    Li, G; Peñagaricano, F; Weigel, K A; Zhang, Y; Rosa, G; Khatib, H

    2012-10-01

    The decline in reproductive performance in cattle is of major concern to farmers and the dairy industry worldwide. Most fertility studies in cattle have focused on fertility of the cow, whereas the genetics of male fertility have not been thoroughly investigated. The present study hypothesizes that the high conservation of spermatogenesis genes from fly to human implies important roles of these genes in male fertility in cattle. To test this hypothesis, we performed an association analysis between highly conserved spermatogenesis genes and sire conception rate (SCR) in US Holsteins as a measure of bull fertility. Sequencing analysis revealed 24 single nucleotide polymorphisms (SNP) in 9 genes in the bull population using the pooled DNA sequencing approach. Five SNP previously identified in 5 genes from the POU1F1 pathway were also included in this study because they have shown significant associations with female and male fertility traits. Overall, 29 SNP located in 14 candidate genes were tested for association with sire conception rate in a population of 1,988 bulls. Three SNP located in MAP1B and 1 SNP in PPP1R11 showed significant associations with SCR. For the POU1F1 pathway, single gene analysis revealed significant associations of POU1F1 and STAT5A with SCR. Analysis of genotypic interactions between adjacent genes in the pathway revealed significant associations of STAT5A and UTMP genotypic combinations with SCR. The most significant spermatogenesis gene, MAP1B, was found to be associated with fertilization and blastocyst rates. Thus, the association of these genes with bull fertility testifies to the usefulness of the comparative genomics approach in selecting candidate male fertility genes.

  2. Use of eQTL Analysis for the Discovery of Target Genes Identified by GWAS

    DTIC Science & Technology

    2012-04-01

    prostate tissue-specific expression quantitative trait loci (eQTL) dataset; and 2) utilize this dataset to identify candidate genes for existing...set of 500 samples of normal prostate tissue sampled from men with PC. To date, we have pre-screened normal prostate tissue with the use of H&E...stained sections from 4000 men having a radical prostatectomy in order to identify those cases meeting our strict selection criteria for further

  3. Transcriptome Analysis Identifies Key Candidate Genes Mediating Purple Ovary Coloration in Asiatic Hybrid Lilies

    PubMed Central

    Xu, Leifeng; Yang, Panpan; Yuan, Suxia; Feng, Yayan; Xu, Hua; Cao, Yuwei; Ming, Jun

    2016-01-01

    Lily tepals have a short lifespan. Once the tepals senesce, the ornamental value of the flower is lost. Some cultivars have attractive purple ovaries and fruits which greatly enhance the ornamental value of Asiatic hybrid lilies. However, little is known about the molecular mechanisms of anthocyanin biosynthesis in Asiatic hybrid lily ovaries. To investigate the transcriptional network that governs purple ovary coloration in Asiatic hybrid lilies, we obtained transcriptome data from green ovaries (S1) and purple ovaries (S2) of Asiatic “Tiny Padhye”. Comparative transcriptome analysis revealed 4228 differentially expressed genes. Differential expression analysis revealed that ten unigenes including four CHS genes, one CHI gene, one F3H gene, one F3′H gene, one DFR gene, one UFGT gene, and one 3RT gene were significantly up-regulated in purple ovaries. One MYB gene, LhMYB12-Lat, was identified as a key transcription factor determining the distribution of anthocyanins in Asiatic hybrid lily ovaries. Further qPCR results showed unigenes related to anthocyanin biosynthesis were highly expressed in purple ovaries of three purple-ovaried Asiatic hybrid lilies at stages 2 and 3, while they showed an extremely low level of expression in ovaries of three green-ovaried Asiatic hybrid lilies during all developmental stages. In addition, shading treatment significantly decreased pigment accumulation by suppressing the expression of several unigenes related to anthocyanin biosynthesis in ovaries of Asiatic “Tiny Padhye”. Lastly, a total of 15,048 Simple Sequence Repeats (SSRs) were identified in 13,710 sequences, and primer pairs for SSRs were designed. The results could further our understanding of the molecular mechanisms of anthocyanin biosynthesis in Asiatic hybrid lily ovaries. PMID:27879624

  4. Integrative Analysis of Genomics and Transcriptome Data to Identify Potential Functional Genes of BMDs in Females.

    PubMed

    Chen, Yuan-Cheng; Guo, Yan-Fang; He, Hao; Lin, Xu; Wang, Xia-Fang; Zhou, Rou; Li, Wen-Ting; Pan, Dao-Yan; Shen, Jie; Deng, Hong-Wen

    2016-05-01

    Osteoporosis is known to be highly heritable. However, to date, the findings from more than 20 genome-wide association studies (GWASs) have explained less than 6% of genetic risks. Studies suggest that the missing heritability data may be because of joint effects among genes. To identify novel heritability for osteoporosis, we performed a system-level study on bone mineral density (BMD) by weighted gene coexpression network analysis (WGCNA), using the largest GWAS data set for BMD in the field, Genetic Factors for Osteoporosis Consortium (GEFOS-2), and a transcriptomic gene expression data set generated from transiliac bone biopsies in women. A weighted gene coexpression network was generated for 1574 genes with GWAS nominal evidence of association (p ≤ 0.05) based on dissimilarity measurement on the expression data. Twelve distinct gene modules were identified, and four modules showed nominally significant associations with BMD (p ≤ 0.05), but only one module, the yellow module, demonstrated a good correlation between module membership (MM) and gene significance (GS), suggesting that the yellow module serves an important biological role in bone regulation. Interestingly, through characterization of module content and topology, the yellow module was found to be significantly enriched with contractile fiber part (GO:044449), which is widely recognized as having a close relationship between muscle and bone. Furthermore, detailed submodule analyses of important candidate genes (HOMER1, SPTBN1) by all edges within the yellow module implied significant enrichment of functional connections between bone and cytoskeletal protein binding. Our study yielded novel information from system genetics analyses of GWAS data jointly with transcriptomic data. The findings highlighted a module and several genes in the model as playing important roles in the regulation of bone mass in females, which may yield novel insights into the genetic basis of osteoporosis. © 2016

  5. Low-copy piggyBac transposon mutagenesis in mice identifies genes driving melanoma.

    PubMed

    Ni, Thomas K; Landrette, Sean F; Bjornson, Robert D; Bosenberg, Marcus W; Xu, Tian

    2013-09-17

    Despite considerable efforts to sequence hypermutated cancers such as melanoma, distinguishing cancer-driving genes from thousands of recurrently mutated genes remains a significant challenge. To circumvent the problematic background mutation rates and identify new melanoma driver genes, we carried out a low-copy piggyBac transposon mutagenesis screen in mice. We induced eleven melanomas with mutation burdens that were 100-fold lower relative to human melanomas. Thirty-eight implicated genes, including two known drivers of human melanoma, were classified into three groups based on high, low, or background-level mutation frequencies in human melanomas, and we further explored the functional significance of genes in each group. For two genes overlooked by prevailing discovery methods, we found that loss of membrane associated guanylate kinase, WW and PDZ domain containing 2 and protein tyrosine phosphatase, receptor type, O cooperated with the v-raf murine sarcoma viral oncogene homolog B (BRAF) recurrent V600E mutation to promote cellular transformation. Moreover, for infrequently mutated genes often disregarded by current methods, we discovered recurrent mitogen-activated protein kinase kinase kinase 1 (Map3k1)-activating insertions in our screen, mirroring recurrent MAP3K1 up-regulation in human melanomas. Aberrant expression of Map3k1 enabled growth factor-autonomous proliferation and drove BRAF-independent ERK signaling, thus shedding light on alternative means of activating this prominent signaling pathway in melanoma. In summary, our study contributes several previously undescribed genes involved in melanoma and establishes an important proof-of-principle for the utility of the low-copy transposon mutagenesis approach for identifying cancer-driving genes, especially those masked by hypermutation.

  6. Identifying Significant Features in Cancer Methylation Data Using Gene Pathway Segmentation

    PubMed Central

    Hira, Zena M.; Gillies, Duncan F.

    2016-01-01

    In order to provide the most effective therapy for cancer, it is important to be able to diagnose whether a patient’s cancer will respond to a proposed treatment. Methylation profiling could contain information from which such predictions could be made. Currently, hypothesis testing is used to determine whether possible biomarkers for cancer progression produce statistically significant results. However, this approach requires the identification of individual genes, or sets of genes, as candidate hypotheses, and with the increasing size of modern microarrays, this task is becoming progressively harder. Exhaustive testing of small sets of genes is computationally infeasible, and so hypothesis generation depends either on the use of established biological knowledge or on heuristic methods. As an alternative machine learning, methods can be used to identify groups of genes that are acting together within sets of cancer data and associate their behaviors with cancer progression. These methods have the advantage of being multivariate and unbiased but unfortunately also rapidly become computationally infeasible as the number of gene probes and datasets increases. To address this problem, we have investigated a way of utilizing prior knowledge to segment microarray datasets in such a way that machine learning can be used to identify candidate sets of genes for hypothesis testing. A methylation dataset is divided into subsets, where each subset contains only the probes that relate to a known gene pathway. Each of these pathway subsets is used independently for classification. The classification method is AdaBoost with decision trees as weak classifiers. Since each pathway subset contains a relatively small number of gene probes, it is possible to train and test its classification accuracy quickly and determine whether it has valuable diagnostic information. Finally, genes from successful pathway subsets can be combined to create a classifier of high accuracy. PMID

  7. A Cross-Disorder Method to Identify Novel Candidate Genes for Developmental Brain Disorders

    PubMed Central

    Gonzalez-Mantilla, Andrea J.; Moreno-De-Luca, Andres; Ledbetter, David H.; Martin, Christa Lese

    2017-01-01

    IMPORTANCE Developmental brain disorders are a group of clinically and genetically heterogeneous disorders characterized by high heritability. Specific highly penetrant genetic causes can often be shared by a subset of individuals with different phenotypic features, and recent advances in genome sequencing have allowed the rapid and cost-effective identification of many of these pathogenic variants. OBJECTIVES To identify novel candidate genes for developmental brain disorders and provide additional evidence of previously implicated genes. DATA SOURCES The PubMed database was searched for studies published from March 28,2003, through May 7,2015, with large cohorts of individuals with developmental brain disorders. DATA EXTRACTION AND SYNTHESIS A tiered, multilevel data-integration approach was used, which intersects (1) whole-genome data from structural and sequence pathogenic loss-of-function (pLOF) variants, (2) phenotype data from 6 apparently distinct disorders (intellectual disability, autism, attention-deficit/hyperactivity disorder, schizophrenia, bipolar disorder, and epilepsy), and (3) additional data from largescale studies, smaller cohorts, and case reports focusing on specific candidate genes. All candidate genes were ranked into 4 tiers based on the strength of evidence as follows: tier 1, genes with 3 or more de novo pathogenic loss-of-function variants; tier 2, genes with 2 de novo pathogenic loss-of-function variants; tier 3, genes with 1 de novo pathogenic loss-of-function variant; and tier 4, genes with only inherited (or unknown inheritance) pathogenic loss-of-function variants. MAIN OUTCOMES AND MEASURES Development of a comprehensive knowledge base of candidate genes related to developmental brain disorders. Genes were prioritized based on the inheritance pattern and total number of pathogenic loss-of-function variants identified amongst unrelated individuals with any one of six developmental brain disorders. STUDY SELECTION A combination of

  8. Genetic‐Genomic Replication to Identify Candidate Mouse Atherosclerosis Modifier Genes

    PubMed Central

    Hsu, Jeffrey; Smith, Jonathan D.

    2013-01-01

    Objective Genetics plays a large role in atherosclerosis susceptibility in humans and mice. We attempted to confirm previously determined mouse atherosclerosis‐associated loci and use bioinformatics and transcriptomics to create a catalog of candidate atherosclerosis modifier genes at these loci. Methods and Results A strain intercross was performed between AKR and DBA/2 mice on the apoE−/− background generating 166 F2 progeny. Using the phenotype log10 of the aortic root lesion area, we identified 3 suggestive atherosclerosis quantitative trait loci (Ath QTLs). When combined with our prior strain intercross, we confirmed 3 significant Ath QTLs on chromosomes 2, 15, and 17, with combined logarithm of odds scores of 5.9, 5.3, and 5.6, respectively, which each met the genome‐wide 5% false discovery rate threshold. We identified all of the protein coding differences between these 2 mouse strains within the Ath QTL intervals. Microarray gene expression profiling was performed on macrophages and endothelial cells from this intercross to identify expression QTLs (eQTLs), the loci that are associated with variation in the expression levels of specific transcripts. Cross tissue eQTLs and macrophage eQTLs that replicated from a prior strain intercross were identified. These bioinformatic and eQTL analyses produced a comprehensive list of candidate genes that may be responsible for the Ath QTLs. Conclusions Replication studies for clinical traits as well as gene expression traits are worthwhile in identifying true versus false genetic associations. We have replicated 3 loci on mouse chromosomes 2, 15, and 17 that are associated with atherosclerosis. We have also identified protein coding differences and multiple replicated eQTLs, which may be useful in the identification of atherosclerosis modifier genes. PMID:23525445

  9. Gene expression analysis identifies new candidate genes associated with the development of black skin spots in Corriedale sheep.

    PubMed

    Peñagaricano, Francisco; Zorrilla, Pilar; Naya, Hugo; Robello, Carlos; Urioste, Jorge I

    2012-02-01

    The white coat colour of sheep is an important economic trait. For unknown reasons, some animals are born with, and others develop with time, black skin spots that can also produce pigmented fibres. The presence of pigmented fibres in the white wool significantly decreases the fibre quality. The aim of this work was to study gene expression in black spots (with and without pigmented fibres) and white skin by microarray techniques, in order to identify the possible genes involved in the development of this trait. Five unrelated Corriedale sheep were used and, for each animal, the three possible comparisons (three different hybridisations) between the three samples of interest were performed. Differential gene expression patterns were analysed using different t-test approaches. Most of the major genes with well-known roles in skin pigmentation, e.g. ASIP, MC1R and C-KIT, showed no significant difference in the gene expression between white skin and black spots. On the other hand, many of the differentially expressed genes (raw P-value < 0.005) detected in this study, e.g. C-FOS, KLF4 and UFC1, fulfil biological functions that are plausible to be involved in the formation of black spots. The gene expression of C-FOS and KLF4, transcription factors involved in the cellular response to external factors such as ultraviolet light, was validated by quantitative polymerase chain reaction (PCR). This exploratory study provides a list of candidate genes that could be associated with the development of black skin spots that should be studied in more detail. Characterisation of these genes will enable us to discern the molecular mechanisms involved in the development of this feature and, hence, increase our understanding of melanocyte biology and skin pigmentation. In sheep, understanding this phenomenon is a first step towards developing molecular tools to assist in the selection against the presence of pigmented fibres in white wool.

  10. Expression profiling identifies novel Hh/Gli-regulated genes in developing zebrafish embryos.

    PubMed

    Bergeron, Sadie A; Milla, Luis A; Villegas, Rosario; Shen, Meng-Chieh; Burgess, Shawn M; Allende, Miguel L; Karlstrom, Rolf O; Palma, Verónica

    2008-02-01

    The Hedgehog (Hh) signaling pathway plays critical instructional roles during embryonic development. Misregulation of Hh/Gli signaling is a major causative factor in human congenital disorders and in a variety of cancers. The zebrafish is a powerful genetic model for the study of Hh signaling during embryogenesis, as a large number of mutants that affect different components of the Hh/Gli signaling system have been identified. By performing global profiling of gene expression in different Hh/Gli gain- and loss-of-function scenarios we identified known (e.g., ptc1 and nkx2.2a) and novel Hh-regulated genes that are differentially expressed in embryos with altered Hh/Gli signaling function. By uncovering changes in tissue-specific gene expression, we revealed new embryological processes that are influenced by Hh signaling. We thus provide a comprehensive survey of Hh/Gli-regulated genes during embryogenesis and we identify new Hh-regulated genes that may be targets of misregulation during tumorigenesis.

  11. A misexpression screen identifies genes that can modulate RAS1 pathway signaling in Drosophila melanogaster.

    PubMed Central

    Huang, A M; Rubin, G M

    2000-01-01

    Differentiation of the R7 photoreceptor cell is dependent on the Sevenless receptor tyrosine kinase, which activates the RAS1/mitogen-activated protein kinase signaling cascade. Kinase suppressor of Ras (KSR) functions genetically downstream of RAS1 in this signal transduction cascade. Expression of dominant-negative KSR (KDN) in the developing eye blocks RAS pathway signaling, prevents R7 cell differentiation, and causes a rough eye phenotype. To identify genes that modulate RAS signaling, we screened for genes that alter RAS1/KSR signaling efficiency when misexpressed. In this screen, we recovered three known genes, Lk6, misshapen, and Akap200. We also identified seven previously undescribed genes; one encodes a novel rel domain member of the NFAT family, and six encode novel proteins. These genes may represent new components of the RAS pathway or components of other signaling pathways that can modulate signaling by RAS. We discuss the utility of gain-of-function screens in identifying new components of signaling pathways in Drosophila. PMID:11063696

  12. Integrated genomic analyses identify ERRFI1 and TACC3 as glioblastoma-targeted genes

    PubMed Central

    Payne, Cathy A.; Lampson, Benjamin; Chen, William C.; Liu, Jeff; Solomon, David; Waldman, Todd; Towers, Aaron J.; Gregory, Simon G.; McDonald, Kerrie L.; McLendon, Roger E.; Bigner, Darell D.; Yan, Hai

    2010-01-01

    The glioblastoma genome displays remarkable chromosomal aberrations, which harbor critical glioblastoma-specific genes contributing to several oncogenetic pathways. To identify glioblastoma-targeted genes, we completed a multifaceted genome-wide analysis to characterize the most significant aberrations of DNA content occurring in glioblastomas. We performed copy number analysis of 111 glioblastomas by Digital Karyotyping and Illumina BeadChip assays and validated our findings using data from the TCGA (The Cancer Genome Atlas) glioblastoma project. From this study, we identified recurrent focal copy number alterations in 1p36.23 and 4p16.3. Expression analyses of genes located in the two regions revealed genes which are dysregulated in glioblastomas. Specifically, we identify EGFR negative regulator, ERRFI1, within the minimal region of deletion in 1p36.23. In glioblastoma cells with a focal deletion of the ERRFI1 locus, restoration of ERRFI1 expression slowed cell migration. Furthermore, we demonstrate that TACC3, an Aurora-A kinase substrate, on 4p16.3, displays gain of copy number, is overexpressed in a glioma-grade-specific pattern, and correlates with Aurora kinase overexpression in glioblastomas. Our multifaceted genomic evaluation of glioblastoma establishes ERRFI1 as a potential candidate tumor suppressor gene and TACC3 as a potential oncogene, and provides insight on targets for oncogenic pathway-based therapy. PMID:21113414

  13. Challenges in identifying cancer genes by analysis of exome sequencing data

    PubMed Central

    Hofree, Matan; Carter, Hannah; Kreisberg, Jason F.; Bandyopadhyay, Sourav; Mischel, Paul S.; Friend, Stephen; Ideker, Trey

    2016-01-01

    Massively parallel sequencing has permitted an unprecedented examination of the cancer exome, leading to predictions that all genes important to cancer will soon be identified by genetic analysis of tumours. To examine this potential, here we evaluate the ability of state-of-the-art sequence analysis methods to specifically recover known cancer genes. While some cancer genes are identified by analysis of recurrence, spatial clustering or predicted impact of somatic mutations, many remain undetected due to lack of power to discriminate driver mutations from the background mutational load (13–60% recall of cancer genes impacted by somatic single-nucleotide variants, depending on the method). Cancer genes not detected by mutation recurrence also tend to be missed by all types of exome analysis. Nonetheless, these genes are implicated by other experiments such as functional genetic screens and expression profiling. These challenges are only partially addressed by increasing sample size and will likely hold even as greater numbers of tumours are analysed. PMID:27417679

  14. Identifying Context-Specific Transcription Factor Targets from Prior Knowledge and Gene Expression Data

    PubMed Central

    Fertig, Elana J; Favorov, Alexander V; Ochs, Michael F

    2013-01-01

    Numerous methodologies, assays, and databases presently provide candidate targets of transcription factors (TFs). However, TFs rarely regulate their targets universally. The context of activation of a TF can change the transcriptional response of targets. Direct multiple regulation typical to mammalian genes complicates direct inference of TF targets from gene expression data. We present a novel statistic that infers context-specific TF regulation based upon the CoGAPS algorithm, which infers overlapping gene expression patterns resulting from coregulation. Numerical experiments with simulated data showed that this statistic correctly inferred targets that are common to multiple TFs, except in cases where the signal from a TF is negligible relative to noise level and signal from other TFs. The statistic is robust to moderate levels of error in the simulated gene sets, identifying fewer false positives than false negatives. Significantly, the regulatory statistic refines the number of TF targets relevant to cell signaling in gastrointestinal stromal tumors (GIST) to genes consistent with the phosphorylation patterns of TFs identified in previous studies. As formulated, the proposed regulatory statistic has wide applicability to inferring set membership in integrated datasets. This statistic could be naturally extended to account for prior probabilities of set membership or to add candidate gene targets. PMID:23694699

  15. Identifying context-specific transcription factor targets from prior knowledge and gene expression data.

    PubMed

    Fertig, Elana J; Favorov, Alexander V; Ochs, Michael F

    2013-09-01

    Numerous methodologies, assays, and databases presently provide candidate targets of transcription factors (TFs). However, TFs rarely regulate their targets universally. The context of activation of a TF can change the transcriptional response of targets. Direct multiple regulation typical to mammalian genes complicates direct inference of TF targets from gene expression data. We present a novel statistic that infers context-specific TF regulation based upon the CoGAPS algorithm, which infers overlapping gene expression patterns resulting from coregulation. Numerical experiments with simulated data showed that this statistic correctly inferred targets that are common to multiple TFs, except in cases where the signal from a TF is negligible relative to noise level and signal from other TFs. The statistic is robust to moderate levels of error in the simulated gene sets, identifying fewer false positives than false negatives. Significantly, the regulatory statistic refines the number of TF targets relevant to cell signaling in gastrointestinal stromal tumors (GIST) to genes consistent with the phosphorylation patterns of TFs identified in previous studies. As formulated, the proposed regulatory statistic has wide applicability to inferring set membership in integrated datasets. This statistic could be naturally extended to account for prior probabilities of set membership or to add candidate gene targets.

  16. Integrated genomic analyses identify frequent gene fusion events and VHL inactivation in gastrointestinal stromal tumors

    PubMed Central

    Sun, Choong-Hyun; Park, Inho; Lee, Seungmook; Kwon, Jekeun; Do, Ingu; Hong, Min Eui; Van Vrancken, Michael; Lee, Jeeyun; Park, Joon Oh; Cho, Jeonghee; Kim, Kyoung-Mee; Sohn, Tae Sung

    2016-01-01

    Gastrointestinal stromal tumors (GISTs) are the most common mesenchymal tumors of the gastrointestinal tract. We sequenced nine exomes and transcriptomes, and two genomes of GISTs for integrated analyses. We detected 306 somatic variants in nine GISTs and recurrent protein-altering mutations in 29 genes. Transcriptome sequencing revealed 328 gene fusions, and the most frequently involved fusion events were associated with IGF2 fused to several partner genes including CCND1, FUS, and LASP1. We additionally identified three recurrent read-through fusion transcripts: POLA2-CDC42EP2, C8orf42-FBXO25, and STX16-NPEPL1. Notably, we found intragenic deletions in one of three exons of the VHL gene and increased mRNAs of VEGF, PDGF-β, and IGF-1/2 in 56% of GISTs, suggesting a mechanistic link between VHL inactivation and overexpression of hypoxia-inducible factor target genes in the absence of hypoxia. We also identified copy number gain and increased mRNA expression of AMACR, CRIM1, SKP2, and CACNA1E. Mapping of copy number and gene expression results to the KEGG pathways revealed activation of the JAK-STAT pathway in small intestinal GISTs and the MAPK pathway in wild-type GISTs. These observations will allow us to determine the genetic basis of GISTs and will facilitate further investigation to develop new therapeutic options. PMID:25987131

  17. Integrated genomic analyses identify frequent gene fusion events and VHL inactivation in gastrointestinal stromal tumors.

    PubMed

    Kang, Guhyun; Yun, Hongseok; Sun, Choong-Hyun; Park, Inho; Lee, Seungmook; Kwon, Jekeun; Do, Ingu; Hong, Min Eui; Van Vrancken, Michael; Lee, Jeeyun; Park, Joon Oh; Cho, Jeonghee; Kim, Kyoung-Mee; Sohn, Tae Sung

    2016-02-09

    Gastrointestinal stromal tumors (GISTs) are the most common mesenchymal tumors of the gastrointestinal tract. We sequenced nine exomes and transcriptomes, and two genomes of GISTs for integrated analyses. We detected 306 somatic variants in nine GISTs and recurrent protein-altering mutations in 29 genes. Transcriptome sequencing revealed 328 gene fusions, and the most frequently involved fusion events were associated with IGF2 fused to several partner genes including CCND1, FUS, and LASP1. We additionally identified three recurrent read-through fusion transcripts: POLA2-CDC42EP2, C8orf42-FBXO25, and STX16-NPEPL1. Notably, we found intragenic deletions in one of three exons of the VHL gene and increased mRNAs of VEGF, PDGF-β, and IGF-1/2 in 56% of GISTs, suggesting a mechanistic link between VHL inactivation and overexpression of hypoxia-inducible factor target genes in the absence of hypoxia. We also identified copy number gain and increased mRNA expression of AMACR, CRIM1, SKP2, and CACNA1E. Mapping of copy number and gene expression results to the KEGG pathways revealed activation of the JAK-STAT pathway in small intestinal GISTs and the MAPK pathway in wild-type GISTs. These observations will allow us to determine the genetic basis of GISTs and will facilitate further investigation to develop new therapeutic options.

  18. Genome-Wide Overexpression Screen Identifies Genes Able to Bypass p16-Mediated Senescence in Melanoma.

    PubMed

    Lee, Won Jae; Škalamera, Dubravka; Dahmer-Heath, Mareike; Shakhbazov, Konstanin; Ranall, Max V; Fox, Carly; Lambie, Duncan; Stevenson, Alexander J; Yaswen, Paul; Gonda, Thomas J; Gabrielli, Brian

    2017-03-01

    Malignant melanomas often arise from nevi, which result from initial oncogene-induced hyperproliferation of melanocytes that are maintained in a CDKN2A/p16-mediated senescent state. Thus, genes that can bypass this senescence barrier are likely to contribute to melanoma development. We have performed a gain-of-function screen of 17,030 lentivirally expressed human open reading frames (ORFs) in a melanoma cell line containing an inducible p16 construct to identify such genes. Genes known to bypass p16-induced senescence arrest, including the human papilloma virus 18 E7 gene ( HPV18E7), and genes such as the p16-binding CDK6 with expected functions, as well as panel of novel genes, were identified, including high-mobility group box (HMGB) proteins. A number of these were further validated in two other models of p16-induced senescence. Tissue immunohistochemistry demonstrated higher levels of CDK6 in primary melanomas compared with normal skin and nevi. Reduction of CDK6 levels drove melanoma cells expressing functional p16 into senescence, demonstrating its contribution to bypass senescence.

  19. Analysis of Pigeon (Columba) Ovary Transcriptomes to Identify Genes Involved in Blue Light Regulation.

    PubMed

    Wang, Ying; Ding, Jia-Tong; Yang, Hai-Ming; Yan, Zheng-Jie; Cao, Wei; Li, Yang-Bai

    2015-01-01

    Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp) were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species.

  20. A fast and high performance multiple data integration algorithm for identifying human disease genes

    PubMed Central

    2015-01-01

    Background Integrating multiple data sources is indispensable in improving disease gene identification. It is not only due to the fact that disease genes associated with similar genetic diseases tend to lie close with each other in various biological networks, but also due to the fact that gene-disease associations are complex. Although various algorithms have been proposed to identify disease genes, their prediction performances and the computational time still should be further improved. Results In this study, we propose a fast and high performance multiple data integration algorithm for identifying human disease genes. A posterior probability of each candidate gene associated with individual diseases is calculated by using a Bayesian analysis method and a binary logistic regression model. Two prior probability estimation strategies and two feature vector construction methods are developed to test the performance of the proposed algorithm. Conclusions The proposed algorithm is not only generated predictions with high AUC scores, but also runs very fast. When only a single PPI network is employed, the AUC score is 0.769 by using F2 as feature vectors. The average running time for each leave-one-out experiment is only around 1.5 seconds. When three biological networks are integrated, the AUC score using F3 as feature vectors increases to 0.830, and the average running time for each leave-one-out experiment takes only about 12.54 seconds. It is better than many existing algorithms. PMID:26399620

  1. A microarray model system identifies potential new target genes of the proto-oncogene HOX11.

    PubMed

    Hoffmann, Katrin; Dixon, Darcelle N; Greene, Wayne K; Ford, Jette; Taplin, Ross; Kees, Ursula R

    2004-12-01

    HOX11 is a homeobox gene originally identified at a chromosomal breakpoint in T-cell acute lymphoblastic leukemia (T-ALL). It is one of the most frequently deregulated genes in T-ALL, although the precise role of HOX11 in leukemogenesis as well as in normal development remains obscure. To gain more insight into the functional role of HOX11, we utilized a microarray model system to characterize the gene expression network that it directs. Using one of our T-ALL cell lines that had been stably transfected to express HOX11 and high-density oligonucleotide HG-U95A arrays, we identified a large number of differentially expressed genes in response to the enforced expression of HOX11. We focused on examining genes found to be up-regulated according to the microarray analysis and selected three putative target genes, NFKB2, SMARCD3, and NR4A3, for further investigation. We could not only confirm the up-regulation of NR4A3 by an independent method in all clones expressing HOX11, but luciferase reporter assays demonstrated that the effect that HOX11 exerted on the proximal promoter of NR4A3 was dependent on the presence of an intact homeodomain, providing support for the idea that HOX11 manifests its regulatory function via its action as a transcription factor.

  2. Evolutionary analysis of vision genes identifies potential drivers of visual differences between giraffe and okapi

    PubMed Central

    Agaba, Morris; Cavener, Douglas R.

    2017-01-01

    Background The capacity of visually oriented species to perceive and respond to visual signal is integral to their evolutionary success. Giraffes are closely related to okapi, but the two species have broad range of phenotypic differences including their visual capacities. Vision studies rank giraffe’s visual acuity higher than all other artiodactyls despite sharing similar vision ecological determinants with many of them. The extent to which the giraffe’s unique visual capacity and its difference with okapi is reflected by changes in their vision genes is not understood. Methods The recent availability of giraffe and okapi genomes provided opportunity to identify giraffe and okapi vision genes. Multiple strategies were employed to identify thirty-six candidate mammalian vision genes in giraffe and okapi genomes. Quantification of selection pressure was performed by a combination of branch-site tests of positive selection and clade models of selection divergence through comparing giraffe and okapi vision genes and orthologous sequences from other mammals. Results Signatures of selection were identified in key genes that could potentially underlie giraffe and okapi visual adaptations. Importantly, some genes that contribute to optical transparency of the eye and those that are critical in light signaling pathway were found to show signatures of adaptive evolution or selection divergence. Comparison between giraffe and other ruminants identifies significant selection divergence in CRYAA and OPN1LW. Significant selection divergence was identified in SAG while positive selection was detected in LUM when okapi is compared with ruminants and other mammals. Sequence analysis of OPN1LW showed that at least one of the sites known to affect spectral sensitivity of the red pigment is uniquely divergent between giraffe and other ruminants. Discussion By taking a systemic approach to gene function in vision, the results provide the first molecular clues associated with

  3. Salmonid microarrays identify intestinal genes that reliably monitor P deficiency in rainbow trout aquaculture.

    PubMed

    Kirchner, S; McDaniel, N K; Sugiura, S H; Soteropoulos, P; Tian, B; Fletcher, J W; Ferraris, R P

    2007-08-01

    Nutrient-responsive genes can identify important metabolic pathways and evaluate optimal dietary levels. Using a 16K Salmo salar microarray, we identified in rainbow trout (Oncorhynchus mykiss) 21 potential phosphorus (P)-responsive genes, mainly involved in immune response, proteolysis or transport, whose expression levels changed in the intestine after 5 days of feeding a low-P (LP) diet. Diet-induced changes in the expression levels of several genes in each fish were tightly correlated with changes in serum P, and the changes persisted for an additional 15 days after dietary P deficiency. We then evaluated these and previously identified P-responsive genes under simulated farm conditions, and monitored the intestinal gene expression from 6 h to 7 days after the trout were switched from a sufficient-P (SP) diet to a LP diet (SP-->LP), and from a LP diet to a SP diet (LP-->SP). After 7 days, mean serum P decreased 0.14 mM/day for SP-->LP and increased 0.10 mm/day for LP-->SP. The mRNA abundance of the metalloendopeptidase meprin 1alpha (MEP1alpha), the Na(+)-dependent phosphate co-transporter (NaPi2b,SLC34A2), the sulfotransferase SULT2beta1 and carbonic anhydrase XIII genes all increased after SP-->LP and decreased after LP-->SP, suggesting that adaptive expression is reversible and correlated with dietary P. The duration of change in gene expression in response to SP-->LP was generally shorter than that of LP-->SP, suggesting potentially different mechanisms of adaptation to deficiency as opposed to excess. Diet-induced changes in mRNA abundance of other genes were either transient or modest. We identified, by heterologous microarray hybridization, new genes sensitive to perturbations in dietary P, and then showed that these genes can reliably monitor P deficiency under field conditions. Simultaneous changes in the expression of these P biomarkers could predict either P deficiency (to prevent economic losses to the farmers) or P excess (to prevent inadvertent

  4. Gene Expression Profile of High IFN-γ Producers Stimulated with Leishmania braziliensis Identifies Genes Associated with Cutaneous Leishmaniasis

    PubMed Central

    Carneiro, Marcia W.; Fukutani, Kiyoshi F.; Andrade, Bruno B.; Curvelo, Rebecca P.; Cristal, Juqueline R.; Carvalho, Augusto M.; Barral, Aldina

    2016-01-01

    Background The initial response to Leishmania parasites is essential in determining disease development or resistance. In vitro, a divergent response to Leishmania, characterized by high or low IFN-γ production has been described as a potential tool to predict both vaccine response and disease susceptibility in vivo. Methods and findings We identified uninfected and healthy individuals that were shown to be either high- or low IFN-γ producers (HPs and LPs, respectively) following stimulation of peripheral blood cells with Leishmania braziliensis. Following stimulation, RNA was processed for gene expression analysis using immune gene arrays. Both HPs and LPs were shown to upregulate the expression of CXCL10, IFI27, IL6 and LTA. Genes expressed in HPs only (CCL7, IL8, IFI44L and IL1B) were associated with pathways related to IL17 and TREM 1 signaling. In LPs, uniquely expressed genes (for example IL9, IFI44, IFIT1 and IL2RA) were associated with pathways related to pattern recognition receptors and interferon signaling. We then investigated whether the unique gene expression profiles described here could be recapitulated in vivo, in individuals with active Cutaneous Leishmaniasis or with subclinical infection. Indeed, using a set of six genes (TLR2, JAK2, IFI27, IFIT1, IRF1 and IL6) modulated in HPs and LPs, we could successfully discriminate these two clinical groups. Finally, we demonstrate that these six genes are significantly overexpressed in CL lesions. Conclusion Upon interrogation of the peripheral response of naive individuals with diverging IFN-γ production to L. braziliensis, we identified differences in the innate response to the parasite that are recapitulated in vivo and that discriminate CL patients from individuals presenting a subclinical infection. PMID:27870860

  5. Candidate fire blight resistance genes in Malus identified with the use of genomic tools and approaches

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The goal of this research is to utilize current advances in Rosaceae genomics to identify DNA markers for use in marker-assisted selection of durable resistance to fire blight. Candidate fire blight resistance genes were selected and ranked based upon differential expression after inoculation with ...

  6. Systems Biology in Animal Breeding: Identifying relationships among markers, genes, and phenotypes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Breeding and Genetics Symposium titled “Systems Biology in Animal Breeding: Identifying relationships among markers, genes, and phenotypes” was held at the Joint Annual Meeting of the American Dairy Science Association and the American Society of Animal Science in Phoenix, AZ, July 15 to 19, 201...

  7. Gene from a novel plant virus satellite from grapevine identifies a viral satellite lineage

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We have identified the genome of a novel viral satellite in deep sequence analysis of double-stranded RNA from grapevine. The genome was 1,060 bases in length, and encoded two open reading frames. Neither frame was related to any known plant virus gene. But translation of the longer frame showed ...

  8. Multiple gene mutations identified in patients infected with influenza A (H7N9) virus

    PubMed Central

    Chen, Cuicui; Wang, Mingbang; Zhu, Zhaoqin; Qu, Jieming; Xi, Xiuhong; Tang, Xinjun; Lao, Xiangda; Seeley, Eric; Li, Tao; Fan, Xiaomei; Du, Chunling; Wang, Qin; Yang, Lin; Hu, Yunwen; Bai, Chunxue; Zhang, Zhiyong; Lu, Shuihua; Song, Yuanlin; Zhou, Wenhao

    2016-01-01

    Influenza A (H7N9) virus induced high mortality since 2013. It is important to elucidate the potential genetic variations that contribute to virus infection susceptibilities. In order to identify genetic mutations that might increase host susceptibility to infection, we performed exon sequencing and validated the SNPS by Sanger sequencing on 18 H7N9 patients. Blood samples were collected from 18 confirmed H7N9 patients. The genomic DNA was captured with the Agilent SureSelect Human All Exon kit, sequenced on the Illumina Hiseq 2000, and the resulting data processed and annotated with Genome analysis Tool. SNPs were verified by independent Sanger sequencing. The DAVID database and the DAPPLE database were used to do bioinformatics analysis. Through exon sequencing and Sanger sequencing, we identified 21 genes that were highly associated with H7N9 influenza infection. Protein-protein interaction analysis showed that direct interactions among genetic products were significantly higher than expected (p = 0.004), and DAVID analysis confirmed the defense-related functions of these genes. Gene mutation profiles of survived and non-survived patients were similar, suggesting some of genes identified in this study may be associated with H7N9 influenza susceptibility. Host specific genetic determinants of disease severity identified by this approach may provide new targets for the treatment of H7N9 influenza. PMID:27156515

  9. Tsukamurella pulmonis Bloodstream Infection Identified by secA1 Gene Sequencing

    PubMed Central

    Cano, María E.; García de la Fuente, Celia; Martínez-Martínez, Luis; López, Mónica; Fernández-Mazarrasa, Carlos

    2014-01-01

    Recurrent bloodstream infections caused by a Gram-positive bacterium affected an immunocompromised child. Tsukamurella pulmonis was the microorganism identified by secA1 gene sequencing. Antibiotic treatment in combination with removal of the subcutaneous port healed the patient. PMID:25520439

  10. Regularized Non-negative Matrix Factorization for Identifying Differential Genes and Clustering Samples: a Survey.

    PubMed

    Liu, Jin-Xing; Wang, Dong; Gao, Ying-Lian; Zheng, Chun-Hou; Xu, Yong; Yu, Jiguo

    2017-02-07

    Non-negative Matrix Factorization (NMF), a classical method for dimensionality reduction, has been applied in many fields. It is based on the idea that negative numbers are physically meaningless in various data-processing tasks. Apart from its contribution to conventional data analysis, the recent overwhelming interest in NMF is due to its newly discovered ability to solve challenging data mining and machine learning problems, especially in relation to gene expression data. This survey paper mainly focuses on research examining the application of NMF to identify differentially expressed genes and to cluster samples, and the main NMF models, properties, principles, and algorithms with its various generalizations, extensions, and modifications are summarized. The experimental results demonstrate the performance of the various NMF algorithms in identifying differentially expressed genes and clustering samples.

  11. Exome sequencing in amyotrophic lateral sclerosis identifies risk genes and pathways

    PubMed Central

    Cirulli, Elizabeth T.; Lasseigne, Brittany N.; Petrovski, Slavé; Sapp, Peter C.; Dion, Patrick A.; Leblond, Claire S.; Couthouis, Julien; Lu, Yi-Fan; Wang, Quanli; Krueger, Brian J.; Ren, Zhong; Keebler, Jonathan; Han, Yujun; Levy, Shawn E.; Boone, Braden E.; Wimbish, Jack R.; Waite, Lindsay L.; Jones, Angela L.; Carulli, John P.; Day-Williams, Aaron G.; Staropoli, John F.; Xin, Winnie W.; Chesi, Alessandra; Raphael, Alya R.; McKenna-Yasek, Diane; Cady, Janet; de Jong, J.M.B. Vianney; Kenna, Kevin P.; Smith, Bradley N.; Topp, Simon; Miller, Jack; Gkazi, Athina; Al-Chalabi, Ammar; van den Berg, Leonard H.; Veldink, Jan; Silani, Vincenzo; Ticozzi, Nicola; Shaw, Christopher E.; Baloh, Robert H.; Appel, Stanley; Simpson, Ericka; Lagier-Tourenne, Clotilde; Pulst, Stefan M.; Gibson, Summer; Trojanowski, John Q.; Elman, Lauren; McCluskey, Leo; Grossman, Murray; Shneider, Neil A.; Chung, Wendy K.; Ravits, John M.; Glass, Jonathan D.; Sims, Katherine B.; Van Deerlin, Vivianna M.; Maniatis, Tom; Hayes, Sebastian D.; Ordureau, Alban; Swarup, Sharan; Landers, John; Baas, Frank; Allen, Andrew S.; Bedlack, Richard S.; Harper, J. Wade; Gitler, Aaron D.; Rouleau, Guy A.; Brown, Robert; Harms, Matthew B.; Cooper, Gregory M.; Harris, Tim; Myers, Richard M.; Goldstein, David B.

    2015-01-01

    Amyotrophic lateral sclerosis (ALS) is a devastating neurological disease with no effective treatment. Here we report the results of a moderate-scale sequencing study aimed at identifying new genes contributing to predisposition for ALS. We performed whole exome sequencing of 2,874 ALS patients and compared them to 6,405 controls. Several known ALS genes were found to be associated, and the non-canonical IκB kinase family TANK-Binding Kinase 1 (TBK1) was identified as an ALS gene. TBK1 is known to bind to and phosphorylate a number of proteins involved in innate immunity and autophagy, including optineurin (OPTN) and p62 (SQSTM1/sequestosome), both of which have also been implicated in ALS. These observations reveal a key role of the autophagic pathway in ALS and suggest specific targets for therapeutic intervention. PMID:25700176

  12. Gene Expression Signature Analysis Identifies Vorinostat as a Candidate Therapy for Gastric Cancer

    PubMed Central

    Choi, Woonyoung; Park, Yun-Yong; Kim, KyoungHyun; Kim, Sang-Bae; Lee, Ju-Seog; Mills, Gordon B.; Cho, Jae Yong

    2011-01-01

    Background Gastric cancer continues to be one of the deadliest cancers in the world and therefore identification of new drugs targeting this type of cancer is thus of significant importance. The purpose of this study was to identify and validate a therapeutic agent which might improve the outcomes for gastric cancer patients in the future. Methodology/Principal Findings Using microarray technology, we generated a gene expression profile of human gastric cancer–specific genes from human gastric cancer tissue samples. We used this profile in the Broad Institute's Connectivity Map analysis to identify candidate therapeutic compounds for gastric cancer. We found the histone deacetylase inhibitor vorinostat as the lead compound and thus a potential therapeutic drug for gastric cancer. Vorinostat induced both apoptosis and autophagy in gastric cancer cell lines. Pharmacological and genetic inhibition of autophagy however, increased the therapeutic efficacy of vorinostat, indicating that a combination of vorinostat with autophagy inhibitors may therapeutically be more beneficial. Moreover, gene expression analysis of gastric cancer identified a collection of genes (ITGB5, TYMS, MYB, APOC1, CBX5, PLA2G2A, and KIF20A) whose expression was elevated in gastric tumor tissue and downregulated more than 2-fold by vorinostat treatment in gastric cancer cell lines. In contrast, SCGB2A1, TCN1, CFD, APLP1, and NQO1 manifested a reversed pattern. Conclusions/Significance We showed that analysis of gene expression signature may represent an emerging approach to discover therapeutic agents for gastric cancer, such as vorinostat. The observation of altered gene expression after vorinostat treatment may provide the clue to identify the molecular mechanism of vorinostat and those patients likely to benefit from vorinostat treatment. PMID:21931799

  13. Deep sequencing identifies viral and wasp genes with potential roles in replication of Microplitis demolitor Bracovirus.

    PubMed

    Burke, Gaelen R; Strand, Michael R

    2012-03-01

    Viruses in the genus Bracovirus (BV) (Polydnaviridae) are symbionts of parasitoid wasps that specifically replicate in the ovaries of females. Recent analysis of expressed sequence tags from two wasp species, Cotesia congregata and Chelonus inanitus, identified transcripts related to 24 different nudivirus genes. These results together with other data strongly indicate that BVs evolved from a nudivirus ancestor. However, it remains unclear whether BV-carrying wasps contain other nudivirus-like genes and what types of wasp genes may also be required for BV replication. Microplitis demolitor carries Microplitis demolitor bracovirus (MdBV). Here we characterized MdBV replication and performed massively parallel sequencing of M. demolitor ovary transcripts. Our results indicated that MdBV replication begins in stage 2 pupae and continues in adults. Analysis of prereplication- and active-replication-stage ovary RNAs yielded 22 Gb of sequence that assembled into 66,425 transcripts. This breadth of sampling indicated that a large percentage of genes in the M. demolitor genome were sequenced. A total of 41 nudivirus-like transcripts were identified, of which a majority were highly expressed during MdBV replication. Our results also identified a suite of wasp genes that were highly expressed during MdBV replication. Among these products were several transcripts with conserved roles in regulating locus-specific DNA amplification by eukaryotes. Overall, our data set together with prior results likely identify the majority of nudivirus-related genes that are transcriptionally functional during BV replication. Our results also suggest that amplification of proviral DNAs for packaging into BV virions may depend upon the replication machinery of wasps.

  14. A New Strategy to Identify and Annotate Human RPE-Specific Gene Expression

    PubMed Central

    Booij, Judith C.; ten Brink, Jacoline B.; Swagemakers, Sigrid M. A.; Verkerk, Annemieke J. M. H.; Essing, Anke H. W.; van der Spek, Peter J.; Bergen, Arthur A. B.

    2010-01-01

    Background To identify and functionally annotate cell type-specific gene expression in the human retinal pigment epithelium (RPE), a key tissue involved in age-related macular degeneration and retinitis pigmentosa. Methodology RPE, photoreceptor and choroidal cells were isolated from selected freshly frozen healthy human donor eyes using laser microdissection. RNA isolation, amplification and hybridization to 44 k microarrays was carried out according to Agilent specifications. Bioinformatics was carried out using Rosetta Resolver, David and Ingenuity software. Principal Findings Our previous 22 k analysis of the RPE transcriptome showed that the RPE has high levels of protein synthesis, strong energy demands, is exposed to high levels of oxidative stress and a variable degree of inflammation. We currently use a complementary new strategy aimed at the identification and functional annotation of RPE-specific expressed transcripts. This strategy takes advantage of the multilayered cellular structure of the retina and overcomes a number of limitations of previous studies. In triplicate, we compared the transcriptomes of RPE, photoreceptor and choroidal cells and we deduced RPE specific expression. We identified at least 114 entries with RPE-specific gene expression. Thirty-nine of these 114 genes also show high expression in the RPE, comparison with the literature showed that 85% of these 39 were previously identified to be expressed in the RPE. In the group of 114 RPE specific genes there was an overrepresentation of genes involved in (membrane) transport, vision and ophthalmic disease. More fundamentally, we found RPE-specific involvement in the RAR-activation, retinol metabolism and GABA receptor signaling pathways. Conclusions In this study we provide a further specification and understanding of the RPE transcriptome by identifying and analyzing genes that are specifically expressed in the RPE. PMID:20479888

  15. Transposon mutagenesis identifies genes and cellular processes driving epithelial-mesenchymal transition in hepatocellular carcinoma

    PubMed Central

    Kodama, Takahiro; Newberg, Justin Y.; Kodama, Michiko; Rangel, Roberto; Yoshihara, Kosuke; Tien, Jean C.; Parsons, Pamela H.; Wu, Hao; Finegold, Milton J.; Copeland, Neal G.; Jenkins, Nancy A.

    2016-01-01

    Epithelial-mesenchymal transition (EMT) is thought to contribute to metastasis and chemoresistance in patients with hepatocellular carcinoma (HCC), leading to their poor prognosis. The genes driving EMT in HCC are not yet fully understood, however. Here, we show that mobilization of Sleeping Beauty (SB) transposons in immortalized mouse hepatoblasts induces mesenchymal liver tumors on transplantation to nude mice. These tumors show significant down-regulation of epithelial markers, along with up-regulation of mesenchymal markers and EMT-related transcription factors (EMT-TFs). Sequencing of transposon insertion sites from tumors identified 233 candidate cancer genes (CCGs) that were enriched for genes and cellular processes driving EMT. Subsequent trunk driver analysis identified 23 CCGs that are predicted to function early in tumorigenesis and whose mutation or alteration in patients with HCC is correlated with poor patient survival. Validation of the top trunk drivers identified in the screen, including MET (MET proto-oncogene, receptor tyrosine kinase), GRB2-associated binding protein 1 (GAB1), HECT, UBA, and WWE domain containing 1 (HUWE1), lysine-specific demethylase 6A (KDM6A), and protein-tyrosine phosphatase, nonreceptor-type 12 (PTPN12), showed that deregulation of these genes activates an EMT program in human HCC cells that enhances tumor cell migration. Finally, deregulation of these genes in human HCC was found to confer sorafenib resistance through apoptotic tolerance and reduced proliferation, consistent with recent studies showing that EMT contributes to the chemoresistance of tumor cells. Our unique cell-based transposon mutagenesis screen appears to be an excellent resource for discovering genes involved in EMT in human HCC and potentially for identifying new drug targets. PMID:27247392

  16. Exome sequencing in amyotrophic lateral sclerosis identifies risk genes and pathways.

    PubMed

    Cirulli, Elizabeth T; Lasseigne, Brittany N; Petrovski, Slavé; Sapp, Peter C; Dion, Patrick A; Leblond, Claire S; Couthouis, Julien; Lu, Yi-Fan; Wang, Quanli; Krueger, Brian J; Ren, Zhong; Keebler, Jonathan; Han, Yujun; Levy, Shawn E; Boone, Braden E; Wimbish, Jack R; Waite, Lindsay L; Jones, Angela L; Carulli, John P; Day-Williams, Aaron G; Staropoli, John F; Xin, Winnie W; Chesi, Alessandra; Raphael, Alya R; McKenna-Yasek, Diane; Cady, Janet; Vianney de Jong, J M B; Kenna, Kevin P; Smith, Bradley N; Topp, Simon; Miller, Jack; Gkazi, Athina; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan; Silani, Vincenzo; Ticozzi, Nicola; Shaw, Christopher E; Baloh, Robert H; Appel, Stanley; Simpson, Ericka; Lagier-Tourenne, Clotilde; Pulst, Stefan M; Gibson, Summer; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Grossman, Murray; Shneider, Neil A; Chung, Wendy K; Ravits, John M; Glass, Jonathan D; Sims, Katherine B; Van Deerlin, Vivianna M; Maniatis, Tom; Hayes, Sebastian D; Ordureau, Alban; Swarup, Sharan; Landers, John; Baas, Frank; Allen, Andrew S; Bedlack, Richard S; Harper, J Wade; Gitler, Aaron D; Rouleau, Guy A; Brown, Robert; Harms, Matthew B; Cooper, Gregory M; Harris, Tim; Myers, Richard M; Goldstein, David B

    2015-03-27

    Amyotrophic lateral sclerosis (ALS) is a devastating neurological disease with no effective treatment. We report the results of a moderate-scale sequencing study aimed at increasing the number of genes known to contribute to predisposition for ALS. We performed whole-exome sequencing of 2869 ALS patients and 6405 controls. Several known ALS genes were found to be associated, and TBK1 (the gene encoding TANK-binding kinase 1) was identified as an ALS gene. TBK1 is known to bind to and phosphorylate a number of proteins involved in innate immunity and autophagy, including optineurin (OPTN) and p62 (SQSTM1/sequestosome), both of which have also been implicated in ALS. These observations reveal a key role of the autophagic pathway in ALS and suggest specific targets for therapeutic intervention.

  17. Gene trapping identifies a putative tumor suppressor and a new inducer of cell migration

    SciTech Connect

    Guardiola-Serrano, Francisca; Haendeler, Judith; Lukosz, Margarete; Sturm, Karsten; Melchner, Harald von; Altschmied, Joachim

    2008-11-28

    Tumor necrosis factor alpha (TNF{alpha}) is a pleiotropic cytokine involved in apoptotic cell death, cellular proliferation, differentiation, inflammation, and tumorigenesis. In tumors it is secreted by tumor associated macrophages and can have both pro- and anti-tumorigenic effects. To identify genes regulated by TNF{alpha}, we performed a gene trap screen in the mammary carcinoma cell line MCF-7 and recovered 64 unique, TNF{alpha}-induced gene trap integration sites. Among these were the genes coding for the zinc finger protein ZC3H10 and for the transcription factor grainyhead-like 3 (GRHL3). In line with the dual effects of TNF{alpha} on tumorigenesis, we found that ZC3H10 inhibits anchorage independent growth in soft agar suggesting a tumor suppressor function, whereas GRHL3 strongly stimulated the migration of endothelial cells which is consistent with an angiogenic, pro-tumorigenic function.

  18. Newly identified CSP41b gene localized in chloroplasts affects leaf color in rice.

    PubMed

    Mei, Jiasong; Li, Feifei; Liu, Xuri; Hu, Guocheng; Fu, Yaping; Liu, Wenzhen

    2017-03-01

    A rice mutant with light-green leaves was discovered from a transgenic line of Oryza sativa. The mutant has reduced chlorophyll content and abnormal chloroplast morphology throughout its life cycle. Genetic analysis revealed that a single nuclear-encoded recessive gene is responsible for the mutation, here designated as lgl1. To isolate the lgl1 gene, a high-resolution physical map of the chromosomal region around the lgl1 gene was made using a mapping population consisting of 1984 mutant individuals. The lgl1 gene was mapped in the 76.5kb region between marker YG4 and marker YG5 on chromosome 12. Sequence analysis revealed that there was a 39bp deletion within the fourth exon of the candidate gene Os12g0420200 (TIGR locus Os12g23180) encoding a chloroplast stem-loop-binding protein of 41kDa b (CSP41b). The lgl1 mutation was rescued by transformation with the wild type CSP41b gene. Accordingly, the CSP41b gene is identified as the LGL1 gene. CSP41b was transcribed in various tissues and was mainly expressed in leaves. Expression of CSP41b-GFP fusion protein indicated that CSP41b is localized in chloroplasts. The expression levels of some key genes involved in chlorophyll biosynthesis and photosynthesis, such as ChlD, ChlI, Hema1, Ygl1, POR, Cab1R, Cab2R, PsaA, and rbcL, was significantly changed in the lgl1 mutant. Our results demonstrate that CSP41b is a novel gene required for normal leaf color and chloroplast morphology in rice.

  19. Global Gene-Expression Analysis to Identify Differentially Expressed Genes Critical for the Heat Stress Response in Brassica rapa.

    PubMed

    Dong, Xiangshu; Yi, Hankuil; Lee, Jeongyeo; Nou, Ill-Sup; Han, Ching-Tack; Hur, Yoonkang

    2015-01-01

    Genome-wide dissection of the heat stress response (HSR) is necessary to overcome problems in crop production caused by global warming. To identify HSR genes, we profiled gene expression in two Chinese cabbage inbred lines with different thermotolerances, Chiifu and Kenshin. Many genes exhibited >2-fold changes in expression upon exposure to 0.5- 4 h at 45°C (high temperature, HT): 5.2% (2,142 genes) in Chiifu and 3.7% (1,535 genes) in Kenshin. The most enriched GO (Gene Ontology) items included 'response to heat', 'response to reactive oxygen species (ROS)', 'response to temperature stimulus', 'response to abiotic stimulus', and 'MAPKKK cascade'. In both lines, the genes most highly induced by HT encoded small heat shock proteins (Hsps) and heat shock factor (Hsf)-like proteins such as HsfB2A (Bra029292), whereas high-molecular weight Hsps were constitutively expressed. Other upstream HSR components were also up-regulated: ROS-scavenging genes like glutathione peroxidase 2 (BrGPX2, Bra022853), protein kinases, and phosphatases. Among heat stress (HS) marker genes in Arabidopsis, only exportin 1A (XPO1A) (Bra008580, Bra006382) can be applied to B. rapa for basal thermotolerance (BT) and short-term acquired thermotolerance (SAT) gene. CYP707A3 (Bra025083, Bra021965), which is involved in the dehydration response in Arabidopsis, was associated with membrane leakage in both lines following HS. Although many transcription factors (TF) genes, including DREB2A (Bra005852), were involved in HS tolerance in both lines, Bra024224 (MYB41) and Bra021735 (a bZIP/AIR1 [Anthocyanin-Impaired-Response-1]) were specific to Kenshin. Several candidate TFs involved in thermotolerance were confirmed as HSR genes by real-time PCR, and these assignments were further supported by promoter analysis. Although some of our findings are similar to those obtained using other plant species, clear differences in Brassica rapa reveal a distinct HSR in this species. Our data could also provide a

  20. Joint QTL mapping and gene expression analysis identify positional candidate genes influencing pork quality traits

    PubMed Central

    González-Prendes, Rayner; Quintanilla, Raquel; Cánovas, Angela; Manunza, Arianna; Figueiredo Cardoso, Tainã; Jordana, Jordi; Noguera, José Luis; Pena, Ramona N.; Amills, Marcel

    2017-01-01

    Meat quality traits have an increasing importance in the pig industry because of their strong impact on consumer acceptance. Herewith, we have combined phenotypic and microarray expression data to map loci with potential effects on five meat quality traits recorded in the longissimus dorsi (LD) and gluteus medius (GM) muscles of 350 Duroc pigs, i.e. pH at 24 hours post-mortem (pH24), electric conductivity (CE) and muscle redness (a*), lightness (L*) and yellowness (b*). We have found significant genome-wide associations for CE of LD on SSC4 (~104 Mb), SSC5 (~15 Mb) and SSC13 (~137 Mb), while several additional regions were significantly associated with meat quality traits at the chromosome-wide level. There was a low positional concordance between the associations found for LD and GM traits, a feature that reflects the existence of differences in the genetic determinism of meat quality phenotypes in these two muscles. The performance of an eQTL search for SNPs mapping to the regions associated with meat quality traits demonstrated that the GM a* SSC3 and pH24 SSC17 QTL display positional concordance with cis-eQTL regulating the expression of several genes with a potential role on muscle metabolism. PMID:28054563

  1. Yeast functional screen to identify genes conferring salt stress tolerance in Salicornia europaea

    PubMed Central

    Nakahara, Yoshiki; Sawabe, Shogo; Kainuma, Kenta; Katsuhara, Maki; Shibasaka, Mineo; Suzuki, Masanori; Yamamoto, Kosuke; Oguri, Suguru; Sakamoto, Hikaru

    2015-01-01

    Salinity is a critical environmental factor that adversely affects crop productivity. Halophytes have evolved various mechanisms to adapt to saline environments. Salicornia europaea L. is one of the most salt-tolerant plant species. It does not have special salt-secreting structures like a salt gland or salt bladder, and is therefore a good model for studying the common mechanisms underlying plant salt tolerance. To identify candidate genes encoding key proteins in the mediation of salt tolerance in S. europaea, we performed a functional screen of a cDNA library in yeast. The library was screened for genes that allowed the yeast to grow in the presence of 1.3 M NaCl. We obtained three full-length S. europaea genes that confer salt tolerance. The genes are predicted to encode (1) a novel protein highly homologous to thaumatin-like proteins, (2) a novel coiled-coil protein of unknown function, and (3) a novel short peptide of 32 residues. Exogenous application of a synthetic peptide corresponding to the 32 residues improved salt tolerance of Arabidopsis. The approach described in this report provides a rapid assay system for large-scale screening of S. europaea genes involved in salt stress tolerance and supports the identification of genes responsible for such mechanisms. These genes may be useful candidates for improving crop salt tolerance by genetic transformation. PMID:26579166

  2. A Special Local Clustering Algorithm for Identifying the Genes Associated With Alzheimer’s Disease

    PubMed Central

    Pang, Chao-Yang; Hu, Wei; Hu, Ben-Qiong; Shi, Ying; Vanderburg, Charles R.; Rogers, Jack T.

    2010-01-01

    Clustering is the grouping of similar objects into a class. Local clustering feature refers to the phenomenon whereby one group of data is separated from another, and the data from these different groups are clustered locally. A compact class is defined as one cluster in which all similar elements cluster tightly within the cluster. Herein, the essence of the local clustering feature, revealed by mathematical manipulation, results in a novel clustering algorithm termed as the special local clustering (SLC) algorithm that was used to process gene microarray data related to Alzheimer’s disease (AD). SLC algorithm was able to group together genes with similar expression patterns and identify significantly varied gene expression values as isolated points. If a gene belongs to a compact class in control data and appears as an isolated point in incipient, moderate and/or severe AD gene microarray data, this gene is possibly associated with AD. Application of a clustering algorithm in disease-associated gene identification such as in AD is rarely reported. PMID:20089478

  3. Transcriptome Analysis Identifies the Dysregulation of Ultraviolet Target Genes in Human Skin Cancers

    PubMed Central

    Shen, Yao; Kim, Arianna L.; Du, Rong; Liu, Liang

    2016-01-01

    Exposure to ultraviolet radiation (UVR) is a major risk factor for both melanoma and non-melanoma skin cancers. In addition to its mutagenic effect, UVR can also induce substantial transcriptional instability in skin cells affecting thousands of genes, including many cancer genes, suggesting that transcriptional instability may be another important etiological factor in skin photocarcinogenesis. In this study, we performed detailed transcriptomic profiling studies to characterize the kinetic changes in global gene expression in human keratinocytes exposed to different UVR conditions. We identified a subset of UV-responsive genes as UV signature genes (UVSGs) based on 1) conserved UV-responsiveness of this subset of genes among different keratinocyte lines; and 2) UV-induced persistent changes in their mRNA levels long after exposure. Interestingly, 11 of the UVSGs were shown to be critical to skin cancer cell proliferation and survival. Through computational Gene Set Enrichment Analysis, we demonstrated that a significant portion of the UVSGs were dysregulated in human skin squamous cell carcinomas, but not in other human malignancies. This highlights the potential and specificity of the UVSGs in clinical diagnosis of UV damage and stratification of skin cancer risk. PMID:27643989

  4. Mapping of Craniofacial Traits in Outbred Mice Identifies Major Developmental Genes Involved in Shape Determination

    PubMed Central

    Pallares, Luisa F.; Carbonetto, Peter; Gopalakrishnan, Shyam; Parker, Clarissa C.; Ackert-Bicknell, Cheryl L.; Palmer, Abraham A.; Tautz, Diethard

    2015-01-01

    The vertebrate cranium is a prime example of the high evolvability of complex traits. While evidence of genes and developmental pathways underlying craniofacial shape determination is accumulating, we are still far from understanding how such variation at the genetic level is translated into craniofacial shape variation. Here we used 3D geometric morphometrics to map genes involved in shape determination in a population of outbred mice (Carworth Farms White, or CFW). We defined shape traits via principal component analysis of 3D skull and mandible measurements. We mapped genetic loci associated with shape traits at ~80,000 candidate single nucleotide polymorphisms in ~700 male mice. We found that craniofacial shape and size are highly heritable, polygenic traits. Despite the polygenic nature of the traits, we identified 17 loci that explain variation in skull shape, and 8 loci associated with variation in mandible shape. Together, the associated variants account for 11.4% of skull and 4.4% of mandible shape variation, however, the total additive genetic variance associated with phenotypic variation was estimated in ~45%. Candidate genes within the associated loci have known roles in craniofacial development; this includes 6 transcription factors and several regulators of bone developmental pathways. One gene, Mn1, has an unusually large effect on shape variation in our study. A knockout of this gene was previously shown to affect negatively the development of membranous bones of the cranial skeleton, and evolutionary analysis shows that the gene has arisen at the base of the bony vertebrates (Eutelostomi), where the ossified head first appeared. Therefore, Mn1 emerges as a key gene for both skull formation and within-population shape variation. Our study shows that it is possible to identify important developmental genes through genome-wide mapping of high-dimensional shape features in an outbred population. PMID:26523602

  5. Comprehensively identifying and characterizing the missing gene sequences in human reference genome with integrated analytic approaches.

    PubMed

    Chen, Geng; Wang, Charles; Shi, Leming; Tong, Weida; Qu, Xiongfei; Chen, Jiwei; Yang, Jianmin; Shi, Caiping; Chen, Long; Zhou, Peiying; Lu, Bingxin; Shi, Tieliu

    2013-08-01

    The human reference genome is still incomplete and a number of gene sequences are missing from it. The approaches to uncover them, the reasons causing their absence and their functions are less explored. Here, we comprehensively identified and characterized the missing genes of human reference genome with RNA-Seq data from 16 different human tissues. By using a combined approach of genome-guided transcriptome reconstruction coupled with genome-wide comparison, we uncovered 3.78 and 2.37 Mb transcribed regions in the human genome assemblies of Celera and HuRef either missed from their homologous chromosomes of NCBI human reference genome build 37.2 or partially or entirely absent from the reference. We further identified a significant number of novel transcript contigs in each tissue from de novo transcriptome assembly that are unalignable to NCBI build 37.2 but can be aligned to at least one of the genomes from Celera, HuRef, chimpanzee, macaca or mouse. Our analyses indicate that the missing genes could result from genome misassembly, transposition, copy number variation, translocation and other structural variations. Moreover, our results further suggest that a large portion of these missing genes are conserved between human and other mammals, implying their important biological functions. Totally, 1,233 functional protein domains were detected in these missing genes. Collectively, our study not only provides approaches for uncovering the missing genes of a genome, but also proposes the potential reasons causing genes missed from the genome and highlights the importance of uncovering the missing genes of incomplete genomes.

  6. Using RNA sequencing for identifying gene imprinting and random monoallelic expression in human placenta

    PubMed Central

    Metsalu, Tauno; Viltrop, Triin; Tiirats, Airi; Rajashekar, Balaji; Reimann, Ene; Kõks, Sulev; Rull, Kristiina; Milani, Lili; Acharya, Ganesh; Basnet, Purusotam; Vilo, Jaak; Mägi, Reedik; Metspalu, Andres; Peters, Maire; Haller-Kikkatalo, Kadri; Salumets, Andres

    2014-01-01

    Given the possible critical importance of placental gene imprinting and random monoallelic expression on fetal and infant health, most of those genes must be identified, in order to understand the risks that the baby might meet during pregnancy and after birth. Therefore, the aim of the current study was to introduce a workflow and tools for analyzing imprinted and random monoallelic gene expression in human placenta, by applying whole-transcriptome (WT) RNA sequencing of placental tissue and genotyping of coding DNA variants in family trios. Ten family trios, each with a healthy spontaneous single-term pregnancy, were recruited. Total RNA was extracted for WT analysis, providing the full sequence information for the placental transcriptome. Parental and child blood DNA genotypes were analyzed by exome SNP genotyping microarrays, mapping the inheritance and estimating the abundance of parental expressed alleles. Imprinted genes showed consistent expression from either parental allele, as demonstrated by the SNP content of sequenced transcripts, while monoallelically expressed genes had random activity of parental alleles. We revealed 4 novel possible imprinted genes (LGALS8, LGALS14, PAPPA2 and SPTLC3) and confirmed the imprinting of 4 genes (AIM1, PEG10, RHOBTB3 and ZFAT-AS1) in human placenta. The major finding was the identification of 4 genes (ABP1, BCLAF1, IFI30 and ZFAT) with random allelic bias, expressing one of the parental alleles preferentially. The main functions of the imprinted and monoallelically expressed genes included: i) mediating cellular apoptosis and tissue development; ii) regulating inflammation and immune system; iii) facilitating metabolic processes; and iv) regulating cell cycle. PMID:25437054

  7. Expressed sequences tags of the anther smut fungus, Microbotryum violaceum, identify mating and pathogenicity genes

    PubMed Central

    Yockteng, Roxana; Marthey, Sylvain; Chiapello, Hélène; Gendrault, Annie; Hood, Michael E; Rodolphe, François; Devier, Benjamin; Wincker, Patrick; Dossat, Carole; Giraud, Tatiana

    2007-01-01

    Background The basidiomycete fungus Microbotryum violaceum is responsible for the anther-smut disease in many plants of the Caryophyllaceae family and is a model in genetics and evolutionary biology. Infection is initiated by dikaryotic hyphae produced after the conjugation of two haploid sporidia of opposite mating type. This study describes M. violaceum ESTs corresponding to nuclear genes expressed during conjugation and early hyphal production. Results A normalized cDNA library generated 24,128 sequences, which were assembled into 7,765 unique genes; 25.2% of them displayed significant similarity to annotated proteins from other organisms, 74.3% a weak similarity to the same set of known proteins, and 0.5% were orphans. We identified putative pheromone receptors and genes that in other fungi are involved in the mating process. We also identified many sequences similar to genes known to be involved in pathogenicity in other fungi. The M. violaceum EST database, MICROBASE, is available on the Web and provides access to the sequences, assembled contigs, annotations and programs to compare similarities against MICROBASE. Conclusion This study provides a basis for cloning the mating type locus, for further investigation of pathogenicity genes in the anther smut fungi, and for comparative genomics. PMID:17692127

  8. A novel DNA replication origin identified in the human heat shock protein 70 gene promoter.

    PubMed Central

    Taira, T; Iguchi-Ariga, S M; Ariga, H

    1994-01-01

    A general and sensitive method for the mapping of initiation sites of DNA replication in vivo, developed by Vassilev and Johnson, has revealed replication origins in the region of simian virus 40 ori, in the regions upstream from the human c-myc gene and downstream from the Chinese hamster dihydrofolate reductase gene, and in the enhancer region of the mouse immunoglobulin heavy-chain gene. Here we report that the region containing the promoter of the human heat shock protein 70 (hsp70) gene was identified as a DNA replication origin in HeLa cells by this method. Several segments of the region were cloned into pUC19 and examined for autonomously replicating sequence (ARS) activity. The plasmids carrying the segments replicated episomally and semiconservatively when transfected into HeLa cells. The segments of ARS activity contained the sequences previously identified as binding sequences for a c-myc protein complex (T. Taira, Y. Negishi, F. Kihara, S. M. M. Iguchi-Ariga, and H. Ariga, Biochem. Biophys. Acta 1130:166-174, 1992). Mutations introduced within the c-myc protein complex binding sequences abolished the ARS activity. Moreover, the ARS plasmids stably replicated at episomal state for a long time in established cell lines. The results suggest that the promoter region of the human hsp70 gene plays a role in DNA replication as well as in transcription. Images PMID:8065368

  9. Expression of DOF genes identifies early stages of vascular development in Arabidopsis leaves.

    PubMed

    Gardiner, Jason; Sherr, Ira; Scarpella, Enrico

    2010-01-01

    The sequence of events underlying the formation of vascular networks in the leaf has long fascinated developmental biologists. In Arabidopsis leaves, vascular-precursor procambial cells derive from the elongation of morphologically inconspicuous ground cells that selectively activate expression of the HD-ZIP III gene ATHB8. Inception of ATHB8 expression operationally defines acquisition of a typically irreversible preprocambial cell state that preludes to vein formation. A view of the constellation of genes whose expression is activated at preprocambial stages would therefore be particularly desirable; however, very few preprocambial gene expression profiles have been identified. Here, we show that expression of three genes encoding members of the DOF family of plant-specific transcription factors is activated at stages overlapping onset of ATHB8 expression. Expression of DOF genes is initiated in wide domains that become confined to sites of vein development. Congruence between DOF expression fields and zones of vein formation persists upon experimental manipulation of leaf vascular patterning, suggesting that DOF expression identifies consistently recurring steps in vein ontogeny. Our results contribute to defining preprocambial cell identity at the molecular level.

  10. Immunogenetic mechanisms leading to thyroid autoimmunity: recent advances in identifying susceptibility genes and regions.

    PubMed

    Brand, Oliver J; Gough, Stephen C L

    2011-12-01

    The autoimmune thyroid diseases (AITD) include Graves' disease (GD) and Hashimoto's thyroiditis (HT), which are characterised by a breakdown in immune tolerance to thyroid antigens. Unravelling the genetic architecture of AITD is vital to better understanding of AITD pathogenesis, required to advance therapeutic options in both disease management and prevention. The early whole-genome linkage and candidate gene association studies provided the first evidence that the HLA region and CTLA-4 represented AITD risk loci. Recent improvements in; high throughput genotyping technologies, collection of larger disease cohorts and cataloguing of genome-scale variation have facilitated genome-wide association studies and more thorough screening of candidate gene regions. This has allowed identification of many novel AITD risk genes and more detailed association mapping. The growing number of confirmed AITD susceptibility loci, implicates a number of putative disease mechanisms most of which are tightly linked with aspects of immune system function. The unprecedented advances in genetic study will allow future studies to identify further novel disease risk genes and to identify aetiological variants within specific gene regions, which will undoubtedly lead to a better understanding of AITD patho-physiology.

  11. Characterization of novel antibiotic resistance genes identified by functional metagenomics on soil samples.

    PubMed

    Torres-Cortés, Gloria; Millán, Vicenta; Ramírez-Saad, Hugo C; Nisa-Martínez, Rafael; Toro, Nicolás; Martínez-Abarca, Francisco

    2011-04-01

    The soil microbial community is highly complex and contains a high density of antibiotic-producing bacteria, making it a likely source of diverse antibiotic resistance determinants. We used functional metagenomics to search for antibiotic resistance genes in libraries generated from three different soil samples, containing 3.6 Gb of DNA in total. We identified 11 new antibiotic resistance genes: 3 conferring resistance to ampicillin, 2 to gentamicin, 2 to chloramphenicol and 4 to trimethoprim. One of the clones identified was a new trimethoprim resistance gene encoding a 26.8 kDa protein closely resembling unassigned reductases of the dihydrofolate reductase group. This protein, Tm8-3, conferred trimethoprim resistance in Escherichia coli and Sinorhizobium meliloti (γ- and α-proteobacteria respectively). We demonstrated that this gene encoded an enzyme with dihydrofolate reductase activity, with kinetic constants similar to other type I and II dihydrofolate reductases (K(m) of 8.9 µM for NADPH and 3.7 µM for dihydrofolate and IC(50) of 20 µM for trimethoprim). This is the first description of a new type of reductase conferring resistance to trimethoprim. Our results indicate that soil bacteria display a high level of genetic diversity and are a reservoir of antibiotic resistance genes, supporting the use of this approach for the discovery of novel enzymes with unexpected activities unpredictable from their amino acid sequences.

  12. Evaluating HapMap SNP data transferability in a large-scale genotyping project involving 175 cancer-associated genes.

    PubMed

    Ribas, Gloria; González-Neira, Anna; Salas, Antonio; Milne, Roger L; Vega, Ana; Carracedo, Begoña; González, Emilio; Barroso, Eva; Fernández, Lara P; Yankilevich, Patricio; Robledo, Mercedes; Carracedo, Angel; Benítez, Javier

    2006-02-01

    One of the many potential uses of the HapMap project is its application to the investigation of complex disease aetiology among a wide range of populations. This study aims to assess the transferability of HapMap SNP data to the Spanish population in the context of cancer research. We have carried out a genotyping study in Spanish subjects involving 175 candidate cancer genes using an indirect gene-based approach and compared results with those for HapMap CEU subjects. Allele frequencies were very consistent between the two samples, with a high positive correlation (R) of 0.91 (P<1x10(-6)). Linkage disequilibrium patterns and block structures across each gene were also very similar, with disequilibrium coefficient (r (2)) highly correlated (R=0.95, P<1x10(-6)). We found that of the 21 genes that contained at least one block larger than 60 kb, nine (ATM, ATR, BRCA1, ERCC6, FANCC, RAD17, RAD50, RAD54B and XRCC4) belonged to the GO category "DNA repair". Haplotype frequencies per gene were also highly correlated (mean R=0.93), as was haplotype diversity (R=0.91, P<1x10(-6)). "Yin yang" haplotypes were observed for 43% of the genes analysed and 18% of those were identical to the ancestral haplotype (identified in Chimpazee). Finally, the portability of tagSNPs identified in the HapMap CEU data using pairwise r (2) thresholds of 0.8 and 0.5 was assessed by applying these to the Spanish and current HapMap data for 66 genes. In general, the HapMap tagSNPs performed very well. Our results show generally high concordance with HapMap data in allele frequencies and haplotype distributions and confirm the applicability of HapMap SNP data to the study of complex diseases among the Spanish population.

  13. A strategy to identify genes associated with circulating solid tumor cell survival in peripheral blood.

    PubMed Central

    Fournier, M. V.; Carvalho, M. G.; Pardee, A. B.

    1999-01-01

    Efforts in metastasis research have centered on the phenotypic and genetic differences between primary site and metastatic site tumors. However, genes that may be used as molecular markers of metastasis in circulating tumor cells remain unidentified. Genes regulating the dissemination and survival of solid tumor cells in the blood, as well as their adaptation to new environments, could be candidates for unique metastatic tumor markers. Differential display (DD) was conducted to compare the blood of tumor-free individuals with the blood of patients with lung, breast, and colon cancers. Twenty-one up-expressed genes in the tumor patient blood samples but none in the tumor-free donor blood samples were identified. Nine of these samples were isolated, amplified, and directly sequenced. A gene AB-1 homologous to a Bcl-2 family member, which might function as an apoptosis inhibitor, was identified. The overexpression of an apoptosis inhibitor in blood from patients with metastatic tumors might be correlated with the capability of solid tumor cells to survive in peripheral blood. This is the first demonstration of the usefulness of comparing control and patient blood samples by DD to find novel potential genetic markers identifying metastasis in the blood. http://link.springer-ny. com/link/service/journals/00020/bibs/5n5p313.html Images Fig. 1 Fig. 2 Fig. 3 PMID:10390547

  14. Identifying putative promoter regions of Hermansky-Pudlak syndrome genes by means of phylogenetic footprinting

    PubMed Central

    Stanescu, H.; Wolfsberg, T.G.; Moreland, R.T.; Ayub, M.H.; Erickson, E.; Westbroek, W.; Huizing, M.; Gahl, W.A.; Helip-Wooley, A.

    2009-01-01

    Summary HPS is an autosomal recessive disorder characterized by oculocutaneous albinism and prolonged bleeding. Eight human genes are described resulting in the HPS subtypes 1–8. Certain HPS proteins combine to form Biogenesis of Lysosome-related Organelles Complexes (BLOCs), thought to function in the formation of intracellular vesicles such as melanosomes, platelet dense bodies, and lytic granules. Specifically, BLOC-2 contains the HPS3, HPS5 and HPS6 proteins. We used phylogenetic footprinting to identify conserved regions in the upstream sequences of HPS3, HPS5 and HPS6. These conserved regions were verified to have in vitro transcription activation activity using luciferase reporter assays. Transcription factor binding site analyses of the regions identified 52 putative sites shared by all three genes. When analysis was limited to the conserved footprints, seven binding sites were found shared among all three genes: Pax-5, AIRE, CACD, ZF5, Zic1, E2F and Churchill. The HPS3 conserved upstream region was sequenced in four patients with decreased fibroblast HPS3 RNA levels and only one HPS3 mutation in the coding exons and surrounding exon/intron boundaries; no mutation was found. These findings illustrate the power of phylogenetic footprinting for identifying potential regulatory regions in non-coding sequences and define the first putative promoter elements for any HPS genes. PMID:19523149

  15. Progressive retinal atrophy in Schapendoes dogs: mutation of the newly identified CCDC66 gene.

    PubMed

    Dekomien, Gabriele; Vollrath, Conni; Petrasch-Parwez, Elisabeth; Boevé, Michael H; Akkad, Denis A; Gerding, Wanda M; Epplen, Jörg T

    2010-05-01

    Canine generalized progressive retinal atrophy (gPRA) is characterized by continuous degeneration of photoreceptor cells leading to night blindness and progressive vision loss. Until now, mutations in 11 genes have been described that account for gPRA in dogs, mostly following an autosomal recessive inheritance mode. Here, we describe a gPRA locus comprising the newly identified gene coiled-coil domain containing 66 (CCDC66) on canine chromosome 20, as identified via linkage analysis in the Schapendoes breed. Mutation screening of the CCDC66 gene revealed a 1-bp insertion in exon 6 leading to a stop codon as the underlying cause of disease. The insertion is present in all affected dogs in the homozygous state as well as in all obligatory mutation carriers in the heterozygous state. The CCDC66 gene is evolutionarily conserved in different vertebrate species and exhibits a complex pattern of differential RNA splicing resulting in various isoforms in the retina. Immunohistochemically, CCDC66 protein is detected mainly in the inner segments of photoreceptors in mouse, dog, and man. The affected Schapendoes retina lacks CCDC66 protein. Thus this natural canine model for gPRA yields superior potential to understand functional implications of this newly identified protein including its physiology, and it opens new perspectives for analyzing different aspects of the general pathophysiology of gPRA.

  16. Expression profiling of uniparental mouse embryos is inefficient in identifying novel imprinted genes.

    PubMed

    Ruf, Nico; Dünzinger, Ulrich; Brinckmann, Anja; Haaf, Thomas; Nürnberg, Peter; Zechner, Ulrich

    2006-04-01

    Imprinted genes are expressed from only one allele in a parent-of-origin-specific manner. We here describe a systematic approach to identify novel imprinted genes using quantification of allele-specific expression by Pyrosequencing, a highly accurate method to detect allele-specific expression differences. Sixty-eight candidate imprinted transcripts mapping to known imprinted chromosomal regions were selected from a recent expression profiling study of uniparental mouse embryos and analyzed. Three novel imprinted transcripts encoding putative non-protein-coding RNAs were identified on the basis of parent-of-origin-specific monoallelic expression in E11.5 (C57BL/6 x Cast/Ei)F1 and informative (C57BL/6 x Cast/Ei) x C57BL/6 backcross embryos. In addition, four transcripts with preferential expression of a strain-specific allele were found. Intriguingly, a vast majority of the analyzed transcripts showed no imprinting-associated expression in F1 embryos. These data strengthen the view that a large fraction of nonimprinted genes is differentially expressed between parthenogenetic and androgenetic embryos and question the efficiency of expression profiling of uniparental embryos to identify novel imprinted genes.

  17. Comparative Transcriptome Analysis Identifies Putative Genes Involved in the Biosynthesis of Xanthanolides in Xanthium strumarium L.

    PubMed Central

    Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng

    2016-01-01

    Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides. PMID:27625674

  18. A novel approach identifies new differentially methylated regions (DMRs) associated with imprinted genes

    PubMed Central

    Choufani, Sanaa; Shapiro, Jonathan S.; Susiarjo, Martha; Butcher, Darci T.; Grafodatskaya, Daria; Lou, Youliang; Ferreira, Jose C.; Pinto, Dalila; Scherer, Stephen W.; Shaffer, Lisa G.; Coullin, Philippe; Caniggia, Isabella; Beyene, Joseph; Slim, Rima; Bartolomei, Marisa S.; Weksberg, Rosanna

    2011-01-01

    Imprinted genes are critical for normal human growth and neurodevelopment. They are characterized by differentially methylated regions (DMRs) of DNA that confer parent of origin-specific transcription. We developed a new strategy to identify imprinted gene-associated DMRs. Using genome-wide methylation profiling of sodium bisulfite modified DNA from normal human tissues of biparental origin, candidate DMRs were identified by selecting CpGs with methylation levels consistent with putative allelic differential methylation. In parallel, the methylation profiles of tissues of uniparental origin, i.e., paternally-derived androgenetic complete hydatidiform moles (AnCHMs), and maternally-derived mature cystic ovarian teratoma (MCT), were examined and then used to identify CpGs with parent of origin-specific DNA methylation. With this approach, we found known DMRs associated with imprinted genomic regions as well as new DMRs for known imprinted genes, NAP1L5 and ZNF597, and novel candidate imprinted genes. The paternally methylated DMR for one candidate, AXL, a receptor tyrosine kinase, was also validated in experiments with mouse embryos that demonstrated Axl was expressed preferentially from the maternal allele in a DNA methylation-dependent manner. PMID:21324877

  19. Integromic Analysis of Genetic Variation and Gene Expression Identifies Networks for Cardiovascular Disease Phenotypes

    PubMed Central

    Yao, Chen; Chen, Brian H.; Joehanes, Roby; Otlu, Burcak; Zhang, Xiaoling; Liu, Chunyu; Huan, Tianxiao; Tastan, Oznur; Cupples, L. Adrienne; Meigs, James B.; Fox, Caroline S.; Freedman, Jane E.; Courchesne, Paul; O’Donnell, Christopher J.; Munson, Peter J.; Keles, Sunduz; Levy, Daniel

    2015-01-01

    Background Cardiovascular disease (CVD) reflects a highly coordinated complex of traits. Although genome-wide association studies have reported numerous single nucleotide polymorphisms (SNPs) to be associated with CVD, the role of most of these variants in disease processes remains unknown. Methods and Results We built a CVD network using 1512 SNPs associated with 21 CVD traits in genome-wide association studies (at P≤5×10−8) and cross-linked different traits by virtue of their shared SNP associations. We then explored whole blood gene expression in relation to these SNPs in 5257 participants in the Framingham Heart Study. At a false discovery rate <0.05, we identified 370 cis-expression quantitative trait loci (eQTLs; SNPs associated with altered expression of nearby genes) and 44 trans-eQTLs (SNPs associated with altered expression of remote genes). The eQTL network revealed 13 CVD-related modules. Searching for association of eQTL genes with CVD risk factors (lipids, blood pressure, fasting blood glucose, and body mass index) in the same individuals, we found examples in which the expression of eQTL genes was significantly associated with these CVD phenotypes. In addition, mediation tests suggested that a subset of SNPs previously associated with CVD phenotypes in genome-wide association studies may exert their function by altering expression of eQTL genes (eg, LDLR and PCSK7), which in turn may promote interindividual variation in phenotypes. Conclusions Using a network approach to analyze CVD traits, we identified complex networks of SNP-phenotype and SNP-transcript connections. Integrating the CVD network with phenotypic data, we identified biological pathways that may provide insights into potential drug targets for treatment or prevention of CVD. PMID:25533967

  20. Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis.

    PubMed

    Skwark, Marcin J; Croucher, Nicholas J; Puranen, Santeri; Chewapreecha, Claire; Pesonen, Maiju; Xu, Ying Ying; Turner, Paul; Harris, Simon R; Beres, Stephen B; Musser, James M; Parkhill, Julian; Bentley, Stephen D; Aurell, Erik; Corander, Jukka

    2017-02-01

    Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent advances in computational structural biology to identify the polymorphic loci under the strongest co-evolutionary pressures. We apply genomeDCA to two large population data sets representing the major human pathogens Streptococcus pneumoniae (pneumococcus) and Streptococcus pyogenes (group A Streptococcus). For pneumococcus we identified 5,199 putative epistatic interactions between 1,936 sites. Over three-quarters of the links were between sites within the pbp2x, pbp1a and pbp2b genes, the sequences of which are critical in determining non-susceptibility to beta-lactam antibiotics. A network-based analysis found these genes were also coupled to that encoding dihydrofolate reductase, changes to which underlie trimethoprim resistance. Distinct from these antibiotic resistance genes, a large network component of 384 protein coding sequences encompassed many genes critical in basic cellular functions, while another distinct component included genes associated with virulence. The group A Streptococcus (GAS) data set population represents a clonal population with relatively little genetic variation and a high level of linkage disequilibrium across the genome. Despite this, we were able to pinpoint two RNA pseudouridine synthases, which were each strongly linked to a separate set of loci across the chromosome, representing biologically plausible targets of co-selection. The population genomic analysis method applied here identifies statistically significantly co-evolving locus pairs, potentially arising from fitness selection interdependence reflecting underlying protein-protein interactions, or genes whose product activities contribute to the same phenotype. This discovery approach greatly

  1. Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis

    PubMed Central

    Pesonen, Maiju; Musser, James M.; Bentley, Stephen D.; Aurell, Erik; Corander, Jukka

    2017-01-01

    Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent advances in computational structural biology to identify the polymorphic loci under the strongest co-evolutionary pressures. We apply genomeDCA to two large population data sets representing the major human pathogens Streptococcus pneumoniae (pneumococcus) and Streptococcus pyogenes (group A Streptococcus). For pneumococcus we identified 5,199 putative epistatic interactions between 1,936 sites. Over three-quarters of the links were between sites within the pbp2x, pbp1a and pbp2b genes, the sequences of which are critical in determining non-susceptibility to beta-lactam antibiotics. A network-based analysis found these genes were also coupled to that encoding dihydrofolate reductase, changes to which underlie trimethoprim resistance. Distinct from these antibiotic resistance genes, a large network component of 384 protein coding sequences encompassed many genes critical in basic cellular functions, while another distinct component included genes associated with virulence. The group A Streptococcus (GAS) data set population represents a clonal population with relatively little genetic variation and a high level of linkage disequilibrium across the genome. Despite this, we were able to pinpoint two RNA pseudouridine synthases, which were each strongly linked to a separate set of loci across the chromosome, representing biologically plausible targets of co-selection. The population genomic analysis method applied here identifies statistically significantly co-evolving locus pairs, potentially arising from fitness selection interdependence reflecting underlying protein-protein interactions, or genes whose product activities contribute to the same phenotype. This discovery approach greatly

  2. Identifying novel mycobacterial stress associated genes using a random mutagenesis screen in Mycobacterium smegmatis.

    PubMed

    Viswanathan, Gopinath; Joshi, Shrilaxmi V; Sridhar, Aditi; Dutta, Sayantanee; Raghunand, Tirumalai R

    2015-12-10

    Cell envelope associated components of Mycobacterium tuberculosis (M.tb) have been implicated in stress response, immune modulation and in vivo survival of the pathogen. Although many such factors have been identified, there is a large disparity between the number of genes predicted to be involved in functions linked to the envelope and those described in the literature. To identify and characterise novel stress related factors associated with the mycobacterial cell envelope, we isolated colony morphotype mutants of Mycobacterium smegmatis (M. smegmatis), based on the hypothesis that mutants with unusual colony morphology may have defects in the biosynthesis of cell envelope components. On testing their susceptibility to stress conditions relevant to M.tb physiology, multiple mutants were found to be sensitive to Isoniazid, Diamide and H2O2, indicative of altered permeability due to changes in cell envelope composition. Two mutants showed defects in biofilm formation implying possible roles for the target genes in antibiotic tolerance and/or virulence. These assays identified novel stress associated roles for several mycobacterial genes including sahH, tatB and aceE. Complementation analysis of selected mutants with the M. smegmatis genes and their M.tb homologues showed phenotypic restoration, validating their link to the observed phenotypes. A mutant carrying an insertion in fhaA encoding a forkhead associated domain containing protein, showed reduced survival in THP-1 macrophages, providing in vivo validation to this screen. Taken together, these results suggest that the M.tb homologues of a majority of the identified genes may play significant roles in the pathogenesis of tuberculosis.

  3. A cross-species transcriptomics approach to identify genes involved in leaf development

    PubMed Central

    Street, Nathaniel Robert; Sjödin, Andreas; Bylesjö, Max; Gustafsson, Petter; Trygg, Johan; Jansson, Stefan

    2008-01-01

    Background We have made use of publicly available gene expression data to identify transcription factors and transcriptional modules (regulons) associated with leaf development in Populus. Different tissue types were compared to identify genes informative in the discrimination of leaf and non-leaf tissues. Transcriptional modules within this set of genes were identified in a much wider set of microarray data collected from leaves in a number of developmental, biotic, abiotic and transgenic experiments. Results Transcription factors that were over represented in leaf EST libraries and that were useful for discriminating leaves from other tissues were identified, revealing that the C2C2-YABBY, CCAAT-HAP3 and 5, MYB, and ZF-HD families are particularly important in leaves. The expression of transcriptional modules and transcription factors was examined across a number of experiments to select those that were particularly active during the early stages of leaf development. Two transcription factors were found to collocate to previously published Quantitative Trait Loci (QTL) for leaf length. We also found that miRNA family 396 may be important in the control of leaf development, with three members of the family collocating with clusters of leaf development QTL. Conclusion This work provides a set of candidate genes involved in the control and processes of leaf development. This resource can be used for a wide variety of purposes such as informing the selection of candidate genes for association mapping or for the selection of targets for reverse genetics studies to further understanding of the genetic control of leaf size and shape. PMID:19061504

  4. Expression profiling identifies genes expressed early during lint fibre initiation in cotton.

    PubMed

    Wu, Yingru; Machado, Adriane C; White, Rosemary G; Llewellyn, Danny J; Dennis, Elizabeth S

    2006-01-01

    Cotton fibres are a subset of single epidermal cells that elongate from the seed coat to produce the long cellulose strands or lint used for spinning into yarn. To identify genes that might regulate lint fibre initiation, expression profiles of 0 days post-anthesis (dpa) whole ovules from six reduced fibre or fibreless mutants were compared with wild-type linted cotton using cDNA microarrays. Numerous clones were differentially expressed, but when only those genes that are normally expressed in the ovule outer integument (where fibres develop) were considered, just 13 different cDNA clones were down-regulated in some or all of the mutants. These included: a Myb transcription factor (GhMyb25) similar to the Antirrhinum Myb AmMIXTA, a putative homeodomain protein (related to Arabidopsis ATML1), a cyclin D gene, some previously identified fibre-expressed structural and metabolic genes, such as lipid transfer protein, alpha-expansin and sucrose synthase, as well as some unknown genes. Laser capture microdissection and reverse transcription-PCR were used to show that both the GhMyb25 and the homeodomain gene were predominantly ovule specific and were up-regulated on the day of anthesis in fibre initials relative to adjacent non-fibre ovule epidermal cells. Their spatial and temporal expression pattern therefore coincided with the time and location of fibre initiation. Constitutive overexpression of GhMyb25 in transgenic tobacco resulted in an increase in branched long-stalked leaf trichomes. The involvement of cell cycle genes prompted DNA content measurements that indicated that fibre initials, like leaf trichomes, undergo DNA endoreduplication. Cotton fibre initiation therefore has some parallels with leaf trichome development, although the detailed molecular mechanisms are clearly different.

  5. Gene expression profiling identifies distinct molecular subgroups of leiomyosarcoma with clinical relevance

    PubMed Central

    Lee, Yin-Fai; Roe, Toby; Mangham, D Chas; Fisher, Cyril; Grimer, Robert J; Judson, Ian

    2016-01-01

    Background: Soft tissue sarcomas are heterogeneous and a major complication in their management is that the existing classification scheme is not definitive and is still evolving. Leiomyosarcomas, a major histologic category of soft tissue sarcomas, are malignant tumours displaying smooth muscle differentiation. Although defined as a single group, they exhibit a wide range of clinical behaviour. We aimed to carry out molecular classification to identify new molecular subgroups with clinical relevance. Methods: We used gene expression profiling on 20 extra-uterine leiomyosarcomas and cross-study analyses for molecular classification of leiomyosarcomas. Clinical significance of the subgroupings was investigated. Results: We have identified two distinct molecular subgroups of leiomyosarcomas. One group was characterised by high expression of 26 genes that included many genes from the sub-classification gene cluster proposed by Nielsen et al. These sub-classification genes include genes that have importance structurally, as well as in cell signalling. Notably, we found a statistically significant association of the subgroupings with tumour grade. Further refinement led to a group of 15 genes that could recapitulate the tumour subgroupings in our data set and in a second independent sarcoma set. Remarkably, cross-study analyses suggested that these molecular subgroups could be found in four independent data sets, providing strong support for their existence. Conclusions: Our study strongly supported the existence of distinct leiomyosarcoma molecular subgroups, which have clinical association with tumour grade. Our findings will aid in advancing the classification of leiomyosarcomas and lead to more individualised and better management of the disease. PMID:27607470

  6. An in vivo screen identifies ependymoma oncogenes and tumor-suppressor genes.

    PubMed

    Mohankumar, Kumarasamypet M; Currle, David S; White, Elsie; Boulos, Nidal; Dapper, Jason; Eden, Christopher; Nimmervoll, Birgit; Thiruvenkatam, Radhika; Connelly, Michele; Kranenburg, Tanya A; Neale, Geoffrey; Olsen, Scott; Wang, Yong-Dong; Finkelstein, David; Wright, Karen; Gupta, Kirti; Ellison, David W; Thomas, Arzu Onar; Gilbertson, Richard J

    2015-08-01

    Cancers are characterized by non-random chromosome copy number alterations that presumably contain oncogenes and tumor-suppressor genes (TSGs). The affected loci are often large, making it difficult to pinpoint which genes are driving the cancer. Here we report a cross-species in vivo screen of 84 candidate oncogenes and 39 candidate TSGs, located within 28 recurrent chromosomal alterations in ependymoma. Through a series of mouse models, we validate eight new ependymoma oncogenes and ten new ependymoma TSGs that converge on a small number of cell functions, including vesicle trafficking, DNA modification and cholesterol biosynthesis, identifying these as potential new therapeutic targets.

  7. Back to the sea twice: identifying candidate plant genes for molecular evolution to marine life

    PubMed Central

    2011-01-01

    Background Seagrasses are a polyphyletic group of monocotyledonous angiosperms that have adapted to a completely submerged lifestyle in marine waters. Here, we exploit two collections of expressed sequence tags (ESTs) of two wide-spread and ecologically important seagrass species, the Mediterranean seagrass Posidonia oceanica (L.) Delile and the eelgrass Zostera marina L., which have independently evolved from aquatic ancestors. This replicated, yet independent evolutionary history facilitates the identification of traits that may have evolved in parallel and are possible instrumental candidates for adaptation to a marine habitat. Results In our study, we provide the first quantitative perspective on molecular adaptations in two seagrass species. By constructing orthologous gene clusters shared between two seagrasses (Z. marina and P. oceanica) and eight distantly related terrestrial angiosperm species, 51 genes could be identified with detection of positive selection along the seagrass branches of the phylogenetic tree. Characterization of these positively selected genes using KEGG pathways and the Gene Ontology uncovered that these genes are mostly involved in translation, metabolism, and photosynthesis. Conclusions These results provide first insights into which seagrass genes have diverged from their terrestrial counterparts via an initial aquatic stage characteristic of the order and to the derived fully-marine stage characteristic of seagrasses. We discuss how adaptive changes in these processes may have contributed to the evolution towards an aquatic and marine existence. PMID:21226908

  8. Identifying photoreceptors in blind eyes caused by RPE65 mutations: Prerequisite for human gene therapy success.

    PubMed

    Jacobson, Samuel G; Aleman, Tomas S; Cideciyan, Artur V; Sumaroka, Alexander; Schwartz, Sharon B; Windsor, Elizabeth A M; Traboulsi, Elias I; Heon, Elise; Pittler, Steven J; Milam, Ann H; Maguire, Albert M; Palczewski, Krzysztof; Stone, Edwin M; Bennett, Jean

    2005-04-26

    Mutations in RPE65, a gene essential to normal operation of the visual (retinoid) cycle, cause the childhood blindness known as Leber congenital amaurosis (LCA). Retinal gene therapy restores vision to blind canine and murine models of LCA. Gene therapy in blind humans with LCA from RPE65 mutations may also have potential for success but only if the retinal photoreceptor layer is intact, as in the early-disease stage-treated animals. Here, we use high-resolution in vivo microscopy to quantify photoreceptor layer thickness in the human disease to define the relationship of retinal structure to vision and determine the potential for gene therapy success. The normally cone photoreceptor-rich central retina and rod-rich regions were studied. Despite severely reduced cone vision, many RPE65-mutant retinas had near-normal central microstructure. Absent rod vision was associated with a detectable but thinned photoreceptor layer. We asked whether abnormally thinned RPE65-mutant retina with photoreceptor loss would respond to treatment. Gene therapy in Rpe65(-/-) mice at advanced-disease stages, a more faithful mimic of the humans we studied, showed success but only in animals with better-preserved photoreceptor structure. The results indicate that identifying and then targeting retinal locations with retained photoreceptors will be a prerequisite for successful gene therapy in humans with RPE65 mutations and in other retinal degenerative disorders now moving from proof-of-concept studies toward clinical trials.

  9. A CRISPR-Based Screen Identifies Genes Essential for West-Nile-Virus-Induced Cell Death.

    PubMed

    Ma, Hongming; Dang, Ying; Wu, Yonggan; Jia, Gengxiang; Anaya, Edgar; Zhang, Junli; Abraham, Sojan; Choi, Jang-Gi; Shi, Guojun; Qi, Ling; Manjunath, N; Wu, Haoquan

    2015-07-28

    West Nile virus (WNV) causes an acute neurological infection attended by massive neuronal cell death. However, the mechanism(s) behind the virus-induced cell death is poorly understood. Using a library containing 77,406 sgRNAs targeting 20,121 genes, we performed a genome-wide screen followed by a second screen with a sub-library. Among the genes identified, seven genes, EMC2, EMC3, SEL1L, DERL2, UBE2G2, UBE2J1, and HRD1, stood out as having the strongest phenotype, whose knockout conferred strong protection against WNV-induced cell death with two different WNV strains and in three cell lines. Interestingly, knockout of these genes did not block WNV replication. Thus, these appear to be essential genes that link WNV replication to downstream cell death pathway(s). In addition, the fact that all of these genes belong to the ER-associated protein degradation (ERAD) pathway suggests that this might be the primary driver of WNV-induced cell death.

  10. Quantitative analysis of bristle number in Drosophila mutants identifies genes involved in neural development

    NASA Technical Reports Server (NTRS)

    Norga, Koenraad K.; Gurganus, Marjorie C.; Dilda, Christy L.; Yamamoto, Akihiko; Lyman, Richard F.; Patel, Prajal H.; Rubin, Gerald M.; Hoskins, Roger A.; Mackay, Trudy F.; Bellen, Hugo J.

    2003-01-01

    BACKGROUND: The identification of the function of all genes that contribute to specific biological processes and complex traits is one of the major challenges in the postgenomic era. One approach is to employ forward genetic screens in genetically tractable model organisms. In Drosophila melanogaster, P element-mediated insertional mutagenesis is a versatile tool for the dissection of molecular pathways, and there is an ongoing effort to tag every gene with a P element insertion. However, the vast majority of P element insertion lines are viable and fertile as homozygotes and do not exhibit obvious phenotypic defects, perhaps because of the tendency for P elements to insert 5' of transcription units. Quantitative genetic analysis of subtle effects of P element mutations that have been induced in an isogenic background may be a highly efficient method for functional genome annotation. RESULTS: Here, we have tested the efficacy of this strategy by assessing the extent to which screening for quantitative effects of P elements on sensory bristle number can identify genes affecting neural development. We find that such quantitative screens uncover an unusually large number of genes that are known to function in neural development, as well as genes with yet uncharacterized effects on neural development, and novel loci. CONCLUSIONS: Our findings establish the use of quantitative trait analysis for functional genome annotation through forward genetics. Similar analyses of quantitative effects of P element insertions will facilitate our understanding of the genes affecting many other complex traits in Drosophila.

  11. Molecular profiling of experimental endometriosis identified gene expression patterns in common with human disease

    PubMed Central

    Flores, Idhaliz; Rivera, Elizabeth; Ruiz, Lynnette A.; Santiago, Olga I.; Vernon, Michael W.; Appleyard, Caroline B.

    2007-01-01

    OBJECTIVE To validate a rat model of endometriosis using cDNA microarrays by identifying common gene expression patterns beween experimental and natural disease. DESIGN Autotransplantation rat model. SETTING Medical school department. ANIMALS Female Sprague-Dawley rats. INTERVENTIONS Endometriosis was surgically-induced by suturing uterine horn implants next to the small intestine’s mesentery. Control rats received sutures with no implants. After 60 days, endometriotic implants and uterine horn were obtained. MAIN OUTCOME MEASURES Gene expression levels determined by cDNA microarrays and QRT-PCR. METHODS Cy5-labeled cDNA was synthesized from total RNA obtained from endometriotic implants. Cy3-labeled cDNA was synthesized using uterine RNA from a control rat. Gene expression levels were analyzed after hybridizing experimental and control labeled cDNA to PIQOR™ Toxicology Rat Microarrays (Miltenyi Biotec) containing 1,252 known genes. Cy5/Cy3 ratios were determined and genes with >2-fold higher or <0.5-fold lower expression levels were selected. Microarray results were validated by QRT-PCR. RESULTS We observed differential expression of genes previously shown to be upregulated in patients, including growth factors, inflammatory cytokines/receptors, tumor invasion/metastasis factors, adhesion molecules, and anti-apoptotic factors. CONCLUSIONS This study presents evidence in support of using this rat model to study the natural history of endometriosis and test novel therapeutics for this incurable disease. PMID:17478174

  12. Identifying genes related to choriogenesis in insect panoistic ovaries by Suppression Subtractive Hybridization

    PubMed Central

    Irles, Paula; Bellés, Xavier; Piulachs, M Dolors

    2009-01-01

    Background Insect ovarioles are classified into two categories: panoistic and meroistic, the later having apparently evolved from an ancestral panoistic type. Molecular data on oogenesis is practically restricted to meroistic ovaries. If we aim at studying the evolutionary transition from panoistic to meroistic, data on panoistic ovaries should be gathered. To this end, we planned the construction of a Suppression Subtractive Hybridization (SSH) library to identify genes involved in panoistic choriogenesis, using the cockroach Blattella germanica as model. Results We constructed a post-vitellogenic ovary library by SSH to isolate genes involved in choriogenesis in B. germanica. The tester library was prepared with an ovary pool from 6- to 7-day-old females, whereas the driver library was prepared with an ovary pool from 3- to 4-day-old females. From the SSH library, we obtained 258 high quality sequences which clustered into 34 unique sequences grouped in 19 contigs and 15 singlets. The sequences were compared against non-redundant NCBI databases using BLAST. We found that 44% of the unique sequences had homologous sequences in known genes of other organisms, whereas 56% had no significant similarity to any of the databases entries. A Gene Ontology analysis was carried out, classifying the 34 sequences into different functional categories. Seven of these gene sequences, representative of different categories and processes, were chosen to perform expression studies during the first gonadotrophic cycle by real-time PCR. Results showed that they were mainly expressed during post-vitellogenesis, which validates the SSH technique. In two of them corresponding to novel genes, we demonstrated that they are specifically expressed in the cytoplasm of follicular cells in basal oocytes at the time of choriogenesis. Conclusion The SSH approach has proven to be useful in identifying ovarian genes expressed after vitellogenesis in B. germanica. For most of the genes, functions

  13. Transcriptomic and genetic studies identify NFAT5 as a candidate gene for cocaine dependence

    PubMed Central

    Fernàndez-Castillo, N; Cabana-Domínguez, J; Soriano, J; Sànchez-Mora, C; Roncero, C; Grau-López, L; Ros-Cucurull, E; Daigre, C; van Donkelaar, M M J; Franke, B; Casas, M; Ribasés, M; Cormand, B

    2015-01-01

    Cocaine reward and reinforcing effects are mediated mainly by dopaminergic neurotransmission. In this study, we aimed at evaluating gene expression changes induced by acute cocaine exposure on SH-SY5Y-differentiated cells, which have been widely used as a dopaminergic neuronal model. Expression changes and a concomitant increase in neuronal activity were observed after a 5 μM cocaine exposure, whereas no changes in gene expression or in neuronal activity took place at 1 μM cocaine. Changes in gene expression were identified in a total of 756 genes, mainly related to regulation of transcription and gene expression, cell cycle, adhesion and cell projection, as well as mitogen-activeated protein kinase (MAPK), CREB, neurotrophin and neuregulin signaling pathways. Some genes displaying altered expression were subsequently targeted with predicted functional single-nucleotide polymorphisms (SNPs) in a case–control association study in a sample of 806 cocaine-dependent patients and 817 controls. This study highlighted associations between cocaine dependence and five SNPs predicted to alter microRNA binding at the 3′-untranslated region of the NFAT5 gene. The association of SNP rs1437134 with cocaine dependence survived the Bonferroni correction for multiple testing. A functional effect was confirmed for this variant by a luciferase reporter assay, with lower expression observed for the rs1437134G allele, which was more pronounced in the presence of hsa-miR-509. However, brain volumes in regions of relevance to addiction, as assessed with magnetic resonance imaging, did not correlate with NFAT5 variation. These results suggest that the NFAT5 gene, which is upregulated a few hours after cocaine exposure, may be involved in the genetic predisposition to cocaine dependence. PMID:26506053

  14. SVM-T-RFE: a novel gene selection algorithm for identifying metastasis-related genes in colorectal cancer using gene expression profiles.

    PubMed

    Li, Xiaobo; Peng, Sihua; Chen, Jian; Lü, Bingjian; Zhang, Honghe; Lai, Maode

    2012-03-09

    Although metastasis is the principal cause of death cause for colorectal cancer (CRC) patients, the molecular mechanisms underlying CRC metastasis are still not fully understood. In an attempt to identify metastasis-related genes in CRC, we obtained gene expression profiles of 55 early stage primary CRCs, 56 late stage primary CRCs, and 34 metastatic CRCs from the expression project in Oncology (http://www.intgen.org/expo/). We developed a novel gene selection algorithm (SVM-T-RFE), which extends support vector machine recursive feature elimination (SVM-RFE) algorithm by incorporating T-statistic. We achieved highest classification accuracy (100%) with smaller gene subsets (10 and 6, respectively), when classifying between early and late stage primary CRCs, as well as between metastatic CRCs and late stage primary CRCs. We also compared the performance of SVM-T-RFE and SVM-RFE gene selection algorithms on another large-scale CRC dataset and the five public microarray datasets. SVM-T-RFE bestowed SVM-RFE algorithm in identifying more differentially expressed genes, and achieving highest prediction accuracy using equal or smaller number of selected genes. A fraction of selected genes have been reported to be associated with CRC development or metastasis.

  15. Evaluation of voltage-dependent calcium channel γ gene families identified several novel potential susceptible genes to schizophrenia

    PubMed Central

    Guan, Fanglin; Zhang, Tianxiao; Liu, Xinshe; Han, Wei; Lin, Huali; Li, Lu; Chen, Gang; Li, Tao

    2016-01-01

    Voltage-gated L-type calcium channels (VLCC) are distributed widely throughout the brain. Among the genes involved in schizophrenia (SCZ), genes encoding VLCC subunits have attracted widespread attention. Among the four subunits comprising the VLCC (α − 1, α −2/δ, β, and γ), the γ subunit that comprises an eight-member protein family is the least well understood. In our study, to further investigate the risk susceptibility by the γ subunit gene family to SCZ, we conducted a large-scale association study in Han Chinese individuals. The SNP rs17645023 located in the intergenic region of CACNG4 and CACNG5 was identified to be significantly associated with SCZ (OR = 0.856, P = 5.43 × 10−5). Similar results were obtained in the meta-analysis with the current SCZ PGC data (OR = 0.8853). We also identified a two-SNP haplotype (rs10420331-rs11084307, P = 1.4 × 10−6) covering the intronic region of CACNG8 to be significantly associated with SCZ. Epistasis analyses were conducted, and significant statistical interaction (OR = 0.622, P = 2.93 × 10−6, Pperm < 0.001) was observed between rs192808 (CACNG6) and rs2048137 (CACNG5). Our results indicate that CACNG4, CACNG5, CACNG6 and CACNG8 may contribute to the risk of SCZ. The statistical epistasis identified between CACNG5 and CACNG6 suggests that there may be an underlying biological interaction between the two genes. PMID:27102562

  16. Gene from a novel plant virus satellite from grapevine identifies a viral satellite lineage.

    PubMed

    Al Rwahnih, Maher; Daubert, Steve; Sudarshana, Mysore R; Rowhani, Adib

    2013-08-01

    We have identified the genome of a novel viral satellite in deep sequence analysis of double-stranded RNA from grapevine. The genome was 1,060 bases in length, and encoded two open reading frames. Neither frame was related to any known plant virus gene. But translation of the longer frame showed a protein sequence similar to those of other plant virus satellites. Other than in commonalities they shared in this gene sequence, members of that group were extensively divergent. The reading frame in this gene from the novel satellite could be translationally coupled to an adjacent reading frame in the -1 register, through overlapping start/stop codons. These overlapping AUGA start/stop codons were adjacent to a sequence that could be folded into a pseudoknot structure. Field surveys with PCR probes specific for the novel satellite revealed its presence in 3% of the grapevines (n = 346) sampled.

  17. Transcriptome Analysis to Identify Cold-Responsive Genes in Amur Carp (Cyprinus carpio haematopterus)

    PubMed Central

    He, XuLing

    2015-01-01

    The adaptation of fish to low temperatures is the result of long-term evolution. Amur carp (Cyprinus carpio haematopterus) survives low temperatures (0-4°C) for six months per year. Therefore, we chose this fish as a model organism to study the mechanisms of cold-adaptive responses using high-throughput sequencing technology. This system provided an excellent model for exploring the relationship between evolutionary genomic changes and environmental adaptations. The Amur carp transcriptome was sequenced using the Illumina platform and was assembled into 163,121 cDNA contigs, with an average read length of 594 bp and an N50 length of 913 bp. A total of 162,339 coding sequences (CDSs) were identified and of 32,730 unique CDSs were annotated. Gene Ontology (GO), EuKaryotic Orthologous Groups (KOG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed to classify all CDSs into different functional categories. A large number of cold-responsive genes were detected in different tissues at different temperatures. A total of 9,427 microsatellites were identified and classified, with 1952 identifying in cold-responsive genes. Based on GO enrichment analysis of the cold-induced genes, “protein localization” and “protein transport” were the most highly represented biological processes. “Circadian rhythm,” “protein processing in endoplasmic reticulum,” “endocytosis,” “insulin signaling pathway,” and “lysosome” were the most highly enriched pathways for the genes induced by cold stress. Our data greatly contribute to the common carp (C. carpio) transcriptome resource, and the identification of cold-responsive genes in different tissues at different temperatures will aid in deciphering the genetic basis of ecological and environmental adaptations in this species. Based on our results, the Amur carp has evolved special strategies to survive low temperatures, and these strategies include the system-wide or tissue-specific induction

  18. Fine Mapping and Whole-Genome Resequencing Identify the Seed Coat Color Gene in Brassica rapa

    PubMed Central

    Guo, Shaomin; An, Fengyun; Du, Dezhi

    2016-01-01

    A yellow seed coat is a desirable agronomic trait in the seeds of oilseed-type Brassica crops. In this study, we identified a candidate gene for seed coat color in Dahuang, a landrace of Brassica rapa. A previous study of Dahuang mapped the seed coat color gene Brsc1 to a 2.8-Mb interval on chromosome A9 of B. rapa. In the present study, the density of the linkage map for Brsc1 was increased by adding simple sequence repeat (SSR) markers, and the candidate region for Brsc1 was narrowed to 1.04 Mb. In addition, whole-genome resequencing with bulked segregant analysis (BSA) was conducted to identify candidate intervals for Brsc1. A genome-wide comparison of SNP profiles was performed between yellow-seeded and brown-seeded bulk samples. SNP index analyses identified a major candidate interval on chromosome A9 (A09:18,255,838–18,934,000, 678 kb) containing a long overlap with the target region recovered from the fine mapping results. According to gene annotation, Bra028067 (BrTT1) is an important candidate gene for Brsc1 in the overlapping region. Quantitative reverse transcription (qRT)-PCR revealed that BrTT1 mainly functions in the seed. Point mutations and small deletions in BrTT1 were found between yellow- and brown-seeded Dahuang plants. Collectively, the expression and sequence analysis results provide preliminary evidence that BrTT1 is a candidate gene for the seed coat color trait in Dahuang. PMID:27829069

  19. Large-Scale Gene-Centric Meta-analysis across 32 Studies Identifies Multiple Lipid Loci

    PubMed Central

    Asselbergs, Folkert W.; Guo, Yiran; van Iperen, Erik P.A.; Sivapalaratnam, Suthesh; Tragante, Vinicius; Lanktree, Matthew B.; Lange, Leslie A.; Almoguera, Berta; Appelman, Yolande E.; Barnard, John; Baumert, Jens; Beitelshees, Amber L.; Bhangale, Tushar R.; Chen, Yii-Der Ida; Gaunt, Tom R.; Gong, Yan; Hopewell, Jemma C.; Johnson, Toby; Kleber, Marcus E.; Langaee, Taimour Y.; Li, Mingyao; Li, Yun R.; Liu, Kiang; McDonough, Caitrin W.; Meijs, Matthijs F.L.; Middelberg, Rita P.S.; Musunuru, Kiran; Nelson, Christopher P.; O’Connell, Jeffery R.; Padmanabhan, Sandosh; Pankow, James S.; Pankratz, Nathan; Rafelt, Suzanne; Rajagopalan, Ramakrishnan; Romaine, Simon P.R.; Schork, Nicholas J.; Shaffer, Jonathan; Shen, Haiqing; Smith, Erin N.; Tischfield, Sam E.; van der Most, Peter J.; van Vliet-Ostaptchouk, Jana V.; Verweij, Niek; Volcik, Kelly A.; Zhang, Li; Bailey, Kent R.; Bailey, Kristian M.; Bauer, Florianne; Boer, Jolanda M.A.; Braund, Peter S.; Burt, Amber; Burton, Paul R.; Buxbaum, Sarah G.; Chen, Wei; Cooper-DeHoff, Rhonda M.; Cupples, L. Adrienne; deJong, Jonas S.; Delles, Christian; Duggan, David; Fornage, Myriam; Furlong, Clement E.; Glazer, Nicole; Gums, John G.; Hastie, Claire; Holmes, Michael V.; Illig, Thomas; Kirkland, Susan A.; Kivimaki, Mika; Klein, Ronald; Klein, Barbara E.; Kooperberg, Charles; Kottke-Marchant, Kandice; Kumari, Meena; LaCroix, Andrea Z.; Mallela, Laya; Murugesan, Gurunathan; Ordovas, Jose; Ouwehand, Willem H.; Post, Wendy S.; Saxena, Richa; Scharnagl, Hubert; Schreiner, Pamela J.; Shah, Tina; Shields, Denis C.; Shimbo, Daichi; Srinivasan, Sathanur R.; Stolk, Ronald P.; Swerdlow, Daniel I.; Taylor, Herman A.; Topol, Eric J.; Toskala, Elina; van Pelt, Joost L.; van Setten, Jessica; Yusuf, Salim; Whittaker, John C.; Zwinderman, A.H.; Anand, Sonia S.; Balmforth, Anthony J.; Berenson, Gerald S.; Bezzina, Connie R.; Boehm, Bernhard O.; Boerwinkle, Eric; Casas, Juan P.; Caulfield, Mark J.; Clarke, Robert; Connell, John M.; Cruickshanks, Karen J.; Davidson, Karina W.; Day, Ian N.M.; de Bakker, Paul I.W.; Doevendans, Pieter A.; Dominiczak, Anna F.; Hall, Alistair S.; Hartman, Catharina A.; Hengstenberg, Christian; Hillege, Hans L.; Hofker, Marten H.; Humphries, Steve E.; Jarvik, Gail P.; Johnson, Julie A.; Kaess, Bernhard M.; Kathiresan, Sekar; Koenig, Wolfgang; Lawlor, Debbie A.; März, Winfried; Melander, Olle; Mitchell, Braxton D.; Montgomery, Grant W.; Munroe, Patricia B.; Murray, Sarah S.; Newhouse, Stephen J.; Onland-Moret, N. Charlotte; Poulter, Neil; Psaty, Bruce; Redline, Susan; Rich, Stephen S.; Rotter, Jerome I.; Schunkert, Heribert; Sever, Peter; Shuldiner, Alan R.; Silverstein, Roy L.; Stanton, Alice; Thorand, Barbara; Trip, Mieke D.; Tsai, Michael Y.; van der Harst, Pim; van der Schoot, Ellen; van der Schouw, Yvonne T.; Verschuren, W.M. Monique; Watkins, Hugh; Wilde, Arthur A.M.; Wolffenbuttel, Bruce H.R.; Whitfield, John B.; Hovingh, G. Kees; Ballantyne, Christie M.; Wijmenga, Cisca; Reilly, Muredach P.; Martin, Nicholas G.; Wilson, James G.; Rader, Daniel J.; Samani, Nilesh J.; Reiner, Alex P.; Hegele, Robert A.; Kastelein, John J.P.; Hingorani, Aroon D.; Talmud, Philippa J.; Hakonarson, Hakon; Elbers, Clara C.; Keating, Brendan J.; Drenos, Fotios

    2012-01-01

    Genome-wide association studies (GWASs) have identified many SNPs underlying variations in plasma-lipid levels. We explore whether additional loci associated with plasma-lipid phenotypes, such as high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), total cholesterol (TC), and triglycerides (TGs), can be identified by a dense gene-centric approach. Our meta-analysis of 32 studies in 66,240 individuals of European ancestry was based on the custom ∼50,000 SNP genotyping array (the ITMAT-Broad-CARe array) covering ∼2,000 candidate genes. SNP-lipid associations were replicated either in a cohort comprising an additional 24,736 samples or within the Global Lipid Genetic Consortium. We identified four, six, ten, and four unreported SNPs in established lipid genes for HDL-C, LDL-C, TC, and TGs, respectively. We also identified several lipid-related SNPs in previously unreported genes: DGAT2, HCAR2, GPIHBP1, PPARG, and FTO for HDL-C; SOCS3, APOH, SPTY2D1, BRCA2, and VLDLR for LDL-C; SOCS3, UGT1A1, BRCA2, UBE3B, FCGR2A, CHUK, and INSIG2 for TC; and SERPINF2, C4B, GCK, GATA4, INSR, and LPAL2 for TGs. The proportion of explained phenotypic variance in the subset of studies providing individual-level data was 9.9% for HDL-C, 9.5% for LDL-C, 10.3% for TC, and 8.0% for TGs. This large meta-analysis of lipid phenotypes with the use of a dense gene-centric approach identified multiple SNPs not previously described in established lipid genes and several previously unknown loci. The explained phenotypic variance from this approach was comparable to that from a meta-analysis of GWAS data, suggesting that a focused genotyping approach can further increase the understanding of heritability of plasma lipids. PMID:23063622

  20. Genome Screen to Identify Susceptibility Genes for Parkinson Disease in a Sample without parkin Mutations

    PubMed Central

    Pankratz, Nathan; Nichols, William C.; Uniacke, Sean K.; Halter, Cheryl; Rudolph, Alice; Shults, Cliff; Conneally, P. Michael; Foroud, Tatiana

    2002-01-01

    Parkinson disease (PD) is a common neurodegenerative disorder characterized by bradykinesia, resting tremor, muscular rigidity, and postural instability, as well as by a clinically significant response to treatment with levodopa. Mutations in the α-synuclein gene have been found to result in autosomal dominant PD, and mutations in the parkin gene produce autosomal recessive juvenile-onset PD. We have studied 203 sibling pairs with PD who were evaluated by a rigorous neurological assessment based on (a) inclusion criteria consisting of clinical features highly associated with autopsy-confirmed PD and (b) exclusion criteria highly associated with other, non-PD pathological diagnoses. Families with positive LOD scores for a marker in an intron of the parkin gene were prioritized for parkin-gene testing, and mutations in the parkin gene were identified in 22 families. To reduce genetic heterogeneity, these families were not included in subsequent genome-screen analysis. Thus, a total of 160 multiplex families without evidence of a parkin mutation were used in multipoint nonparametric linkage analysis to identify PD-susceptibility genes. Two models of PD affection status were considered: model I included only those individuals with a more stringent diagnosis of verified PD (96 sibling pairs from 90 families), whereas model II included all examined individuals as affected, regardless of their final diagnostic classification (170 sibling pairs from 160 families). Under model I, the highest LOD scores were observed on chromosome X (LOD score 2.1) and on chromosome 2 (LOD score 1.9). Analyses performed with all available sibling pairs (model II) found even greater evidence of linkage to chromosome X (LOD score 2.7) and to chromosome 2 (LOD score 2.5). Evidence of linkage was also found to chromosomes 4, 5, and 13 (LOD scores >1.5). Our findings are consistent with those of other linkage studies that have reported linkage to chromosomes 5 and X. PMID:12058349

  1. Transcriptional profiling identifies differentially expressed genes in developing turkey skeletal muscle

    PubMed Central

    2011-01-01

    involved in extracellular matrix regulation, cell death/apoptosis, and calcium signaling/muscle function, as well as genes with miscellaneous function was confirmed by qPCR. Conclusions The current study identified gene pathways and uncovered novel genes important in turkey muscle growth and development. Future experiments will focus further on several of these candidate genes and the expression and mechanism of action of their protein products. PMID:21385442

  2. A Genome-Wide Screen for Dendritically Localized RNAs Identifies Genes Required for Dendrite Morphogenesis

    PubMed Central

    Misra, Mala; Edmund, Hendia; Ennis, Darragh; Schlueter, Marissa A.; Marot, Jessica E.; Tambasco, Janet; Barlow, Ida; Sigurbjornsdottir, Sara; Mathew, Renjith; Vallés, Ana Maria; Wojciech, Waldemar; Roth, Siegfried; Davis, Ilan; Leptin, Maria; Gavis, Elizabeth R.

    2016-01-01

    Localizing messenger RNAs at specific subcellular sites is a conserved mechanism for targeting the synthesis of cytoplasmic proteins to distinct subcellular domains, thereby generating the asymmetric protein distributions necessary for cellular and developmental polarity. However, the full range of transcripts that are asymmetrically distributed in specialized cell types, and the significance of their localization, especially in the nervous system, are not known. We used the EP-MS2 method, which combines EP transposon insertion with the MS2/MCP in vivo fluorescent labeling system, to screen for novel localized transcripts in polarized cells, focusing on the highly branched Drosophila class IV dendritic arborization neurons. Of a total of 541 lines screened, we identified 55 EP-MS2 insertions producing transcripts that were enriched in neuronal processes, particularly in dendrites. The 47 genes identified by these insertions encode molecularly diverse proteins, and are enriched for genes that function in neuronal development and physiology. RNAi-mediated knockdown confirmed roles for many of the candidate genes in dendrite morphogenesis. We propose that the transport of mRNAs encoded by these genes into the dendrites allows their expression to be regulated on a local scale during the dynamic developmental processes of dendrite outgrowth, branching, and/or remodeling. PMID:27260999

  3. A Genome-Wide Screen for Dendritically Localized RNAs Identifies Genes Required for Dendrite Morphogenesis.

    PubMed

    Misra, Mala; Edmund, Hendia; Ennis, Darragh; Schlueter, Marissa A; Marot, Jessica E; Tambasco, Janet; Barlow, Ida; Sigurbjornsdottir, Sara; Mathew, Renjith; Vallés, Ana Maria; Wojciech, Waldemar; Roth, Siegfried; Davis, Ilan; Leptin, Maria; Gavis, Elizabeth R

    2016-08-09

    Localizing messenger RNAs at specific subcellular sites is a conserved mechanism for targeting the synthesis of cytoplasmic proteins to distinct subcellular domains, thereby generating the asymmetric protein distributions necessary for cellular and developmental polarity. However, the full range of transcripts that are asymmetrically distributed in specialized cell types, and the significance of their localization, especially in the nervous system, are not known. We used the EP-MS2 method, which combines EP transposon insertion with the MS2/MCP in vivo fluorescent labeling system, to screen for novel localized transcripts in polarized cells, focusing on the highly branched Drosophila class IV dendritic arborization neurons. Of a total of 541 lines screened, we identified 55 EP-MS2 insertions producing transcripts that were enriched in neuronal processes, particularly in dendrites. The 47 genes identified by these insertions encode molecularly diverse proteins, and are enriched for genes that function in neuronal development and physiology. RNAi-mediated knockdown confirmed roles for many of the candidate genes in dendrite morphogenesis. We propose that the transport of mRNAs encoded by these genes into the dendrites allows their expression to be regulated on a local scale during the dynamic developmental processes of dendrite outgrowth, branching, and/or remodeling.

  4. Genetic Susceptibility to Vitiligo: GWAS Approaches for Identifying Vitiligo Susceptibility Genes and Loci

    PubMed Central

    Shen, Changbing; Gao, Jing; Sheng, Yujun; Dou, Jinfa; Zhou, Fusheng; Zheng, Xiaodong; Ko, Randy; Tang, Xianfa; Zhu, Caihong; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Zhang, Xuejun

    2016-01-01

    Vitiligo is an autoimmune disease with a strong genetic component, characterized by areas of depigmented skin resulting from loss of epidermal melanocytes. Genetic factors are known to play key roles in vitiligo through discoveries in association studies and family studies. Previously, vitiligo susceptibility genes were mainly revealed through linkage analysis and candidate gene studies. Recently, our understanding of the genetic basis of vitiligo has been rapidly advancing through genome-wide association study (GWAS). More than 40 robust susceptible loci have been identified and confirmed to be associated with vitiligo by using GWAS. Most of these associated genes participate in important pathways involved in the pathogenesis of vitiligo. Many susceptible loci with unknown functions in the pathogenesis of vitiligo have also been identified, indicating that additional molecular mechanisms may contribute to the risk of developing vitiligo. In this review, we summarize the key loci that are of genome-wide significance, which have been shown to influence vitiligo risk. These genetic loci may help build the foundation for genetic diagnosis and personalize treatment for patients with vitiligo in the future. However, substantial additional studies, including gene-targeted and functional studies, are required to confirm the causality of the genetic variants and their biological relevance in the development of vitiligo. PMID:26870082

  5. Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

    SciTech Connect

    Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

    2003-06-01

    OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally important for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.

  6. Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention.

    PubMed

    Johnston, Iain G; Williams, Ben P

    2016-02-24

    Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species.

  7. A genome-wide screen identifies genes that affect somatic homolog pairing in Drosophila.

    PubMed

    Bateman, Jack R; Larschan, Erica; D'Souza, Ryan; Marshall, Lauren S; Dempsey, Kyle E; Johnson, Justine E; Mellone, Barbara G; Kuroda, Mitzi I

    2012-07-01

    In Drosophila and other Dipterans, homologous chromosomes are in close contact in virtually all nuclei, a phenomenon known as somatic homolog pairing. Although homolog pairing has been recognized for over a century, relatively little is known about its regulation. We performed a genome-wide RNAi-based screen that monitored the X-specific localization of the male-specific lethal (MSL) complex, and we identified 59 candidate genes whose knockdown via RNAi causes a change in the pattern of MSL staining that is consistent with a disruption of X-chromosomal homolog pairing. Using DNA fluorescent in situ hybridization (FISH), we confirmed that knockdown of 17 of these genes has a dramatic effect on pairing of the 359 bp repeat at the base of the X. Furthermore, dsRNAs targeting Pr-set7, which encodes an H4K20 methyltransferase, cause a modest disruption in somatic homolog pairing. Consistent with our results in cultured cells, a classical mutation in one of the strongest candidate genes, pebble (pbl), causes a decrease in somatic homolog pairing in developing embryos. Interestingly, many of the genes identified by our screen have known roles in diverse cell-cycle events, suggesting an important link between somatic homolog pairing and the choreography of chromosomes during the cell cycle.

  8. High Throughput Gene Expression Analysis Identifies Reliable Expression Markers of Human Corneal Endothelial Cells

    PubMed Central

    Chng, Zhenzhi; Peh, Gary S. L.; Herath, Wishva B.; Cheng, Terence Y. D.; Ang, Heng-Pei; Toh, Kah-Peng; Robson, Paul; Mehta, Jodhbir S.; Colman, Alan

    2013-01-01

    Considerable interest has been generated for the development of suitable corneal endothelial graft alternatives through cell-tissue engineering, which can potentially alleviate the shortage of corneal transplant material. The advent of less invasive suture-less key-hole surgery options such as Descemet’s Stripping Endothelial Keratoplasty (DSEK) and Descemet’s Membrane Endothelial Keratoplasty (DMEK), which involve transplantation of solely the endothelial layer instead of full thickness cornea, provide further impetus for the development of alternative endothelial grafts for clinical applications. A major challenge for this endeavor is the lack of specific markers for this cell type. To identify genes that reliably mark corneal endothelial cells (CECs) in vivo and in vitro, we performed RNA-sequencing on freshly isolated human CECs (from both young and old donors), CEC cultures, and corneal stroma. Gene expression of these corneal cell types was also compared to that of other human tissue types. Based on high throughput comparative gene expression analysis, we identified a panel of markers that are: i) highly expressed in CECs from both young donors and old donors; ii) expressed in CECs in vivo and in vitro; and iii) not expressed in corneal stroma keratocytes and the activated corneal stroma fibroblasts. These were SLC4A11, COL8A2 and CYYR1. The use of this panel of genes in combination reliably ascertains the identity of the CEC cell type. PMID:23844023

  9. A New Glabrous Gene (csgl3) Identified in Trichome Development in Cucumber (Cucumis sativus L.)

    PubMed Central

    Ding, Li-Hong; Wehner, Todd C.; Liu, Pan-Na; Wang, Ye; Zhang, Sheng-Ping; Gu, Xing-Fang

    2016-01-01

    Spines or trichomes on the fruit of cucumbers enhance their commercial value in China. In addition, glabrous mutants exhibit resistance to aphids and therefore their use by growers can reduce pesticide residues. Previous studies have reported two glabrous mutant plants containing the genes, csgl1 and csgl2. In the present study, a new glabrous mutant, NCG157, was identified showing a gene interaction effect with csgl1 and csgl2. This mutant showed the glabrous character on stems, leaves, tendrils, receptacles and ovaries, and there were no spines or tumors on the fruit surface. Inheritance analysis showed that a single recessive gene, named csgl3, determined the glabrous trait. An F2 population derived from the cross of two inbred lines 9930 (a fresh market type from Northern China that exhibits trichomes) and NCG157 (an American processing type with glabrous surfaces) was used for genetic mapping of the csgl3 gene. By combining bulked segregant analysis (BAS) with molecular markers, 18 markers, including two simple sequence repeats (SSR), nine insertion deletions (InDel) and seven derived cleaved amplified polymorphism sequences (dCAPs), were identified to link to the csgl3 gene. All of the linked markers were used as anchor loci to locate the csgl3 gene on cucumber chromosome 6. The csgl3 gene was mapped between the dCAPs markers dCAPs-21 and dCAPs-19, at genetic distances of 0.05 cM and 0.15 cM, respectively. The physical distance of this region was 19.6 kb. Three markers, InDel-19, dCAPs-2 and dCAPs-11, co-segregated with csgl3. There were two candidate genes in the region, Csa6M514860 and Csa6M514870. Quantitative real-time PCR showed that the expression of Csa6M514870 was higher in the tissues of 9930 than that of NCG157, and this was consistent with their phenotypic characters. Csa6M514870 is therefore postulated to be the candidate gene for the development of trichomes in cucumber. This study will facilitate marker-assisted selection (MAS) of the smooth

  10. Large-Scale Gene-Centric Analysis Identifies Novel Variants for Coronary Artery Disease

    PubMed Central

    2011-01-01

    Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants. We examined 49,094 genetic variants in ∼2,100 genes of cardiovascular relevance, using a customised gene array in 15,596 CAD cases and 34,992 controls (11,202 cases and 30,733 controls of European descent; 4,394 cases and 4,259 controls of South Asian origin). We attempted to replicate putative novel associations in an additional 17,121 CAD cases and 40,473 controls. Potential mechanisms through which the novel variants could affect CAD risk were explored through association tests with vascular risk factors and gene expression. We confirmed associations of several previously known CAD susceptibility loci (eg, 9p21.3:p<10−33; LPA:p<10−19; 1p13.3:p<10−17) as well as three recently discovered loci (COL4A1/COL4A2, ZC3HC1, CYP17A1:p<5×10−7). However, we found essentially null results for most previously suggested CAD candidate genes. In our replication study of 24 promising common variants, we identified novel associations of variants in or near LIPA, IL5, TRIB1, and ABCG5/ABCG8, with per-allele odds ratios for CAD risk with each of the novel variants ranging from 1.06–1.09. Associations with variants at LIPA, TRIB1, and ABCG5/ABCG8 were supported by gene expression data or effects on lipid levels. Apart from the previously reported variants in LPA, none of the other ∼4,500 low frequency and functional variants showed a strong effect. Associations in South Asians did not differ appreciably from those in Europeans, except for 9p21.3 (per-allele odds ratio: 1.14 versus 1.27 respectively; P for heterogeneity = 0.003). This large-scale gene-centric analysis has identified several novel genes for CAD that relate to diverse biochemical and cellular functions and

  11. A New Glabrous Gene (csgl3) Identified in Trichome Development in Cucumber (Cucumis sativus L.).

    PubMed

    Cui, Jin-Ying; Miao, Han; Ding, Li-Hong; Wehner, Todd C; Liu, Pan-Na; Wang, Ye; Zhang, Sheng-Ping; Gu, Xing-Fang

    2016-01-01

    Spines or trichomes on the fruit of cucumbers enhance their commercial value in China. In addition, glabrous mutants exhibit resistance to aphids and therefore their use by growers can reduce pesticide residues. Previous studies have reported two glabrous mutant plants containing the genes, csgl1 and csgl2. In the present study, a new glabrous mutant, NCG157, was identified showing a gene interaction effect with csgl1 and csgl2. This mutant showed the glabrous character on stems, leaves, tendrils, receptacles and ovaries, and there were no spines or tumors on the fruit surface. Inheritance analysis showed that a single recessive gene, named csgl3, determined the glabrous trait. An F2 population derived from the cross of two inbred lines 9930 (a fresh market type from Northern China that exhibits trichomes) and NCG157 (an American processing type with glabrous surfaces) was used for genetic mapping of the csgl3 gene. By combining bulked segregant analysis (BAS) with molecular markers, 18 markers, including two simple sequence repeats (SSR), nine insertion deletions (InDel) and seven derived cleaved amplified polymorphism sequences (dCAPs), were identified to link to the csgl3 gene. All of the linked markers were used as anchor loci to locate the csgl3 gene on cucumber chromosome 6. The csgl3 gene was mapped between the dCAPs markers dCAPs-21 and dCAPs-19, at genetic distances of 0.05 cM and 0.15 cM, respectively. The physical distance of this region was 19.6 kb. Three markers, InDel-19, dCAPs-2 and dCAPs-11, co-segregated with csgl3. There were two candidate genes in the region, Csa6M514860 and Csa6M514870. Quantitative real-time PCR showed that the expression of Csa6M514870 was higher in the tissues of 9930 than that of NCG157, and this was consistent with their phenotypic characters. Csa6M514870 is therefore postulated to be the candidate gene for the development of trichomes in cucumber. This study will facilitate marker-assisted selection (MAS) of the smooth

  12. Transcriptional Shift Identifies a Set of Genes Driving Breast Cancer Chemoresistance

    PubMed Central

    Vera-Ramirez, Laura; Sanchez-Rovira, Pedro; Ramirez-Tortosa, Cesar L.; Quiles, Jose L.; Ramirez-Tortosa, MCarmen; Lorente, Jose A.

    2013-01-01

    Background Distant recurrences after antineoplastic treatment remain a serious problem for breast cancer clinical management, which threats patients’ life. Systemic therapy is administered to eradicate cancer cells from the organism, both at the site of the primary tumor and at any other potential location. Despite this intervention, a significant proportion of breast cancer patients relapse even many years after their primary tumor has been successfully treated according to current clinical standards, evidencing the existence of a chemoresistant cell subpopulation originating from the primary tumor. Methods/Findings To identify key molecules and signaling pathways which drive breast cancer chemoresistance we performed gene expression analysis before and after anthracycline and taxane-based chemotherapy and compared the results between different histopathological response groups (good-, mid- and bad-response), established according to the Miller & Payne grading system. Two cohorts of 33 and 73 breast cancer patients receiving neoadjuvant chemotherapy were recruited for whole-genome expression analysis and validation assay, respectively. Identified genes were subjected to a bioinformatic analysis in order to ascertain the molecular function of the proteins they encode and the signaling in which they participate. High throughput technologies identified 65 gene sequences which were over-expressed in all groups (P ≤ 0·05 Bonferroni test). Notably we found that, after chemotherapy, a significant proportion of these genes were over-expressed in the good responders group, making their tumors indistinguishable from those of the bad responders in their expression profile (P ≤ 0.05 Benjamini-Hochgerg`s method). Conclusions These data identify a set of key molecular pathways selectively up-regulated in post-chemotherapy cancer cells, which may become appropriate targets for the development of future directed therapies against breast cancer. PMID:23326553

  13. Genetic and expression analysis of cattle identifies candidate genes in pathways responding to Trypanosoma congolense infection

    PubMed Central

    Noyes, Harry; Brass, Andy; Obara, Isaiah; Anderson, Susan; Archibald, Alan L.; Bradley, Dan G.; Fisher, Paul; Freeman, Abigail; Gibson, John; Gicheru, Michael; Hall, Laurence; Hanotte, Olivier; Hulme, Helen; McKeever, Declan; Murray, Caitriona; Oh, Sung Jung; Tate, Catriona; Smith, Ken; Tapio, Miika; Wambugu, John; Williams, Diana J.; Agaba, Morris; Kemp, Stephen J.

    2011-01-01

    African bovine trypanosomiasis caused by Trypanosoma sp., is a major constraint on cattle productivity in sub-Saharan Africa. Some African Bos taurus breeds are highly tolerant of infection, but the potentially more productive Bos indicus zebu breeds are much more susceptible. Zebu cattle are well adapted for plowing and haulage, and increasing their tolerance of trypanosomiasis could have a major impact on crop cultivation as well as dairy and beef production. We used three strategies to obtain short lists of candidate genes within QTL that were previously shown to regulate response to infection. We analyzed the transcriptomes of trypanotolerant N'Dama and susceptible Boran cattle after infection with Trypanosoma congolense. We sequenced EST libraries from these two breeds to identify polymorphisms that might underlie previously identified quantitative trait loci (QTL), and we assessed QTL regions and candidate loci for evidence of selective sweeps. The scan of the EST sequences identified a previously undescribed polymorphism in ARHGAP15 in the Bta2 trypanotolerance QTL. The polymorphism affects gene function in vitro and could contribute to the observed differences in expression of the MAPK pathway in vivo. The expression data showed that TLR and MAPK pathways responded to infection, and the former contained TICAM1, which is within a QTL on Bta7. Genetic analyses showed that selective sweeps had occurred at TICAM1 and ARHGAP15 loci in African taurine cattle, making them strong candidates for the genes underlying the QTL. Candidate QTL genes were identified in other QTL by their expression profile and the pathways in which they participate. PMID:21593421

  14. Combined Gene Expression and RNAi Screening to Identify Alkylation Damage Survival Pathways from Fly to Human.

    PubMed

    Zanotto-Filho, Alfeu; Dashnamoorthy, Ravi; Loranc, Eva; de Souza, Luis H T; Moreira, José C F; Suresh, Uthra; Chen, Yidong; Bishop, Alexander J R

    2016-01-01

    Alkylating agents are a key component of cancer chemotherapy. Several cellular mechanisms are known to be important for its survival, particularly DNA repair and xenobiotic detoxification, yet genomic screens indicate that additional cellular components may be involved. Elucidating these components has value in either identifying key processes that can be modulated to improve chemotherapeutic efficacy or may be altered in some cancers to confer chemoresistance. We therefore set out to reevaluate our prior Drosophila RNAi screening data by comparison to gene expression arrays in order to determine if we could identify any novel processes in alkylation damage survival. We noted a consistent conservation of alkylation survival pathways across platforms and species when the analysis was conducted on a pathway/process level rather than at an individual gene level. Better results were obtained when combining gene lists from two datasets (RNAi screen plus microarray) prior to analysis. In addition to previously identified DNA damage responses (p53 signaling and Nucleotide Excision Repair), DNA-mRNA-protein metabolism (transcription/translation) and proteasome machinery, we also noted a highly conserved cross-species requirement for NRF2, glutathione (GSH)-mediated drug detoxification and Endoplasmic Reticulum stress (ER stress)/Unfolded Protein Responses (UPR) in cells exposed to alkylation. The requirement for GSH, NRF2 and UPR in alkylation survival was validated by metabolomics, protein studies and functional cell assays. From this we conclude that RNAi/gene expression fusion is a valid strategy to rapidly identify key processes that may be extendable to other contexts beyond damage survival.

  15. Live-cell monitoring of periodic gene expression in synchronous human cells identifies Forkhead genes involved in cell cycle control.

    PubMed

    Grant, Gavin D; Gamsby, Joshua; Martyanov, Viktor; Brooks, Lionel; George, Lacy K; Mahoney, J Matthew; Loros, Jennifer J; Dunlap, Jay C; Whitfield, Michael L

    2012-08-01

    We developed a system to monitor periodic luciferase activity from cell cycle-regulated promoters in synchronous cells. Reporters were driven by a minimal human E2F1 promoter with peak expression in G1/S or a basal promoter with six Forkhead DNA-binding sites with peak expression at G2/M. After cell cycle synchronization, luciferase activity was measured in live cells at 10-min intervals across three to four synchronous cell cycles, allowing unprecedented resolution of cell cycle-regulated gene expression. We used this assay to screen Forkhead transcription factors for control of periodic gene expression. We confirmed a role for FOXM1 and identified two novel cell cycle regulators, FOXJ3 and FOXK1. Knockdown of FOXJ3 and FOXK1 eliminated cell cycle-dependent oscillations and resulted in decreased cell proliferation rates. Analysis of genes regulated by FOXJ3 and FOXK1 showed that FOXJ3 may regulate a network of zinc finger proteins and that FOXK1 binds to the promoter and regulates DHFR, TYMS, GSDMD, and the E2F binding partner TFDP1. Chromatin immunoprecipitation followed by high-throughput sequencing analysis identified 4329 genomic loci bound by FOXK1, 83% of which contained a FOXK1-binding motif. We verified that a subset of these loci are activated by wild-type FOXK1 but not by a FOXK1 (H355A) DNA-binding mutant.

  16. An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora

    PubMed Central

    2011-01-01

    Background Coffee is one of the world's most important crops; it is consumed worldwide and plays a significant role in the economy of producing countries. Coffea arabica and C. canephora are responsible for 70 and 30% of commercial production, respectively. C. arabica is an allotetraploid from a recent hybridization of the diploid species, C. canephora and C. eugenioides. C. arabica has lower genetic diversity and results in a higher quality beverage than C. canephora. Research initiatives have been launched to produce genomic and transcriptomic data about Coffea spp. as a strategy to improve breeding efficiency. Results Assembling the expressed sequence tags (ESTs) of C. arabica and C. canephora produced by the Brazilian Coffee Genome Project and the Nestlé-Cornell Consortium revealed 32,007 clusters of C. arabica and 16,665 clusters of C. canephora. We detected different GC3 profiles between these species that are related to their genome structure and mating system. BLAST analysis revealed similarities between coffee and grape (Vitis vinifera) genes. Using KA/KS analysis, we identified coffee genes under purifying and positive selection. Protein domain and gene ontology analyses suggested differences between Coffea spp. data, mainly in relation to complex sugar synthases and nucleotide binding proteins. OrthoMCL was used to identify specific and prevalent coffee protein families when compared to five other plant species. Among the interesting families annotated are new cystatins, glycine-rich proteins and RALF-like peptides. Hierarchical clustering was used to independently group C. arabica and C. canephora expression clusters according to expression data extracted from EST libraries, resulting in the identification of differentially expressed genes. Based on these results, we emphasize gene annotation and discuss plant defenses, abiotic stress and cup quality-related functional categories. Conclusion We present the first comprehensive genome-wide transcript

  17. Two novel mutations of ornithine transcarbamylase gene identified from three Chinese neonates with ornithine transcarbamylase deficiency.

    PubMed

    Liu, Jing; Dong, Lei; Wang, Yan; Zhang, Mei

    2015-01-01

    We aim to analyze the blood metabolic profiling and the gene mutation of ornithine transcarbamylase (OTC) in three neonates with ornithine transcarbamylase deficiency (OTCD). Three neonates with OTCD were included in this study. The profiling of amino acids and acylcarnitine was determined using MS-MS assay. The OTC exons were amplified using PCR amplification. DNA sequencing was performed, based on which mutation analysis of OTC genes was carried out. For the clinical symptoms, all the three neonates showed poor reaction and feeding. In addition, convulsion and neonatal infection were noticed. A remarkable decrease of citrulline concentration was revealed by MS-MS assay. In case 1, a 548A > G substitution was identified in exon 6, which resulted in replacement of cysteine by tyrosine in codon 183. In case 2, a 1016T > G substitution was identified in exon 10, leading to replacement of valine by glycine in codon 339. In case 3, a 995G > C mutation was noted in exon 9, resulting in missense mutation of tryptophane to serine in codon332. Three types of OTC gene mutations were identified in Chinese neonates with OTC deficiency, among which two novel mutations, including 1016T > G and 995G > C, are presented uniquely in our study.

  18. A novel homozygous mutation in SUCLA2 gene identified by exome sequencing

    PubMed Central

    Lamperti, Costanza; Fang, Mingyan; Invernizzi, Federica; Liu, Xuanzhu; Wang, Hairong; Zhang, Qing; Carrara, Franco; Moroni, Isabella; Zeviani, Massimo; Zhang, Jianguo; Ghezzi, Daniele

    2012-01-01

    Mitochondrial disorders with multiple mitochondrial respiratory chain (MRC) enzyme deficiency and depletion of mitochondrial DNA (mtDNA) are autosomal recessive conditions due to mutations in several nuclear genes necessary for proper mtDNA maintenance. In this report, we describe two Italian siblings presenting with encephalomyopathy and mtDNA depletion in muscle. By whole exome-sequencing and prioritization of candidate genes, we identified a novel homozygous missense mutation in the SUCLA2 gene in a highly conserved aminoacid residue. Although a recurrent mutation in the SUCLA2 gene is relatively frequent in the Faroe Islands, mutations in other populations are extremely rare. In contrast with what has been reported in other patients, methyl-malonic aciduria, a biomarker for this genetic defect, was absent in our proband and very mildly elevated in her affected sister. This report demonstrates that next-generation technologies, particularly exome-sequencing, are user friendly, powerful means for the identification of disease genes in genetically and clinically heterogeneous inherited conditions, such as mitochondrial disorders. PMID:23010432

  19. Yeast model identifies ENTPD6 as a potential non-obstructive azoospermia pathogenic gene

    PubMed Central

    Wang, Qian; Liu, Chao; Tang, Chaoming; Guo, Huiping; Liu, Yujiao; Wang, Lina; Zhao, Haichao; Shang, Yongliang; Wen, Yang; Lin, Yuan; Zhou, Tao; Zhou, Zuomin; Dong, Wen; Hu, Zhibin; Guo, Xuejiang; Sha, Jiahao; Li, Wei

    2015-01-01

    Approximately ten percent of male infertility is caused by non-obstructive azoospermia (NOA), but the etiologies of many NOA remain elusive. Recently, a genome-wide association study (GWAS) of NOA in Han Chinese men was conducted, and only a few genetic variants associated with NOA were found, which might have resulted from genetic heterogeneity. However, those variants that lack genome-wide significance might still be essential for fertility. Functional analysis of genes surrounding these variants in Drosophila identified some spermatogenesis-essential genes. As a complementary method of Drosophila screening, SK1 background Saccharomvces cerevisiae was used as a model to screen meiosis-related genes from the NOA GWAS data in this study. After functional screening, GDA1 (orthologous to humanENTPD6) was found to be a novel meiosis-related gene. The deletion of GDA1 resulted in the failure of yeast sporulation. Further investigations showed that Gda1p was important for pre-meiotic S phase entry. Interestingly, the meiotic role of Gda1p was dependent on its guanosine diphosphatase activity, but not it’s cytoplasmic, transmembrane or stem domains. These yeast data suggest that ENTPD6 may be a novel meiosis-associated NOA-related gene, and the yeast model provides a good approach to analyze GWAS results of NOA. PMID:26152596

  20. A network-based phenotype mapping approach to identify genes that modulate drug response phenotypes

    PubMed Central

    Cairns, Junmei; Ung, Choong Yong; da Rocha, Edroaldo Lummertz; Zhang, Cheng; Correia, Cristina; Weinshilboum, Richard; Wang, Liewei; Li, Hu

    2016-01-01

    To better address the problem of drug resistance during cancer chemotherapy and explore the possibility of manipulating drug response phenotypes, we developed a network-based phenotype mapping approach (P-Map) to identify gene candidates that upon perturbed can alter sensitivity to drugs. We used basal transcriptomics data from a panel of human lymphoblastoid cell lines (LCL) to infer drug response networks (DRNs) that are responsible for conferring response phenotypes for anthracycline and taxane, two common anticancer agents use in clinics. We further tested selected gene candidates that interact with phenotypic differentially expressed genes (PDEGs), which are up-regulated genes in LCL for a given class of drug response phenotype in triple-negative breast cancer (TNBC) cells. Our results indicate that it is possible to manipulate a drug response phenotype, from resistant to sensitive or vice versa, by perturbing gene candidates in DRNs and suggest plausible mechanisms regulating directionality of drug response sensitivity. More important, the current work highlights a new way to formulate systems-based therapeutic design: supplementing therapeutics that aim to target disease culprits with phenotypic modulators capable of altering DRN properties with the goal to re-sensitize resistant phenotypes. PMID:27841317

  1. A systems approach identifies networks and genes linking sleep and stress: implications for neuropsychiatric disorders.

    PubMed

    Jiang, Peng; Scarpa, Joseph R; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D; Hao, Ke; Summa, Keith C; Yang, He S; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H; Turek, Fred W; Kasarskis, Andrew

    2015-05-05

    Sleep dysfunction and stress susceptibility are comorbid complex traits that often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multilevel organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J × A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type-specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests that the interplay among sleep, stress, and neuropathology emerges from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework for interrogating the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders.

  2. Yeast model identifies ENTPD6 as a potential non-obstructive azoospermia pathogenic gene.

    PubMed

    Wang, Qian; Liu, Chao; Tang, Chaoming; Guo, Huiping; Liu, Yujiao; Wang, Lina; Zhao, Haichao; Shang, Yongliang; Wen, Yang; Lin, Yuan; Zhou, Tao; Zhou, Zuomin; Dong, Wen; Hu, Zhibin; Guo, Xuejiang; Sha, Jiahao; Li, Wei

    2015-07-08

    Approximately ten percent of male infertility is caused by non-obstructive azoospermia (NOA), but the etiologies of many NOA remain elusive. Recently, a genome-wide association study (GWAS) of NOA in Han Chinese men was conducted, and only a few genetic variants associated with NOA were found, which might have resulted from genetic heterogeneity. However, those variants that lack genome-wide significance might still be essential for fertility. Functional analysis of genes surrounding these variants in Drosophila identified some spermatogenesis-essential genes. As a complementary method of Drosophila screening, SK1 background Saccharomvces cerevisiae was used as a model to screen meiosis-related genes from the NOA GWAS data in this study. After functional screening, GDA1 (orthologous to humanENTPD6) was found to be a novel meiosis-related gene. The deletion of GDA1 resulted in the failure of yeast sporulation. Further investigations showed that Gda1p was important for pre-meiotic S phase entry. Interestingly, the meiotic role of Gda1p was dependent on its guanosine diphosphatase activity, but not it's cytoplasmic, transmembrane or stem domains. These yeast data suggest that ENTPD6 may be a novel meiosis-associated NOA-related gene, and the yeast model provides a good approach to analyze GWAS results of NOA.

  3. An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways

    PubMed Central

    Ander, Bradley P.; Zhang, Xiaoshuai; Xue, Fuzhong; Sharp, Frank R.; Yang, Xiaowei

    2013-01-01

    The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. PMID:23844055

  4. Differential expression of genes identified by suppression subtractive hybridization in petals of opening carnation flowers.

    PubMed

    Harada, Taro; Torii, Yuka; Morita, Shigeto; Masumura, Takehiro; Satoh, Shigeru

    2010-05-01

    Flower opening is an event accompanied by morphological changes in petals which include elongation, expansion, and outward-curving. Petal cell growth is a fundamental process that underlies such phenomena, but its molecular mechanism remains largely unknown. Suppression subtractive hybridization was performed between petals during the early elongation period (stage 1) and during the opening period (stage 5) in carnation flowers and a pair of subtraction libraries abundant in differentially expressed genes was constructed at each stage. 393 cDNA clones picked up by differential screening out of 1728 clones were sequenced and 235 different cDNA fragments were identified, among which 211 did not match any known nucleotide sequence of carnation genes in the databases. BLASTX search of nucleotide sequences revealed that putative functions of the translational products can be classified into several categories including transcription, signalling, cell wall modification, lipid metabolism, and transport. Open reading frames of 15 selected genes were successfully determined by rapid amplification of cDNA ends (RACE). Time-course analysis of these genes by real-time RT-PCR showed that transcript levels of several genes correlatively fluctuate in petals of opening carnation flowers, suggesting an association with the morphological changes by elongation or curving. Based on the results, it is suggested that the growth of carnation petals is controlled by co-ordinated gene expression during the progress of flower opening. In addition, the possible roles of some key genes in the initiation of cell growth, the construction of the cell wall and cuticle, and transport across membranes were discussed.

  5. Gene Expression Profiling Identifies Molecular Pathways Associated with Collagen VI Deficiency and Provides Novel Therapeutic Targets

    PubMed Central

    Paco, Sonia; Kalko, Susana G.; Jou, Cristina; Rodríguez, María A.; Corbera, Joan; Muntoni, Francesco; Feng, Lucy; Rivas, Eloy; Torner, Ferran; Gualandi, Francesca; Gomez-Foix, Anna M.; Ferrer, Anna; Ortez, Carlos; Nascimento, Andrés; Colomer, Jaume; Jimenez-Mallebrera, Cecilia

    2013-01-01

    Ullrich congenital muscular dystrophy (UCMD), caused by collagen VI deficiency, is a common congenital muscular dystrophy. At present, the role of collagen VI in muscle and the mechanism of disease are not fully understood. To address this we have applied microarrays to analyse the transcriptome of UCMD muscle and compare it to healthy muscle and other muscular dystrophies. We identified 389 genes which are differentially regulated in UCMD relative to controls. In addition, there were 718 genes differentially expressed between UCMD and dystrophin deficient muscle. In contrast, only 29 genes were altered relative to other congenital muscular dystrophies. Changes in gene expression were confirmed by real-time PCR. The set of regulated genes was analysed by Gene Ontology, KEGG pathways and Ingenuity Pathway analysis to reveal the molecular functions and gene networks associated with collagen VI defects. The most significantly regulated pathways were those involved in muscle regeneration, extracellular matrix remodelling and inflammation. We characterised the immune response in UCMD biopsies as being mainly mediated via M2 macrophages and the complement pathway indicating that anti-inflammatory treatment may be beneficial to UCMD as for other dystrophies. We studied the immunolocalisation of ECM components and found that biglycan, a collagen VI interacting proteoglycan, was reduced in the basal lamina of UCMD patients. We propose that biglycan reduction is secondary to collagen VI loss and that it may be contributing towards UCMD pathophysiology. Consequently, strategies aimed at over-expressing biglycan and restore the link between the muscle cell surface and the extracellular matrix should be considered. PMID:24223098

  6. Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability.

    PubMed

    Riazuddin, S; Hussain, M; Razzaq, A; Iqbal, Z; Shahzad, M; Polla, D L; Song, Y; van Beusekom, E; Khan, A A; Tomas-Roca, L; Rashid, M; Zahoor, M Y; Wissink-Lindhout, W M; Basra, M A R; Ansar, M; Agha, Z; van Heeswijk, K; Rasheed, F; Van de Vorst, M; Veltman, J A; Gilissen, C; Akram, J; Kleefstra, T; Assir, M Z; Grozeva, D; Carss, K; Raymond, F L; O'Connor, T D; Riazuddin, S A; Khan, S N; Ahmed, Z M; de Brouwer, A P M; van Bokhoven, H; Riazuddin, S

    2016-07-26

    Intellectual disability (ID) is a clinically and genetically heterogeneous disorder, affecting 1-3% of the general population. Although research into the genetic causes of ID has recently gained momentum, identification of pathogenic mutations that cause autosomal recessive ID (ARID) has lagged behind, predominantly due to non-availability of sizeable families. Here we present the results of exome sequencing in 121 large consanguineous Pakistani ID families. In 60 families, we identified homozygous or compound heterozygous DNA variants in a single gene, 30 affecting reported ID genes and 30 affecting novel candidate ID genes. Potential pathogenicity of these alleles was supported by co-segregation with the phenotype, low frequency in control populations and the application of stringent bioinformatics analyses. In another eight families segregation of multiple pathogenic variants was observed, affecting 19 genes that were either known or are novel candidates for ID. Transcriptome profiles of normal human brain tissues showed that the novel candidate ID genes formed a network significantly enriched for transcriptional co-expression (P<0.0001) in the frontal cortex during fetal development and in the temporal-parietal and sub-cortex during infancy through adulthood. In addition, proteins encoded by 12 novel ID genes directly interact with previously reported ID proteins in six known pathways essential for cognitive function (P<0.0001). These results suggest that disruptions of temporal parietal and sub-cortical neurogenesis during infancy are critical to the pathophysiology of ID. These findings further expand the existing repertoire of genes involved in ARID, and provide new insights into the molecular mechanisms and the transcriptome map of ID.Molecular Psychiatry advance online publication, 26 July 2016; doi:10.1038/mp.2016.109.

  7. Genome-wide RNAi screening identifies protein damage as a regulator of osmoprotective gene expression

    PubMed Central

    Lamitina, Todd; Huang, Chunyi George; Strange, Kevin

    2006-01-01

    The detection, stabilization, and repair of stress-induced damage are essential requirements for cellular life. All cells respond to osmotic stress-induced water loss with increased expression of genes that mediate accumulation of organic osmolytes, solutes that function as chemical chaperones and restore osmotic homeostasis. The signals and signaling mechanisms that regulate osmoprotective gene expression in animal cells are poorly understood. Here, we show that gpdh-1 and gpdh-2, genes that mediate the accumulation of the organic osmolyte glycerol, are essential for survival of the nematode Caenorhabditis elegans during osmotic stress. Expression of GFP driven by the gpdh-1 promoter (Pgpdh-1::GFP) is detected only during hypertonic stress but is not induced by other stressors. Using Pgpdh-1::GFP expression as a phenotype, we screened ≈16,000 genes by RNAi feeding and identified 122 that cause constitutive activation of gpdh-1 expression and glycerol accumulation. Many of these genes function to regulate protein translation and cotranslational protein folding and to target and degrade denatured proteins, suggesting that the accumulation of misfolded proteins functions as a signal to activate osmoprotective gene expression and organic osmolyte accumulation in animal cells. Consistent with this hypothesis, 73% of these protein-homeostasis genes have been shown to slow age-dependent protein aggregation in C. elegans. Because diverse environmental stressors and numerous disease states result in protein misfolding, mechanisms must exist that discriminate between osmotically induced and other forms of stress-induced protein damage. Our findings provide a foundation for understanding how these damage-selectivity mechanisms function. PMID:16880390

  8. Comparative Transcriptome Analysis Identifies CCDC80 as a Novel Gene Associated with Pulmonary Arterial Hypertension

    PubMed Central

    Sasagawa, Shota; Nishimura, Yuhei; Sawada, Hirofumi; Zhang, Erquan; Okabe, Shiko; Murakami, Soichiro; Ashikawa, Yoshifumi; Yuge, Mizuki; Kawaguchi, Koki; Kawase, Reiko; Mitani, Yoshihide; Maruyama, Kazuo; Tanaka, Toshio

    2016-01-01

    Pulmonary arterial hypertension (PAH) is a heterogeneous disorder associated with a progressive increase in pulmonary artery resistance and pressure. Although various therapies have been developed, the 5-year survival rate of PAH patients remains low. There is thus an important need to identify novel genes that are commonly dysregulated in PAH of various etiologies and could be used as biomarkers and/or therapeutic targets. In this study, we performed comparative transcriptome analysis of five mammalian PAH datasets downloaded from a public database. We identified 228 differentially expressed genes (DEGs) from a rat PAH model caused by inhibition of vascular endothelial growth factor receptor under hypoxic conditions, 379 DEGs from a mouse PAH model associated with systemic sclerosis, 850 DEGs from a mouse PAH model associated with schistosomiasis, 1598 DEGs from one cohort of human PAH patients, and 4260 DEGs from a second cohort of human PAH patients. Gene-by-gene comparison identified four genes that were differentially upregulated or downregulated in parallel in all five sets of DEGs. Expression of coiled-coil domain containing 80 (CCDC80) and anterior gradient two genes was significantly increased in the five datasets, whereas expression of SMAD family member six and granzyme A was significantly decreased. Weighted gene co-expression network analysis revealed a connection between CCDC80 and collagen type I alpha 1 (COL1A1) expression. To validate the function of CCDC80 in vivo, we knocked out ccdc80 in zebrafish using the clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 system. In vivo imaging of zebrafish expressing a fluorescent protein in endothelial cells showed that ccdc80 deletion significantly increased the diameter of the ventral artery, a vessel supplying blood to the gills. We also demonstrated that expression of col1a1 and endothelin-1 mRNA was significantly decreased in the ccdc80-knockout zebrafish. Finally, we examined Ccdc

  9. Large-Scale Transposition Mutagenesis of Streptomyces coelicolor Identifies Hundreds of Genes Influencing Antibiotic Biosynthesis.

    PubMed

    Xu, Zhong; Wang, Yemin; Chater, Keith F; Ou, Hong-Yu; Xu, H Howard; Deng, Zixin; Tao, Meifeng

    2017-03-15

    Gram-positive Streptomyces bacteria produce thousands of bioactive secondary metabolites, including antibiotics. To systematically investigate genes affecting secondary metabolism, we developed a hyperactive transposase-based Tn5 transposition system and employed it to mutagenize the model species Streptomyces coelicolor, leading to the identification of 51,443 transposition insertions. These insertions were distributed randomly along the chromosome except for some preferred regions associated with relatively low GC content in the chromosomal core. The base composition of the insertion site and its flanking sequences compiled from the 51,443 insertions implied a 19-bp expanded target site surrounding the insertion site, with a slight nucleic acid base preference in some positions, suggesting a relative randomness of Tn5 transposition targeting in the high-GC Streptomyces genome. From the mutagenesis library, 724 mutants involving 365 genes had altered levels of production of the tripyrrole antibiotic undecylprodigiosin (RED), including 17 genes in the RED biosynthetic gene cluster. Genetic complementation revealed that most of the insertions (more than two-thirds) were responsible for the changed antibiotic production. Genes associated with branched-chain amino acid biosynthesis, DNA metabolism, and protein modification affected RED production, and genes involved in signaling, stress, and transcriptional regulation were overrepresented. Some insertions caused dramatic changes in RED production, identifying future targets for strain improvement.IMPORTANCE High-GC Gram-positive streptomycetes and related actinomycetes have provided more than 100 clinical drugs used as antibiotics, immunosuppressants, and antitumor drugs. Their genomes harbor biosynthetic genes for many more unknown compounds with potential as future drugs. Here we developed a useful genome-wide mutagenesis tool based on the transposon Tn5 for the study of secondary metabolism and its regulation

  10. Transcriptome Analysis of Recurrently Deregulated Genes across Multiple Cancers Identifies New Pan-Cancer Biomarkers.

    PubMed

    Kaczkowski, Bogumil; Tanaka, Yuji; Kawaji, Hideya; Sandelin, Albin; Andersson, Robin; Itoh, Masayoshi; Lassmann, Timo; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R

    2016-01-15

    Genes that are commonly deregulated in cancer are clinically attractive as candidate pan-diagnostic markers and therapeutic targets. To globally identify such targets, we compared Cap Analysis of Gene Expression profiles from 225 different cancer cell lines and 339 corresponding primary cell samples to identify transcripts that are deregulated recurrently in a broad range of cancer types. Comparing RNA-seq data from 4,055 tumors and 563 normal tissues profiled in the The Cancer Genome Atlas and FANTOM5 datasets, we identified a core transcript set with theranostic potential. Our analyses also revealed enhancer RNAs, which are upregulated in cancer, defining promoters that overlap with repetitive elements (especially SINE/Alu and LTR/ERV1 elements) that are often upregulated in cancer. Lastly, we documented for the first time upregulation of multiple copies of the REP522 interspersed repeat in cancer. Overall, our genome-wide expression profiling approach identified a comprehensive set of candidate biomarkers with pan-cancer potential, and extended the perspective and pathogenic significance of repetitive elements that are frequently activated during cancer progression.

  11. Candidate genes and functional noncoding variants identified in a canine model of obsessive-compulsive disorder

    PubMed Central

    2014-01-01

    Background Obsessive-compulsive disorder (OCD), a severe mental disease manifested in time-consuming repetition of behaviors, affects 1 to 3% of the human population. While highly heritable, complex genetics has hampered attempts to elucidate OCD etiology. Dogs suffer from naturally occurring compulsive disorders that closely model human OCD, manifested as an excessive repetition of normal canine behaviors that only partially responds to drug therapy. The limited diversity within dog breeds makes identifying underlying genetic factors easier. Results We use genome-wide association of 87 Doberman Pinscher cases and 63 controls to identify genomic loci associated with OCD and sequence these regions in 8 affected dogs from high-risk breeds and 8 breed-matched controls. We find 119 variants in evolutionarily conserved sites that are specific to dogs with OCD. These case-only variants are significantly more common in high OCD risk breeds compared to breeds with no known psychiatric problems. Four genes, all with synaptic function, have the most case-only variation: neuronal cadherin (CDH2), catenin alpha2 (CTNNA2), ataxin-1 (ATXN1), and plasma glutamate carboxypeptidase (PGCP). In the 2 Mb gene desert between the cadherin genes CDH2 and DSC3, we find two different variants found only in dogs with OCD that disrupt the same highly conserved regulatory element. These variants cause significant changes in gene expression in a human neuroblastoma cell line, likely due to disrupted transcription factor binding. Conclusions The limited genetic diversity of dog breeds facilitates identification of genes, functional variants and regulatory pathways underlying complex psychiatric disorders that are mechanistically similar in dogs and humans. PMID:24995881

  12. A Genome-wide Association Study Identifies LIPA as a Susceptibility Gene for Coronary Artery Disease

    PubMed Central

    Wild, Philipp S; Zeller, Tanja; Schillert, Arne; Szymczak, Silke; Sinning, Christoph R; Deiseroth, Arne; Schnabel, Renate B; Lubos, Edith; Keller, Till; Eleftheriadis, Medea S; Bickel, Christoph; Rupprecht, Hans J; Wilde, Sandra; Rossmann, Heidi; Diemert, Patrick; Cupples, L Adrienne; Perret, Claire; Erdmann, Jeanette; Stark, Klaus; Kleber, Marcus E; Epstein, Stephen E; Voight, Benjamin F; Kuulasmaa, Kari; Li, Mingyao; Schäfer, Arne S; Klopp, Norman; Braund, Peter S; Sager, Hendrik B; Demissie, Serkalem; Proust, Carole; König, Inke R; Wichmann, Heinz-Erich; Reinhard, Wibke; Hoffmann, Michael M; Virtamo, Jarmo; Burnett, Mary Susan; Siscovick, David; Wiklund, Per Gunnar; Qu, Liming; El Mokthari, Nour Eddine; Thompson, John R; Peters, Annette; Smith, Albert V; Yon, Emmanuelle; Baumert, Jens; Hengstenberg, Christian; März, Winfried; Amouyel, Philippe; Devaney, Joseph; Schwartz, Stephen M; Saarela, Olli; Mehta, Nehal N; Rubin, Diana; Silander, Kaisa; Hall, Alistair S; Ferrieres, Jean; Harris, Tamara B; Melander, Olle; Kee, Frank; Hakonarson, Hakon; Schrezenmeir, Juergen; Gudnason, Vilmundur; Elosua, Roberto; Arveiler, Dominique; Evans, Alun; Rader, Daniel J; Illig, Thomas; Schreiber, Stefan; Bis, Joshua C; Altshuler, David; Kavousi, Maryam; Witteman, Jaqueline CM; Uitterlinden, Andre G; Hofman, Albert; Folsom, Aaron R; Barbalic, Maja; Boerwinkle, Eric; Kathiresan, Sekar; Reilly, Muredach P; O'Donnell, Christopher J; Samani, Nilesh J; Schunkert, Heribert; Cambien, Francois; Lackner, Karl J; Tiret, Laurence; Salomaa, Veikko; Munzel, Thomas; Ziegler, Andreas; Blankenberg, Stefan

    2011-01-01

    Background eQTL analyses are important to improve the understanding of genetic association results. Here, we performed a genome-wide association and global gene expression study to identify functionally relevant variants affecting the risk of coronary artery disease (CAD). Methods and Results In a genome-wide association analysis of 2,078 CAD cases and 2,953 controls, we identified 950 single nucleotide polymorphisms (SNPs) that were associated with CAD at P<10-3. Subsequent in silico and wet-lab replication stages and a final meta-analysis of 21,428 CAD cases and 38,361 controls revealed a novel association signal at chromosome 10q23.31 within the LIPA (Lysosomal Acid Lipase A) gene (P=3.7×10-8; OR 1.1; 95% CI: 1.07-1.14). The association of this locus with global gene expression was assessed by genome-wide expression analyses in the monocyte transcriptome of 1,494 individuals. The results showed a strong association of this locus with expression of the LIPA transcript (P=1.3×10-96). An assessment of LIPA SNPs and transcript with cardiovascular phenotypes revealed an association of LIPA transcript levels with impaired endothelial function (P=4.4×10-3). Conclusions The use of data on genetic variants and the addition of data on global monocytic gene expression led to the identification of the novel functional CAD susceptibility locus LIPA, located on chromosome 10q23.31. The respective eSNPs associated with CAD strongly affect LIPA gene expression level, which itself was related to endothelial dysfunction, a precursor of CAD. PMID:21606135

  13. Gene expression profiling identifies Fibronectin 1 and CXCL9 as candidate biomarkers for breast cancer screening

    PubMed Central

    Ruiz-Garcia, E; Scott, V; Machavoine, C; Bidart, J M; Lacroix, L; Delaloge, S; Andre, F

    2010-01-01

    Background: There is a need to develop blood-based bioassays for breast cancer (BC) screening. In this study, differential gene expression between BC samples and benign tumours was used to identify candidate biomarkers for blood-based screening. Methods: We identified two proteins (Fibronectin 1 and CXCL9) from a gene expression data set that included 120 BC samples and 45 benign lesions. These proteins fulfil the following criteria: differential gene expression between cancer and benign lesion, protein released in the extracellular medium and stable in the serum, commercially available ELISA kit, ELISA accuracy in a feasibility study. Protein concentrations were determined by ELISA. Blood samples were from normal volunteers (n=119) and early BC patients (n=133). Results: Seventy-three per cent of patients had cT1-T2 tumour. Patients had higher CXCL9 and Fibronectin 1 concentrations than volunteers. CXCL9 mean concentration was 851 and 635 pg ml−1 for patients and volunteers respectively (P=0.013). CXCL9 concentration was significantly higher in patients with estrogen receptor (ER)-negative compared with volunteers (P=0.003), data consistent with gene expression profile. Fibronectin 1 mean concentration was 190 μg ml−1 for patients and 125 μg ml−1 for volunteers (P<0.001). Areas under the curve for BC diagnosis were 0.78 and 0.62 for Fibronectin 1 and CXCL9 respectively. A combined score including Fibronectin 1 and CXCL9 dosages presented 53% of sensitivity and 98% of specificity. Similar performances were observed for ER-negative tumours. Conclusions: This study suggests that Fibronectin 1/CXCL9 dosage in serum could screen a significant rate of BC, including ER-negative, and that differential gene expression analysis is a good approach to select candidate biomarkers to set up blood assays cancer screening. PMID:20068563

  14. Analysis of genomic aberrations and gene expression profiling identifies novel lesions and pathways in myeloproliferative neoplasms

    PubMed Central

    Rice, K L; Lin, X; Wolniak, K; Ebert, B L; Berkofsky-Fessler, W; Buzzai, M; Sun, Y; Xi, C; Elkin, P; Levine, R; Golub, T; Gilliland, D G; Crispino, J D; Licht, J D; Zhang, W

    2011-01-01

    Polycythemia vera (PV), essential thrombocythemia and primary myelofibrosis, are myeloproliferative neoplasms (MPNs) with distinct clinical features and are associated with the JAK2V617F mutation. To identify genomic anomalies involved in the pathogenesis of these disorders, we profiled 87 MPN patients using Affymetrix 250K single-nucleotide polymorphism (SNP) arrays. Aberrations affecting chr9 were the most frequently observed and included 9pLOH (n=16), trisomy 9 (n=6) and amplifications of 9p13.3–23.3 (n=1), 9q33.1–34.13 (n=1) and 9q34.13 (n=6). Patients with trisomy 9 were associated with elevated JAK2V617F mutant allele burden, suggesting that gain of chr9 represents an alternative mechanism for increasing JAK2V617F dosage. Gene expression profiling of patients with and without chr9 abnormalities (+9, 9pLOH), identified genes potentially involved in disease pathogenesis including JAK2, STAT5B and MAPK14. We also observed recurrent gains of 1p36.31–36.33 (n=6), 17q21.2–q21.31 (n=5) and 17q25.1–25.3 (n=5) and deletions affecting 18p11.31–11.32 (n=8). Combined SNP and gene expression analysis identified aberrations affecting components of a non-canonical PRC2 complex (EZH1, SUZ12 and JARID2) and genes comprising a ‘HSC signature' (MLLT3, SMARCA2 and PBX1). We show that NFIB, which is amplified in 7/87 MPN patients and upregulated in PV CD34+ cells, protects cells from apoptosis induced by cytokine withdrawal. PMID:22829077

  15. A General Approach for Identifying Distant Regulatory Elements Applied to the Gdf6 Gene

    PubMed Central

    Mortlock, Douglas P.; Guenther, Catherine; Kingsley, David M.

    2003-01-01

    Regulatory sequences in higher genomes can map large distances from gene coding regions, and cannot yet be identified by simple inspection of primary DNA sequence information. Here we describe an efficient method of surveying large genomic regions for gene regulatory information, and subdividing complex sets of distant regulatory elements into smaller intervals for detailed study. The mouse Gdf6 gene is expressed in a number of distinct embryonic locations that are involved in the patterning of skeletal and soft tissues. To identify sequences responsible for Gdf6 regulation, we first isolated a series of overlapping bacterial artificial chromosomes (BACs) that extend varying distances upstream and downstream of the gene. A LacZ reporter cassette was integrated into the Gdf6 transcription unit of each BAC using homologous recombination in bacteria. Each modified BAC was injected into fertilized mouse eggs, and founder transgenic embryos were analyzed for LacZ expression mid-gestation. The overlapping segments defined by the BAC clones revealed five separate regulatory regions that drive LacZ expression in 11 distinct anatomical locations. To further localize sequences that control expression in developing skeletal joints, we created a series of BAC constructs with precise deletions across a putative joint-control region. This approach further narrowed the critical control region to an area containing several stretches of sequence that are highly conserved between mice and humans. A distant 2.9-kilobase fragment containing the highly conserved regions is able to direct very specific expression of a minimal promoter/LacZ reporter in proximal limb joints. These results demonstrate that even distant, complex regulatory sequences can be identified using a combination of BAC scanning, BAC deletion, and comparative sequencing approaches. PMID:12915490

  16. Phenotype Sequencing: Identifying the Genes That Cause a Phenotype Directly from Pooled Sequencing of Independent Mutants

    PubMed Central

    Harper, Marc A.; Chen, Zugen; Toy, Traci; Machado, Iara M. P.; Nelson, Stanley F.; Liao, James C.; Lee, Christopher J.

    2011-01-01

    Random mutagenesis and phenotype screening provide a powerful method for dissecting microbial functions, but their results can be laborious to analyze experimentally. Each mutant strain may contain 50–100 random mutations, necessitating extensive functional experiments to determine which one causes the selected phenotype. To solve this problem, we propose a “Phenotype Sequencing” approach in which genes causing the phenotype can be identified directly from sequencing of multiple independent mutants. We developed a new computational analysis method showing that 1. causal genes can be identified with high probability from even a modest number of mutant genomes; 2. costs can be cut many-fold compared with a conventional genome sequencing approach via an optimized strategy of library-pooling (multiple strains per library) and tag-pooling (multiple tagged libraries per sequencing lane). We have performed extensive validation experiments on a set of E. coli mutants with increased isobutanol biofuel tolerance. We generated a range of sequencing experiments varying from 3 to 32 mutant strains, with pooling on 1 to 3 sequencing lanes. Our statistical analysis of these data (4099 mutations from 32 mutant genomes) successfully identified 3 genes (acrB, marC, acrA) that have been independently validated as causing this experimental phenotype. It must be emphasized that our approach reduces mutant sequencing costs enormously. Whereas a conventional genome sequencing experiment would have cost $7,200 in reagents alone, our Phenotype Sequencing design yielded the same information value for only $1200. In fact, our smallest experiments reliably identified acrB and marC at a cost of only $110–$340. PMID:21364744

  17. Nucleotide sequence variation of the envelope protein gene identifies two distinct genotypes of yellow fever virus.

    PubMed Central

    Chang, G J; Cropp, B C; Kinney, R M; Trent, D W; Gubler, D J

    1995-01-01

    The evolution of yellow fever virus over 67 years was investigated by comparing the nucleotide sequences of the envelope (E) protein genes of 20 viruses isolated in Africa, the Caribbean, and South America. Uniformly weighted parsimony algorithm analysis defined two major evolutionary yellow fever virus lineages designated E genotypes I and II. E genotype I contained viruses isolated from East and Central Africa. E genotype II viruses were divided into two sublineages: IIA viruses from West Africa and IIB viruses from America, except for a 1979 virus isolated from Trinidad (TRINID79A). Unique signature patterns were identified at 111 nucleotide and 12 amino acid positions within the yellow fever virus E gene by signature pattern analysis. Yellow fever viruses from East and Central Africa contained unique signatures at 60 nucleotide and five amino acid positions, those from West Africa contained unique signatures at 25 nucleotide and two amino acid positions, and viruses from America contained such signatures at 30 nucleotide and five amino acid positions in the E gene. The dissemination of yellow fever viruses from Africa to the Americas is supported by the close genetic relatedness of genotype IIA and IIB viruses and genetic evidence of a possible second introduction of yellow fever virus from West Africa, as illustrated by the TRINID79A virus isolate. The E protein genes of American IIB yellow fever viruses had higher frequencies of amino acid substitutions than did genes of yellow fever viruses of genotypes I and IIA on the basis of comparisons with a consensus amino acid sequence for the yellow fever E gene. The great variation in the E proteins of American yellow fever virus probably results from positive selection imposed by virus interaction with different species of mosquitoes or nonhuman primates in the Americas. PMID:7637022

  18. Gene expression profiling of M. truncatula transcription factors identifies putative regulators of grain legume seed filling.

    PubMed

    Verdier, Jérôme; Kakar, Klementina; Gallardo, Karine; Le Signor, Christine; Aubert, Grégoire; Schlereth, Armin; Town, Christopher D; Udvardi, Michael K; Thompson, Richard D

    2008-08-01

    Legume seeds represent a major source of proteins for human and livestock diets. The model legume Medicago truncatula is characterized by a process of seed development very similar to that of other legumes, involving the interplay of sets of transcription factors (TFs). Here, we report the first expression profiling of over 700 M. truncatula genes encoding putative TFs throughout seven stages of seed development, obtained using real-time quantitative RT-PCR. A total of 169 TFs were selected which were expressed at late embryogenesis, seed filling or desiccation. The site of expression within the seed was examined for 41 highly expressed transcription factors out of the 169. To identify possible target genes for these TFs, the data were combined with a microarray-derived transcriptome dataset. This study identified 17 TFs preferentially expressed in individual seed tissues and 135 corresponding co-expressed genes, including possible targets. Certain of the TFs co-expressed with storage protein mRNAs correspond to those already known to regulate seed storage protein synthesis in Arabidopsis, whereas the timing of expression of others may be more specifically related to the delayed expression of the legumin-class storage proteins observed in legumes.

  19. DNA methylation map of mouse and human brain identifies target genes in Alzheimer’s disease

    PubMed Central

    Sanchez-Mut, Jose V.; Aso, Ester; Panayotis, Nicolas; Lott, Ira; Dierssen, Mara; Rabano, Alberto; Urdinguio, Rocio G.; Fernandez, Agustin F.; Astudillo, Aurora; Martin-Subero, Jose I.; Balint, Balazs; Fraga, Mario F.; Gomez, Antonio; Gurnot, Cecile; Roux, Jean-Christophe; Avila, Jesus; Hensch, Takao K.; Ferrer, Isidre

    2013-01-01

    The central nervous system has a pattern of gene expression that is closely regulated with respect to functional and anatomical regions. DNA methylation is a major regulator of transcriptional activity, and aberrations in the distribution of this epigenetic mark may be involved in many neurological disorders, such as Alzheimer’s disease. Herein, we have analysed 12 distinct mouse brain regions according to their CpG 5’-end gene methylation patterns and observed their unique epigenetic landscapes. The DNA methylomes obtained from the cerebral cortex were used to identify aberrant DNA methylation changes that occurred in two mouse models of Alzheimer’s disease. We were able to translate these findings to patients with Alzheimer’s disease, identifying DNA methylation-associated silencing of three targets genes: thromboxane A2 receptor (TBXA2R), sorbin and SH3 domain containing 3 (SORBS3) and spectrin beta 4 (SPTBN4). These hypermethylation targets indicate that the cyclic AMP response element-binding protein (CREB) activation pathway and the axon initial segment could contribute to the disease. PMID:24030951

  20. DNA methylation map of mouse and human brain identifies target genes in Alzheimer's disease.

    PubMed

    Sanchez-Mut, Jose V; Aso, Ester; Panayotis, Nicolas; Lott, Ira; Dierssen, Mara; Rabano, Alberto; Urdinguio, Rocio G; Fernandez, Agustin F; Astudillo, Aurora; Martin-Subero, Jose I; Balint, Balazs; Fraga, Mario F; Gomez, Antonio; Gurnot, Cecile; Roux, Jean-Christophe; Avila, Jesus; Hensch, Takao K; Ferrer, Isidre; Esteller, Manel

    2013-10-01

    The central nervous system has a pattern of gene expression that is closely regulated with respect to functional and anatomical regions. DNA methylation is a major regulator of transcriptional activity, and aberrations in the distribution of this epigenetic mark may be involved in many neurological disorders, such as Alzheimer's disease. Herein, we have analysed 12 distinct mouse brain regions according to their CpG 5'-end gene methylation patterns and observed their unique epigenetic landscapes. The DNA methylomes obtained from the cerebral cortex were used to identify aberrant DNA methylation changes that occurred in two mouse models of Alzheimer's disease. We were able to translate these findings to patients with Alzheimer's disease, identifying DNA methylation-associated silencing of three targets genes: thromboxane A2 receptor (TBXA2R), sorbin and SH3 domain containing 3 (SORBS3) and spectrin beta 4 (SPTBN4). These hypermethylation targets indicate that the cyclic AMP response element-binding protein (CREB) activation pathway and the axon initial segment could contribute to the disease.

  1. An ENU mutagenesis screen identifies novel and known genes involved in epigenetic processes in the mouse

    PubMed Central

    2013-01-01

    Background We have used a sensitized ENU mutagenesis screen to produce mouse lines that carry mutations in genes required for epigenetic regulation. We call these lines Modifiers of murine metastable epialleles (Mommes). Results We report a basic molecular and phenotypic characterization for twenty of the Momme mouse lines, and in each case we also identify the causative mutation. Three of the lines carry a mutation in a novel epigenetic modifier, Rearranged L-myc fusion (Rlf), and one gene, Rap-interacting factor 1 (Rif1), has not previously been reported to be involved in transcriptional regulation in mammals. Many of the other lines are novel alleles of known epigenetic regulators. For two genes, Rlf and Widely-interspaced zinc finger (Wiz), we describe the first mouse mutants. All of the Momme mutants show some degree of homozygous embryonic lethality, emphasizing the importance of epigenetic processes. The penetrance of lethality is incomplete in a number of cases. Similarly, abnormalities in phenotype seen in the heterozygous individuals of some lines occur with incomplete penetrance. Conclusions Recent advances in sequencing enhance the power of sensitized mutagenesis screens to identify the function of previously uncharacterized factors and to discover additional functions for previously characterized proteins. The observation of incomplete penetrance of phenotypes in these inbred mutant mice, at various stages of development, is of interest. Overall, the Momme collection of mouse mutants provides a valuable resource for researchers across many disciplines. PMID:24025402

  2. A genetic pedigree analysis to identify gene mutations involved in femoral head necrosis.

    PubMed

    Wang, Lin; Pan, Hehai; Zhu, Zhen-An

    2014-10-01

    The present study presents results from a linkage and mutation screening analysis aiming to identify the causative gene of femoral head necrosis, also known as osteonecrosis of femoral head (ONFH), in a Chinese pedigree. We collected clinical data on the osteonecrosis pedigree, and extracted blood and genomic DNA from the family members. Polymerase chain reaction (PCR) and direct sequencing allowed to identify a mutation in the COL2A1 gene of the proband; the clinical manifestations of the proband meet the criteria for osteonecrosis. The exons of COL2A1 were amplified by polymerase chain reaction and mutation screening was conducted by direct sequencing in all the family members. The locus was also sequenced in 50 unrelated healthy controls. The c.3665G>A heterozygous mutation was detected in patients of the pedigree, but not in healthy individuals. We conclude that a mutation in the COL2A1 gene is the causative agent of ONFH in this family. Therefore, this mutation may be associated with osteonecrosis in Chinese populations.

  3. A Genome-wide CRISPR Screen in Toxoplasma Identifies Essential Apicomplexan Genes.

    PubMed

    Sidik, Saima M; Huet, Diego; Ganesan, Suresh M; Huynh, My-Hang; Wang, Tim; Nasamu, Armiyaw S; Thiru, Prathapan; Saeij, Jeroen P J; Carruthers, Vern B; Niles, Jacquin C; Lourido, Sebastian

    2016-09-08

    Apicomplexan parasites are leading causes of human and livestock diseases such as malaria and toxoplasmosis, yet most of their genes remain uncharacterized. Here, we present the first genome-wide genetic screen of an apicomplexan. We adapted CRISPR/Cas9 to assess the contribution of each gene from the parasite Toxoplasma gondii during infection of human fibroblasts. Our analysis defines ∼200 previously uncharacterized, fitness-conferring genes unique to the phylum, from which 16 were investigated, revealing essential functions during infection of human cells. Secondary screens identify as an invasion factor the claudin-like apicomplexan microneme protein (CLAMP), which resembles mammalian tight-junction proteins and localizes to secretory organelles, making it critical to the initiation of infection. CLAMP is present throughout sequenced apicomplexan genomes and is essential during the asexual stages of the malaria parasite Plasmodium falciparum. These results provide broad-based functional information on T. gondii genes and will facilitate future approaches to expand the horizon of antiparasitic interventions.

  4. Mismatch repair genes identified using genetic screens in Blm-deficient embryonic stem cells.

    PubMed

    Guo, Ge; Wang, Wei; Bradley, Allan

    2004-06-24

    Phenotype-driven recessive genetic screens in diploid organisms require a strategy to render the mutation homozygous. Although homozygous mutant mice can be generated by breeding, a reliable method to make homozygous mutations in cultured cells has not been available, limiting recessive screens in culture. Cultured embryonic stem (ES) cells provide access to all of the genes required to elaborate the fundamental components and physiological systems of a mammalian cell. Here we have exploited the high rate of mitotic recombination in Bloom's syndrome protein (Blm)-deficient ES cells to generate a genome-wide library of homozygous mutant cells from heterozygous mutations induced with a revertible gene trap retrovirus. We have screened this library for cells with defects in DNA mismatch repair (MMR), a system that detects and repairs base-base mismatches. We demonstrate the recovery of cells with homozygous mutations in known and novel MMR genes. We identified Dnmt1(ref. 5) as a novel MMR gene and confirmed that Dnmt1-deficient ES cells exhibit micro-satellite instability, providing a mechanistic explanation for the role of Dnmt1 in cancer. The combination of insertional mutagenesis in Blm-deficient ES cells establishes a new approach for phenotype-based recessive genetic screens in ES cells.

  5. Exome sequencing identifies KIAA1377 and C5orf42 as susceptibility genes for monomelic amyotrophy.

    PubMed

    Lim, Young-Min; Koh, Insong; Park, Young-Mi; Kim, Jae-Jung; Kim, Dae-Seong; Kim, Hyo-Jin; Baik, Kyu-Heum; Choi, Hye-Yeon; Yang, Gap-Seok; Also-Rallo, Eva; Tizzano, Eduardo F; Gamez, Josep; Park, Kiejung; Yoo, Han-Wook; Lee, Jong-Keuk; Kim, Kwang-Kuk

    2012-05-01

    Precise topographic localization, predominance in males mostly of Asian origin, and existence of some familial cases suggest a genetic background for monomelic amyotrophy. To identify susceptibility genes for monomelic amyotrophy, we performed whole-exome sequencing of four unrelated patients with monomelic amyotrophy and detected a total of 45 novel nonsynonymous single-nucleotide polymorphisms as unique variants to monomelic amyotrophy compared to control exomes. Genetic association analysis showed significant association with monomelic amyotrophy in the Gly668Ser variant of the KIAA1377 gene (odds ratio=4.62, P-value=0.0040) and the Pro1794Leu variant of the C5orf42 gene (odds ratio=4.63, P-value=0.0040). Moreover, the combination of two variants increased the risk of monomelic amyotrophy (P=1.4×10(-5), OR=61.69, 95% confidence interval=9.62-394.94, in case of combination of two heterozygotes). These data suggest that KIAA1377 and C5orf42 synergistically play a role as susceptibility genes for monomelic amyotrophy.

  6. The spxB gene as a target to identify Lactobacillus casei group species in cheese.

    PubMed

    Savo Sardaro, Maria Luisa; Levante, Alessia; Bernini, Valentina; Gatti, Monica; Neviani, Erasmo; Lazzi, Camilla

    2016-10-01

    This study focused on the spxB gene, which encodes for pyruvate oxidase. The presence of spxB in the genome and its transcription could be a way to produce energy and allow bacterial growth during carbohydrate starvation. In addition, the activity of pyruvate oxidase, which produces hydrogen peroxide, could be a mechanism for interspecies competition. Because this gene seems to provide advantages for the encoding species for adaptation in complex ecosystems, we studied spxB in a large set of cheese isolates belonging to the Lactobacillus casei group. Through this study, we demonstrated that this gene is widely found in the genomes of members of the L. casei group and shows variability useful for taxonomic studies. In particular, the HRM analysis method allowed for a specific discrimination between Lactobacillus rhamnosus, Lactobacillus paracasei and L. casei. Regarding the coding region, the spxB functionality in cheese was shown for the first time by real-time PCR, and by exploiting the heterogeneity between the L. casei group species, we identified the bacterial communities encoding the spxB gene in this ecosystem. This study allowed for monitoring of the active bacterial community involved in different stages of ripening by following the POX pathway.

  7. Candidate gene analysis of BRCA1/2 mutation-negative high-risk Russian breast cancer patients.

    PubMed

    Sokolenko, Anna P; Preobrazhenskaya, Elena V; Aleksakhina, Svetlana N; Iyevleva, Aglaya G; Mitiushkina, Natalia V; Zaitseva, Olga A; Yatsuk, Olga S; Tiurin, Vladislav I; Strelkova, Tatiana N; Togo, Alexandr V; Imyanitov, Evgeny N

    2015-04-10

    Twenty one DNA repair genes were analyzed in a group of 95 BC patients, who displayed clinical features of hereditary disease predisposition but turned out to be negative for mutations in BRCA1 and BRCA2 entire coding region as well as for founder disease-predisposing alleles in CHEK2, NBN/NBS1 and ATM genes. Full-length sequencing of CHEK2 and NBN/NBS1 failed to identify non-founder mutations. The analysis of TP53 revealed a woman carrying the R282W allele; further testing of additional 108 BC patients characterized by a very young age at onset (35 years or earlier) detected one more carrier of the TP53 germ-line defect. In addition, this study confirmed non-random occurrence of PALB2 truncating mutations in Russian hereditary BC patients. None of the studied cases carried germ-line defects in recently discovered hereditary BC genes, BRIP1, FANCC, MRE11A and RAD51C. The analysis of genes with yet unproven BC-predisposing significance (BARD1, BRD7, CHEK1, DDB2, ERCC1, EXO1, FANCG, PARP1, PARP2, RAD51, RNF8, WRN) identified single women carrying a protein-truncating allele, WRN R1406X. DNA sequencing of another set of 95 hereditary BC cases failed to reveal additional WRN heterozygous genotypes. Since WRN is functionally similar to the known BC-predisposing gene, BLM, it deserves to be analyzed in future hereditary BC studies. Furthermore, this investigation revealed a number of rare missense germ-line variants, which are classified as probably protein-damaging by online in silico tools and therefore may require further consideration.

  8. IDENTIFYING GENES CONTROLLING FERULATE CROSS-LINKING FORMATION IN GRASS CELL WALLS

    SciTech Connect

    de O Buanafina, Marcia Maria

    2013-10-16

    DESCRIPTION/ABSTRACT This proposal focuses on cell wall feruloylation and our long term goal is to identify and isolate novel genes controlling feruloylation and to characterize the phenotype of mutants in this pathway, with a spotlight on cell wall properties. Currently, the genes underlying AX feruloylation have not been identified and the isolation of such genes could be of great importance in manipulating ferulates accretion to the wall. Mutation of the feruloyl transferase gene(s) should lead to less ferulates secreted to the cell wall and reduced ferulate cross-linking. Our current research is based on the hypothesis that controlling the level of total feruloylation will have a direct impact on the level of cross-linking and in turn impact biomass utility for forage and biofuel production. Our results/accomplishments for this project so far include: 1. Mutagenised Brachypodium population. We have developed EMS mutagenized populations of model grass species Brachypodium distachyon. EMS populations have been developed from over 28,000 mutagenized seeds generating 5,184 M2 families. A total of 20,793 plants have been screened and 1,233 were originally selected. 2. Selected Brachypodium mutants: Potential mutants on their levels of cell wall ferulates and cell wall AX ? have been selected from 708 M2 families. A total of 303 back-crosses to no-mutagenized parental stock have been done, followed by selfing selected genotypes in order to confirm heritability of traits and to remove extraneous mutations generated by EMS mutagenesis. We are currently growing 12 F5 and F6 populations in order to assess CW composition. If low level of ferulates are confirmed in the candidate lines selected the mutation could be altered in different in one or several kinds of genes such as genes encoding an AX feruloyl transferase; genes encoding the arabinosyl transferase; genes encoding the synthesis of the xylan backbone; genes encoding enzymes of the monolignol pathway affecting FA

  9. Cognitive change in aging: identifying gene-environment correlation and nonshared environment mechanisms.

    PubMed

    Deater-Deckard, Kirby; Mayr, Ulrich

    2005-03-01

    We describe gene-environment processes that may help account for individual differences in successful aging. Our emphasis is on successful aging in the cognitive domain, wherein individuals come to use a variety of strategies to cope with changes in cognitive capacities. We focus on the role of executive control in particular and define gene-environment correlation and nonshared environmental mechanisms. The quantitative genetic methods used to identify these mechanisms are described, with examples from research in childhood, where such studies are now common. Future work will be most effective if it is guided by life-span development frameworks that address these processes, such as the developmental genotype-->environment theory of Scarr and McCartney and the selection/optimization/compensation theory of Baltes and Baltes.

  10. Identifying Liver Cancer and Its Relations with Diseases, Drugs, and Genes: A Literature-Based Approach

    PubMed Central

    Song, Min

    2016-01-01

    In biomedicine, scientific literature is a valuable source for knowledge discovery. Mining knowledge from textual data has become an ever important task as the volume of scientific literature is growing unprecedentedly. In this paper, we propose a framework for examining a certain disease based on existing information provided by scientific literature. Disease-related entities that include diseases, drugs, and genes are systematically extracted and analyzed using a three-level network-based approach. A paper-entity network and an entity co-occurrence network (macro-level) are explored and used to construct six entity specific networks (meso-level). Important diseases, drugs, and genes as well as salient entity relations (micro-level) are identified from these networks. Results obtained from the literature-based literature mining can serve to assist clinical applications. PMID:27195695

  11. First case of L1CAM gene mutation identified in MASA syndrome in Asia.

    PubMed

    Kanemura, Yonehiro; Takuma, Yuuichi; Kamiguchi, Hiroyuki; Yamasaki, Mami

    2005-06-01

    We report here the first case of an L1CAM gene mutation identified in mental retardation, adducted thumbs, shuffling gait, and aphasia (MASA) syndrome in Japan. The patient was a 10-year-old boy with mild mental retardation, bilateral adducted thumbs and corpus callosum hypoplasia. His family had no history of MASA syndrome. The L1CAM gene contained a nonsense mutation (R1166X) in exon 26 in the cytoplasmic domain. No mutation was found in the extracellular and transmembrane domains of L1CAM. The abnormal development of axon tracts resulting in the corpus callosum hypoplasia and adducted thumbs appears to be caused by malfunction of the cytoplasmic domain of L1CAM.

  12. A profile-based method for identifying functional divergence of orthologous genes in bacterial genomes

    PubMed Central

    Wheeler, Nicole E.; Barquist, Lars; Kingsley, Robert A.; Gardner, Paul P.

    2016-01-01

    Motivation: Next generation sequencing technologies have provided us with a wealth of information on genetic variation, but predicting the functional significance of this variation is a difficult task. While many comparative genomics studies have focused on gene flux and large scale changes, relatively little attention has been paid to quantifying the effects of single nucleotide polymorphisms and indels on protein function, particularly in bacterial genomics. Results: We present a hidden Markov model based approach we call delta-bitscore (DBS) for identifying orthologous proteins that have diverged at the amino acid sequence level in a way that is likely to impact biological function. We benchmark this approach with several widely used datasets and apply it to a proof-of-concept study of orthologous proteomes in an investigation of host adaptation in Salmonella enterica. We highlight the value of the method in identifying functional divergence of genes, and suggest that this tool may be a better approach than the commonly used dN/dS metric for identifying functionally significant genetic changes occurring in recently diverged organisms. Availability and Implementation: A program implementing DBS for pairwise genome comparisons is freely available at: https://github.com/UCanCompBio/deltaBS. Contact: nicole.wheeler@pg.canterbury.ac.nz or lars.barquist@uni-wuerzburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27503221

  13. A modifier screen identifies DNAJB6 as a cardiomyopathy susceptibility gene

    PubMed Central

    Ding, Yonghe; Long, Pamela A.; Bos, J. Martijn; Shih, Yu-Huan; Ma, Xiao; Sundsbak, Rhianna S.; Chen, Jianhua; Zhao, Liqun; Hu, Xinyang; Wang, Jianan; Shi, Yongyong; Ackerman, Michael J.; Lin, Xueying; Ekker, Stephen C.; Redfield, Margaret M.; Olson, Timothy M.

    2016-01-01

    Mutagenesis screening is a powerful forward genetic approach that has been successfully applied in lower-model organisms to discover genetic factors for biological processes. This phenotype-based approach has yet to be established in vertebrates for probing major human diseases, largely because of the complexity of colony management. Herein, we report a rapid strategy for identifying genetic modifiers of cardiomyopathy (CM). Based on the application of doxorubicin stress to zebrafish insertional cardiac (ZIC) mutants, we identified 4 candidate CM-modifying genes, of which 3 have been linked previously to CM. The long isoform of DnaJ (Hsp40) homolog, subfamily B, member 6b (dnajb6b(L)) was identified as a CM susceptibility gene, supported by identification of rare variants in its human ortholog DNAJB6 from CM patients. Mechanistic studies indicated that the deleterious, loss-of-function modifying effects of dnajb6b(L) can be ameliorated by inhibition of ER stress. In contrast, overexpression of dnajb6(L) exerts cardioprotective effects on both fish and mouse CM models. Together, our findings establish a mutagenesis screening strategy that is scalable for systematic identification of genetic modifiers of CM, feasible to suggest therapeutic targets, and expandable to other major human diseases. PMID:27642634

  14. Gene expression signature based screening identifies ribonucleotide reductase as a candidate therapeutic target in Ewing sarcoma

    PubMed Central

    Goss, Kelli L.; Gordon, David J.

    2016-01-01

    There is a critical need in cancer therapeutics to identify targeted therapies that will improve outcomes and decrease toxicities compared to conventional, cytotoxic chemotherapy. Ewing sarcoma is a highly aggressive bone and soft tissue cancer that is caused by the EWS-FLI1 fusion protein. Although EWS-FLI1 is specific for cancer cells, and required for tumorigenesis, directly targeting this transcription factor has proven challenging. Consequently, targeting unique dependencies or key downstream mediators of EWS-FLI1 represent important alternative strategies. We used gene expression data derived from a genetically defined model of Ewing sarcoma to interrogate the Connectivity Map and identify a class of drugs, iron chelators, that downregulate a significant number of EWS-FLI1 target genes. We then identified ribonucleotide reductase M2 (RRM2), the iron-dependent subunit of ribonucleotide reductase (RNR), as one mediator of iron chelator toxicity in Ewing sarcoma cells. Inhibition of RNR in Ewing sarcoma cells caused apoptosis in vitro and attenuated tumor growth in an in vivo, xenograft model. Additionally, we discovered that the sensitivity of Ewing sarcoma cells to inhibition or suppression of RNR is mediated, in part, by high levels of SLFN11, a protein that sensitizes cells to DNA damage. This work demonstrates a unique dependency of Ewing sarcoma cells on RNR and supports further investigation of RNR inhibitors, which are currently used in clinical practice, as a novel approach for treating Ewing sarcoma. PMID:27557498

  15. Using Transcriptomics to Identify Differential Gene Expression in Response to Salinity among Australian Phragmites australis Clones

    PubMed Central

    Holmes, Gareth D.; Hall, Nathan E.; Gendall, Anthony R.; Boon, Paul I.; James, Elizabeth A.

    2016-01-01

    Common Reed (Phragmites australis) is a frequent component of inland and coastal wetlands in temperate zones worldwide. Ongoing environmental changes have resulted in the decline of this species in many areas and invasive expansion in others. In the Gippsland Lakes coastal waterway system in south-eastern Australia, increasing salinity is thought to have contributed to the loss of fringing P. australis reed beds leading to increased shoreline erosion. A major goal of restoration in this waterway is to address the effect of salinity by planting a genetically diverse range of salt-tolerant P. australis plants. This has prompted an interest in examining the variation in salinity tolerance among clones and the underlying basis of this variation. Transcriptomics is an approach for identifying variation in genes and their expression levels associated with the exposure of plants to environmental stressors. In this paper we present initial results of the first comparative culm transcriptome analysis of P. australis clones. After sampling plants from sites of varied surface water salinity across the Gippsland Lakes, replicates from three clones from highly saline sites (>18 g L-1 TDS) and three from low salinity sites (<6 g L-1) were grown in containers irrigated with either fresh (<0.1 g L-1) or saline water (16 g L-1). An RNA-Seq protocol was used to generate sequence data from culm tissues from the 12 samples allowing an analysis of differential gene expression. Among the key findings, we identified several genes uniquely up- or down-regulated in clones from highly saline sites when irrigated with saline water relative to clones from low salinity sites. These included the higher relative expression levels of genes associated with photosynthesis and lignan biosynthesis indicative of a greater ability of these clones to maintain growth under saline conditions. Combined with growth data from a parallel study, our data suggests local adaptation of certain clones to salinity

  16. Using Transcriptomics to Identify Differential Gene Expression in Response to Salinity among Australian Phragmites australis Clones.

    PubMed

    Holmes, Gareth D; Hall, Nathan E; Gendall, Anthony R; Boon, Paul I; James, Elizabeth A

    2016-01-01

    Common Reed (Phragmites australis) is a frequent component of inland and coastal wetlands in temperate zones worldwide. Ongoing environmental changes have resulted in the decline of this species in many areas and invasive expansion in others. In the Gippsland Lakes coastal waterway system in south-eastern Australia, increasing salinity is thought to have contributed to the loss of fringing P. australis reed beds leading to increased shoreline erosion. A major goal of restoration in this waterway is to address the effect of salinity by planting a genetically diverse range of salt-tolerant P. australis plants. This has prompted an interest in examining the variation in salinity tolerance among clones and the underlying basis of this variation. Transcriptomics is an approach for identifying variation in genes and their expression levels associated with the exposure of plants to environmental stressors. In this paper we present initial results of the first comparative culm transcriptome analysis of P. australis clones. After sampling plants from sites of varied surface water salinity across the Gippsland Lakes, replicates from three clones from highly saline sites (>18 g L(-1) TDS) and three from low salinity sites (<6 g L(-1)) were grown in containers irrigated with either fresh (<0.1 g L(-1)) or saline water (16 g L(-1)). An RNA-Seq protocol was used to generate sequence data from culm tissues from the 12 samples allowing an analysis of differential gene expression. Among the key findings, we identified several genes uniquely up- or down-regulated in clones from highly saline sites when irrigated with saline water relative to clones from low salinity sites. These included the higher relative expression levels of genes associated with photosynthesis and lignan biosynthesis indicative of a greater ability of these clones to maintain growth under saline conditions. Combined with growth data from a parallel study, our data suggests local adaptation of certain clones to

  17. Meta-analysis of transcriptome data identifies a novel 5-gene pancreatic adenocarcinoma classifier

    PubMed Central

    Bhasin, Manoj K.; Ndebele, Kenneth; Bucur, Octavian; Yee, Eric U.; Otu, Hasan H.; Plati, Jessica; Bullock, Andrea; Gu, Xuesong; Castan, Eduardo; Zhang, Peng; Najarian, Robert; Muraru, Maria S.

    2016-01-01

    Purpose Pancreatic ductal adenocarcinoma (PDAC) is largely incurable due to late diagnosis. Superior early detection biomarkers are critical to improving PDAC survival and risk stratification. Experimental Design Optimized meta-analysis of PDAC transcriptome datasets identified and validated key PDAC biomarkers. PDAC-specific expression of a 5-gene biomarker panel was measured by qRT-PCR in microdissected patient-derived FFPE tissues. Cell-based assays assessed impact of two of these biomarkers, TMPRSS4 and ECT2, on PDAC cells. Results A 5-gene PDAC classifier (TMPRSS4, AHNAK2, POSTN, ECT2, SERPINB5) achieved on average 95% sensitivity and 89% specificity in discriminating PDAC from non-tumor samples in four training sets and similar performance (sensitivity = 94%, specificity = 89.6%) in five independent validation datasets. This classifier accurately discriminated PDAC from chronic pancreatitis (AUC = 0.83), other cancers (AUC = 0.89), and non-tumor from PDAC precursors (AUC = 0.92) in three independent datasets. Importantly, the classifier distinguished PanIN from healthy pancreas in the PDX1-Cre;LSL-KrasG12D PDAC mouse model. Discriminatory expression of the PDAC classifier genes was confirmed in microdissected FFPE samples of PDAC and matched surrounding non-tumor pancreas or pancreatitis. Notably, knock-down of TMPRSS4 and ECT2 reduced PDAC soft agar growth and cell viability and TMPRSS4 knockdown also blocked PDAC migration and invasion. Conclusions This study identified and validated a highly accurate 5-gene PDAC classifier for discriminating PDAC and early precursor lesions from non-malignant tissue that may facilitate early diagnosis and risk stratification upon validation in prospective clinical trials. Cell-based experiments of two overexpressed proteins encoded by the panel, TMPRSS4 and ECT2, suggest a causal link to PDAC development and progression, confirming them as potential therapeutic targets. PMID:26993610

  18. The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping services

    PubMed Central

    2010-01-01

    Background Many complementary solutions are available for the identifier mapping problem. This creates an opportunity for bioinformatics tool developers. Tools can be made to flexibly support multiple mapping services or mapping services could be combined to get broader coverage. This approach requires an interface layer between tools and mapping services. Results Here we present BridgeDb, a software framework for gene, protein and metabolite identifier mapping. This framework provides a standardized interface layer through which bioinformatics tools can be connected to different identifier mapping services. This approach makes it easier for tool developers to support identifier mapping. Mapping services can be combined or merged to support multi-omics experiments or to integrate custom microarray annotations. BridgeDb provides its own ready-to-go mapping services, both in webservice and local database forms. However, the framework is intended for customization and adaptation to any identifier mapping service. BridgeDb has already been integrated into several bioinformatics applications. Conclusion By uncoupling bioinformatics tools from mapping services, BridgeDb improves capability and flexibility of those tools. All described software is open source and available at http://www.bridgedb.org. PMID:20047655

  19. Microarray Expression Data Identify DCC as a Candidate Gene for Early Meningioma Progression.

    PubMed

    Schulten, Hans-Juergen; Hussein, Deema; Al-Adwani, Fatima; Karim, Sajjad; Al-Maghrabi, Jaudah; Al-Sharif, Mona; Jamal, Awatif; Al-Ghamdi, Fahad; Baeesa, Saleh S; Bangash, Mohammed; Chaudhary, Adeel; Al-Qahtani, Mohammed

    2016-01-01

    Meningiomas are the most common primary brain tumors bearing in a minority of cases an aggressive phenotype. Although meningiomas are stratified according to their histology and clinical behavior, the underlying molecular genetics predicting aggressiveness are not thoroughly understood. We performed whole transcript expression profiling in 10 grade I and four grade II meningiomas, three of which invaded the brain. Microarray expression analysis identified deleted in colorectal cancer (DCC) as a differentially expressed gene (DEG) enabling us to cluster meningiomas into DCC low expression (3 grade I and 3 grade II tumors), DCC medium expression (2 grade I and 1 grade II tumors), and DCC high expression (5 grade I tumors) groups. Comparison between the DCC low expression and DCC high expression groups resulted in 416 DEGs (p-value<0.05; fold change>2). The most significantly downregulated genes in the DCC low expression group comprised DCC, phosphodiesterase 1C (PDE1C), calmodulin-dependent 70kDa olfactomedin 2 (OLFM2), glutathione S-transferase mu 5 (GSTM5), phosphotyrosine interaction domain containing 1 (PID1), sema domain, transmembrane domain (TM) and cytoplasmic domain, (semaphorin) 6D (SEMA6D), and indolethylamine N-methyltransferase (INMT). The most significantly upregulated genes comprised chromosome 5 open reading frame 63 (C5orf63), homeodomain interacting protein kinase 2 (HIPK2), and basic helix-loop-helix family, member e40 (BHLHE40). Biofunctional analysis identified as predicted top upstream regulators beta-estradiol, TGFB1, Tgf beta complex, LY294002, and dexamethasone and as predicted top regulator effectors NFkB, PIK3R1, and CREBBP. The microarray expression data served also for a comparison between meningiomas from female and male patients and for a comparison between brain invasive and non-invasive meningiomas resulting in a number of significant DEGs and related biofunctions. In conclusion, based on its expression levels, DCC may constitute a

  20. A genome-wide survey of CD4+ lymphocyte regulatory genetic variants identifies novel asthma genes

    PubMed Central

    Sharma, Sunita; Zhou, Xiaobo; Thibault, Derek M.; Himes, Blanca E.; Liu, Andy; Szefler, Stanley J.; Strunk, Robert; Castro, Mario; Hansel, Nadia N.; Diette, Gregory B.; Vonakis, Becky M.; Adkinson, N. Franklin; Avila, Lydiana; Soto-Quiros, Manuel; Barraza-Villareal, Albino; Lemanske, Robert F.; Solway, Julian; Krishnan, Jerry; White, Steven R.; Cheadle, Chris; Berger, Alan E.; Fan, Jinshui; Boorgula, Meher Preethi; Nicolae, Dan; Gilliland, Frank; Barnes, Kathleen; London, Stephanie J.; Martinez, Fernando; Ober, Carole; Celedón, Juan C.; Carey, Vincent J.; Weiss, Scott T.; Raby, Benjamin A.

    2014-01-01

    Background Genome-wide association studies have yet to identify the majority of genetic variants involved in asthma. We hypothesized that expression quantitative trait locus (eQTL) mapping can identify novel asthma genes by enabling prioritization of putative functional variants for association testing. Objective We evaluated 6,706 cis-acting expression-associated variants (eSNP) identified through a genome-wide eQTL survey of CD4+ lymphocytes for association with asthma. Methods eSNP were tested for association with asthma in 359 asthma cases and 846 controls from the Childhood Asthma Management Program, with verification using family-based testing. Significant associations were tested for replication in 579 parent-child trios with asthma from Costa Rica. Further functional validation was performed by Formaldehyde Assisted Isolation of Regulatory Elements (FAIRE)-qPCR and Chromatin-Immunoprecipitation (ChIP)-PCR in lung derived epithelial cell lines (Beas-2B and A549) and Jurkat cells, a leukemia cell line derived from T lymphocytes. Results Cis-acting eSNP demonstrated associations with asthma in both cohorts. We confirmed the previously-reported association of ORMDL3/GSDMB variants with asthma (combined p=2.9 × 108). Reproducible associations were also observed for eSNP in three additional genes: FADS2 (p=0.002), NAGA (p=0.0002), and F13A1 (p=0.0001). We subsequently demonstrated that FADS2 mRNA is increased in CD4+ lymphocytes in asthmatics, and that the associated eSNPs reside within DNA segments with histone modifications that denote open chromatin status and confer enhancer activity. Conclusions Our results demonstrate the utility of eQTL mapping in the identification of novel asthma genes, and provide evidence for the importance of FADS2, NAGA, and F13A1 in the pathogenesis of asthma. PMID:24934276

  1. Microarray Expression Data Identify DCC as a Candidate Gene for Early Meningioma Progression

    PubMed Central

    Schulten, Hans-Juergen; Hussein, Deema; Al-Adwani, Fatima; Karim, Sajjad; Al-Maghrabi, Jaudah; Al-Sharif, Mona; Jamal, Awatif; Al-Ghamdi, Fahad; Baeesa, Saleh S.; Bangash, Mohammed; Chaudhary, Adeel; Al-Qahtani, Mohammed

    2016-01-01

    Meningiomas are the most common primary brain tumors bearing in a minority of cases an aggressive phenotype. Although meningiomas are stratified according to their histology and clinical behavior, the underlying molecular genetics predicting aggressiveness are not thoroughly understood. We performed whole transcript expression profiling in 10 grade I and four grade II meningiomas, three of which invaded the brain. Microarray expression analysis identified deleted in colorectal cancer (DCC) as a differentially expressed gene (DEG) enabling us to cluster meningiomas into DCC low expression (3 grade I and 3 grade II tumors), DCC medium expression (2 grade I and 1 grade II tumors), and DCC high expression (5 grade I tumors) groups. Comparison between the DCC low expression and DCC high expression groups resulted in 416 DEGs (p-value < 0.05; fold change > 2). The most significantly downregulated genes in the DCC low expression group comprised DCC, phosphodiesterase 1C (PDE1C), calmodulin-dependent 70kDa olfactomedin 2 (OLFM2), glutathione S-transferase mu 5 (GSTM5), phosphotyrosine interaction domain containing 1 (PID1), sema domain, transmembrane domain (TM) and cytoplasmic domain, (semaphorin) 6D (SEMA6D), and indolethylamine N-methyltransferase (INMT). The most significantly upregulated genes comprised chromosome 5 open reading frame 63 (C5orf63), homeodomain interacting protein kinase 2 (HIPK2), and basic helix-loop-helix family, member e40 (BHLHE40). Biofunctional analysis identified as predicted top upstream regulators beta-estradiol, TGFB1, Tgf beta complex, LY294002, and dexamethasone and as predicted top regulator effectors NFkB, PIK3R1, and CREBBP. The microarray expression data served also for a comparison between meningiomas from female and male patients and for a comparison between brain invasive and non-invasive meningiomas resulting in a number of significant DEGs and related biofunctions. In conclusion, based on its expression levels, DCC may constitute

  2. Oil palm (Elaeis guineensis Jacq.) tissue culture ESTs: Identifying genes associated with callogenesis and embryogenesis

    PubMed Central

    Low, Eng-Ti L; Alias, Halimah; Boon, Soo-Heong; Shariff, Elyana M; Tan, Chi-Yee A; Ooi, Leslie CL; Cheah, Suan-Choo; Raha, Abdul-Rahim; Wan, Kiew-Lian; Singh, Rajinder

    2008-01-01

    Background Oil palm (Elaeis guineensis Jacq.) is one of the most important oil bearing crops in the world. However, genetic improvement of oil palm through conventional breeding is extremely slow and costly, as the breeding cycle can take up to 10 years. This has brought about interest in vegetative propagation of oil palm. Since the introduction of oil palm tissue culture in the 1970s, clonal propagation has proven to be useful, not only in producing uniform planting materials, but also in the development of the genetic engineering programme. Despite considerable progress in improving the tissue culture techniques, the callusing and embryogenesis rates from proliferating callus cultures remain very low. Thus, understanding the gene diversity and expression profiles in oil palm tissue culture is critical in increasing the efficiency of these processes. Results A total of 12 standard cDNA libraries, representing three main developmental stages in oil palm tissue culture, were generated in this study. Random sequencing of clones from these cDNA libraries generated 17,599 expressed sequence tags (ESTs). The ESTs were analysed, annotated and assembled to generate 9,584 putative unigenes distributed in 3,268 consensi and 6,316 singletons. These unigenes were assigned putative functions based on similarity and gene ontology annotations. Cluster analysis, which surveyed the relatedness of each library based on the abundance of ESTs in each consensus, revealed that lipid transfer proteins were highly expressed in embryogenic tissues. A glutathione S-transferase was found to be highly expressed in non-embryogenic callus. Further analysis of the unigenes identified 648 non-redundant simple sequence repeats and 211 putative full-length open reading frames. Conclusion This study has provided an overview of genes expressed during oil palm tissue culture. Candidate genes with expression that are modulated during tissue culture were identified. However, in order to confirm

  3. Transcriptome Analysis of Syringa oblata Lindl. Inflorescence Identifies Genes Associated with Pigment Biosynthesis and Scent Metabolism

    PubMed Central

    Zheng, Jian; Hu, Zenghui; Guan, Xuelian; Dou, Dequan; Bai, Guo; Wang, Yu; Guo, Yingtian; Li, Wei; Leng, Pingsheng

    2015-01-01

    Syringa oblata Lindl. is a woody ornamental plant with high economic value and characteristics that include early flowering, multiple flower colors, and strong fragrance. Despite a long history of cultivation, the genetics and molecular biology of S. oblata are poorly understood. Transcriptome and expression profiling data are needed to identify genes and to better understand the biological mechanisms of floral pigments and scents in this species. Nine cDNA libraries were obtained from three replicates of three developmental stages: inflorescence with enlarged flower buds not protruded, inflorescence with corolla lobes not displayed, and inflorescence with flowers fully opened and emitting strong fragrance. Using the Illumina RNA-Seq technique, 319,425,972 clean reads were obtained and were assembled into 104,691 final unigenes (average length of 853 bp), 41.75% of which were annotated in the NCBI non-redundant protein database. Among the annotated unigenes, 36,967 were assigned to gene ontology categories and 19,956 were assigned to eukaryoticorthologous groups. Using the Kyoto Encyclopedia of Genes and Genomes pathway database, 12,388 unigenes were sorted into 286 pathways. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at different flower stages and that were related to floral pigment biosynthesis and fragrance metabolism. This comprehensive transcriptomic analysis provides fundamental information on the genes and pathways involved in flower secondary metabolism and development in S. oblata, providing a useful database for further research on S. oblata and other plants of genus Syringa. PMID:26587670

  4. Genome-Wide Association Study to Identify Genes Related to Renal Mercury Concentrations in Mice

    PubMed Central

    Alkaissi, Hammoudi; Ekstrand, Jimmy; Jawad, Aksa; Nielsen, Jesper Bo; Havarinasab, Said; Soderkvist, Peter; Hultman, Per

    2016-01-01

    , Hultman P. 2016. Genome-wide association study to identify genes related to renal mercury concentrations in mice. Environ Health Perspect 124:920–926; http://dx.doi.org/10.1289/ehp.1409284 PMID:26942574

  5. Onto-CC: a web server for identifying Gene Ontology conceptual clusters

    PubMed Central

    Romero-Zaliz, R.; del Val, C.; Cobb, J. P.; Zwir, I.

    2008-01-01

    The Gene Ontology (GO) vocabulary has been extensively explored to analyze the functions of coexpressed genes. However, despite its extended use in Biology and Medical Sciences, there are still high levels of uncertainty about which ontology (i.e. Molecular Process, Cellular Component or Molecular Function) should be used, and at which level of specificity. Moreover, the GO database can contain incomplete information resulting from human annotations, or highly influenced by the available knowledge about a specific branch in an ontology. In spite of these drawbacks, there is a trend to ignore these problems and even use GO terms to conduct searches of gene expression profiles (i.e. expression + GO) instead of more cautious approaches that just consider them as an independent source of validation (i.e. expression versus GO). Consequently, propagating the uncertainty and producing biased analysis of the required gene grouping hypotheses. We proposed a web tool, Onto-CC, as an automatic method specially suited for independent explanation/validation of gene grouping hypotheses (e.g. coexpressed genes) based on GO clusters (i.e. expression versus GO). Onto-CC approach reduces the uncertainty of the queries by identifying optimal conceptual clusters that combine terms from different ontologies simultaneously, as well as terms defined at different levels of specificity in the GO hierarchy. To do so, we implemented the EMO-CC methodology to find clusters in structural databases [GO Directed acyclic Graph (DAG) tree], inspired on Conceptual Clustering algorithms. This approach allows the management of optimal cluster sets as potential parallel hypotheses, guided by multiobjective/multimodal optimization techniques. Therefore, we can generate alternative and, still, optimal explanations of queries that can provide new insights for a given problem. Onto-CC has been successfully used to test different medical and biological hypotheses including the explanation and prediction of

  6. Identifying key stage-specific genes and transcription factors for gastric cancer based on RNA-sequencing data

    PubMed Central

    Wang, Yan

    2017-01-01

    Abstract Background: To identify gastric cancer (GC)-associated genes and transcription factors (TFs) using RNA-sequencing (RNA-seq) data of Asians. Materials and methods: The RNA-seq data (GSE36968) were downloaded from Gene Expression Omnibus database, including 6 noncancerous gastric tissue samples, 5 stage I GC samples, 5 stage II GC samples, 8 stage III GC samples, and 6 stage IV GC samples. The gene expression values in each sample were calculated using Cuffdiff. Following, stage-specific genes were identified by 1-way analysis of variance and hierarchical clustering analysis. Upstream TFs were identified using Seqpos. Besides, functional enrichment analysis of stage-specific genes was performed by DAVID. In addition, the underlying protein–protein interactions (PPIs) information among stage IV-specific genes were extracted from STRING database and PPI network was constructed using Cytoscape software. Results: A total of 3576 stage-specific genes were identified, including 813 specifically up-regulated genes in the normal gastric tissues, 2224 stage I and II-specific genes, and 539 stage IV-specific genes. Also, a total of 9 and 11 up-regulated TFs were identified for the stage I and II-specific genes and stage IV-specific genes, respectively. Functional enrichment showed SPARC, MMP17, and COL6A3 were related to extracellular matrix. Notably, 2 regulatory pathways HOXA4-GLI3-RUNX2-FGF2 and HMGA2-PRKCA were obtained from the PPI network for stage IV-specific genes. In the PPI network, TFs HOXA4 and HMGA2 might function via mediating other genes. Conclusion: These stage-specific genes and TFs might act in the pathogenesis of GC in Asians. PMID:28121923

  7. Expression profiling of Crambe abyssinica under arsenate stress identifies genes and gene networks involved in arsenic metabolism and detoxification

    PubMed Central

    2010-01-01

    Background Arsenic contamination is widespread throughout the world and this toxic metalloid is known to cause cancers of organs such as liver, kidney, skin, and lung in human. In spite of a recent surge in arsenic related studies, we are still far from a comprehensive understanding of arsenic uptake, detoxification, and sequestration in plants. Crambe abyssinica, commonly known as 'abyssinian mustard', is a non-food, high biomass oil seed crop that is naturally tolerant to heavy metals. Moreover, it accumulates significantly higher levels of arsenic as compared to other species of the Brassicaceae family. Thus, C. abyssinica has great potential to be utilized as an ideal inedible crop for phytoremediation of heavy metals and metalloids. However, the mechanism of arsenic metabolism in higher plants, including C. abyssinica, remains elusive. Results To identify the differentially expressed transcripts and the pathways involved in arsenic metabolism and detoxification, C. abyssinica plants were subjected to arsenate stress and a PCR-Select Suppression Subtraction Hybridization (SSH) approach was employed. A total of 105 differentially expressed subtracted cDNAs were sequenced which were found to represent 38 genes. Those genes encode proteins functioning as antioxidants, metal transporters, reductases, enzymes involved in the protein degradation pathway, and several novel uncharacterized proteins. The transcripts corresponding to the subtracted cDNAs showed strong upregulation by arsenate stress as confirmed by the semi-quantitative RT-PCR. Conclusions Our study revealed novel insights into the plant defense mechanisms and the regulation of genes and gene networks in response to arsenate toxicity. The differential expression of transcripts encoding glutathione-S-transferases, antioxidants, sulfur metabolism, heat-shock proteins, metal transporters, and enzymes in the ubiquitination pathway of protein degradation as well as several unknown novel proteins serve as

  8. Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets

    PubMed Central

    Vinayagam, Arunachalam; Gibson, Travis E.; Lee, Ho-Joon; Yilmazel, Bahar; Roesel, Charles; Hu, Yanhui; Kwon, Young; Sharma, Amitabh; Liu, Yang-Yu; Perrimon, Norbert; Barabási, Albert-László

    2016-01-01

    The protein–protein interaction (PPI) network is crucial for cellular information processing and decision-making. With suitable inputs, PPI networks drive the cells to diverse functional outcomes such as cell proliferation or cell death. Here, we characterize the structural controllability of a large directed human PPI network comprising 6,339 proteins and 34,813 interactions. This network allows us to classify proteins as “indispensable,” “neutral,” or “dispensable,” which correlates to increasing, no effect, or decreasing the number of driver nodes in the network upon removal of that protein. We find that 21% of the proteins in the PPI network are indispensable. Interestingly, these indispensable proteins are the primary targets of disease-causing mutations, human viruses, and drugs, suggesting that altering a network’s control property is critical for the transition between healthy and disease states. Furthermore, analyzing copy number alterations data from 1,547 cancer patients reveals that 56 genes that are frequently amplified or deleted in nine different cancers are indispensable. Among the 56 genes, 46 of them have not been previously associated with cancer. This suggests that controllability analysis is very useful in identifying novel disease genes and potential drug targets. PMID:27091990

  9. Gene expression profiling in Entamoeba histolytica identifies key components in iron uptake and metabolism.

    PubMed

    Hernández-Cuevas, Nora Adriana; Weber, Christian; Hon, Chung-Chau; Guillen, Nancy

    2014-01-01

    Entamoeba histolytica is an ameboid parasite that causes colonic dysentery and liver abscesses in humans. The parasite encounters dramatic changes in iron concentration during its invasion of the host, with relatively low levels in the intestinal lumen and then relatively high levels in the blood and liver. The liver notably contains sources of iron; therefore, the parasite's ability to use these sources might be relevant to its survival in the liver and thus the pathogenesis of liver abscesses. The objective of the present study was to identify factors involved in iron uptake, use and storage in E. histolytica. We compared the respective transcriptomes of E. histolytica trophozoites grown in normal medium (containing around 169 µM iron), low-iron medium (around 123 µM iron), iron-deficient medium (around 91 µM iron), and iron-deficient medium replenished with hemoglobin. The differentially expressed genes included those coding for the ATP-binding cassette transporters and major facilitator transporters (which share homology with bacterial siderophores and heme transporters) and genes involved in heme biosynthesis and degradation. Iron deficiency was associated with increased transcription of genes encoding a subset of cell signaling molecules, some of which have previously been linked to adaptation to the intestinal environment and virulence. The present study is the first to have assessed the transcriptome of E. histolytica grown under various iron concentrations. Our results provide insights into the pathways involved in iron uptake and metabolism in this parasite.

  10. Probability fold change: a robust computational approach for identifying differentially expressed gene lists.

    PubMed

    Deng, Xutao; Xu, Jun; Hui, James; Wang, Charles

    2009-02-01

    Identifying genes that are differentially expressed under different experimental conditions is a fundamental task in microarray studies. However, different ranking methods generate very different gene lists, and this could profoundly impact follow-up analyses and biological interpretation. Therefore, developing improved ranking methods are critical in microarray data analysis. We developed a new algorithm, the probabilistic fold change (PFC), which ranks genes based on a confidence interval estimate of fold change. We performed extensive testing using multiple benchmark data sources including the MicroArray Quality Control (MAQC) data sets. We corroborated our observations with MAQC data sets using qRT-PCR data sets and Latin square spike-in data sets. Along with PFC, we tested six other popular ranking algorithms including Mean Fold Change (FC), SAM, t-statistic (T), Bayesian-t (BAYT), Intensity-Conditional Fold Change (CFC), and Rank Product (RP). PFC achieved reproducibility and accuracy that are consistently among the best of the seven ranking algorithms while other ranking algorithms would show weakness in some cases. Contrary to common belief, our results demonstrated that statistical accuracy will not translate to biological reproducibility and therefore both quality aspects need to be evaluated.

  11. Non-ribosomal peptide synthetases: Identifying the cryptic gene clusters and decoding the natural product.

    PubMed

    Singh, Mangal; Chaudhary, Sandeep; Sareen, Dipti

    2017-03-01

    Non-ribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs) present in bacteria and fungi are the major multi-modular enzyme complexes which synthesize secondary metabolites like the pharmacologically important antibiotics and siderophores. Each of the multiple modules of an NRPS activates a different amino or aryl acid, followed by their condensation to synthesize a linear or cyclic natural product. The studies on NRPS domains, the knowledge of their gene cluster architecture and tailoring enzymes have helped in the in silico genetic screening of the ever-expanding sequenced microbial genomic data for the identification of novel NRPS/PKS clusters and thus deciphering novel non-ribosomal peptides (NRPs). Adenylation domain is an integral part of the NRPSs and is the substrate selecting unit for the final assembled NRP. In some cases, it also requires a small protein, the MbtH homolog, for its optimum activity. The presence of putative adenylation domain and MbtH homologs in a sequenced genome can help identify the novel secondary metabolite producers. The role of the adenylation domain in the NRPS gene clusters and its characterization as a tool for the discovery of novel cryptic NRPS gene clusters are discussed.

  12. Identifying driving gene clusters in complex diseases through critical transition theory

    NASA Astrophysics Data System (ADS)

    Wolanyk, Nathaniel; Wang, Xujing; Hessner, Martin; Gao, Shouguo; Chen, Ye; Jia, Shuang

    A novel approach of looking at the human body using critical transition theory has yielded positive results: clusters of genes that act in tandem to drive complex disease progression. This cluster of genes can be thought of as the first part of a large genetic force that pushes the body from a curable, but sick, point to an incurable diseased point through a catastrophic bifurcation. The data analyzed is time course microarray blood assay data of 7 high risk individuals for Type 1 Diabetes who progressed into a clinical onset, with an additional larger study requested to be presented at the conference. The normalized data is 25,000 genes strong, which were narrowed down based on statistical metrics, and finally a machine learning algorithm using critical transition metrics found the driving network. This approach was created to be repeatable across multiple complex diseases with only progression time course data needed so that it would be applicable to identifying when an individual is at risk of developing a complex disease. Thusly, preventative measures can be enacted, and in the longer term, offers a possible solution to prevent all Type 1 Diabetes.

  13. Exploiting Genomics Resources to Identify Candidate Genes Underlying Antioxidants Content in Tomato Fruit

    PubMed Central

    Calafiore, Roberta; Ruggieri, Valentino; Raiola, Assunta; Rigano, Maria M.; Sacco, Adriana; Hassan, Mohamed I.; Frusciante, Luigi; Barone, Amalia

    2016-01-01

    The tomato is a model species for fleshy fruit development and ripening, as well as for genomics studies of others Solanaceae. Many genetic and genomics resources, including databases for sequencing, transcriptomics and metabolomics data, have been developed and are today available. The purpose of the present work was to uncover new genes and/or alleles that determine ascorbic acid and carotenoids accumulation, by exploiting one Solanum pennellii introgression lines (IL7-3) harboring quantitative trait loci (QTL) that increase the content of these metabolites in the fruit. The higher ascorbic acid and carotenoids content in IL7-3 was confirmed at three fruit developmental stages. The tomato genome reference sequence and the recently released S. pennellii genome sequence were investigated to identify candidate genes (CGs) that might control ascorbic acid and carotenoids accumulation. First of all, a refinement of the wild region borders in the IL7-3 was achieved by analyzing CAPS markers designed in our laboratory. Afterward, six CGs associated to ascorbic acid and one with carotenoids metabolism were identified exploring the annotation and the Gene Ontology terms of genes included in the region. Variants between the sequence of the wild and the cultivated alleles of these genes were investigated for their functional relevance and their potential effects on the protein sequences were predicted. Transcriptional levels of CGs in the introgression region were extracted from RNA-Seq data available for the entire S. pennellii introgression lines collection and verified by Real-Time qPCR. Finally, seven IL7-3 sub-lines were genotyped using 28 species-specific markers and then were evaluated for metabolites content. These analyses evidenced a significant decrease in transcript abundance for one 9-cis-epoxycarotenoid dioxygenase and one L-ascorbate oxidase homolog, whose role in the accumulation of carotenoids and ascorbic acid is discussed. Comprehensively, the reported

  14. Large-Scale Evaluation of Candidate Genes Identifies Associations between VEGF Polymorphisms and Bladder Cancer Risk

    PubMed Central

    García-Closas, Montserrat; Malats, Núria; Real, Francisco X; Yeager, Meredith; Welch, Robert; Silverman, Debra; Kogevinas, Manolis; Dosemeci, Mustafa; Figueroa, Jonine; Chatterjee, Nilanjan; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Murta-Nascimento, Cristiane; Rothman, Nathaniel; Chanock, Stephen J

    2007-01-01

    Common genetic variation could alter the risk for developing bladder cancer. We conducted a large-scale evaluation of single nucleotide polymorphisms (SNPs) in candidate genes for cancer to identify common variants that influence bladder cancer risk. An Illumina GoldenGate assay was used to genotype 1,433 SNPs within or near 386 genes in 1,086 cases and 1,033 controls in Spain. The most significant finding was in the 5′ UTR of VEGF (rs25648, p for likelihood ratio test, 2 degrees of freedom = 1 × 10−5). To further investigate the region, we analyzed 29 additional SNPs in VEGF, selected to saturate the promoter and 5′ UTR and to tag common genetic variation in this gene. Three additional SNPs in the promoter region (rs833052, rs1109324, and rs1547651) were associated with increased risk for bladder cancer: odds ratio (95% confidence interval): 2.52 (1.06–5.97), 2.74 (1.26–5.98), and 3.02 (1.36–6.63), respectively; and a polymorphism in intron 2 (rs3024994) was associated with reduced risk: 0.65 (0.46–0.91). Two of the promoter SNPs and the intron 2 SNP showed linkage disequilibrium with rs25648. Haplotype analyses revealed three blocks of linkage disequilibrium with significant associations for two blocks including the promoter and 5′ UTR (global p = 0.02 and 0.009, respectively). These findings are biologically plausible since VEGF is critical in angiogenesis, which is important for tumor growth, its elevated expression in bladder tumors correlates with tumor progression, and specific 5′ UTR haplotypes have been shown to influence promoter activity. Associations between bladder cancer risk and other genes in this report were not robust based on false discovery rate calculations. In conclusion, this large-scale evaluation of candidate cancer genes has identified common genetic variants in the regulatory regions of VEGF that could be associated with bladder cancer risk. PMID:17319747

  15. Cross-Species Functional Genomic Analysis Identifies Resistance Genes of the Histone Deacetylase Inhibitor Valproic Acid

    PubMed Central

    Forthun, Rakel Brendsdal; SenGupta, Tanima; Skjeldam, Hanne Kim; Lindvall, Jessica Margareta; McCormack, Emmet; Gjertsen, Bjørn Tore; Nilsen, Hilde

    2012-01-01

    The mechanisms of successful epigenetic reprogramming in cancer are not well characterized as they involve coordinated removal of repressive marks and deposition of activating marks by a large number of histone and DNA modification enzymes. Here, we have used a cross-species functional genomic approach to identify conserved genetic interactions to improve therapeutic effect of the histone deacetylase inhibitor (HDACi) valproic acid, which increases survival in more than 20% of patients with advanced acute myeloid leukemia (AML). Using a bidirectional synthetic lethality screen revealing genes that increased or decreased VPA sensitivity in C. elegans, we identified novel conserved sensitizers and synthetic lethal interactors of VPA. One sensitizer identified as a conserved determinant of therapeutic success of HDACi was UTX (KDM6A), which demonstrates a functional relationship between protein acetylation and lysine-specific methylation. The synthetic lethal screen identified resistance programs that compensated for the HDACi-induced global hyper-acetylation, and confirmed MAPKAPK2, HSP90AA1, HSP90AB1 and ACTB as conserved hubs in a resistance program for HDACi that are drugable in human AML cell lines. Hence, these resistance hubs represent promising novel targets for refinement of combinatorial epigenetic anti-cancer therapy. PMID:23155442

  16. Expression Quantitative Trait Loci Analysis Identifies Associations Between Genotype and Gene Expression in Human Intestine

    PubMed Central

    KABAKCHIEV, BOYKO; SILVERBERG, MARK S.

    2013-01-01

    BACKGROUND & AIMS Genome-wide association studies have greatly increased our understanding of intestinal disease. However, little is known about how genetic variations result in phenotypic changes. Some polymorphisms have been shown to modulate quantifiable phenotypic traits; these are called quantitative trait loci. Quantitative trait loci that affect levels of gene expression are called expression quantitative trait loci (eQTL), which can provide insight into the biological relevance of data from genome-wide association studies. We performed a comprehensive eQTL scan of intestinal tissue. METHODS Total RNA was extracted from ileal biopsy specimens and genomic DNA was obtained from whole-blood samples from the same cohort of individuals. Cis- and trans-eQTL analyses were performed using a custom software pipeline for samples from 173 subjects. The analyses determined the expression levels of 19,047 unique autosomal genes listed in the US National Center for Biotechnology Information database and more than 580,000 variants from the Single Nucleotide Polymorphism database. RESULTS The presence of more than 15,000 cis- and trans-eQTL was detected with statistical significance. eQTL associated with the same expression trait were in high linkage disequilibrium. Comparative analysis with previous eQTL studies showed that 30% to 40% of genes identified as eQTL in monocytes, liver tissue, lymphoblastoid cell lines, T cells, and fibroblasts are also eQTL in ileal tissue. Conversely, most of the significant eQTL have not been previously identified and could be tissue specific. These are involved in many cell functions, including division and antigen processing and presentation. Our analysis confirmed that previously published cis-eQTL are single nucleotide polymorphisms associated with inflammatory bowel disease: rs2298428/UBE2L3, rs1050152/SLC22A4, and SLC22A5. We identified many new associations between inflammatory bowel disease susceptibility loci and gene expression

  17. Identifying genes associated with a quantitative trait or quantitative trait locus via selective transcriptional profiling.

    PubMed

    Wang, Dong; Nettleton, Dan

    2006-06-01

    Genetical genomics is an approach that blends the mapping of quantitative trait loci (QTL) with microarray analysis. The approach can be used to identify associations between the allelic state of a genomic region and a gene's transcript abundance. However, the large number of microarrays required for adequate power results in high material and labor costs that prevent wide adoption of the genetical genomics strategy outside of some well-funded laboratories. We present a method called selective transcriptional profiling that involves selecting an optimal subset of individuals to microarray from a larger set of individuals for which relatively inexpensive quantitative trait and molecular marker data are available. We show how to use microarray data from the selected individuals, along with the trait and marker data from all individuals, to identify genes whose transcript abundance is associated with a quantitative trait of interest through linkage to a trait QTL or correlation with the trait. Our methods for selection and analysis are derived within a missing data framework.

  18. A gain-of-function screen to identify genes that reduce lifespan in the adult of Drosophila melanogaster

    PubMed Central

    2014-01-01

    Background Several lines of evidence associate misregulated genetic expression with risk factors for diabetes, Alzheimer’s, and other diseases that sporadically develop in healthy adults with no background of hereditary disorders. Thus, we are interested in genes that may be expressed normally through parts of an individual’s life, but can cause physiological defects and disease when misexpressed in adulthood. Results We attempted to identify these genes in a model organism by arbitrarily misexpressing specific genes in adult Drosophila melanogaster, using 14,133 Gene Search lines. We identified 39 “reduced-lifespan genes” that, when misexpressed in adulthood, shortened the flies’ lifespan to less than 30% of that of control flies. About half of these genes have human orthologs that are known to be involved in human diseases. For about one-fourth of the reduced-lifespan genes, suppressing apoptosis restored the lifespan shortened by their misexpression. We determined the organs responsible for reduced lifespan when these genes were misexpressed specifically in adulthood, and found that while some genes induced reduced lifespan only when misexpressed in specific adult organs, others could induce reduced lifespan when misexpressed in various organs. This finding suggests that tissue-specific dysfunction may be involved in reduced lifespan related to gene misexpression. Gene ontology analysis showed that reduced-lifespan genes are biased toward genes related to development. Conclusions We identified 39 genes that, when misexpressed in adulthood, shortened the lifespan of adult flies. Suppressing apoptosis rescued this shortened lifespan for only a subset of the reduced-lifespan genes. The adult tissues in which gene misexpression caused early death differed among the reduced-lifespan genes. These results suggest that the cause of reduced lifespan upon misexpression differed among the genes. PMID:24739137

  19. LSAMP, a novel candidate tumor suppressor gene in human osteosarcomas, identified by array comparative genomic hybridization.

    PubMed

    Kresse, Stine H; Ohnstad, Hege O; Paulsen, Erik B; Bjerkehagen, Bodil; Szuhai, Karoly; Serra, Massimo; Schaefer, Karl-Ludwig; Myklebost, Ola; Meza-Zepeda, Leonardo A

    2009-08-01

    Osteosarcomas are the most common primary malignant tumor of bone, and almost all conventional osteosarcomas are high-grade tumors with complex karyotypes. We have examined DNA copy number changes in 36 osteosarcoma tumors and 20 cell lines using microarray-based comparative genomic hybridization. The most frequent minimal recurrent regions of gain identified in the tumor samples were in 1q21.2-q21.3 (78% of the samples), 1q21.3-q22 (78%), and 8q22.1 (72%). Minimal recurrent regions in 10q22.1-q22.2 (81%), 6q16.1 (67%), 13q14.2 (67%), and 13q21.1 (67%) were most frequently lost. A small region in 3q13.31 (2.1 Mb) containing the gene limbic system-associated membrane protein (LSAMP) was frequently deleted (56%). LSAMP has previously been reported to be a candidate tumor suppressor gene in other cancer types. The deletion was validated using fluorescence in situ hybridization, and the expression level and promoter methylation status of LSAMP were investigated using quantitative real-time reverse transcription PCR and methylation-specific PCR, respectively. LSAMP showed low expression compared to two normal bone samples in 6/15 tumors and 5/9 cell lines with deletion of 3q13.31, and also in 5/14 tumors and 3/11 cell lines with normal copy number or gain. Partial or full methylation of the investigated CpG island was identified in 3/30 tumors and 7/20 cell lines. Statistical analyses revealed that loss of 11p15.4-p15.3 and low expression of LSAMP (both P = 0.011) were significantly associated with poor survival. Our results show that LSAMP is a novel candidate tumor suppressor gene in osteosarcomas.

  20. Fraction of cases of acquired immunodeficiency syndrome prevented by the interactions of identified restriction gene variants.

    PubMed

    Silverberg, M J; Smith, M W; Chmiel, J S; Detels, R; Margolick, J B; Rinaldo, C R; O'Brien, S J; Muñoz, A

    2004-02-01

    Previous research has demonstrated isolated effects of host genetic factors on the progression of human immunodeficiency virus type 1 (HIV-1) infection. In this paper, the authors present a novel use of multivariable methods for estimating the prevented fraction of acquired immunodeficiency syndrome (AIDS) cases attributable to six restriction genes after accounting for their epidemiologic interactions. The methods presented will never yield a prevented fraction above 1. The study population consisted of a well-characterized cohort of 525 US men with HIV-1 seroconversion documented during follow-up (1984-1996). On the basis of a regression tree approach using a Cox proportional hazards model for times to clinical AIDS, the combinations of genes associated with the greatest protection, relative to the lack of a protective genotype, consisted of: 1) C-C chemokine receptor 5 (CCR5)-Delta 32 and C-C chemokine receptor 2 (CCR2)-64I (relative hazard = 0.44); 2) interleukin 10 (IL10)-+/+ in combination with CCR5-Delta 32 or CCR2-64I (relative hazard = 0.45); and 3) IL10-+/+ in combination with stromal-derived factor (SDF1)-3 'A and CCR5 promoter P1/approximately P1 (relative hazard = 0.37). Overall, 30% of potential AIDS cases were prevented by the observed combinations of restriction genes (95% confidence interval: 7, 47). However, the combined effect was confined to the first 4 years following HIV-1 seroconversion. Additional research is needed to identify AIDS restriction genes with stronger and long-lasting protection to better characterize the genetic epidemiology of HIV-1.

  1. A Sparse Reconstruction Approach for Identifying Gene Regulatory Networks Using Steady-State Experiment Data

    PubMed Central

    Zhang, Wanhong; Zhou, Tong

    2015-01-01

    Motivation Identifying gene regulatory networks (GRNs) which consist of a large number of interacting units has become a problem of paramount importance in systems biology. Situations exist extensively in which causal interacting relationships among these units are required to be reconstructed from measured expression data and other a priori information. Though numerous classical methods have been developed to unravel the interactions of GRNs, these methods either have higher computing complexities or have lower estimation accuracies. Note that great similarities exist between identification of genes that directly regulate a specific gene and a sparse vector reconstruction, which often relates to the determination of the number, location and magnitude of nonzero entries of an unknown vector by solving an underdetermined system of linear equations y = Φx. Based on these similarities, we propose a novel framework of sparse reconstruction to identify the structure of a GRN, so as to increase accuracy of causal regulation estimations, as well as to reduce their computational complexity. Results In this paper, a sparse reconstruction framework is proposed on basis of steady-state experiment data to identify GRN structure. Different from traditional methods, this approach is adopted which is well suitable for a large-scale underdetermined problem in inferring a sparse vector. We investigate how to combine the noisy steady-state experiment data and a sparse reconstruction algorithm to identify causal relationships. Efficiency of this method is tested by an artificial linear network, a mitogen-activated protein kinase (MAPK) pathway network and the in silico networks of the DREAM challenges. The performance of the suggested approach is compared with two state-of-the-art algorithms, the widely adopted total least-squares (TLS) method and those available results on the DREAM project. Actual results show that, with a lower computational cost, the proposed method can

  2. Gene Expression Profiling of Muscle Stem Cells Identifies Novel Regulators of Postnatal Myogenesis

    PubMed Central

    Alonso-Martin, Sonia; Rochat, Anne; Mademtzoglou, Despoina; Morais, Jessica; de Reyniès, Aurélien; Auradé, Frédéric; Chang, Ted Hung-Tse; Zammit, Peter S.; Relaix, Frédéric

    2016-01-01

    Skeletal muscle growth and regeneration require a population of muscle stem cells, the satellite cells, located in close contact to the myofiber. These cells are specified during fetal and early postnatal development in mice from a Pax3/7 population of embryonic progenitor cells. As little is known about the genetic control of their formation and maintenance, we performed a genome-wide chronological expression profile identifying the dynamic transcriptomic changes involved in establishment of muscle stem cells through life, and acquisition of muscle stem cell properties. We have identified multiple genes and pathways associated with satellite cell formation, including set of genes specifically induced (EphA1, EphA2, EfnA1, EphB1, Zbtb4, Zbtb20) or inhibited (EphA3, EphA4, EphA7, EfnA2, EfnA3, EfnA4, EfnA5, EphB2, EphB3, EphB4, EfnBs, Zfp354c, Zcchc5, Hmga2) in adult stem cells. Ephrin receptors and ephrins ligands have been implicated in cell migration and guidance in many tissues including skeletal muscle. Here we show that Ephrin receptors and ephrins ligands are also involved in regulating the adult myogenic program. Strikingly, impairment of EPHB1 function in satellite cells leads to increased differentiation at the expense of self-renewal in isolated myofiber cultures. In addition, we identified new transcription factors, including several zinc finger proteins. ZFP354C and ZCCHC5 decreased self-renewal capacity when overexpressed, whereas ZBTB4 increased it, and ZBTB20 induced myogenic progression. The architectural and transcriptional regulator HMGA2 was involved in satellite cell activation. Together, our study shows that transcriptome profiling coupled with myofiber culture analysis, provides an efficient system to identify and validate candidate genes implicated in establishment/maintenance of muscle stem cells. Furthermore, tour de force transcriptomic profiling provides a wealth of data to inform for future stem cell-based muscle therapies. PMID:27446912

  3. Whole-exome sequencing identifies MST1R as a genetic susceptibility gene in nasopharyngeal carcinoma

    PubMed Central

    Dai, Wei; Zheng, Hong; Cheung, Arthur Kwok Leung; Tang, Clara Sze-man; Ko, Josephine Mun Yee; Wong, Bonnie Wing Yan; Leong, Merrin Man Long; Sham, Pak Chung; Cheung, Florence; Kwong, Dora Lai-Wan; Ngan, Roger Kai Cheong; Ng, Wai Tong; Yau, Chun Chung; Pan, Jianji; Peng, Xun; Tung, Stewart; Zhang, Zengfeng; Ji, Mingfang; Chiang, Alan Kwok-Shing; Lee, Anne Wing-Mui; Lee, Victor Ho-fun; Lam, Ka-On; Au, Kwok Hung; Cheng, Hoi Ching; Yiu, Harry Ho-Yin; Lung, Maria Li

    2016-01-01

    Multiple factors, including host genetics, environmental factors, and Epstein–Barr virus (EBV) infection, contribute to nasopharyngeal carcinoma (NPC) development. To identify genetic susceptibility genes for NPC, a whole-exome sequencing (WES) study was performed in 161 NPC cases and 895 controls of Southern Chinese descent. The gene-based burden test discovered an association between macrophage-stimulating 1 receptor (MST1R) and NPC. We identified 13 independent cases carrying the MST1R pathogenic heterozygous germ-line variants, and 53.8% of these cases were diagnosed with NPC aged at or even younger than 20 y, indicating that MST1R germ-line variants are relevant to disease early-age onset (EAO) (age of ≤20 y). In total, five MST1R missense variants were found in EAO cases but were rare in controls (EAO vs. control, 17.9% vs. 1.2%, P = 7.94 × 10−12). The validation study, including 2,160 cases and 2,433 controls, showed that the MST1R variant c.G917A:p.R306H is highly associated with NPC (odds ratio of 9.0). MST1R is predominantly expressed in the tissue-resident macrophages and is critical for innate immunity that protects organs from tissue damage and inflammation. Importantly, MST1R expression is detected in the ciliated epithelial cells in normal nasopharyngeal mucosa and plays a role in the cilia motility important for host defense. Although no somatic mutation of MST1R was identified in the sporadic NPC tumors, copy number alterations and promoter hypermethylation at MST1R were often observed. Our findings provide new insights into the pathogenesis of NPC by highlighting the involvement of the MST1R-mediated signaling pathways. PMID:26951679

  4. Semantic interrogation of a multi knowledge domain ontological model of tendinopathy identifies four strong candidate risk genes

    PubMed Central

    Saunders, Colleen J.; Jalali Sefid Dashti, Mahjoubeh; Gamieldien, Junaid

    2016-01-01

    Tendinopathy is a multifactorial syndrome characterised by tendon pain and thickening, and impaired performance during activity. Candidate gene association studies have identified genetic factors that contribute to intrinsic risk of developing tendinopathy upon exposure to extrinsic factors. Bioinformatics approaches that data-mine existing knowledge for biological relationships may assist with the identification of candidate genes. The aim of this study was to data-mine functional annotation of human genes and identify candidate genes by ontology-seeded queries capturing the features of tendinopathy. Our BioOntological Relationship Graph database (BORG) integrates multiple sources of genomic and biomedical knowledge into an on-disk semantic network where human genes and their orthologs in mouse and rat are central concepts mapped to ontology terms. The BORG was used to screen all human genes for potential links to tendinopathy. Following further prioritisation, four strong candidate genes (COL11A2, ELN, ITGB3, LOX) were identified. These genes are differentially expressed in tendinopathy, functionally linked to features of tendinopathy and previously implicated in other connective tissue diseases. In conclusion, cross-domain semantic integration of multiple sources of biomedical knowledge, and interrogation of phenotypes and gene functions associated with disease, may significantly increase the probability of identifying strong and unobvious candidate genes in genetic association studies. PMID:26804977

  5. Semantic interrogation of a multi knowledge domain ontological model of tendinopathy identifies four strong candidate risk genes.

    PubMed

    Saunders, Colleen J; Jalali Sefid Dashti, Mahjoubeh; Gamieldien, Junaid

    2016-01-25

    Tendinopathy is a multifactorial syndrome characterised by tendon pain and thickening, and impaired performance during activity. Candidate gene association studies have identified genetic factors that contribute to intrinsic risk of developing tendinopathy upon exposure to extrinsic factors. Bioinformatics approaches that data-mine existing knowledge for biological relationships may assist with the identification of candidate genes. The aim of this study was to data-mine functional annotation of human genes and identify candidate genes by ontology-seeded queries capturing the features of tendinopathy. Our BioOntological Relationship Graph database (BORG) integrates multiple sources of genomic and biomedical knowledge into an on-disk semantic network where human genes and their orthologs in mouse and rat are central concepts mapped to ontology terms. The BORG was used to screen all human genes for potential links to tendinopathy. Following further prioritisation, four strong candidate genes (COL11A2, ELN, ITGB3, LOX) were identified. These genes are differentially expressed in tendinopathy, functionally linked to features of tendinopathy and previously implicated in other connective tissue diseases. In conclusion, cross-domain semantic integration of multiple sources of biomedical knowledge, and interrogation of phenotypes and gene functions associated with disease, may significantly increase the probability of identifying strong and unobvious candidate genes in genetic association studies.

  6. CoGA: An R Package to Identify Differentially Co-Expressed Gene Sets by Analyzing the Graph Spectra.

    PubMed

    Santos, Suzana de Siqueira; Galatro, Thais Fernanda de Almeida; Watanabe, Rodrigo Akira; Oba-Shinjo, Sueli Mieko; Nagahashi Marie, Suely Kazue; Fujita, André

    2015-01-01

    Gene set analysis aims to identify predefined sets of functionally related genes that are differentially expressed between two conditions. Although gene set analysis has been very successful, by incorporating biological knowledge about the gene sets and enhancing statistical power over gene-by-gene analyses, it does not take into account the correlation (association) structure among the genes. In this work, we present CoGA (Co-expression Graph Analyzer), an R package for the identification of groups of differentially associated genes between two phenotypes. The analysis is based on concepts of Information Theory applied to the spectral distributions of the gene co-expression graphs, such as the spectral entropy to measure the randomness of a graph structure and the Jensen-Shannon divergence to discriminate classes of graphs. The package also includes common measures to compare gene co-expression networks in terms of their structural properties, such as centrality, degree distribution, shortest path length, and clustering coefficient. Besides the structural analyses, CoGA also includes graphical interfaces for visual inspection of the networks, ranking of genes according to their "importance" in the network, and the standard differential expression analysis. We show by both simulation experiments and analyses of real data that the statistical tests performed by CoGA indeed control the rate of false positives and is able to identify differentially co-expressed genes that other methods failed.

  7. ToP: a trend-of-disease-progression procedure works well for identifying cancer genes from multi-state cohort gene expression data for human colorectal cancer.

    PubMed

    Chung, Feng-Hsiang; Lee, Henry Hsin-Chung; Lee, Hoong-Chien

    2013-01-01

    Significantly expressed genes extracted from microarray gene expression data have proved very useful for identifying genetic biomarkers of diseases, including cancer. However, deriving a disease related inference from a list of differentially expressed genes has proven less than straightforward. In a systems disease such as cancer, how genes interact with each other should matter just as much as the level of gene expression. Here, in a novel approach, we used the network and disease progression properties of individual genes in state-specific gene-gene interaction networks (GGINs) to select cancer genes for human colorectal cancer (CRC) and obtain a much higher hit rate of known cancer genes when compared with methods not based on network theory. We constructed GGINs by integrating gene expression microarray data from multiple states--healthy control (Nor), adenoma (Ade), inflammatory bowel disease (IBD) and CRC--with protein-protein interaction database and Gene Ontology. We tracked changes in the network degrees and clustering coefficients of individual genes in the GGINs as the disease state changed from one to another. From these we inferred the state sequences Nor-Ade-CRC and Nor-IBD-CRC both exhibited a trend of (disease) progression (ToP) toward CRC, and devised a ToP procedure for selecting cancer genes for CRC. Of the 141 candidates selected using ToP, ∼50% had literature support as cancer genes, compared to hit rates of 20% to 30% for standard methods using only gene expression data. Among the 16 candidate cancer genes that encoded transcription factors, 13 were known to be tumorigenic and three were novel: CDK1, SNRPF, and ILF2. We identified 13 of the 141 predicted cancer genes as candidate markers for early detection of CRC, 11 and 2 at the Ade and IBD states, respectively.

  8. Meta-analysis of clinical data using human meiotic genes identifies a novel cohort of highly restricted cancer-specific marker genes.

    PubMed

    Feichtinger, Julia; Aldeailej, Ibrahim; Anderson, Rebecca; Almutairi, Mikhlid; Almatrafi, Ahmed; Alsiwiehri, Naif; Griffiths, Keith; Stuart, Nicholas; Wakeman, Jane A; Larcombe, Lee; McFarlane, Ramsay J

    2012-08-01

    Identifying cancer-specific biomarkers represents an ongoing challenge to the development of novel cancer diagnostic, prognostic and therapeutic strategies. Cancer/testis (CT) genes are an important gene family with expression tightly restricted to the testis in normal individuals but which can also be activated in cancers. Here we develop a pipeline to identify new CT genes. We analysed and validated expression profiles of human meiotic genes in normal and cancerous tissue followed by meta-analyses of clinical data sets from a range of tumour types resulting in the identification of a large cohort of highly specific cancer biomarker genes, including the recombination hot spot activator PRDM9 and the meiotic cohesin genes SMC1beta and RAD21L. These genes not only provide excellent cancer biomarkers for diagnostics and prognostics, but may serve as oncogenes and have excellent drug targeting potential.

  9. Focal Chromosomal Copy Number Aberrations Identify CMTM8 and GPR177 as New Candidate Driver Genes in Osteosarcoma

    PubMed Central

    Bras, Johannes; Schaap, Gerard R.; Baas, Frank; Ylstra, Bauke; Hulsebos, Theo J. M.

    2014-01-01

    Osteosarcoma is an aggressive bone tumor that preferentially develops in adolescents. The tumor is characterized by an abundance of genomic aberrations, which hampers the identification of the driver genes involved in osteosarcoma tumorigenesis. Our study aims to identify these genes by the investigation of focal copy number aberrations (CNAs, <3 Mb). For this purpose, we subjected 26 primary tumors of osteosarcoma patients to high-resolution single nucleotide polymorphism array analyses and identified 139 somatic focal CNAs. Of these, 72 had at least one gene located within or overlapping the focal CNA, with a total of 94 genes. For 84 of these genes, the expression status in 31 osteosarcoma samples was determined by expression microarray analysis. This enabled us to identify the genes of which the over- or underexpression was in more than 35% of cases in accordance to their copy number status (gain or loss). These candidate genes were subsequently validated in an independent set and furthermore corroborated as driver genes by verifying their role in other tumor types. We identified CMTM8 as a new candidate tumor suppressor gene and GPR177 as a new candidate oncogene in osteosarcoma. In osteosarcoma, CMTM8 has been shown to suppress EGFR signaling. In other tumor types, CMTM8 is known to suppress the activity of the oncogenic protein c-Met and GPR177 is known as an overexpressed upstream regulator of the Wnt-pathway. Further studies are needed to determine whether these proteins also exert the latter functions in osteosarcoma tumorigenesis. PMID:25551557

  10. Focal chromosomal copy number aberrations identify CMTM8 and GPR177 as new candidate driver genes in osteosarcoma.

    PubMed

    Both, Joeri; Krijgsman, Oscar; Bras, Johannes; Schaap, Gerard R; Baas, Frank; Ylstra, Bauke; Hulsebos, Theo J M

    2014-01-01

    Osteosarcoma is an aggressive bone tumor that preferentially develops in adolescents. The tumor is characterized by an abundance of genomic aberrations, which hampers the identification of the driver genes involved in osteosarcoma tumorigenesis. Our study aims to identify these genes by the investigation of focal copy number aberrations (CNAs, <3 Mb). For this purpose, we subjected 26 primary tumors of osteosarcoma patients to high-resolution single nucleotide polymorphism array analyses and identified 139 somatic focal CNAs. Of these, 72 had at least one gene located within or overlapping the focal CNA, with a total of 94 genes. For 84 of these genes, the expression status in 31 osteosarcoma samples was determined by expression microarray analysis. This enabled us to identify the genes of which the over- or underexpression was in more than 35% of cases in accordance to their copy number status (gain or loss). These candidate genes were subsequently validated in an independent set and furthermore corroborated as driver genes by verifying their role in other tumor types. We identified CMTM8 as a new candidate tumor suppressor gene and GPR177 as a new candidate oncogene in osteosarcoma. In osteosarcoma, CMTM8 has been shown to suppress EGFR signaling. In other tumor types, CMTM8 is known to suppress the activity of the oncogenic protein c-Met and GPR177 is known as an overexpressed upstream regulator of the Wnt-pathway. Further studies are needed to determine whether these proteins also exert the latter functions in osteosarcoma tumorigenesis.

  11. An Efficient Method for Identifying Gene Fusions by Targeted RNA Sequencing from Fresh Frozen and FFPE Samples.

    PubMed

    Scolnick, Jonathan A; Dimon, Michelle; Wang, I-Ching; Huelga, Stephanie C; Amorese, Douglas A

    2015-01-01

    Fusion genes are known to be key drivers of tumor growth in several types of cancer. Traditionally, detecting fusion genes has been a difficult task based on fluorescent in situ hybridization to detect chromosomal abnormalities. More recently, RNA sequencing has enabled an increased pace of fusion gene identification. However, RNA-Seq is inefficient for the identification of fusion genes due to the high number of sequencing reads needed to detect the small number of fusion transcripts present in cells of interest. Here we describe a method, Single Primer Enrichment Technology (SPET), for targeted RNA sequencing that is customizable to any target genes, is simple to use, and efficiently detects gene fusions. Using SPET to target 5701 exons of 401 known cancer fusion genes for sequencing, we were able to identify known and previously unreported gene fusions from both fresh-frozen and formalin-fixed paraffin-embedded (FFPE) tissue RNA in both normal tissue and cancer cells.

  12. An Efficient Method for Identifying Gene Fusions by Targeted RNA Sequencing from Fresh Frozen and FFPE Samples

    PubMed Central

    Scolnick, Jonathan A.; Dimon, Michelle; Wang, I-Ching; Huelga, Stephanie C.; Amorese, Douglas A.

    2015-01-01

    Fusion genes are known to be key drivers of tumor growth in several types of cancer. Traditionally, detecting fusion genes has been a difficult task based on fluorescent in situ hybridization to detect chromosomal abnormalities. More recently, RNA sequencing has enabled an increased pace of fusion gene identification. However, RNA-Seq is inefficient for the identification of fusion genes due to the high number of sequencing reads needed to detect the small number of fusion transcripts present in cells of interest. Here we describe a method, Single Primer Enrichment Technology (SPET), for targeted RNA sequencing that is customizable to any target genes, is simple to use, and efficiently detects gene fusions. Using SPET to target 5701 exons of 401 known cancer fusion genes for sequencing, we were able to identify known and previously unreported gene fusions from both fresh-frozen and formalin-fixed paraffin-embedded (FFPE) tissue RNA in both normal tissue and cancer cells. PMID:26132974

  13. Identifying Tinnitus-Related Genes Based on a Side-Effect Network Analysis

    PubMed Central

    Elgoyhen, A B; Langguth, B; Nowak, W; Schecklmann, M; De Ridder, D; Vanneste, S

    2014-01-01

    Tinnitus, phantom sound perception, is a worldwide highly prevalent disorder for which no clear underlying pathology has been established and for which no approved drug is on the market. Thus, there is an urgent need for new approaches to understand this condition. We used a network pharmacology side-effect analysis to search for genes that are involved in tinnitus generation. We analyzed a network of 1,313 drug–target pairs, based on 275 compounds that elicit tinnitus as side effect and their targets reported in databases, and used a quantitative score to identify emergent significant targets that were more common than expected at random. Cyclooxigenase 1 and 2 were significant, which validates our approach, since salicylate is a known tinnitus generator. More importantly, we predict previously unknown tinnitus-related targets. The present results have important implications toward understanding tinnitus pathophysiology and might pave the way toward the design of novel pharmacotherapies. PMID:24477090

  14. Epigenetic landscapes explain partially reprogrammed cells and identify key reprogramming gene

    NASA Astrophysics Data System (ADS)

    Lang, Alex; Li, Hu; Collins, James; Mehta, Pankaj

    2013-03-01

    A common metaphor for describing development is a rugged epigenetic landscape where cell fates are represented as attracting valleys resulting from a complex regulatory network. Here, we introduce a framework for explicitly constructing epigenetic landscapes that combines genomic data with techniques from physics, specifically Hopfield neural networks. Each cell fate is a dynamic attractor, yet cells can change fate in response to external signals. Our model suggests that partially reprogrammed cells (cells found in reprogramming experiments but not in vivo) are a natural consequence of high-dimensional landscapes and predicts that partially reprogrammed cells should be hybrids that coexpress genes from multiple cell fates. We verify this prediction by reanalyzing existing data sets. Our model reproduces known reprogramming protocols and identifies candidate transcription factors for reprogramming to novel cell fates, suggesting epigenetic landscapes are a powerful paradigm for understanding cellular identity.

  15. Genetic similarity between cancers and comorbid Mendelian diseases identifies candidate driver genes

    PubMed Central

    Melamed, Rachel D.; Emmett, Kevin J.; Madubata, Chioma; Rzhetsky, Andrey; Rabadan, Raul

    2015-01-01

    Despite large-scale cancer genomics studies, key somatic mutations driving cancer, and their functional roles, remain elusive. Here we propose that analysis of comorbidities of Mendelian diseases with cancers provides a novel, systematic way to discover new cancer genes. If germline genetic variation in Mendelian loci predisposes bearers to common cancers, the same loci may harbor cancer-associated somatic variation. Compilations of clinical records spanning over 100 million patients provide an unprecedented opportunity to assess clinical associations between Mendelian diseases and cancers. We systematically compare these comorbidities against recurrent somatic mutations from more than five thousand patients across many cancers. Using multiple measures of genetic similarity, we show that a Mendelian disease and comorbid cancer indeed have genetic alterations of significant functional similarity. This result provides a basis to identify candidate drivers in cancers including melanoma and glioblastoma. Some Mendelian diseases demonstrate “pan-cancer” comorbidity and shared genetics across cancers. PMID:25926297

  16. Epigenetic Landscapes Explain Partially Reprogrammed Cells and Identify Key Reprogramming Genes

    PubMed Central

    Lang, Alex H.; Li, Hu; Collins, James J.; Mehta, Pankaj

    2014-01-01

    A common metaphor for describing development is a rugged “epigenetic landscape” where cell fates are represented as attracting valleys resulting from a complex regulatory network. Here, we introduce a framework for explicitly constructing epigenetic landscapes that combines genomic data with techniques from spin-glass physics. Each cell fate is a dynamic attractor, yet cells can change fate in response to external signals. Our model suggests that partially reprogrammed cells are a natural consequence of high-dimensional landscapes, and predicts that partially reprogrammed cells should be hybrids that co-express genes from multiple cell fates. We verify this prediction by reanalyzing existing datasets. Our model reproduces known reprogramming protocols and identifies candidate transcription factors for reprogramming to novel cell fates, suggesting epigenetic landscapes are a powerful paradigm for understanding cellular identity. PMID:25122086

  17. Integrative analysis of DNA methylation and gene expression data identifies EPAS1 as a key regulator of COPD.

    PubMed

    Yoo, Seungyeul; Takikawa, Sachiko; Geraghty, Patrick; Argmann, Carmen; Campbell, Joshua; Lin, Luan; Huang, Tao; Tu, Zhidong; Foronjy, Robert F; Feronjy, Robert; Spira, Avrum; Schadt, Eric E; Powell, Charles A; Zhu, Jun

    2015-01-01

    Chronic Obstructive Pulmonary Disease (COPD) is a complex disease. Genetic, epigenetic, and environmental factors are known to contribute to COPD risk and disease progression. Therefore we developed a systematic approach to identify key regulators of COPD that integrates genome-wide DNA methylation, gene expression, and phenotype data in lung tissue from COPD and control samples. Our integrative analysis identified 126 key regulators of COPD. We identified EPAS1 as the only key regulator whose downstream genes significantly overlapped with multiple genes sets associated with COPD disease severity. EPAS1 is distinct in comparison with other key regulators in terms of methylation profile and downstream target genes. Genes predicted to be regulated by EPAS1 were enriched for biological processes including signaling, cell communications, and system development. We confirmed that EPAS1 protein levels are lower in human COPD lung tissue compared to non-disease controls and that Epas1 gene expression is reduced in mice chronically exposed to cigarette smoke. As EPAS1 downstream genes were significantly enriched for hypoxia responsive genes in endothelial cells, we tested EPAS1 function in human endothelial cells. EPAS1 knockdown by siRNA in endothelial cells impacted genes that significantly overlapped with EPAS1 downstream genes in lung tissue including hypoxia responsive genes, and genes associated with emphysema severity. Our first integrative analysis of genome-wide DNA methylation and gene expression profiles illustrates that not only does DNA methylation play a 'causal' role in the molecular pathophysiology of COPD, but it can be leveraged to directly identify novel key mediators of this pathophysiology.

  18. Identifying human disease genes: advances in molecular genetics and computational approaches.

    PubMed

    Bakhtiar, S M; Ali, A; Baig, S M; Barh, D; Miyoshi, A; Azevedo, V

    2014-07-04

    The human genome project is one of the significant achievements that have provided detailed insight into our genetic legacy. During the last two decades, biomedical investigations have gathered a considerable body of evidence by detecting more than 2000 disease genes. Despite the imperative advances in the genetic understanding of various diseases, the pathogenesis of many others remains obscure. With recent advances, the laborious methodologies used to identify DNA variations are replaced by direct sequencing of genomic DNA to detect genetic changes. The ability to perform such studies depends equally on the development of high-throughput and economical genotyping methods. Currently, basically for every disease whose origen is still unknown, genetic approaches are available which could be pedigree-dependent or -independent with the capacity to elucidate fundamental disease mechanisms. Computer algorithms and programs for linkage analysis have formed the foundation for many disease gene detection projects, similarly databases of clinical findings have been widely used to support diagnostic decisions in dysmorphology and general human disease. For every disease type, genome sequence variations, particularly single nucleotide polymorphisms are mapped by comparing the genetic makeup of case and control groups. Methods that predict the effects of polymorphisms on protein stability are useful for the identification of possible disease associations, whereas structural effects can be assessed using methods to predict stability changes in proteins using sequence and/or structural information.

  19. Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes.

    PubMed

    Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi

    2016-06-01

    Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits.

  20. A dehydration-inducible gene in the truffle Tuber borchii identifies a novel group of dehydrins

    PubMed Central

    Abba', Simona; Ghignone, Stefano; Bonfante, Paola

    2006-01-01

    Background The expressed sequence tag M6G10 was originally isolated from a screening for differentially expressed transcripts during the reproductive stage of the white truffle Tuber borchii. mRNA levels for M6G10 increased dramatically during fruiting body maturation compared to the vegetative mycelial stage. Results Bioinformatics tools, phylogenetic analysis and expression studies were used to support the hypothesis that this sequence, named TbDHN1, is the first dehydrin (DHN)-like coding gene isolated in fungi. Homologs of this gene, all defined as "coding for hypothetical proteins" in public databases, were exclusively found in ascomycetous fungi and in plants. Although complete (or almost complete) fungal genomes and EST collections of some Basidiomycota and Glomeromycota are already available, DHN-like proteins appear to be represented only in Ascomycota. A new and previously uncharacterized conserved signature pattern was identified and proposed to Uniprot database as the main distinguishing feature of this new group of DHNs. Expression studies provide experimental evidence of a transcript induction of TbDHN1 during cellular dehydration. Conclusion Expression pattern and sequence similarities to known plant DHNs indicate that TbDHN1 is the first characterized DHN-like protein in fungi. The high similarity of TbDHN1 with homolog coding sequences implies the existence of a novel fungal/plant group of LEA Class II proteins characterized by a previously undescribed signature pattern. PMID:16512918

  1. Identifying the genes underlying quantitative traits: a rationale for the QTN programme

    PubMed Central

    Lee, Young Wha; Gould, Billie A.; Stinchcombe, John R.

    2014-01-01

    The goal of identifying the genes or even nucleotides underlying quantitative and adaptive traits has been characterized as the ‘QTN programme’ and has recently come under severe criticism. Part of the reason for this criticism is that much of the QTN programme has asserted that finding the genes and nucleotides for adaptive and quantitative traits is a fundamental goal, without explaining why it is such a hallowed goal. Here we outline motivations for the QTN programme that offer general insight, regardless of whether QTNs are of large or small effect, and that aid our understanding of the mechanistic dynamics of adaptive evolution. We focus on five areas: (i) vertical integration of insight across different levels of biological organization, (ii) genetic parallelism and the role of pleiotropy in shaping evolutionary dynamics, (iii) understanding the forces maintaining genetic variation in populations, (iv) distinguishing between adaptation from standing variation and new mutation, and (v) the role of genomic architecture in facilitating adaptation. We argue that rather than abandoning the QTN programme, we should refocus our efforts on topics where molecular data will be the most effective for testing hypotheses about phenotypic evolution. PMID:24790125

  2. Genetic and functional analyses identify DISC1 as a novel callosal agenesis candidate gene.

    PubMed

    Osbun, Nathan; Li, Jiang; O'Driscoll, Mary C; Strominger, Zoe; Wakahiro, Mari; Rider, Eric; Bukshpun, Polina; Boland, Elena; Spurrell, Cailyn H; Schackwitz, Wendy; Pennacchio, Len A; Dobyns, William B; Black, Graeme C M; Sherr, Elliott H

    2011-08-01

    Agenesis of the corpus callosum (AgCC) is a congenital brain malformation that occurs in approximately 1:1,000-1:6,000 births. Several syndromes associated with AgCC have been traced to single gene mutations; however, the majority of AgCC causes remain unidentified. We investigated a mother and two children who all shared complete AgCC and a chromosomal deletion at 1q42. We fine mapped this deletion and show that it includes Disrupted-in-Schizophrenia 1 (DISC1), a gene implicated in schizophrenia and other psychiatric disorders. Furthermore, we report a de novo chromosomal deletion at 1q42.13 to q44, which includes DISC1, in another individual with AgCC. We resequenced DISC1 in a cohort of 144 well-characterized AgCC individuals and identified 20 sequence changes, of which 4 are rare potentially pathogenic variants. Two of these variants were undetected in 768 control chromosomes. One of these is a splice site mutation at the 5' boundary of exon 11 that dramatically reduces full-length mRNA expression of DISC1, but not of shorter forms. We investigated the developmental expression of mouse DISC1 and find that it is highly expressed in the embryonic corpus callosum at a critical time for callosal formation. Taken together our results suggest a significant role for DISC1 in corpus callosum development.

  3. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder.

    PubMed

    C Yuen, Ryan K; Merico, Daniele; Bookman, Matt; L Howe, Jennifer; Thiruvahindrapuram, Bhooma; Patel, Rohan V; Whitney, Joe; Deflaux, Nicole; Bingham, Jonathan; Wang, Zhuozhi; Pellecchia, Giovanna; Buchanan, Janet A; Walker, Susan; Marshall, Christian R; Uddin, Mohammed; Zarrei, Mehdi; Deneault, Eric; D'Abate, Lia; Chan, Ada J S; Koyanagi, Stephanie; Paton, Tara; Pereira, Sergio L; Hoang, Ny; Engchuan, Worrawat; Higginbotham, Edward J; Ho, Karen; Lamoureux, Sylvia; Li, Weili; MacDonald, Jeffrey R; Nalpathamkalam, Thomas; Sung, Wilson W L; Tsoi, Fiona J; Wei, John; Xu, Lizhen; Tasse, Anne-Marie; Kirby, Emily; Van Etten, William; Twigger, Simon; Roberts, Wendy; Drmic, Irene; Jilderda, Sanne; Modi, Bonnie MacKinnon; Kellam, Barbara; Szego, Michael; Cytrynbaum, Cheryl; Weksberg, Rosanna; Zwaigenbaum, Lonnie; Woodbury-Smith, Marc; Brian, Jessica; Senman, Lili; Iaboni, Alana; Doyle-Thomas, Krissy; Thompson, Ann; Chrysler, Christina; Leef, Jonathan; Savion-Lemieux, Tal; Smith, Isabel M; Liu, Xudong; Nicolson, Rob; Seifer, Vicki; Fedele, Angie; Cook, Edwin H; Dager, Stephen; Estes, Annette; Gallagher, Louise; Malow, Beth A; Parr, Jeremy R; Spence, Sarah J; Vorstman, Jacob; Frey, Brendan J; Robinson, James T; Strug, Lisa J; Fernandez, Bridget A; Elsabbagh, Mayada; Carter, Melissa T; Hallmayer, Joachim; Knoppers, Bartha M; Anagnostou, Evdokia; Szatmari, Peter; Ring, Robert H; Glazer, David; Pletcher, Mathew T; Scherer, Stephen W

    2017-04-01

    We are performing whole-genome sequencing of families with autism spectrum disorder (ASD) to build a resource (MSSNG) for subcategorizing the phenotypes and underlying genetic factors involved. Here we report sequencing of 5,205 samples from families with ASD, accompanied by clinical information, creating a database accessible on a cloud platform and through a controlled-access internet portal. We found an average of 73.8 de novo single nucleotide variants and 12.6 de novo insertions and deletions or copy number variations per ASD subject. We identified 18 new candidate ASD-risk genes and found that participants bearing mutations in susceptibility genes had significantly lower adaptive ability (P = 6 × 10(-4)). In 294 of 2,620 (11.2%) of ASD cases, a molecular basis could be determined and 7.2% of these carried copy number variations and/or chromosomal abnormalities, emphasizing the importance of detecting all forms of genetic variation as diagnostic and therapeutic targets in ASD.

  4. Identifying Intracellular pDNA Losses From a Model of Nonviral Gene Delivery.

    PubMed

    Martin, Timothy; Wysocki, Beata; Wysocki, Tadeusz; Pannier, Angela

    2015-01-23

    Nonviral gene delivery systems are a type of nano-communication system that transmit plasmid packets (i.e. pDNA packets) that are programmed at the nanoscale to biological systems at the microscopic cellular level. This engineered nano-communication system suffers large pDNA losses during transmission of the genetically encoded information, preventing its use in biotechnological and medical applications. The pDNA losses largely remain uncharacterized, and the ramifications of reducing pDNA loss from newly designed gene delivery systems remain difficult to predict. Here, the pDNA losses during primary and secondary transmission chains were identified utilizing a MATLAB model employing queuing theory simulating delivery of pEGFPLuc transgene to HeLa cells carried by Lipofectamine 2000 nonviral DNA carrier. Minimizing pDNA loss during endosomal escape of the primary transmission process results in increased number of pDNA in the nucleus with increased transfection, but with increased probability of cell death. The number of pDNA copies in the nucleus and the amount of time the pDNAs are in the nucleus directly correlates to improved transfection efficiency. During secondary transmission, pDNAs are degraded during distribution to daughter cells. Reducing pDNA losses improves transfection, but a balance in quantity of nuclear pDNA, mitosis, and toxicity must be considered in order to achieve therapeutically relevant transfection levels.

  5. Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes

    PubMed Central

    Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi

    2016-01-01

    Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits. PMID:27037832

  6. Computational identifying and characterizing circular RNAs and their associated genes in hepatocellular carcinoma

    PubMed Central

    Li, Yan; Dong, Yongcheng; Huang, Ziyan; Kuang, Qifan; Wu, Yiming; Li, Yizhou; Li, Menglong

    2017-01-01

    Hepatocellular carcinoma (HCC) is currently still a major factor leading to death, lacking of reliable biomarkers. Therefore, deep understanding the pathogenesis for HCC is of great importance. The emergence of circular RNA (circRNA) provides a new way to study the pathogenesis of human disease. Here, we employed the prediction tool to identify circRNAs based on RNA-seq data. Then, to investigate the biological function of the circRNA, the candidate circRNAs were associated with the protein-coding genes (PCGs) by GREAT. We found significant candidate circRNAs expression alterations between normal and tumor samples. Additionally, the PCGs associated with these candidate circRNAs were also found have discriminative expression patterns between normal and tumor samples. The enrichment analysis illustrated that these PCGs were predominantly enriched for liver/cardiovascular-related diseases such as atherosclerosis, myocardial ischemia and coronary heart disease, and participated in various metabolic processes. Together, a further network analysis indicated that these PCGs play important roles in the regulatory and the PPI network. Finally, we built a classification model to distinguish normal and tumor samples by using candidate circRNAs and their associated genes, respectively. Both of them obtained satisfactory results (~ 0.99 of AUC for circRNA and PCG). Our findings suggested that the circRNA could be a critical factor in HCC, providing a useful resource to explore the pathogenesis of HCC. PMID:28346469

  7. Newborn Screening Quality Assurance Program for CFTR Mutation Detection and Gene Sequencing to Identify Cystic Fibrosis

    PubMed Central

    Hendrix, Miyono M.; Foster, Stephanie L.; Cordovado, Suzanne K.

    2016-01-01

    All newborn screening laboratories in the United States and many worldwide screen for cystic fibrosis. Most laboratories use a second-tier genotyping assay to identify a panel of mutations in the CF transmembrane regulator (CFTR) gene. Centers for Disease Control and Prevention’s Newborn Screening Quality Assurance Program houses a dried blood spot repository of samples containing CFTR mutations to assist newborn screening laboratories and ensure high-quality mutation detection in a high-throughput environment. Recently, CFTR mutation detection has increased in complexity with expanded genotyping panels and gene sequencing. To accommodate the growing quality assurance needs, the repository samples were characterized with several multiplex genotyping methods, Sanger sequencing, and 3 next-generation sequencing assays using a high-throughput, low-concentration DNA extraction method. The samples performed well in all of the assays, providing newborn screening laboratories with a resource for complex CFTR mutation detection and next-generation sequencing as they transition to new methods. PMID:28261631

  8. Integrating QTL mapping and transcriptomics identifies candidate genes underlying QTLs associated with soybean tolerance to low-phosphorus stress.

    PubMed

    Zhang, Dan; Zhang, Hengyou; Chu, Shanshan; Li, Hongyan; Chi, Yingjun; Triebwasser-Freese, Daniella; Lv, Haiyan; Yu, Deyue

    2017-01-01

    Soybean is a high phosphorus (P) demand species that is sensitive to low-P stress. Although many quantitative trait loci (QTL) for P efficiency have been identified in soybean, but few of these have been cloned and agriculturally applied mainly due to various limitations on identifying suitable P efficiency candidate genes. Here, we combined QTL mapping, transcriptome profiling, and plant transformation to identify candidate genes underlying QTLs associated with low-P tolerance and response mechanisms to low-P stress in soybean. By performing QTL linkage mapping using 152 recombinant inbred lines (RILs) that were derived from a cross between a P-efficient variety, Nannong 94-156, and P-sensitive Bogao, we identified four major QTLs underlying P efficiency. Within these four QTL regions, 34/81 candidate genes in roots/leaves were identified using comparative transcriptome analysis between two transgressive RILs, low-P tolerant genotype B20 and sensitive B18. A total of 22 phosphatase family genes were up-regulated significantly under low-P condition in B20. Overexpression of an acid phosphatase candidate gene, GmACP2, in soybean hairy roots increased P efficiency by 15.43-24.54 % compared with that in controls. Our results suggest that integrating QTL mapping and transcriptome profiling could be useful for rapidly identifying candidate genes underlying complex traits, and phosphatase-encoding genes, such as GmACP2, play important roles involving in low-P stress tolerance in soybean.

  9. Epigenomic Elements Analyses for Promoters Identify ESRRG as a New Susceptibility Gene for Obesity-related Traits

    PubMed Central

    Dong, Shan-Shan; Guo, Yan; Zhu, Dong-Li; Chen, Xiao-Feng; Wu, Xiao-Ming; Shen, Hui; Chen, Xiang-Ding; Tan, Li-Jun; Tian, Qing; Deng, Hong-Wen; Yang, Tie-Lin

    2016-01-01

    OBJECTIVES With ENCODE epigenomic data and results from published genome-wide association studies (GWASs), we aimed to find regulatory signatures of obesity genes and discover novel susceptibility genes. METHODS Obesity genes were obtained from public GWASs databases and their promoters were annotated based on the regulatory elements information. Significantly enriched or depleted epigenomic elements in the promoters of obesity genes were evaluated and all human genes were then prioritized according to the existence of the selected elements to predict new candidate genes. Top ranked genes were subsequently applied to validate their associations with obesity-related traits in three independent in-house GWASs samples. RESULTS We identified RAD21 and EZH2 as over-represented, STAT2 and IRF3 as depleted transcription factors. Histone modification of H3K9me3 and chromatin state segmentation of “poised promoter” and “repressed” were overrepresented. All genes were prioritized and we selected the top five genes for validation at population level. Combined results from the three GWASs samples, rs7522101 in ESRRG remained significantly associated with BMI after multiple testing corrections (P = 7.25 × 10−5). It was also associated with β-cell function (P = 1.99 × 10−3) and fasting glucose level (P < 0.05) in the meta-analyses of glucose and insulin-related traits consortium (MAGIC) dataset. CONCLUSIONS In summary, we identified epigenomic characteristics for obesity genes and suggested ESRRG as a novel obesity susceptibility gene. PMID:27113491

  10. Differential Gene Expression in the Meristem and during Early Fruit Growth of Pisum sativum L. Identifies Potential Targets for Breeding

    PubMed Central

    Smitha Ninan, Annu; Shah, Anish; Song, Jiancheng; Jameson, Paula E.

    2017-01-01

    For successful molecular breeding it is important to identify targets to the gene family level, and in the specific species of interest, in this case Pisum sativum L. The cytokinins have been identified as a key breeding target due to their influence on plant architecture, and on seed size and sink activity. We focused on the cytokinin biosynthetic gene family (the IPTs) and the gene family key to the destruction of cytokinins (the CKXs), as well as other gene families potentially affected by changing cytokinin levels. These included key meristem genes (WUS and BAM1) and the transporter gene families, sucrose transporters (SUTs) and amino acid permeases (AAPs). We used reverse transcription quantitative PCR (RT-qPCR) to monitor gene expression in the vegetative meristem and in pre- and post-fertilisation young pea fruits. PsWUS expression was specific to the shoot apical meristem while PsBAM1 was highly expressed in the shoot apical meristem (SAM) but was also expressed at a low level in the young fruit. Differential expression was shown between genes and within gene families for IPT, CKX, SUT, and AAP. PsCKX7 showed strong gene family member-specific expression in the SAM, and was also expressed in young pea fruits. We suggest that PsCKX7 is a potential target for downregulation via molecular breeding or gene editing. PMID:28212324

  11. Potential New Genes for Resistance to Mycosphaerella Graminicola Identified in Triticum Aestivum x Lophopyrum Elongatum Disomic Substitution Lines

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Lophopyrum species carry many desirable agronomic traits, including disease resistance, which can be transferred to wheat by interspecific hybridizations. To identify potentially new genes for disease and insect resistance carried by individual Lophopyrum chromosomes, 19 of 21 possible wheat cultiv...

  12. Using expression profiles of Caenorhabditis elegans neurons to identify genes that mediate synaptic connectivity.

    PubMed

    Baruch, Leehod; Itzkovitz, Shalev; Golan-Mashiach, Michal; Shapiro, Ehud; Segal, Eran

    2008-07-11

    Synaptic wiring of neurons in Caenorhabditis elegans is largely invariable between animals. It has been suggested that this feature stems from genetically encoded molecular markers that guide the neurons in the final stage of synaptic formation. Identifying these markers and unraveling the logic by which they direct synapse formation is a key challenge. Here, we address this task by constructing a probabilistic model that attempts to explain the neuronal connectivity diagram of C. elegans as a function of the expression patterns of its neurons. By only considering neuron pairs that are known to be connected by chemical or electrical synapses, we focus on the final stage of synapse formation, in which neurons identify their designated partners. Our results show that for many neurons the neuronal expression map of C. elegans can be used to accurately predict the subset of adjacent neurons that will be chosen as its postsynaptic partners. Notably, these predictions can be achieved using the expression patterns of only a small number of specific genes that interact in a combinatorial fashion.

  13. A functional siRNA screen identifies genes modulating angiotensin II-mediated EGFR transactivation

    PubMed Central

    George, Amee J.; Purdue, Brooke W.; Gould, Cathryn M.; Thomas, Daniel W.; Handoko, Yanny; Qian, Hongwei; Quaife-Ryan, Gregory A.; Morgan, Kylie A.; Simpson, Kaylene J.; Thomas, Walter G.; Hannan, Ross D.

    2013-01-01

    Summary The angiotensin type 1 receptor (AT1R) transactivates the epidermal growth factor receptor (EGFR) to mediate cellular growth, however, the molecular mechanisms involved have not yet been resolved. To address this, we performed a functional siRNA screen of the human kinome in human mammary epithelial cells that demonstrate a robust AT1R–EGFR transactivation. We identified a suite of genes encoding proteins that both positively and negatively regulate AT1R–EGFR transactivation. Many candidates are components of EGFR signalling networks, whereas others, including TRIO, BMX and CHKA, have not been previously linked to EGFR transactivation. Individual knockdown of TRIO, BMX or CHKA attenuated tyrosine phosphorylation of the EGFR by angiotensin II stimulation, but this did not occur following direct stimulation of the EGFR with EGF, indicating that these proteins function between the activated AT1R and the EGFR. Further investigation of TRIO and CHKA revealed that their activity is likely to be required for AT1R–EGFR transactivation. CHKA also mediated EGFR transactivation in response to another G protein-coupled receptor (GPCR) ligand, thrombin, indicating a pervasive role for CHKA in GPCR–EGFR crosstalk. Our study reveals the power of unbiased, functional genomic screens to identify new signalling mediators important for tissue remodelling in cardiovascular disease and cancer. PMID:24046455

  14. Using Expression Profiles of Caenorhabditis elegans Neurons To Identify Genes That Mediate Synaptic Connectivity

    PubMed Central

    Baruch, Leehod; Itzkovitz, Shalev; Golan-Mashiach, Michal; Shapiro, Ehud; Segal, Eran

    2008-01-01

    Synaptic wiring of neurons in Caenorhabditis elegans is largely invariable between animals. It has been suggested that this feature stems from genetically encoded molecular markers that guide the neurons in the final stage of synaptic formation. Identifying these markers and unraveling the logic by which they direct synapse formation is a key challenge. Here, we address this task by constructing a probabilistic model that attempts to explain the neuronal connectivity diagram of C. elegans as a function of the expression patterns of its neurons. By only considering neuron pairs that are known to be connected by chemical or electrical synapses, we focus on the final stage of synapse formation, in which neurons identify their designated partners. Our results show that for many neurons the neuronal expression map of C. elegans can be used to accurately predict the subset of adjacent neurons that will be chosen as its postsynaptic partners. Notably, these predictions can be achieved using the expression patterns of only a small number of specific genes that interact in a combinatorial fashion. PMID:18711638

  15. Thirty-nine novel neurofibromatosis 1 (NF1) gene mutations identified in Slovak patients.

    PubMed

    Nemethova, Martina; Bolcekova, Anna; Ilencikova, Denisa; Durovcikova, Darina; Hlinkova, Katarina; Hlavata, Anna; Kovacs, Laszlo; Kadasi, Ludevit; Zatkova, Andrea

    2013-09-01

    We performed a complex analysis of the neurofibromatosis type 1 (NF1) gene in Slovakia based on direct cDNA sequencing supplemented by multiple ligation dependent probe amplification (MLPA) analysis. All 108 patients had café-au-lait spots, 85% had axilary and/or inguinal freckling, 61% neurofibromas, 36% Lisch nodules of the iris and 31% optic pathway glioma, 5% suffered from typical skeletal disorders, and 51% of patients had family members with NF1. In 78 of the 86 (90.7%) index patients our analysis revealed the presence of NF1 mutations, 68 of which were small changes (87.2%), including 39 (50%) novel. Among the identified mutations the most prevalent were small deletions and insertions causing frameshift (42.3%), followed by nonsense (14.1%), missense (12.8%), and typical splicing (11.5%) mutations. Type 1 NF1 deletions and intragenic deletions/duplication were identified in five cases each (6.4%). Interestingly, in five other cases nontypical splicing variants were found, whose real effect on NF1 transcript would have remained undetected if using a DNA-based method alone, thus underlying the advantage of using the cDNA-based sequencing. We show that Slovak NF1 patients have a similar repertoire of NF1 germline mutations compared to other populations, with some prevalence of small deletions/insertions and a decreased proportion of nonsense mutations.

  16. Genetic interactions affecting human gene expression identified by variance association mapping

    PubMed Central

    Brown, Andrew Anand; Buil, Alfonso; Viñuela, Ana; Lappalainen, Tuuli; Zheng, Hou-Feng; Richards, J Brent; Small, Kerrin S; Spector, Timothy D; Dermitzakis, Emmanouil T; Durbin, Richard

    2014-01-01

    Non-additive interaction between genetic variants, or epistasis, is a possible explanation for the gap between heritability of complex traits and the variation explained by identified genetic loci. Interactions give rise to genotype dependent variance, and therefore the identification of variance quantitative trait loci can be an intermediate step to discover both epistasis and gene by environment effects (GxE). Using RNA-sequence data from lymphoblastoid cell lines (LCLs) from the TwinsUK cohort, we identify a candidate set of 508 variance associated SNPs. Exploiting the twin design we show that GxE plays a role in ∼70% of these associations. Further investigation of these loci reveals 57 epistatic interactions that replicated in a smaller dataset, explaining on average 4.3% of phenotypic variance. In 24 cases, more variance is explained by the interaction than their additive contributions. Using molecular phenotypes in this way may provide a route to uncovering genetic interactions underlying more complex traits. DOI: http://dx.doi.org/10.7554/eLife.01381.001 PMID:24771767

  17. RNAi screens identify CHD4 as an essential gene in breast cancer growth

    PubMed Central

    Cicalese, Angelo; Fornasari, Lorenzo; Furia, Laura; Riva, Laura; Carugo, Alessandro; Curigliano, Giuseppe; Criscitiello, Carmen; Pruneri, Giancarlo; Pelicci, Pier Giuseppe; Faretta, Mario; Bossi, Daniela; Lanfrancone, Luisa

    2016-01-01

    Epigenetic regulation plays an essential role in tumor development and epigenetic modifiers are considered optimal potential druggable candidates. In order to identify new breast cancer vulnerabilities and improve therapeutic chances for patients, we performed in vivo and in vitro shRNA screens in a human breast cancer cell model (MCF10DCIS.com cell line) using epigenetic libraries. Among the genes identified in our screening, we deeply investigated the role of Chromodomain Helicase DNA binding Protein 4 (CHD4) in breast cancer tumorigenesis. CHD4 silencing significantly reduced tumor growth in vivo and proliferation in vitro of MCF10DCIS.com cells. Similarly, in vivo breast cancer growth was decreased in a spontaneous mouse model of breast carcinoma (MMTV-NeuT system) and in metastatic patient-derived xenograft models. Conversely, no reduction in proliferative ability of non-transformed mammary epithelial cells (MCF10A) was detected. Moreover, we showed that CHD4 depletion arrests proliferation by inducing a G0/G1 block of cell cycle associated with up-regulation of CDKN1A (p21). These results highlight the relevance of genetic screens in the identification of tumor frailties and the role of CHD4 as a potential pharmacological target to inhibit breast cancer growth. PMID:27779108

  18. Expression analysis of the genes identified in GWAS of the postmortem brain tissues from patients with schizophrenia.

    PubMed

    Umeda-Yano, Satomi; Hashimoto, Ryota; Yamamori, Hidenaga; Weickert, Cynthia Shannon; Yasuda, Yuka; Ohi, Kazutaka; Fujimoto, Michiko; Ito, Akira; Takeda, Masatoshi

    2014-05-07

    Many gene expression studies have examined postmortem brain tissues of patients with schizophrenia. However, only a few expression studies of the genes identified in genome-wide association study (GWAS) have been published to date. We measured the expression levels of the genes identified in GWAS (ZNF804A, OPCML, RPGRIP1L, NRGN, and TCF4) of the postmortem brain tissues of patients with schizophrenia and controls from two separate sample sets (i.e., the Australian Tissue Resource Center and Stanley Medical Research Institute). We also determined whether the single-nucleotide polymorphisms (SNPs) identified in the GWAS were related to the gene expression changes in the prefrontal cortex. No difference was observed between the patients with schizophrenia and controls from the Australian Tissue Resource Center samples in the mRNA levels of ZNF804A, OPCML, RPGRIP1L, NRGN, or TCF4. The lack of mRNA change for these five transcripts was also found in the brain samples from the Stanley Medical Research Institute. In addition, no relationship between the schizophrenia-associated SNPs identified in the GWAS and the corresponding gene expression was observed in either sample set. Our results suggest that major changes in the transcript levels of the five candidate genes identified in the GWAS may not occur in adult patients with schizophrenia. The lack of linkage between the risk gene polymorphisms and the expression levels of their major transcripts suggests that the control of pan mRNA levels may not be a prominent mechanism by which the genes identified in the GWAS contribute to the pathophysiology of schizophrenia. Further studies are needed to examine how the genes identified in the GWAS contribute to the pathophysiology of schizophrenia.

  19. Discrimination of germline V genes at different sequencing lengths and mutational burdens: A new tool for identifying and evaluating the reliability of V gene assignment.

    PubMed

    Zhang, Bochao; Meng, Wenzhao; Prak, Eline T Luning; Hershberg, Uri

    2015-12-01

    Immune repertoires are collections of lymphocytes that express diverse antigen receptor gene rearrangements consisting of Variable (V), (Diversity (D) in the case of heavy chains) and Joining (J) gene segments. Clonally related cells typically share the same germline gene segments and have highly similar junctional sequences within their third complementarity determining regions. Identifying clonal relatedness of sequences is a key step in the analysis of immune repertoires. The V gene is the most important for clone identification because it has the longest sequence and the greatest number of sequence variants. However, accurate identification of a clone's germline V gene source is challenging because there is a high degree of similarity between different germline V genes. This difficulty is compounded in antibodies, which can undergo somatic hypermutation. Furthermore, high-throughput sequencing experiments often generate partial sequences and have significant error rates. To address these issues, we describe a novel method to estimate which germline V genes (or alleles) cannot be discriminated under different conditions (read lengths, sequencing errors or somatic hypermutation frequencies). Starting with any set of germline V genes, this method measures their similarity using different sequencing lengths and calculates their likelihood of unambiguous assignment under different levels of mutation. Hence, one can identify, under different experimental and biological conditions, the germline V genes (or alleles) that cannot be uniquely identified and bundle them together into groups of specific V genes with highly similar sequences.

  20. A framework to identify gene expression profiles in a model of inflammation induced by lipopolysaccharide after treatment with thalidomide

    PubMed Central

    2012-01-01

    Background Thalidomide is an anti-inflammatory and anti-angiogenic drug currently used for the treatment of several diseases, including erythema nodosum leprosum, which occurs in patients with lepromatous leprosy. In this research, we use DNA microarray analysis to identify the impact of thalidomide on gene expression responses in human cells after lipopolysaccharide (LPS) stimulation. We employed a two-stage framework. Initially, we identified 1584 altered genes in response to LPS. Modulation of this set of genes was then analyzed in the LPS stimulated cells treated with thalidomide. Results We identified 64 genes with altered expression induced by thalidomide using the rank product method. In addition, the lists of up-regulated and down-regulated genes were investigated by means of bioinformatics functional analysis, which allowed for the identification of biological processes affected by thalidomide. Confirmatory analysis was done in five of the identified genes using real time PCR. Conclusions The results showed some genes that can further our understanding of the biological mechanisms in the action of thalidomide. Of the five genes evaluated with real time PCR, three were down regulated and two were up regulated confirming the initial results of the microarray analysis. PMID:22695124

  1. A new point mutation in the ND1 mitochondrial gene identified in a type II diabetic patient

    SciTech Connect

    Kalinin, V.N.; Schmidt, W.; Olek, K.

    1995-08-01

    A novel mutation in a mitochondrial gene was identified in a patient with type II diabetes mellitus. G-to-A transition was localized at the nt3316 position of gene ND1 and resulted in alanine threonine replacement at position 4 of mitochondrial NAD-H-dehydrogenase. 6 refs., 2 figs.

  2. An in vivo hypoxia metagene identifies the novel hypoxia inducible factor target gene SLCO1B3.

    PubMed

    Ramachandran, Anassuya; Betts, Guy; Bhana, Sara; Helme, Gemma; Blick, Christopher; Moller-Levet, Carla; Saunders, Emma; Valentine, Helen; Pepper, Stuart; Miller, Crispin J; Buffa, Francesca; Harris, Adrian L; West, Catharine M L

    2013-05-01

    A hypoxia-associated gene signature (metagene) was previously derived via in vivo data-mining. In this study, we aimed to investigate whether this approach could identify novel hypoxia regulated genes. From an initial list of nine genes, three were selected for further study (BCAR1, IGF2BP2 and SLCO1B3). Ten cell lines were exposed to hypoxia and interrogated for the expression of the three genes. All three genes were hypoxia inducible in at least one of the 10 cell lines with SLCO1B3 induced in seven. SLCO1B3 was studied further using chromatin immunoprecipitation and luciferase assays to investigate hypoxia inducible factor (HIF) dependent transcription. Two functional HIF response elements were identified within intron 1 of the gene. The functional importance of SLCO1B3 was studied by gene knockdown experiments followed by cell growth assays, flow cytometry and Western blotting. SLCO1B3 knockdown reduced cell size and 3-dimensional spheroid volume, which was associated with decreased activation of the mammalian target of rapamycin (mTOR) pathway. Finally, Oncomine analysis revealed that head and neck and colorectal tumours had higher levels of SLCO1B3 compared to normal tissue. Thus, the knowledge based approach for deriving gene signatures can identify novel biologically relevant genes.

  3. Use of Redundant Exclusion PCR To Identify a Novel Bacillus thuringiensis Cry8 Toxin Gene from Pooled Genomic DNA

    PubMed Central

    Zhang, Fengjiao; Shu, Changlong; Crickmore, Neil; Li, Yanqiu; Song, Fuping; Liu, Chunqin

    2016-01-01

    ABSTRACT With the aim of optimizing the cloning of novel genes from a genomic pool containing many previously identified homologous genes, we designed a redundant exclusion PCR (RE-PCR) technique. In RE-PCR, a pair of generic amplification primers are combined with additional primers that are designed to specifically bind to redundant, unwanted genes that are a subset of those copied by the amplification primers. During RE-PCR, the specific primer blocks amplification of the full-length redundant gene. Using this method, we managed to clone a number of cry8 or cry9 toxin genes from a pool of Bacillus thuringiensis genomic DNA while excluding amplicons for cry9Da, cry9Ea, and cry9Eb. The method proved to be very efficient at increasing the number of rare genes in the resulting library. One such rare (and novel) cry8-like gene was expressed, and the encoded toxin was shown to be toxic to Anomala corpulenta. IMPORTANCE Protein toxins from the bacterium Bacillus thuringiensis are being increasingly used as biopesticides against a wide range of insect pests, yet the search for new or improved toxins is becoming more difficult, as traditional methods for gene discovery routinely isolate previously identified clones. This paper describes an approach that we have developed to increase the success rate for novel toxin gene identification through reducing or eliminating the cloning of previously characterized genes. PMID:27084017

  4. Genomic analysis of human lung fibroblasts exposed to vanadium pentoxide to identify candidate genes for occupational bronchitis

    PubMed Central

    Ingram, Jennifer L; Antao-Menezes, Aurita; Turpin, Elizabeth A; Wallace, Duncan G; Mangum, James B; Pluta, Linda J; Thomas, Russell S; Bonner, James C

    2007-01-01

    Background Exposure to vanadium pentoxide (V2O5) is a cause of occupational bronchitis. We evaluated gene expression profiles in cultured human lung fibroblasts exposed to V2O5 in vitro in order to identify candidate genes that could play a role in inflammation, fibrosis, and repair during the pathogenesis of V2O5-induced bronchitis. Methods Normal human lung fibroblasts were exposed to V2O5 in a time course experiment. Gene expression was measured at various time points over a 24 hr period using the Affymetrix Human Genome U133A 2.0 Array. Selected genes that were significantly changed in the microarray experiment were validated by RT-PCR. Results V2O5 altered more than 1,400 genes, of which ~300 were induced while >1,100 genes were suppressed. Gene ontology categories (GO) categories unique to induced genes included inflammatory response and immune response, while GO catogories unique to suppressed genes included ubiquitin cycle and cell cycle. A dozen genes were validated by RT-PCR, including growth factors (HBEGF, VEGF, CTGF), chemokines (IL8, CXCL9, CXCL10), oxidative stress response genes (SOD2, PIPOX, OXR1), and DNA-binding proteins (GAS1, STAT1). Conclusion Our study identified a variety of genes that could play pivotal roles in inflammation, fibrosis and repair during V2O5-induced bronchitis. The induction of genes that mediate inflammation and immune responses, as well as suppression of genes involved in growth arrest appear to be important to the lung fibrotic reaction to V2O5. PMID:17459161

  5. ALK1 signalling analysis identifies angiogenesis related genes and reveals disparity between TGF-β and constitutively active receptor induced gene expression

    PubMed Central

    Lux, Andreas; Salway, Fiona; Dressman, Holly K; Kröner-Lux, Gabriele; Hafner, Mathias; Day, Philip JR; Marchuk, Douglas A; Garland, John

    2006-01-01

    Background TGF-β1 is an important angiogenic factor involved in the different aspects of angiogenesis and vessel maintenance. TGF-β signalling is mediated by the TβRII/ALK5 receptor complex activating the Smad2/Smad3 pathway. In endothelial cells TGF-β utilizes a second type I receptor, ALK1, activating the Smad1/Smad5 pathway. Consequently, a perturbance of ALK1, ALK5 or TβRII activity leads to vascular defects. Mutations in ALK1 cause the vascular disorder hereditary hemorrhagic telangiectasia (HHT). Methods The identification of ALK1 and not ALK5 regulated genes in endothelial cells, might help to better understand the development of HHT. Therefore, the human microvascular endothelial cell line HMEC-1 was infected with a recombinant constitutively active ALK1 adenovirus, and gene expression was studied by using gene arrays and quantitative real-time PCR analysis. Results After 24 hours, 34 genes were identified to be up-regulated by ALK1 signalling. Analysing ALK1 regulated gene expression after 4 hours revealed 13 genes to be up- and 2 to be down-regulated. Several of these genes, including IL-8, ET-1, ID1, HPTPη and TEAD4 are reported to be involved in angiogenesis. Evaluation of ALK1 regulated gene expression in different human endothelial cell types was not in complete agreement. Further on, disparity between constitutively active ALK1 and TGF-β1 induced gene expression in HMEC-1 cells and primary HUVECs was observed. Conclusion Gene array analysis identified 49 genes to be regulated by ALK1 signalling and at least 14 genes are reported to be involved in angiogenesis. There was substantial agreement between the gene array and quantitative real-time PCR data. The angiogenesis related genes might be potential HHT modifier genes. In addition, the results suggest endothelial cell type specific ALK1 and TGF-β signalling. PMID:16594992

  6. Exome sequencing coupled with mRNA analysis identifies NDUFAF6 as a Leigh gene.

    PubMed

    Bianciardi, Laura; Imperatore, Valentina; Fernandez-Vizarra, Erika; Lopomo, Angela; Falabella, Micol; Furini, Simone; Galluzzi, Paolo; Grosso, Salvatore; Zeviani, Massimo; Renieri, Alessandra; Mari, Francesca; Frullanti, Elisa

    2016-11-01

    We report here the case of a young male who started to show verbal fluency disturbance, clumsiness and gait anomalies at the age of 3.5years and presented bilateral striatal necrosis. Clinically, the diagnosis was compatible with Leigh syndrome but the underlying molecular defect remained elusive even after exome analysis using autosomal/X-linked recessive or de novo models. Dosage of respiratory chain activity on fibroblasts, but not in muscle, underlined a deficit in complex I. Re-analysis of heterozygous probably pathogenic variants, inherited from one healthy parent, identified the p.Ala178Pro in NDUFAF6, a complex I assembly factor. RNA analysis showed an almost mono-allelic expression of the mutated allele in blood and fibroblasts and puromycin treatment on cultured fibroblasts did not lead to the rescue of the maternal allele expression, not supporting the involvement of nonsense-mediated RNA decay mechanism. Complementation assay underlined a recovery of complex I activity after transduction of the wild-type gene. Since the second mutation was not detected and promoter methylation analysis resulted normal, we hypothesized a non-exonic event in the maternal allele affecting a regulatory element that, in conjunction with the paternal mutation, leads to the autosomal recessive disorder and the different allele expression in various tissues. This paper confirms NDUFAF6 as a genuine morbid gene and proposes the coupling of exome sequencing with mRNA analysis as a method useful for enhancing the exome sequencing detection rate when the simple application of classical inheritance models fails.

  7. The phenotypic patterns of essential hypertension are the key to identifying "high blood pressure" genes.

    PubMed

    Korner, P I

    2010-01-01

    hypertension provide targets for identifying high BP genes. Reading the genome from the phenotype will require new approaches, such as those used in developmental genetics. In addition, transgenic technology may help verify hypotheses and examine whether an observed effect is through single or multiple mechanisms. To obtain answers will require substantial collaborative efforts between physiologists and geneticists.

  8. Transcriptome Sequencing of Chemically Induced Aquilaria sinensis to Identify Genes Related to Agarwood Formation

    PubMed Central

    Ye, Wei; Wu, Hongqing; He, Xin; Wang, Lei; Zhang, Weimin; Li, Haohua; Fan, Yunfei; Tan, Guohui; Liu, Taomei; Gao, Xiaoxia

    2016-01-01

    Background Agarwood is a traditional Chinese medicine used as a clinical sedative, carminative, and antiemetic drug. Agarwood is formed in Aquilaria sinensis when A. sinensis trees are threatened by external physical, chemical injury or endophytic fungal irritation. However, the mechanism of agarwood formation via chemical induction remains unclear. In this study, we characterized the transcriptome of different parts of a chemically induced A. sinensis trunk sample with agarwood. The Illumina sequencing platform was used to identify the genes involved in agarwood formation. Methodology/Principal Findings A five-year-old Aquilaria sinensis treated by formic acid was selected. The white wood part (B1 sample), the transition part between agarwood and white wood (W2 sample), the agarwood part (J3 sample), and the rotten wood part (F5 sample) were collected for transcriptome sequencing. Accordingly, 54,685,634 clean reads, which were assembled into 83,467 unigenes, were obtained with a Q20 value of 97.5%. A total of 50,565 unigenes were annotated using the Nr, Nt, SWISS-PROT, KEGG, COG, and GO databases. In particular, 171,331,352 unigenes were annotated by various pathways, including the sesquiterpenoid (ko00909) and plant–pathogen interaction (ko03040) pathways. These pathways were related to sesquiterpenoid biosynthesis and defensive responses to chemical stimulation. Conclusions/Significance The transcriptome data of the different parts of the chemically induced A. sinensis trunk provide a rich source of materials for discovering and identifying the genes involved in sesquiterpenoid production and in defensive responses to chemical stimulation. This study is the first to use de novo sequencing and transcriptome assembly for different parts of chemically induced A. sinensis. Results demonstrate that the sesquiterpenoid biosynthesis pathway and WRKY transcription factor play important roles in agarwood formation via chemical induction. The comparative analysis of

  9. Genome-wide association study identifies PERLD1 as asthma candidate gene

    PubMed Central

    2011-01-01

    Background Recent genome-wide association studies (GWAS) for asthma have been successful in identifying novel associations which have been well replicated. The aim of this study is to identify the genetic variants that influence predisposition towards asthma in an ethnic Chinese population in Singapore using a GWAS approach. Methods A two-stage GWAS was performed in case samples with allergic asthma, and in control samples without asthma and atopy. In the discovery stage, 490 case and 490 control samples were analysed by pooled genotyping. Significant associations from the first stage were evaluated in a replication cohort of 521 case and 524 control samples in the second stage. The same 980 samples used in the discovery phase were also individually genotyped for purposes of a combined analysis. An additional 1445 non-asthmatic atopic control samples were also genotyped. Results 19 promising SNPs which passed our genome-wide P value threshold of 5.52 × 10-8 were individually genotyped. In the combined analysis of 1011 case and 1014 control samples, SNP rs2941504 in PERLD1 on chromosome 17q12 was found to be significantly associated with asthma at the genotypic level (P = 1.48 × 10-6, ORAG = 0.526 (0.369-0.700), ORAA = 0.480 (0.361-0.639)) and at the allelic level (P = 9.56 × 10-6, OR = 0.745 (0.654-0.848)). These findings were found to be replicated in 3 other asthma GWAS studies, thus validating our own results. Analysis against the atopy control samples suggested that the SNP was associated with allergic asthma and not to either the asthma or allergy components. Genotyping of additional SNPs in 100 kb flanking rs2941504 further confirmed that the association was indeed to PERLD1. PERLD1 is involved in the modification of the glycosylphosphatidylinositol anchors for cell surface markers such as CD48 and CD59 which are known to play multiple roles in T-cell activation and proliferation. Conclusions These findings reveal the association of a PERLD1 as a novel

  10. Gene expression profiling to identify the toxicities and potentially relevant human disease outcomes associated with environmental heavy metal exposure.

    PubMed

    Korashy, Hesham M; Attafi, Ibraheem M; Famulski, Konrad S; Bakheet, Saleh A; Hafez, Mohammed M; Alsaad, Abdulaziz M S; Al-Ghadeer, Abdul Rahman M

    2017-02-01

    Heavy metals are the most commonly encountered toxic substances that increase susceptibility to various diseases after prolonged exposure. We have previously shown that healthy volunteers living near a mining area had significant contamination with heavy metals associated with significant changes in the expression of some detoxifying genes, xenobiotic metabolizing enzymes, and DNA repair genes. However, alterations of most of the molecular target genes associated with diseases are still unknown. Thus, the aims of this study were to (a) evaluate the gene expression profile and (b) identify the toxicities and potentially relevant human disease outcomes associated with long-term human exposure to environmental heavy metals in mining area using microarray analysis. For this purpose, 40 healthy male volunteers who were residents of a heavy metal-polluted area (Mahd Al-Dhahab city, Saudi Arabia) and 20 healthy male volunteers who were residents of a non-heavy metal-polluted area were included in the study. Total RNA was isolated from whole blood using PAXgene Blood RNA tubes and then reversed transcribed and hybridized to the gene array using the Affymetrix U219 GeneChip. Microarray analysis showed about 2129 genes were identified and differentially altered, among which a shared set of 425 genes was differentially expressed in the heavy metal-exposed groups. Ingenuity pathway analysis revealed that the most altered gene-regulated diseases in heavy metal-exposed groups included hematological and developmental disorders and mostly renal and urological diseases. Quantitative real-time polymerase chain reaction closely matched the microarray data for some genes tested. Importantly, changes in gene-related diseases were attributed to alterations in the genes encoded for protein synthesis. Renal and urological diseases were the diseases that were most frequently associated with the heavy metal-exposed group. Therefore, there is a need for further studies to validate these

  11. Molecular profiling of low grade serous ovarian tumours identifies novel candidate driver genes.

    PubMed

    Hunter, Sally M; Anglesio, Michael S; Ryland, Georgina L; Sharma, Raghwa; Chiew, Yoke-Eng; Rowley, Simone M; Doyle, Maria A; Li, Jason; Gilks, C Blake; Moss, Phillip; Allan, Prue E; Stephens, Andrew N; Huntsman, David G; deFazio, Anna; Bowtell, David D; Gorringe, Kylie L; Campbell, Ian G

    2015-11-10

    Low grade serous ovarian tumours are a rare and under-characterised histological subtype of epithelial ovarian tumours, with little known of the molecular drivers and facilitators of tumorigenesis beyond classic oncogenic RAS/RAF mutations. With a move towards targeted therapies due to the chemoresistant nature of this subtype, it is pertinent to more fully characterise the genetic events driving this tumour type, some of which may influence response to therapy and/or development of drug resistance. We performed genome-wide high-resolution genomic copy number analysis (Affymetrix SNP6.0) and mutation hotspot screening (KRAS, BRAF, NRAS, HRAS, ERBB2 and TP53) to compare a large cohort of ovarian serous borderline tumours (SBTs, n = 57) with low grade serous carcinomas (LGSCs, n = 19). Whole exome sequencing was performed for 13 SBTs, nine LGSCs and one mixed low/high grade carcinoma. Copy number aberrations were detected in 61% (35/57) of SBTs, compared to 100% (19/19) of LGSCs. Oncogenic RAS/RAF/ERBB2 mutations were detected in 82.5% (47/57) of SBTs compared to 63% (12/19) of LGSCs, with NRAS mutations detected only in LGSC. Some copy number aberrations appeared to be enriched in LGSC, most significantly loss of 9p and homozygous deletions of the CDKN2A/2B locus. Exome sequencing identified BRAF, KRAS, NRAS, USP9X and EIF1AX as the most frequently mutated genes. We have identified markers of progression from borderline to LGSC and novel drivers of LGSC. USP9X and EIF1AX have both been linked to regulation of mTOR, suggesting that mTOR inhibitors may be a key companion treatment for targeted therapy trials of MEK and RAF inhibitors.

  12. Molecular profiling of low grade serous ovarian tumours identifies novel candidate driver genes

    PubMed Central

    Hunter, Sally M.; Anglesio, Michael S.; Ryland, Georgina L.; Sharma, Raghwa; Chiew, Yoke-Eng; Rowley, Simone M.; Doyle, Maria A.; Li, Jason; Gilks, C. Blake; Moss, Phillip; Allan, Prue E.; Stephens, Andrew N.; Huntsman, David G.; deFazio, Anna; Bowtell, David D.

    2015-01-01

    Low grade serous ovarian tumours are a rare and under-characterised histological subtype of epithelial ovarian tumours, with little known of the molecular drivers and facilitators of tumorigenesis beyond classic oncogenic RAS/RAF mutations. With a move towards targeted therapies due to the chemoresistant nature of this subtype, it is pertinent to more fully characterise the genetic events driving this tumour type, some of which may influence response to therapy and/or development of drug resistance. We performed genome-wide high-resolution genomic copy number analysis (Affymetrix SNP6.0) and mutation hotspot screening (KRAS, BRAF, NRAS, HRAS, ERBB2 and TP53) to compare a large cohort of ovarian serous borderline tumours (SBTs, n = 57) with low grade serous carcinomas (LGSCs, n = 19). Whole exome sequencing was performed for 13 SBTs, nine LGSCs and one mixed low/high grade carcinoma. Copy number aberrations were detected in 61% (35/57) of SBTs, compared to 100% (19/19) of LGSCs. Oncogenic RAS/RAF/ERBB2 mutations were detected in 82.5% (47/57) of SBTs compared to 63% (12/19) of LGSCs, with NRAS mutations detected only in LGSC. Some copy number aberrations appeared to be enriched in LGSC, most significantly loss of 9p and homozygous deletions of the CDKN2A/2B locus. Exome sequencing identified BRAF, KRAS, NRAS, USP9X and EIF1AX as the most frequently mutated genes. We have identified markers of progression from borderline to LGSC and novel drivers of LGSC. USP9X and EIF1AX have both been linked to regulation of mTOR, suggesting that mTOR inhibitors may be a key companion treatment for targeted therapy trials of MEK and RAF inhibitors. PMID:26506417

  13. Comparison of Expression Profiles in Ovarian Epithelium In Vivo and Ovarian Cancer Identifies Novel Candidate Genes Involved in Disease Pathogenesis

    PubMed Central

    Emmanuel, Catherine; Gava, Natalie; Kennedy, Catherine; Balleine, Rosemary L.; Sharma, Raghwa; Wain, Gerard; Brand, Alison; Hogg, Russell; Etemadmoghadam, Dariush; George, Joshy; Birrer, Michael J.; Clarke, Christine L.; Chenevix-Trench, Georgia; Bowtell, David D. L.; Harnett, Paul R.; deFazio, Anna

    2011-01-01

    Molecular events leading to epithelial ovarian cancer are poorly understood but ovulatory hormones and a high number of life-time ovulations with concomitant proliferation, apoptosis, and inflammation, increases risk. We identified genes that are regulated during the estrous cycle in murine ovarian surface epithelium and analysed these profiles to identify genes dysregulated in human ovarian cancer, using publically available datasets. We identified 338 genes that are regulated in murine ovarian surface epithelium during the estrous cycle and dysregulated in ovarian cancer. Six of seven candidates selected for immunohistochemical validation were expressed in serous ovarian cancer, inclusion cysts, ovarian surface epithelium and in fallopian tube epithelium. Most were overexpressed in ovarian cancer compared with ovarian surface epithelium and/or inclusion cysts (EpCAM, EZH2, BIRC5) although BIRC5 and EZH2 were expressed as highly in fallopian tube epithelium as in ovarian cancer. We prioritised the 338 genes for those likely to be important for ovarian cancer development by in silico analyses of copy number aberration and mutation using publically available datasets and identified genes with established roles in ovarian cancer as well as novel genes for which we have evidence for involvement in ovarian cancer. Chromosome segregation emerged as an important process in which genes from our list of 338 were over-represented including two (BUB1, NCAPD2) for which there is evidence of amplification and mutation. NUAK2, upregulated in ovarian surface epithelium in proestrus and predicted to have a driver mutation in ovarian cancer, was examined in a larger cohort of serous ovarian cancer where patients with lower NUAK2 expression had shorter overall survival. In conclusion, defining genes that are activated in normal epithelium in the course of ovulation that are also dysregulated in cancer has identified a number of pathways and novel candidate genes that may contribute

  14. Analysis of CATMA transcriptome data identifies hundreds of novel functional genes and improves gene models in the Arabidopsis genome

    PubMed Central

    Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique; Taconnat, Ludivine; Bitton, Frédérique; Balzergue, Sandrine; Jullien, Pauline E; Ingouff, Mathieu; Thareau, Vincent; Schiex, Thomas; Lecharny, Alain; Renou, Jean-Pierre

    2007-01-01

    Background Since the finishing of the sequencing of the Arabidopsis thaliana genome, the Arabidopsis community and the annotator centers have been working on the improvement of gene annotation at the structural and functional levels. In this context, we have used the large CATMA resource on the Arabidopsis transcriptome to search for genes missed by different annotation processes. Probes on the CATMA microarrays are specific gene sequence tags (GSTs) based on the CDS models predicted by the Eugene software. Among the 24 576 CATMA v2 GSTs, 677 are in regions considered as intergenic by the TAIR annotation. We analyzed the cognate transcriptome data in the CATMA resource and carried out data-mining to characterize novel genes and improve gene models. Results The statistical analysis of the results of more than 500 hybridized samples distributed among 12 organs provides an experimental validation for 465 novel genes. The hybridization evidence was confirmed by RT-PCR approaches for 88% of the 465 novel genes. Comparisons with the current annotation show that these novel genes often encode small proteins, with an average size of 137 aa. Our approach has also led to the improvement of pre-existing gene models through both the extension of 16 CDS and the identification of 13 gene models erroneously constituted of two merged CDS. Conclusion This work is a noticeable step forward in the improvement of the Arabidopsis genome annotation. We increased the number of Arabidopsis validated genes by 465 novel transcribed genes to which we associated several functional annotations such as expression profiles, sequence conservation in plants, cognate transcripts and protein motifs. PMID:17980019

  15. Connectivity mapping using a combined gene signature from multiple colorectal cancer datasets identified candidate drugs including existing chemotherapies

    PubMed Central

    2015-01-01

    Background While the discovery of new drugs is a complex, lengthy and costly process, identifying new uses for existing drugs is a cost-effective approach to therapeutic discovery. Connectivity mapping integrates gene expression profiling with advanced algorithms to connect genes, diseases and small molecule compounds and has been applied in a large number of studies to identify potential drugs, particularly to facilitate drug repurposing. Colorectal cancer (CRC) is a commonly diagnosed cancer with high mortality rates, presenting a worldwide health problem. With the advancement of high throughput omics technologies, a number of large scale gene expression profiling studies have been conducted on CRCs, providing multiple datasets in gene expression data repositories. In this work, we systematically apply gene expression connectivity mapping to multiple CRC datasets to identify candidate therapeutics to this disease. Results We developed a robust method to compile a combined gene signature for colorectal cancer across multiple datasets. Connectivity mapping analysis with this signature of 148 genes identified 10 candidate compounds, including irinotecan and etoposide, which are chemotherapy drugs currently used to treat CRCs. These results indicate that we have discovered high quality connections between the CRC disease state and the candidate compounds, and that the gene signature we created may be used as a potential therapeutic target in treating the disease. The method we proposed is highly effective in generating quality gene signature through multiple datasets; the publication of the combined CRC gene signature and the list of candidate compounds from this work will benefit both cancer and systems biology research communities for further development and investigations. PMID:26356760

  16. Identifying arsenic trioxide (ATO) functions in leukemia cells by using time series gene expression profiles.

    PubMed

    Yang, Hong; Lin, Shan; Cui, Jingru

    2014-02-10

    Arsenic trioxide (ATO) is presently the most active single agent in the treatment of acute promyelocytic leukemia (APL). In order to explore the molecular mechanism of ATO in leukemia cells with time series, we adopted bioinformatics strategy to analyze expression changing patterns and changes in transcription regulation modules of time series genes filtered from Gene Expression Omnibus database (GSE24946). We totally screened out 1847 time series genes for subsequent analysis. The KEGG (Kyoto encyclopedia of genes and genomes) pathways enrichment analysis of these genes showed that oxidative phosphorylation and ribosome were the top 2 significantly enriched pathways. STEM software was employed to compare changing patterns of gene expression with assigned 50 expression patterns. We screened out 7 significantly enriched patterns and 4 tendency charts of time series genes. The result of Gene Ontology showed that functions of times series genes mainly distributed in profiles 41, 40, 39 and 38. Seven genes with positive regulation of cell adhesion function were enriched in profile 40, and presented the same first increased model then decreased model as profile 40. The transcription module analysis showed that they mainly involved in oxidative phosphorylation pathway and ribosome pathway. Overall, our data summarized the gene expression changes in ATO treated K562-r cell lines with time and suggested that time series genes mainly regulated cell adhesive. Furthermore, our result may provide theoretical basis of molecular biology in treating acute promyelocytic leukemia.

  17. Marker genes identify three somatic cell types in the fetal mouse ovary.

    PubMed

    Rastetter, Raphael H; Bernard, Pascal; Palmer, James S; Chassot, Anne-Amandine; Chen, Huijun; Western, Patrick S; Ramsay, Robert G; Chaboissier, Marie-Christine; Wilhelm, Dagmar

    2014-10-15

    The two main functions of the ovary are the production of oocytes, which allows the continuation of the species, and secretion of female sex hormones, which control many aspects of female development and physiology. Normal development of the ovaries during embryogenesis is critical for their function and the health of the individual in later life. Although the adult ovary has been investigated in great detail, we are only starting to understand the cellular and molecular biology of early ovarian development. Here we show that the adult stem cell marker Lgr5 is expressed in the cortical region of the fetal ovary and this expression is mutually exclusive to FOXL2. Strikingly, a third somatic cell population can be identified, marked by the expression of NR2F2, which is expressed in LGR5- and FOXL2 double-negative ovarian somatic cells. Together, these three marker genes label distinct ovarian somatic cell types. Using lineage tracing in mice, we show that Lgr5-positive cells give rise to adult cortical granulosa cells, which form the follicles of the definitive reserve. Moreover, LGR5 is required for correct timing of germ cell differentiation as evidenced by a delay of entry into meiosis in Lgr5 loss-of-function mutants, demonstrating a key role for LGR5 in the differentiation of pre-granulosa cells, which ensure the differentiation of oogonia, the formation of the definitive follicle reserve, and long-term female fertility.

  18. Comparative Transcriptome Analysis Identifies Candidate Genes Related to Skin Color Differentiation in Red Tilapia

    PubMed Central

    Zhu, Wenbin; Wang, Lanmei; Dong, Zaijie; Chen, Xingting; Song, Feibiao; Liu, Nian; Yang, Hui; Fu, Jianjun

    2016-01-01

    Red tilapia is becoming more popular for aquaculture production in China in recent years. However, the pigmentation differentiation in genetic breeding is the main problem limiting its development of commercial red tilapia culture and the genetic basis of skin color variation is still unknown. In this study, we conducted Illumina sequencing of transcriptome on three color variety red tilapia. A total of 224,895,758 reads were generated, resulting in 160,762 assembled contigs that were used as reference contigs. The contigs of red tilapia transcriptome had hits in the range of 53.4% to 86.7% of the unique proteins of zebrafish, fugu, medaka, three-spined stickleback and tilapia. And 44,723 contigs containing 77,423 simple sequence repeats (SSRs) were identified, with 16,646 contigs containing more than one SSR. Three skin transcriptomes were compared pairwise and the results revealed that there were 148 common significantly differentially expressed unigenes and several key genes related to pigment synthesis, i.e. tyr, tyrp1, silv, sox10, slc24a5, cbs and slc7a11, were included. The results will facilitate understanding the molecular mechanisms of skin pigmentation differentiation in red tilapia and accelerate the molecular selection of the specific strain with consistent skin colors. PMID:27511178

  19. A Fluorescent Probe for Detecting Mycobacterium tuberculosis and Identifying Genes Critical for Cell Entry

    PubMed Central

    Yang, Dong; Ding, Feng; Mitachi, Katsuhiko; Kurosu, Michio; Lee, Richard E.; Kong, Ying

    2016-01-01

    The conventional method for quantitating Mycobacterium tuberculosis (Mtb) in vitro and in vivo relies on bacterial colony forming unit (CFU) enumeration on agar plates. Due to the slow growth rate of Mtb, it takes 3–6 weeks to observe visible colonies on agar plates. Imaging technologies that are capable of quickly quantitating both active and dormant tubercle bacilli in vitro and in vivo would accelerate research toward the development of anti-TB chemotherapies and vaccines. We have developed a fluorescent probe that can directly label the Mtb cell wall components. The fluorescent probe, designated as DLF-1, has a strong affinity to the D-Ala-D-Ala unit of the late peptidoglycan intermediates in the bacterial cell wall. We demonstrate that DLF-1 is capable of detecting Mtb in both the actively replicating and dormant states in vitro at 100 nM without inhibiting bacterial growth. The DLF-1 fluorescence signal correlated well with CFU of the labeled bacteria (R2 = 1 and 0.99 for actively replicating and dormant Mtb, respectively). DLF-1 can also quantitate labeled Mtb inside of cells. The utility of DLF-1 probe to quantitate Mtb was successfully applied to identify genes critical for cell invasion. In conclusion, this novel near infrared imaging probe provides a powerful new tool for enumerating Mtb with potential future use in bacterial virulence study. PMID:28066347

  20. Transcriptome analyses and virus induced gene silencing identify genes in the Rpp4-mediated Asian soybean rust resistance pathway

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Rpp4 (Resistance to Phakopsora pachyrhizi 4) confers resistance to P. pachyrhizi, the causal agent of Asian soybean rust (ASR). By combining expression profiling and virus induced gene silencing (VIGS), we are developing a genetic framework for Rpp4-mediated resistance. We measured gene expression i...

  1. Featured Article: Transcriptional landscape analysis identifies differently expressed genes involved in follicle-stimulating hormone induced postmenopausal osteoporosis.

    PubMed

    Maasalu, Katre; Laius, Ott; Zhytnik, Lidiia; Kõks, Sulev; Prans, Ele; Reimann, Ene; Märtson, Aare

    2017-01-01

    Osteoporosis is a disorder associated with bone tissue reorganization, bone mass, and mineral density. Osteoporosis can severely affect postmenopausal women, causing bone fragility and osteoporotic fractures. The aim of the current study was to compare blood mRNA profiles of postmenopausal women with and without osteoporosis, with the aim of finding different gene expressions and thus targets for future osteoporosis biomarker studies. Our study consisted of transcriptome analysis of whole blood serum from 12 elderly female osteoporotic patients and 12 non-osteoporotic elderly female controls. The transcriptome analysis was performed with RNA sequencing technology. For data analysis, the edgeR package of R Bioconductor was used. Two hundred and fourteen genes were expressed differently in osteoporotic compared with non-osteoporotic patients. Statistical analysis revealed 20 differently expressed genes with a false discovery rate of less than 1.47 × 10(-4) among osteoporotic patients. The expression of 10 genes were up-regulated and 10 down-regulated. Further statistical analysis identified a potential osteoporosis mRNA biomarker pattern consisting of six genes: CACNA1G, ALG13, SBK1, GGT7, MBNL3, and RIOK3. Functional ingenuity pathway analysis identified the strongest candidate genes with regard to potential involvement in a follicle-stimulating hormone activated network of increased osteoclast activity and hypogonadal bone loss. The differentially expressed genes identified in this study may contribute to future research of postmenopausal osteoporosis blood biomarkers.

  2. A genome–wide screen to identify genes controlling the rate of entry into mitosis in fission yeast

    PubMed Central

    Moris, Naomi; Nurse, Paul

    2016-01-01

    ABSTRACT We have carried out a haploinsufficiency (HI) screen in fission yeast using heterozygous deletion diploid mutants of a genome-wide set of cell cycle genes to identify genes encoding products whose level determines the rate of progression through the cell cycle. Cell size at division was used as a measure of advancement or delay of the G2-M transition of rod-shaped fission yeast cells. We found that 13 mutants were significantly longer or shorter (greater than 10%) than control cells at cell division. These included mutants of the cdc2, cdc25, wee1 and pom1 genes, which have previously been shown to play a role in the timing of entry into mitosis, and which validate this approach. Seven of these genes are involved in regulation of the G2-M transition, 5 for nuclear transport and one for nucleotide metabolism. In addition we identified 4 more genes that were 8–10% longer or shorter than the control that also had roles in regulation of the G2-M transition or in nuclear transport. The genes identified here are all conserved in human cells, suggesting that this dataset will be useful as a basis for further studies to identify rate-limiting steps for progression through the cell cycle in other eukaryotes. PMID:27736299

  3. A genome-wide screen to identify genes controlling the rate of entry into mitosis in fission yeast.

    PubMed

    Moris, Naomi; Shrivastava, Jaya; Jeffery, Linda; Li, Juan-Juan; Hayles, Jacqueline; Nurse, Paul

    2016-11-16

    We have carried out a haploinsufficiency (HI) screen in fission yeast using heterozygous deletion diploid mutants of a genome-wide set of cell cycle genes to identify genes encoding products whose level determines the rate of progression through the cell cycle. Cell size at division was used as a measure of advancement or delay of the G2-M transition of rod-shaped fission yeast cells. We found that 13 mutants were significantly longer or shorter (greater than 10%) than control cells at cell division. These included mutants of the cdc2, cdc25, wee1 and pom1 genes, which have previously been shown to play a role in the timing of entry into mitosis, and which validate this approach. Seven of these genes are involved in regulation of the G2-M transition, 5 for nuclear transport and one for nucleotide metabolism. In addition we identified 4 more genes that were 8-10% longer or shorter than the control that also had roles in regulation of the G2-M transition or in nuclear transport. The genes identified here are all conserved in human cells, suggesting that this dataset will be useful as a basis for further studies to identify rate-limiting steps for progression through the cell cycle in other eukaryotes.

  4. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression.

    PubMed

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady

    2017-02-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.

  5. Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

    PubMed Central

    Poole, William; Leinonen, Kalle; Shmulevich, Ilya

    2017-01-01

    Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390

  6. Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts

    PubMed Central

    Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni

    2013-01-01

    Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune

  7. Identifying Neisseria Species by Use of the 50S Ribosomal Protein L6 (rplF) Gene

    PubMed Central

    Bennett, Julia S.; Watkins, Eleanor R.; Jolley, Keith A.; Harrison, Odile B.

    2014-01-01

    The comparison of 16S rRNA gene sequences is widely used to differentiate bacteria; however, this gene can lack resolution among closely related but distinct members of the same genus. This is a problem in clinical situations in those genera, such as Neisseria, where some species are associated with disease while others are not. Here, we identified and validated an alternative genetic target common to all Neisseria species which can be readily sequenced to provide an assay that rapidly and accurately discriminates among members of the genus. Ribosomal multilocus sequence typing (rMLST) using ribosomal protein genes has been shown to unambiguously identify these bacteria. The PubMLST Neisseria database (http://pubmlst.org/neisseria/) was queried to extract the 53 ribosomal protein gene sequences from 44 genomes from diverse species. Phylogenies reconstructed from these genes were examined, and a single 413-bp fragment of the 50S ribosomal protein L6 (rplF) gene was identified which produced a phylogeny that was congruent with the phylogeny reconstructed from concatenated ribosomal protein genes. Primers that enabled the amplification and direct sequencing of the rplF gene fragment were designed to validate the assay in vitro and in silico. Allele sequences were defined for the gene fragment, associated with particular species names, and stored o